-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
8 new BA.2 sequences with private mutations from south africa (September-October 2023) #2484
Comments
Thx , First spotted by @Outpfmance yesterday. |
@dikeledik in the rbd it has also S:K417T , S:K444N, S:V445G and S:L452M not mentioned above please add them,. |
9 now, |
Interesting it has Orf1b:G662S as the other saltation ( defining mutation of Delta BA.2.75 XBB) |
Note, there is no reversion T9866C as this one very likely stems from a ba.2 without C9866T, this can be seen in a lot of the early south african ba.2 samples. Same goes for ba.2.86 and the ba.2.15 saltation singlet. |
@corneliusroemer the most recent sample is misplaced due to dropouts , so they all have Orf1b:G662S, S:S691P and M:A104V and none of them as T28306C so the whole branch is mistakenly put there in this flip flop branch (@AngieHinrichs ) |
Thx! yes exactly. |
Recommending for designation - even if it doesnt grow any further its an interetsing lineage from an evolutionary perspective and hopefully designating will make it easier to pick up in ongoing surveillence data (particularly outside of South Africa) |
Thanks everyone, I'll designate this |
From first look : C21618T and A23040Gthat are part of BA.2 been reversed but for the greater branch before it G23948T that is in BA.2 was here changed to C instead So - this might be BA.6 ? |
Good remark from @thomasppeacock - All are Non Synonymous. most in S1. so less likely BA.6 |
https://twitter.com/Tuliodna/status/1752996494905414026 Dr. de Oliveira mentioned on twitter that NICD has been aware of 8 sequences with the lated one sampled on Nov 29th 2023, and all those 8 sequences are labeled 'baseline surveillance' on GISAID. Also they are the only sequences with such a label from South Africa since October 2023. Meanwhile, EPI_ISL_18845398, the latest sequence, seems a bit different. It's not among the 8 sequences Dr. de Oliveira mentioned; it's labeled as pneumonia surveillance instead of baseline surveillance; and it's uploaded in a different batch. There have been only 5 pneumonia surveillance sequences samped since December 2023. So I'd guess it could still be in circulation? |
full charecterization of this clade + comparison to BA.2. based on our Mutation prevalence and % tools and the tree in USHER. |
Totally unique E mutation (Again !!) E:F23V never before seen anywhere as far as I know |
Yes, two of the 9 were sampled through pneumonia surveillance, the latest one sampled in December. |
This is not the first time we've seen S:V642G, S:S691P, S:T791I together: |
The only two previous variants I'm aware of that were infectious and also deleted (not just shifted) the disulfide bond between S:C15 and S:C136 are C.1.2 and B.1.640. S:R190S isn't rare in general, but it's rare enough that I'm surprised to see it happened 3/3 times here. |
BA.2.87.1 convergence analysis.xlsx Here's the convergence analysis for BA.2.87.1 This data could not be calculated without the data of Usher from @AngieHinrichs. Thanks Angie, from me and evefryone in my team. |
I'd say it is very uncommon, but not completely new: there are around 60 sequences in GISAID showing the E:F23V mutation over the past years. |
thx to heads up from @Over-There-Is now the most recent sample is separated from the other not just by the three dropouts but also by several nucleotides cc @corneliusroemer : |
I think the US traveller surveillance picked up BA.2.87.1 already in October 2023:
This sample is spotty, it has a lot of missing segments (~70% coverage). But it's remarkable that despite the low coverage it shares 9 SNPs (vs. BA.2) with BA.2.87.1:
This is very unlikely due to chance. It looks like the US sample was at least a coinfection of EG.5 and BA.2.87.1 One additional SNP is shared only with one of the 9 SA samples ( On top of the 9/10 SNPs shared with BA.2.87.1, there are 5 reversion compared to EG.5 making it even less unlikely to be due to chance:
The annotation for the sample is: This shows how valuable traveler surveillance is. Maybe Gingko Bioworks can share the raw reads so we can see whether there's further evidence in the N stretches. The spike in particular doesn't look very much like BA.2.87 and much more like EG.5, so the sample wasn't notable until BA.2.87.1 popped up as a distinctive cluster in high quality South African samples. But in retrospect, it seems pretty clear that this US travel surveillance sample is the earliest known BA.2.87.1 (thus far). I found this sequence by doing an advanced covSpectrum search using the "5-of" operator on the list of nuc mutations unique to BA.2.87.1 among all BA.2 descendants. Previous searches I had done didn't specifically use only the "unique" mutations meaning the searches were less discriminative. Here's a query similar to the one I used: https://cov-spectrum.org/explore/World/AllSamples/AllTimes/variants?variantQuery=%5B5-of%3AC66T%2C+A309G%2C+G542A%2C+C4012T%2C+G4354T%2C+T4379G%2C+C7296T%2C+T7480C%2C+G9190T%2C+A10037G%2C+C10702T%2C+C12008T%2C+A12791C%2C+C12896T%2C+C15925T%2C+C17644A%2C+C18744T%2C+T19364C%2C+C19698A%2C+G21604A%2C+G21786A%2C+T21939C%2C+T22552C%2C+T22896G%2C+T23005G%2C+T23487G%2C+T23633C%2C+G23761A%2C+C23934T%2C+C25487A%2C+T26311G%2C+C28256T%2C+T28474C%2C+A28715G%5D& Nextclade picture showing spottiness of the sample and nuc mutations: |
Oh, wow, I'd noticed that sequence had N:T148A, ORF1a:K4176Q, and ORF3a:T32N when no other sequence before BA.2.87.1 had ever had any two of those three. ORF1a:K4176Q had only been in about 90 sequences ever, while N:T148A and ORF3a:T32N have only ever been in ~1000 sequences each. |
EPI_ISL_18849985 has a collection date of 20 September. |
You are totally right! I had missed the September sample! |
Given that GBW does a lot of pooled sequencing—which they then upload to GISAID against database guidelines—this sample is almost certainly a badly contaminated one. I think you've proven beyond doubt that BA.2.87.1 was present in this sequence. But I doubt if a company that clearly has no interest in competently doing the job they're being paid tens of millions of dollars to do will be willing or capable of helping us learn more about one of their embarrassingly bad sequences. |
Accessions and direct link from the SPHERES call chat (IIRC from Cindy Friedman, CDC):
|
This is what the raw reads look like in IGV after some minimal processing (trimming, mapping). Note the coverage is on a log scale going up to 300k. Some observations:
These leads me to the conclusion that what we're seeing is a sample that's clearly BA.2.87.1, but where amplification failed for almost all amplicons. The host was either coinfected with XBB or there was some low-coverage contamination in the regions of the failed amplicons. I'd be curious what other people conclude from the raw reads. It might be helpful to have a list of all mutations in BA.2 (the most recent common ancestor of both XBB and BA.2.87.1) and the mutations that distinguish BA.2.87.1 from BA.2, and those that distinguish XBB from BA.2 All BA.2 mutations (SNPs):
BA.2.87.1 mutations on top of BA.2:
XBB mutations on top of BA.2:
I got these from the prototypical pango sequence tree I maintain here: https://nextstrain.org/staging/nextclade/sars-cov-2 |
i think @shay671 has it |
Well... 3 wastewater samples are uploaded today from Bangkok, Thailand, collected from Dec 2023-Jan 2024. |
@corneliusroemer, I was looking back on this sequence today, and the GISAID file now says it was from a female traveler from India. Do we know when or why this was changed? Has the metadata from the SRA sequence also been changed? |
originally:
|
Good day everyone.
9 new sequences of a BA.2 strain were detected. Samples were collected between September and November 2023 from Gauteng, Mpumalanga and Limpopo, South Africa. NextClade assigns to clade 21L. Pangolin assigns to lineage BA.2. Sequences have two long deletions on the spike (del_15-27 and del_136-146) as well as mutations in the RBD: S:K417T, S:K444N, S:V445G and S:L452M. Relative to BA.2, this cluster has >30 non-synonymous substitutions (concentrated in spike) and 7 deletions (3 in spike). The new BA.2 is basal and branches off directly from the ancestral BA.2.
Accession numbers:
hCoV-19/SouthAfrica/NICD-R13200/2023 EPI_ISL_18849984
hCoV-19/SouthAfrica/NICD-N56614/2023 EPI_ISL_18849985
hCoV-19/SouthAfrica/NICD-N56836/2023 EPI_ISL_18849986
hCoV-19/SouthAfrica/NICD-N57176/2023 EPI_ISL_18849987
hCoV-19/SouthAfrica/NICD-N57208/2023 EPI_ISL_18849988
hCoV-19/SouthAfrica/NICD-N57216/2023 EPI_ISL_18849989
hCoV-19/SouthAfrica/NICD-N57440/2023 EPI_ISL_18849990
hCoV-19/SouthAfrica/NICD-N57469/2023 EPI_ISL_18849991
hCoV-19/South Africa/NICD-R13515/2023 EPI_ISL_18845398
Mutations observed on branch:
Nt
Unique (43):C541T, G542A, C2536T, C4012T, G4354T, T4379G, C5184T, C7296T, T7480C, C7764T, G9190T, T9866C, A10037G, C10702T, C12008T, A12791C, C12896T, G15451A, C16575T, C17644A, C18744T, T19364C, C19698A, G21786A, C21855T, T21939C, G22017T, G22132T, T22552C, T23005G, T23487G, A23598G, T23633C, G23761A, C23934T, T23948C, A24369G, C25487A, T26311G, G26529A, C26833T, C28045T, A28715G
Homoplasies (3):A11782G, A22206G, C23423T
Reversions to root (1):T9866C
ORF1ab
Unique (14):E93K, E1363D, S1372A, P1640L, A2344V, S2500F, F3201L, T3258A, L3915F, K4176Q, P4211S, G5063S, Q5794K, L6367S
Reversions to root (1):F3201L
S
Unique (12):G75D, S98F, V126A, W152L, R190S, N481K, V642G, K679R, S691P, T791I, Y796H, D936G
Homoplasies (2):D215G, P621S
ORF3a
Unique (1):T32N
E
Unique (1):F23V
M
Unique (2):D3N, A104V
ORF8
Unique (1):A51V
N
Unique (1):T148A
Usher places it on a branch next to BA.2.83: (Mutations observed on branch: Homoplasies (1):A22995C, Reversions to root (1):A22995C, Homoplasies (1):S:K478T, Reversions to root (1):S:K478T)
Usher tree below: https://nextstrain.org/fetch/genome.ucsc.edu/trash/ct/subtreeAuspice1_genome_24faa_9de150.json
Downsampled global tree:
https://nextstrain.org/fetch/genome.ucsc.edu/trash/ct/singleSubtreeAuspice_genome_24faa_9de150.json
Nextclade
Deletions in the spike
The text was updated successfully, but these errors were encountered: