Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Where can I find the relabel mapping files? #25

Open
raytaoliu opened this issue Sep 16, 2024 · 10 comments
Open

Where can I find the relabel mapping files? #25

raytaoliu opened this issue Sep 16, 2024 · 10 comments

Comments

@raytaoliu
Copy link

Hi,
I am trying to compare wgs and 16s from the same samples. I use qiime2 with silva to process 16S data and I use Woltka with WoL to process WGS data, but when I generate trees, the labels are not the same, can I relabel one of them to the same format? Thanks!

@wasade
Copy link
Member

wasade commented Sep 16, 2024

Hi @lrt666, the SILVA 16S taxonomy and phylogeny is not compatible with the Web of Life. I recommend processing the data using a common reference and phylogeny like Greengenes2.

@raytaoliu
Copy link
Author

Thanks for your quick response! I tried to process 16S with Greengenes2 using QIIME2 but can we use QIIME2 to process WGS data with Greengenes2 or we need to assign Greengenes2 database to other tools like Woltka?

@wasade
Copy link
Member

wasade commented Sep 16, 2024

Greengenes2 includes a subset of the Web of Life version 2. We've only applied Woltka, via the SHOGUN parameter set with bowtie2 (see https://github.com/qiita-spots/qp-woltka/blob/e62a5550c96b0656cee853d96a593a2c43555f8e/qp_woltka/woltka.py#L179-L183), for use with Greengenes2. That said, it's plausible other methods of mapping to the Web of Life would work -- what's needed is to produce a feature table expressed using the Web of Life version 2 OGU identifiers. That table can then be filtered using q2-greengenes2.

Please redirect questions about QIIME 2 though to its forum.

@raytaoliu
Copy link
Author

Hi, thanks very much for your patience, I still have one more question: Is there any guidance for us how to use Woltka with Greengenes2?

@raytaoliu
Copy link
Author

Hi,
I successfully processed Woltka ogu table with greengenes, but when I processed V4 data, can I relabel asv to ogu identifier or id to asv?

@wasade
Copy link
Member

wasade commented Sep 24, 2024

With woltka, and assuming the OGU table is mapped against WoLr2, it's just a filter-features action.

For V4 ASVs, they already exist in the phylogeny. The only trick is to use the correct taxonomy and/or phylogeny artifact -- if the ASVs are expressed as sequences, then use the .asv files. If they are hashed then use the .md5

@raytaoliu
Copy link
Author

Thanks very much for your quick response and patience! I would like to use V4 region ASVs from my dataset and output a feature table where the feature IDs are formatted like GXXXXX, similar to the format used in Woltka, is that possible?

@wasade
Copy link
Member

wasade commented Sep 24, 2024

The IDs associated with the Web of Life r2 database are derived from the Genbank/RefSeq accessions. There is no way to construct those from ASVs, but there also isn't a need as at the end of the day they are just identifiers

@raytaoliu
Copy link
Author

Thanks for your clarification, I am processing the same dataset with both 16S and Shotgun data, I want to compare them from the perspective of phylogenetic tree so I want to align their features id.

@wasade
Copy link
Member

wasade commented Oct 8, 2024

The feature ID namespace for 16S and shotgun will be different. But as long as the IDs map onto Greengenes2 phylogeny tips, you should be able to proceed with a phylogenetic evaluation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants