Skip to content

TaCoGPT is a large language model employed for the taxonomic classification of microbial sequences.

Notifications You must be signed in to change notification settings

zoubohao/TaCoGPT

Repository files navigation

TaCoGPT

PyTorch

Description

TaCoGPT is a large language model employed for the taxonomic classification of microbial sequences.

TODO:

Write interface function to do training and inference.

How to run

Install the requirements for the requirements.txt file.


Pre-train model

python script_pretrain_TaCoGPT_ddp.py

Fine-tune model with all data:

python script_train_TaCoGPT_ddp_All.py

Fine-tune model with removed clades data.

python script_train_TaCoGPT_ddp_Rm.py

About

TaCoGPT is a large language model employed for the taxonomic classification of microbial sequences.

Resources

Stars

Watchers

Forks

Languages