Grounding DINO Zira

Official pytorch implementation of ZiRa, a method for incremental vision language object detection (IVLOD)

Install

If you have a CUDA environment, please make sure the environment variable CUDA_HOME is set. We recommend using Python 3.9.

pip install -e .

Dataset

Download the coco dataset and all Odinw sub-datasets in a folder named "dataset". Then download the GroundingDINO pre-trained model from this repo. The Odinw sub-datasets can be downloaded by runing this script.

python download.py

Training

Run the scripts.

sh train_odinw13_zira.sh

Citation

If you find our work helpful for your research, please consider citing the following BibTeX entry.

@article{DBLP:journals/corr/abs-2403-01680,
  author       = {Jieren Deng and
                  Haojian Zhang and
                  Kun Ding and
                  Jianhua Hu and
                  Xingxuan Zhang and
                  Yunkuan Wang},
  title        = {Zero-shot Generalizable Incremental Learning for Vision-Language Object
                  Detection},
  journal      = {CoRR},
  volume       = {abs/2403.01680},
  year         = {2024},
  url          = {https://doi.org/10.48550/arXiv.2403.01680},
  doi          = {10.48550/ARXIV.2403.01680},
  eprinttype    = {arXiv},
  eprint       = {2403.01680},
  timestamp    = {Tue, 02 Apr 2024 16:35:34 +0200},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2403-01680.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

@article{liu2023grounding,
  title={Grounding dino: Marrying dino with grounded pre-training for open-set object detection},
  author={Liu, Shilong and Zeng, Zhaoyang and Ren, Tianhe and Li, Feng and Zhang, Hao and Yang, Jie and Li, Chunyuan and Yang, Jianwei and Su, Hang and Zhu, Jun and others},
  journal={arXiv preprint arXiv:2303.05499},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
demo		demo
groundingdino		groundingdino
test_odinw13		test_odinw13
test_odinw13_10shot		test_odinw13_10shot
test_odinw13_10shot_softfreeze		test_odinw13_10shot_softfreeze
test_odinw13_1shot		test_odinw13_1shot
test_odinw13_1shot_softfreeze		test_odinw13_1shot_softfreeze
test_odinw13_5shot		test_odinw13_5shot
test_odinw13_5shot_softfreeze		test_odinw13_5shot_softfreeze
test_odinw13_softfreeze		test_odinw13_softfreeze
test_odinw35		test_odinw35
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
download.py		download.py
requirements.txt		requirements.txt
setup.py		setup.py
train_multidatasets.py		train_multidatasets.py
train_net.py		train_net.py
train_odinw.sh		train_odinw.sh
train_odinw13.sh		train_odinw13.sh
train_odinw13_adapter.sh		train_odinw13_adapter.sh
train_odinw13_berttuning.sh		train_odinw13_berttuning.sh
train_odinw13_finetuning.sh		train_odinw13_finetuning.sh
train_odinw13_linearprobing.sh		train_odinw13_linearprobing.sh
train_odinw13_projectttuning.sh		train_odinw13_projectttuning.sh
train_odinw13_prompttuning.sh		train_odinw13_prompttuning.sh
train_odinw13_zero_shot.sh		train_odinw13_zero_shot.sh
train_odinw13_zira.sh		train_odinw13_zira.sh
visualize_json_results.py		visualize_json_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grounding DINO Zira

Install

Dataset

Training

Citation

About

Releases

Packages

Languages

License

JarintotionDin/ZiRaGroundingDINO

Folders and files

Latest commit

History

Repository files navigation

Grounding DINO Zira

Install

Dataset

Training

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages