-
Sichuan University
- Chengdu, Sichuan
Pinned Loading
-
DAMO-NLP-SG/VCD
DAMO-NLP-SG/VCD Public[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
-
DAMO-NLP-SG/Video-LLaMA
DAMO-NLP-SG/Video-LLaMA Public[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
-
DAMO-NLP-SG/VideoLLaMA2
DAMO-NLP-SG/VideoLLaMA2 PublicVideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
-
DAMO-NLP-SG/Inf-CLIP
DAMO-NLP-SG/Inf-CLIP PublicThe official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training scheme.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.