Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 386

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    3.5k 203

  3. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 62

  4. Show-o Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1k 44

  5. MotionDirector MotionDirector Public

    [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

    Python 850 54

  6. Image2Paragraph Image2Paragraph Public

    [A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.

    Python 791 54

Repositories

Showing 10 of 71 repositories
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    3,470 203 1 1 Updated Nov 21, 2024
  • computer_use_ootb Public

    An out-of-the-box (OOTB) version of Anthropic Claude Computer Use for Windows and macOS

    showlab/computer_use_ootb’s past year of commit activity
    Python 554 MIT 49 5 3 Updated Nov 21, 2024
  • VideoLISA Public

    [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

    showlab/VideoLISA’s past year of commit activity
    Python 51 Apache-2.0 1 2 0 Updated Nov 21, 2024
  • Awesome-GUI-Agent Public

    💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

    showlab/Awesome-GUI-Agent’s past year of commit activity
    243 11 0 0 Updated Nov 20, 2024
  • Show-o Public

    Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 1,030 Apache-2.0 44 32 0 Updated Nov 19, 2024
  • ShowUI Public
    showlab/ShowUI’s past year of commit activity
    10 0 0 0 Updated Nov 17, 2024
  • Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    showlab/Show-1’s past year of commit activity
    Python 1,103 62 8 7 Updated Nov 15, 2024
  • BoxDiff Public

    [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

    showlab/BoxDiff’s past year of commit activity
    Python 253 17 7 0 Updated Nov 12, 2024
  • sparseformer Public

    (ICLR 2024, CVPR 2024) SparseFormer

    showlab/sparseformer’s past year of commit activity
    Python 63 MIT 2 1 0 Updated Nov 10, 2024
  • LOVA3 Public

    (NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment

    showlab/LOVA3’s past year of commit activity
    Python 64 1 0 0 Updated Nov 7, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…