Skip to content
@cxcscmu

cxcscmu

Popular repositories Loading

  1. Craw4LLM Craw4LLM Public

    Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

    Python 588 52

  2. RAGViz RAGViz Public

    Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

    TypeScript 82 11

  3. MATES MATES Public

    Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

    Python 61 7

  4. Montessori-Instruct Montessori-Instruct Public

    Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]

    Python 42 3

  5. ED-Copilot ED-Copilot Public

    Python 6 1

  6. FactMM-RAG FactMM-RAG Public

    Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL 2025]

    Python 4

Repositories

Showing 10 of 11 repositories
  • FactMM-RAG Public

    Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL 2025]

    cxcscmu/FactMM-RAG’s past year of commit activity
    Python 4 MIT 0 0 1 Updated Mar 14, 2025
  • embedding-scope Public

    Interpret and control dense embedding via sparse autoencoder.

    cxcscmu/embedding-scope’s past year of commit activity
    Python 3 MIT 0 0 0 Updated Mar 5, 2025
  • Craw4LLM Public

    Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

    cxcscmu/Craw4LLM’s past year of commit activity
    Python 588 MIT 52 3 0 Updated Feb 24, 2025
  • CLUE-LLM-site Public
    cxcscmu/CLUE-LLM-site’s past year of commit activity
    TypeScript 0 0 0 0 Updated Feb 11, 2025
  • Montessori-Instruct Public

    Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]

    cxcscmu/Montessori-Instruct’s past year of commit activity
    Python 42 MIT 3 0 0 Updated Jan 24, 2025
  • RAGViz Public

    Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

    cxcscmu/RAGViz’s past year of commit activity
    TypeScript 82 MIT 11 1 0 Updated Jan 18, 2025
  • MATES Public

    Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]

    cxcscmu/MATES’s past year of commit activity
    Python 61 MIT 7 3 0 Updated Nov 14, 2024
  • esae Public
    cxcscmu/esae’s past year of commit activity
    Python 0 0 0 0 Updated Oct 29, 2024
  • cxcscmu/InContextDataAttribution’s past year of commit activity
    Python 1 0 0 0 Updated Oct 23, 2024
  • ED-Copilot Public
    cxcscmu/ED-Copilot’s past year of commit activity
    Python 6 1 0 0 Updated Aug 23, 2024

Top languages

Loading…

Most used topics

Loading…