Skip to content
Change the repository type filter

All

    Repositories list

    • fastmlx

      Public
      FastMLX is a high performance production ready API to host MLX models.
      Python
      Other
      23219171Updated Nov 20, 2024Nov 20, 2024
    • Developer resources to work with Arcee models on AWS
      Jupyter Notebook
      Apache License 2.0
      1700Updated Nov 19, 2024Nov 19, 2024
    • mergekit

      Public
      Tools for merging pretrained large language models.
      Python
      GNU Lesser General Public License v3.0
      4394.8k17716Updated Nov 19, 2024Nov 19, 2024
    • Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
      TypeScript
      Other
      7.6k202Updated Nov 11, 2024Nov 11, 2024
    • DALM

      Public
      Domain Adapted Language Modeling Toolkit - E2E RAG
      Python
      Apache License 2.0
      4031165Updated Nov 8, 2024Nov 8, 2024
    • DAM

      Public
      Python
      64111Updated Nov 6, 2024Nov 6, 2024
    • optillm

      Public
      Optimizing inference proxy for LLMs
      Python
      Apache License 2.0
      128200Updated Nov 5, 2024Nov 5, 2024
    • Open-WebUI adaptation for Arcee model deployments
      Svelte
      MIT License
      5.8k002Updated Nov 5, 2024Nov 5, 2024
    • EvolKit

      Public
      EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).
      Jupyter Notebook
      MIT License
      2218102Updated Oct 30, 2024Oct 30, 2024
    • A framework for few-shot evaluation of language models.
      Python
      MIT License
      1.9k000Updated Oct 28, 2024Oct 28, 2024
    • Optimizing inference proxy for LLMs
      Python
      Apache License 2.0
      128000Updated Oct 25, 2024Oct 25, 2024
    • tau-bench

      Public
      Code and Data for Tau-Bench
      Python
      MIT License
      25000Updated Oct 22, 2024Oct 22, 2024
    • entropix

      Public
      Entropy Based Sampling and Parallel CoT Decoding
      TypeScript
      Apache License 2.0
      311300Updated Oct 16, 2024Oct 16, 2024
    • The Arcee client for executing domain-adpated language model routines https://pypi.org/project/arcee-py/
      Python
      52572Updated Oct 8, 2024Oct 8, 2024
    • Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
      Python
      Apache License 2.0
      202001Updated Sep 23, 2024Sep 23, 2024
    • An Open Source Toolkit For LLM Distillation
      Python
      GNU Affero General Public License v3.0
      3835951Updated Sep 17, 2024Sep 17, 2024
    • Shell
      1000Updated Sep 10, 2024Sep 10, 2024
    • chat-ui

      Public
      TypeScript
      Apache License 2.0
      1.1k001Updated Aug 30, 2024Aug 30, 2024
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      4.6k001Updated Jul 31, 2024Jul 31, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.4k000Updated Jul 19, 2024Jul 19, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      Apache License 2.0
      876001Updated Jul 18, 2024Jul 18, 2024
    • The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
      Python
      Apache License 2.0
      103000Updated Jul 12, 2024Jul 12, 2024
    • domain adapted MOE training
      Python
      Other
      2.4k002Updated Jul 1, 2024Jul 1, 2024
    • A block pruning framework for LLMs.
      Python
      2100Updated Jun 20, 2024Jun 20, 2024
    • The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
      Python
      Apache License 2.0
      103100Updated May 24, 2024May 24, 2024
    • Python
      0500Updated May 6, 2024May 6, 2024
    • PruneMe

      Public
      Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
      Python
      2619600Updated Apr 23, 2024Apr 23, 2024
    • Automatically evaluate your LLMs in Google Colab
      Python
      MIT License
      91200Updated Apr 15, 2024Apr 15, 2024
    • The repository contains all the set-up required to execute trainium training jobs.
      Python
      2400Updated Mar 22, 2024Mar 22, 2024
    • Arcee docs repository
      0000Updated Feb 15, 2024Feb 15, 2024