A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jul 5, 2024 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
☁️ Build multimodal AI applications with cloud-native stack
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint in the cloud.
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.
🐢 Open-Source Evaluation & Testing for LLMs and ML models
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
🤖Self-Modifying Framework from the Future 🔮 World's First AMS
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Open-source observability for your LLM application, based on OpenTelemetry
No-code multi-agent framework to build LLM Agents, workflows and applications with your data
Composio equips agents with well-crafted tools empowering them to tackle complex tasks
Add a description, image, and links to the llmops topic page so that developers can more easily learn about it.
To associate your repository with the llmops topic, visit your repo's landing page and select "manage topics."