Skip to content
@deepinfra

Deep Infra

Inference cloud

Popular repositories Loading

  1. deepctl deepctl Public

    Command line tool for Deep Infra cloud ML inference service

    Rust 33 3

  2. deepinfra-node deepinfra-node Public

    Official TypeScript wrapper for DeepInfra Inference API

    TypeScript 17 3

  3. text-generation-inference text-generation-inference Public

    Forked from huggingface/text-generation-inference

    Large Language Model Text Generation Inference

    Python 9 2

  4. ocr-tools ocr-tools Public

    Python 5 2

  5. langchain langchain Public

    Forked from langchain-ai/langchain

    ⚡ Building applications with LLMs through composability ⚡

    Python 1

  6. deepinfra-chat deepinfra-chat Public

    Sample Next.js ai chat app using Deep Infra inference and Vercel ai sdk

    TypeScript 1 2

Repositories

Showing 10 of 36 repositories
  • TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

    deepinfra/TensorRT-LLM’s past year of commit activity
    C++ 0 1,921 0 1 Updated Dec 1, 2025
  • docs Public
    deepinfra/docs’s past year of commit activity
    MDX 0 MIT 0 0 0 Updated Nov 18, 2025
  • openbench Public Forked from groq/openbench

    Provider-agnostic, open-source evaluation infrastructure for language models

    deepinfra/openbench’s past year of commit activity
    Python 0 MIT 92 0 0 Updated Nov 13, 2025
  • huggingface.js Public Forked from huggingface/huggingface.js

    Use Hugging Face with JavaScript

    deepinfra/huggingface.js’s past year of commit activity
    TypeScript 0 MIT 567 0 0 Updated Oct 30, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    deepinfra/vllm’s past year of commit activity
    Python 0 Apache-2.0 11,796 0 0 Updated Oct 22, 2025
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    deepinfra/sglang’s past year of commit activity
    Python 0 Apache-2.0 3,592 0 0 Updated Oct 14, 2025
  • SpecForge Public Forked from sgl-project/SpecForge

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    deepinfra/SpecForge’s past year of commit activity
    Python 0 MIT 113 0 0 Updated Oct 8, 2025
  • Roo-Code Public Forked from RooCodeInc/Roo-Code

    Roo Code gives you a whole dev team of AI agents in your code editor.

    deepinfra/Roo-Code’s past year of commit activity
    TypeScript 0 Apache-2.0 2,609 0 0 Updated Sep 4, 2025
  • kilocode Public Forked from Kilo-Org/kilocode

    Open Source AI coding assistant for planning, building, and fixing code. We're a superset of Roo, Cline, and our own features. Follow us: kilocode.ai/social

    deepinfra/kilocode’s past year of commit activity
    TypeScript 0 Apache-2.0 1,435 0 0 Updated Aug 28, 2025
  • ocr-tools Public
    deepinfra/ocr-tools’s past year of commit activity
    Python 5 2 1 0 Updated Aug 2, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…