mlsys
Here are 31 public repositories matching this topic...
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
-
Updated
Aug 14, 2024
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
-
Updated
Jan 30, 2025 - Cuda
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
-
Updated
Feb 14, 2025 - Cuda
FedScale is a scalable and extensible open-source federated learning (FL) platform.
-
Updated
Dec 18, 2023 - Python
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
-
Updated
Dec 13, 2021 - C
Deep Learning Energy Measurement and Optimization
-
Updated
Feb 5, 2025 - Python
A scalable & efficient active learning/data selection system for everyone.
-
Updated
Jul 8, 2024 - Python
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
-
Updated
Sep 30, 2024 - C++
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
-
Updated
Jan 5, 2025 - HTML
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
-
Updated
Oct 15, 2024 - Python
Optimal Sparse Decision Trees
-
Updated
Apr 27, 2023 - Python
📚FFPA: Yet antother Faster Flash Prefill Attention with O(1)⚡️SRAM complexity for headdim > 256, 1.8x~3x↑🎉faster than SDPA EA.
-
Updated
Feb 13, 2025 - Cuda
Federated Learning Systems Paper List
-
Updated
Feb 7, 2024
sensAI: ConvNets Decomposition via Class Parallelism for Fast Inference on Live Data
-
Updated
Jul 25, 2024 - Python
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
-
Updated
Dec 9, 2024 - Python
GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]
-
Updated
Nov 27, 2024 - Python
[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention
-
Updated
Feb 14, 2025 - Python
Improve this page
Add a description, image, and links to the mlsys topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the mlsys topic, visit your repo's landing page and select "manage topics."