Active Learning for Text Classification in Python
-
Updated
Sep 11, 2025 - Python
Active Learning for Text Classification in Python
Local LLM Powered Recursive Search & Smart Knowledge Explorer
Survey of Small Language Models from Penn State, ...
Doge Family of Small Language Models
"Generative AI in Action" book's code repository
Wonderful Matrices to Build Small Language Models
Remma-O1: An open-source Language Model with 1.17B Params, built on pytorch from scratch. Work in Progress!!! Open for collaboration.
SQL AI Agent - Talk to your DB in Natural Language
Ollama RAG using SQL Database
A Python library to train task-specific LLMs without training data, for offline NLP and Text Classification tasks, such as Guardrail Models and Intent Classification 🤖🚀
Build a Conversational AI System that can answer questions by retrieving the answers from a document.
Repository for the companion Colab notebook of the Domain-Specific Small Language Models book.
LeCarnet is a 2 M+ corpus of simple French stories
Dataset Generation Code for SimpleStories
A lightweight voice companion, optimized for macOS.
Kurtis is a fine-tuning, inference and evaluation tool built for SLMs (Small Language Models), such as Huggingface's SmolLM2.
This project fine-tunes Unsloth's Gemma-3 4B IT (4-bit) model to translate natural language into Cypher queries for Neo4j graph database.
Code for the paper "Deep neural networks and humans both benefit from compositional structure"
This Repository provides a Jupyter Notebook for building a small language model from scratch using 'TinyStories' dataset. Covers data preprocessing, BPE tokenization, binary storage, GPU memory management, and training a Transformer in PyTorch. Generate sample stories to test your model. Ideal for learning NLP and PyTorch.
Add a description, image, and links to the small-language-models topic page so that developers can more easily learn about it.
To associate your repository with the small-language-models topic, visit your repo's landing page and select "manage topics."