szrlee

Yingru Li szrlee

79 followers · 99 following

richardli.xyz

Achievements

Highlights

Starred repositories

tensorgi / T6

The official implementation of Tensor ProducT ATTenTion Transformer (T6)

Python 336 31 Updated Feb 20, 2025

RLHFlow / Online-DPO-R1

Codebase for Iterative DPO Using Rule-based Rewards

Python 225 30 Updated Feb 25, 2025

wizardlancet / diagnosis_zero

Forked from volcengine/verl

diagnosis_zero, R1 Zero reproduce on disease diagnosis

Python 11 Updated Feb 8, 2025

hkust-nlp / simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,193 236 Updated Mar 17, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 42,201 5,771 Updated Mar 17, 2025

bytedance / UI-TARS-desktop

A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.

TypeScript 3,139 232 Updated Mar 19, 2025

bytedance / UI-TARS

2,881 175 Updated Feb 17, 2025

RAGEN-AI / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,182 84 Updated Mar 19, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 48,247 5,151 Updated Jan 22, 2025

google-deepmind / mujoco_playground

An open-source library for GPU-accelerated robot learning and sim-to-real transfer.

Jupyter Notebook 810 86 Updated Mar 18, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,408 86 Updated Mar 18, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 35,235 2,695 Updated Mar 19, 2025

The-Run-Philosophy-Organization / run

润学全球官方指定GITHUB，整理润学宗旨、纲领、理论和各类润之实例；解决为什么润，润去哪里，怎么润三大问题；并成为新中国人的核心宗教，核心信念。

31,935 2,619 Updated Jul 31, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 24,445 2,132 Updated Mar 19, 2025

SylphAI-Inc / LLM-engineer-handbook

A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.

2,855 349 Updated Jan 30, 2025

prs-eth / LoRA-Ensemble

LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks

Python 47 3 Updated Oct 1, 2024

opendilab / awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

454 14 Updated Feb 7, 2025

deepseek-ai / DeepSeek-LLM

DeepSeek LLM: Let there be answers

Makefile 6,201 954 Updated Feb 4, 2024

thunlp / ProactiveAgent

A LLM-based Agent that predict its tasks proactively.

Python 329 29 Updated Mar 7, 2025

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

731 48 Updated Feb 28, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 14,746 1,714 Updated Mar 10, 2025

srush / awesome-o1

A bibliography and survey of the papers surrounding o1

TeX 1,181 50 Updated Nov 16, 2024

sail-sg / oat

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

Python 224 13 Updated Mar 10, 2025

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,086 80 Updated Feb 19, 2025

flowersteam / Grounding_LLMs_with_online_RL

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Python 249 28 Updated Aug 23, 2024

KhoomeiK / LlamaGym

Fine-tune LLM agents with online reinforcement learning

Python 1,091 50 Updated Mar 19, 2024

liziniu / GEM

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)

Python 14 Updated Mar 4, 2025

GAIR-NLP / O1-Journey

O1 Replication Journey

1,977 65 Updated Jan 14, 2025

diagram-of-thought / diagram-of-thought

Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)

176 11 Updated Mar 13, 2025

tmgthb / Autonomous-Agents

Autonomous Agents (LLMs) research papers. Updated Daily.

720 39 Updated Mar 19, 2025

Starred topics

$latex logo$

Yingru Li szrlee

Highlights

Starred repositories

LaTeX