Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
116 commits
Select commit Hold shift + click to select a range
b23765c
initial proposal
garrett4wade Jun 11, 2025
d389cd8
add arealite
garrett4wade Jun 13, 2025
577f72c
.
garrett4wade Jun 13, 2025
989bc8a
change api
garrett4wade Jun 13, 2025
fd7103f
.
garrett4wade Jun 13, 2025
ad76c83
remove LOG_ROOT
garrett4wade Jun 14, 2025
7a51668
remove MODEL_SAVE_PATH
garrett4wade Jun 14, 2025
7adfedd
remove PARAM_REALLOC_PATH, DATASET_CACHE
garrett4wade Jun 14, 2025
170ec3a
prepare for testing
garrett4wade Jun 14, 2025
a433a29
prepare for testing
garrett4wade Jun 14, 2025
d15b091
ready for run
garrett4wade Jun 14, 2025
0b0aa9c
local run
garrett4wade Jun 14, 2025
cafc602
tests mainly pass
garrett4wade Jun 15, 2025
2f17696
format
garrett4wade Jun 15, 2025
b591015
.
garrett4wade Jun 16, 2025
7a81f55
Merge branch 'fw/gh/fix-init-constants' of https://github.com/inclusi…
garrett4wade Jun 16, 2025
0d07566
amend cluster.py
garrett4wade Jun 16, 2025
cd771ad
.
garrett4wade Jun 16, 2025
8f4370d
.
garrett4wade Jun 16, 2025
29b3d50
client test pass
garrett4wade Jun 16, 2025
dbb2703
pass rollout test
garrett4wade Jun 16, 2025
b0d0026
remove unused imports
garrett4wade Jun 16, 2025
da41bf1
add arealite readme
garrett4wade Jun 16, 2025
90cc896
change api
garrett4wade Jun 18, 2025
7de1863
.
garrett4wade Jun 18, 2025
5441e87
.
garrett4wade Jun 18, 2025
2a01a5a
.
garrett4wade Jun 18, 2025
1183bc4
.
garrett4wade Jun 19, 2025
1a63361
.
nuzant Jun 19, 2025
f9390da
.
garrett4wade Jun 19, 2025
a5e82f2
.
nuzant Jun 19, 2025
20c7cd8
.
nuzant Jun 19, 2025
1424e7a
.
garrett4wade Jun 19, 2025
8bf6dd1
Merge branch 'mzy/gh/fsdp2-engine' of https://github.com/inclusionAI/…
garrett4wade Jun 19, 2025
e4921d9
format
garrett4wade Jun 19, 2025
7fbe7d9
.
garrett4wade Jun 19, 2025
a218692
implement iteraptable generation (#112)
zhaochenyang20 Jun 19, 2025
4fc6e2c
update code
garrett4wade Jun 19, 2025
d5484f9
.
garrett4wade Jun 19, 2025
88bae72
fix
garrett4wade Jun 20, 2025
eeda029
Merge branch 'fw/gh/fix-init-constants' of code.alipay.com:inclusionA…
garrett4wade Jun 20, 2025
06829c6
Merge branch 'fw/refactor' of code.alipay.com:inclusionAI/AReaL into …
garrett4wade Jun 20, 2025
6ce5ec3
.
garrett4wade Jun 20, 2025
92d6364
.
garrett4wade Jun 20, 2025
866ceac
.
garrett4wade Jun 20, 2025
91c7de2
pass controller generate batch test
garrett4wade Jun 20, 2025
38e3cac
.
garrett4wade Jun 21, 2025
816e115
refactor rollout controller into worker and controller
garrett4wade Jun 21, 2025
211d461
.
garrett4wade Jun 21, 2025
8b87b23
.
garrett4wade Jun 21, 2025
6f1370c
.
garrett4wade Jun 22, 2025
331ca7c
change to async rollout
garrett4wade Jun 22, 2025
9bf9b18
pass rollout controller test
garrett4wade Jun 22, 2025
18aa285
pass test
garrett4wade Jun 22, 2025
e9d97f3
.
garrett4wade Jun 22, 2025
394a0ff
update readme
garrett4wade Jun 23, 2025
5ffce54
.
garrett4wade Jun 23, 2025
d3c2f15
sft debug
nuzant Jun 23, 2025
9129dd7
merge
nuzant Jun 23, 2025
6ebcbc5
.
garrett4wade Jun 23, 2025
7f1397e
Merge branch 'main' of https://github.com/inclusionAI/AReaL into fw/r…
garrett4wade Jun 23, 2025
302e876
add lisence
garrett4wade Jun 23, 2025
6ed10c9
remove unused files
garrett4wade Jun 23, 2025
7695179
remove unsed args in ppo
garrett4wade Jun 23, 2025
b4766bd
add hf engine wrapper (#116)
Jayon02 Jun 24, 2025
8d2bd4e
format
garrett4wade Jun 24, 2025
06060cf
format
nuzant Jun 24, 2025
b112d83
.
nuzant Jun 24, 2025
49a31c4
refine hf engine
garrett4wade Jun 24, 2025
5d5ac78
.
nuzant Jun 24, 2025
e7163ea
merge fw/refactor
nuzant Jun 24, 2025
ccdf037
fix
nuzant Jun 24, 2025
92e3a3d
add fsdp engine and sft tests
nuzant Jun 25, 2025
9b8306c
.
garrett4wade Jun 25, 2025
8019335
merge
garrett4wade Jun 25, 2025
f9643b7
.
garrett4wade Jun 25, 2025
aa8f4ef
.
garrett4wade Jun 25, 2025
d07f595
pass ppo unittest
garrett4wade Jun 25, 2025
242243b
pass ppo and rollout controller tests
garrett4wade Jun 26, 2025
3059640
clear unused imports
garrett4wade Jun 26, 2025
59d288b
rename ppo to grpo
garrett4wade Jun 26, 2025
1be260e
change reward function organization
garrett4wade Jun 26, 2025
eb431c1
reorganize code
garrett4wade Jun 26, 2025
63cd942
add dataset api
garrett4wade Jun 26, 2025
7e7240d
.
garrett4wade Jun 26, 2025
05a2df0
.
garrett4wade Jun 26, 2025
6ec4493
.
Jun 26, 2025
84ff759
format
Jun 26, 2025
f099bbd
chmod fix
nuzant Jun 26, 2025
15537cb
.
garrett4wade Jun 26, 2025
8c338e9
Merge branch 'fw/refactor' of https://code.alipay.com/inclusionAI/ARe…
garrett4wade Jun 26, 2025
9724c8a
rename workflow to collector
garrett4wade Jun 27, 2025
77a557c
refactor llm_client location
garrett4wade Jun 27, 2025
73b5b3e
.
garrett4wade Jun 27, 2025
4320da8
.
garrett4wade Jun 27, 2025
b424176
fix llm server api
garrett4wade Jun 27, 2025
d2a317d
refactor config structure
garrett4wade Jun 28, 2025
a2ade35
.
garrett4wade Jun 30, 2025
8612932
fix tests
garrett4wade Jun 30, 2025
91d6399
.
garrett4wade Jun 30, 2025
c66ed17
.
garrett4wade Jun 30, 2025
2ce1ece
.
garrett4wade Jul 1, 2025
09f339f
Fix unresolved issue in SFTTrainer PR (#139)
nuzant Jul 1, 2025
df5ee49
Merge branch 'fw/refactor' of https://github.com/inclusionAI/AReaL in…
garrett4wade Jul 1, 2025
d1f863c
Merge branch 'fw/refactor2' of https://code.alipay.com/inclusionAI/AR…
garrett4wade Jul 1, 2025
ab7503a
.
garrett4wade Jul 2, 2025
a5299b1
.
garrett4wade Jul 2, 2025
3a8796b
.
garrett4wade Jul 2, 2025
89a8d8c
.
garrett4wade Jul 4, 2025
078d3e1
Add CI for testing AReaLite (#150)
futrime Jul 7, 2025
9a06675
.
garrett4wade Jul 7, 2025
4afa422
Merge branch 'lite' of https://github.com/inclusionAI/AReaL into fw/r…
garrett4wade Jul 10, 2025
e05c9cc
merge fw/refactor
garrett4wade Jul 10, 2025
4c50652
revert some changes
garrett4wade Jul 10, 2025
09e8d71
fix
garrett4wade Jul 10, 2025
6c4c402
Merge branch 'lite' of https://github.com/inclusionAI/AReaL into fw/r…
garrett4wade Jul 10, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion arealite/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -737,4 +737,4 @@ dataloader = StatefulDataLoader(
)
for data in dataloader:
assert isinstance(data, list)
```
```
3 changes: 3 additions & 0 deletions arealite/api/cli_args.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,9 @@
from pathlib import Path
from typing import Dict, List, Optional, Tuple

import uvloop

uvloop.install()
from hydra import compose as hydra_compose
from hydra import initialize as hydra_init
from omegaconf import MISSING, OmegaConf
Expand Down
2 changes: 1 addition & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -53,9 +53,9 @@ dependencies = [
"hydra-core==1.4.0.dev1",
"packaging",
"tabulate",
"gymnasium>=1.1.1",
"torchdata",
"autoflake",
"gymnasium",
"tensordict",

# Monitoring and logging
Expand Down
2 changes: 2 additions & 0 deletions realhf/api/core/data_api.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@
import random
import time
from contextlib import contextmanager
from functools import lru_cache

# NOTE: We don't sue wildcard importing here because the type
# `Sequence` has a very similar name to `SequenceSample`.
Expand Down Expand Up @@ -47,6 +48,7 @@
RL_TASKS = ["math", "code", "rlhf", "stem"]


@lru_cache(maxsize=8)
def load_hf_tokenizer(
model_name_or_path: str,
fast_tokenizer=True,
Expand Down
4 changes: 2 additions & 2 deletions requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -69,8 +69,8 @@ word2number
Pebble
timeout-decorator
prettytable
gymnasium>=1.1.1
swanlab[dashboard]
torchdata
autoflake
gymnasium
tensordict
tensordict