Skip to content

Activity

fix: timing_raw

tongyx361pushed 1 commit to gm-tyx/puffin/main • 69d57f3…ccf178e • 
1 hour ago

feat: return_dict

tongyx361pushed 1 commit to gm-tyx/puffin/main • 2ec29d9…69d57f3 • 
1 hour ago

refactor: remove redundant response_mask computation (#792)

Pull request merge
tongyx361pushed 1 commit to main • 1b8b0e8…f09e365 • 
1 hour ago

feat: loss_agg_mode

tongyx361pushed 1 commit to gm-tyx/puffin/main • 13e8b3d…2ec29d9 • 
2 hours ago

Deleted branch

Deleted branch

vermouth1992deleted yaowei/vllm_upd • 
2 hours ago

doc: upgrade to vllm 0.8.2 (#769)

Pull request merge
vermouth1992pushed 1 commit to main • aab0176…1b8b0e8 • 
2 hours ago

chore: improve news

tongyx361pushed 2 commits to gm-tyx/puffin/main • 992d1fc…13e8b3d • 
2 hours ago

fix: top p for full version

tongyx361pushed 1 commit to gm-tyx/puffin/main • c01b097…992d1fc • 
2 hours ago

fix: validation top p as paper

tongyx361pushed 1 commit to gm-tyx/puffin/main • 74b7edb…c01b097 • 
2 hours ago

fix: remove uncessary verify from naive

tongyx361pushed 2 commits to gm-tyx/puffin/main • 3b5ef9c…74b7edb • 
2 hours ago

fix: config

tongyx361pushed 1 commit to gm-tyx/puffin/main • bd2059c…3b5ef9c • 
2 hours ago

fix: algo job name

tongyx361pushed 1 commit to gm-tyx/puffin/main • a3c0f9e…bd2059c • 
3 hours ago

fix: config

tongyx361pushed 1 commit to gm-tyx/puffin/main • 2045493…a3c0f9e • 
3 hours ago

Deleted branch

[recipe] refactor: decouple DAPO (#790)

Pull request merge
tongyx361pushed 1 commit to gm-tyx/puffin/main • 66686b4…2045493 • 
3 hours ago

chore: format

tongyx361pushed 1 commit to gm-tyx/puffin/refactor/decouple • 08f4e90…b45c417 • 
3 hours ago

feat: CI for DAPO

tongyx361pushed 5 commits to gm-tyx/puffin/refactor/decouple • 03150d4…08f4e90 • 
3 hours ago

feat: decouple entrypoint

tongyx361created gm-tyx/puffin/refactor/decouple • 03150d4 • 
6 hours ago

fix ut

hiyougapushed 1 commit to yaowei/vllm_upd • 814ed56…075df7e • 
8 hours ago

In the GRPO example scripts, modify the entropy_coeff to 0 to ensur…

Pull request merge
vermouth1992pushed 1 commit to main • 076acdf…aab0176 • 
10 hours ago

prevent division by zero in SFT (#786)

Pull request merge
vermouth1992pushed 1 commit to main • e9bcd2b…076acdf • 
10 hours ago

fix: prevent NaN when all items are intentionally masked by adding ep…

Pull request merge
vermouth1992pushed 1 commit to main • a1dd922…e9bcd2b • 
10 hours ago

chore: wandb run of an early version

tongyx361pushed 1 commit to gm-tyx/puffin/main • 88cf46d…66686b4 • 
11 hours ago

fix: set torch dtype to auto (#749)

Pull request merge
vermouth1992pushed 1 commit to main • fc27caf…a1dd922 • 
13 hours ago

fix: prompt_token_ids should be list[int] instead of np.array (#772)

Pull request merge
hiyougapushed 1 commit to main • 591a476…fc27caf • 
22 hours ago

e2e

hiyougapushed 1 commit to yaowei/vllm_upd • e073c4f…814ed56 • 
22 hours ago

Update math_dataset.py to fix typo in the annotation (#765)

Pull request merge
vermouth1992pushed 1 commit to main • 7f10e57…591a476 • 
yesterday

fix ci

hiyougapushed 1 commit to yaowei/vllm_upd • 5fdb8e4…e073c4f • 
yesterday

upd test script

hiyougapushed 1 commit to yaowei/vllm_upd • cd5c7ef…5fdb8e4 • 
yesterday