Skip to content

Actions: pytorch/torchtitan

Unit Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
102 workflow runs
102 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add model num params display, gpu memory metrics
Unit Test #27: Pull request #56 synchronize by lessw2020
February 13, 2024 00:53 3m 26s lessw2020:add_metrics
February 13, 2024 00:53 3m 26s
add model num params display, gpu memory metrics
Unit Test #26: Pull request #56 opened by lessw2020
February 13, 2024 00:20 3m 52s lessw2020:add_metrics
February 13, 2024 00:20 3m 52s
Add truncated llama style model init via reset parameters()
Unit Test #25: Pull request #54 synchronize by lessw2020
February 12, 2024 17:59 3m 24s lessw2020:llama-init
February 12, 2024 17:59 3m 24s
Add truncated llama style model init via reset parameters()
Unit Test #24: Pull request #54 synchronize by lessw2020
February 12, 2024 17:48 3m 39s lessw2020:llama-init
February 12, 2024 17:48 3m 39s
Add truncated llama style model init via reset parameters()
Unit Test #23: Pull request #54 opened by lessw2020
February 12, 2024 17:44 3m 31s lessw2020:llama-init
February 12, 2024 17:44 3m 31s
Improve run_llama_train.sh args and add local-ranks-filter (#51)
Unit Test #22: Commit da50d34 pushed by wconstab
February 10, 2024 00:51 3m 21s main
February 10, 2024 00:51 3m 21s
Move to cuda unconditionally so pp-only run works (#50)
Unit Test #21: Commit 1a0b9fd pushed by wconstab
February 10, 2024 00:50 3m 25s main
February 10, 2024 00:50 3m 25s
Improve run_llama_train.sh args and add local-ranks-filter
Unit Test #20: Pull request #51 synchronize by wconstab
February 10, 2024 00:48 3m 27s whc/logrank
February 10, 2024 00:48 3m 27s
Move to cuda unconditionally so pp-only run works
Unit Test #19: Pull request #50 synchronize by wconstab
February 10, 2024 00:46 3m 32s whc/fixes
February 10, 2024 00:46 3m 32s
Improve run_llama_train.sh args and add local-ranks-filter
Unit Test #18: Pull request #51 opened by wconstab
February 9, 2024 23:38 3m 23s whc/logrank
February 9, 2024 23:38 3m 23s
Move to cuda unconditionally so pp-only run works
Unit Test #17: Pull request #50 synchronize by wconstab
February 9, 2024 21:57 3m 28s whc/fixes
February 9, 2024 21:57 3m 28s
enable data loading for data parallel training
Unit Test #16: Commit e1b61c3 pushed by tianyu-l
February 8, 2024 01:12 3m 23s main
February 8, 2024 01:12 3m 23s
enable data loading for data parallel training
Unit Test #15: Pull request #49 synchronize by tianyu-l
February 8, 2024 01:11 3m 30s gh/tianyu-l/1/head
February 8, 2024 01:11 3m 30s
enable data loading for data parallel training
Unit Test #14: Pull request #49 opened by tianyu-l
February 7, 2024 23:25 3m 23s gh/tianyu-l/1/head
February 7, 2024 23:25 3m 23s
Add Sequence Parallelism to llama
Unit Test #13: Commit 7a73979 pushed by wanchaol
February 7, 2024 07:20 3m 20s main
February 7, 2024 07:20 3m 20s
Add Sequence Parallelism to llama
Unit Test #12: Pull request #32 synchronize by wanchaol
February 7, 2024 07:15 4m 17s gh/wanchaol/2/head
February 7, 2024 07:15 4m 17s
Remove the accidentally added --compile
Unit Test #11: Pull request #42 opened by fegin
February 6, 2024 21:43 3m 29s chienchin_remove_compile_flag
February 6, 2024 21:43 3m 29s
Enable checkpointing with DCP (#26)
Unit Test #10: Commit 6bd9082 pushed by fegin
February 6, 2024 16:55 3m 21s main
February 6, 2024 16:55 3m 21s
Enable checkpointing with DCP
Unit Test #9: Pull request #26 synchronize by fegin
February 5, 2024 19:41 3m 57s chienchin_enable_checkpoint
February 5, 2024 19:41 3m 57s
Enable checkpointing with DCP
Unit Test #8: Pull request #26 synchronize by fegin
February 5, 2024 19:39 2m 34s chienchin_enable_checkpoint
February 5, 2024 19:39 2m 34s
Enable checkpointing with DCP
Unit Test #7: Pull request #26 synchronize by fegin
February 5, 2024 19:34 3m 28s chienchin_enable_checkpoint
February 5, 2024 19:34 3m 28s
Fix assertion of dp mesh and add slice helper
Unit Test #6: Pull request #41 opened by wconstab
February 3, 2024 01:18 3m 43s whc/fix_mesh
February 3, 2024 01:18 3m 43s
Add pytest and CI workflow from torchtune (#35)
Unit Test #5: Commit 7a8a9ec pushed by wconstab
February 2, 2024 19:24 3m 57s main
February 2, 2024 19:24 3m 57s
Add pytest and CI workflow from torchtune
Unit Test #4: Pull request #35 synchronize by wconstab
February 2, 2024 19:19 3m 40s whc/test
February 2, 2024 19:19 3m 40s
Add pytest and CI workflow from torchtune
Unit Test #3: Pull request #35 synchronize by wconstab
February 2, 2024 19:18 1m 10s whc/test
February 2, 2024 19:18 1m 10s