Skip to content

Actions: pytorch/torchtitan

CPU Unit Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,499 workflow run results
1,499 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enable FSDP2 cpu offloading
CPU Unit Test #1718: Pull request #624 synchronize by mori360
October 24, 2024 23:27 7m 12s mori360:cpu_offload
October 24, 2024 23:27 7m 12s
Enable FSDP2 cpu offloading
CPU Unit Test #1717: Pull request #624 synchronize by mori360
October 24, 2024 23:26 2m 10s mori360:cpu_offload
October 24, 2024 23:26 2m 10s
Enable FSDP2 cpu offloading
CPU Unit Test #1716: Pull request #624 synchronize by mori360
October 24, 2024 20:05 6m 0s mori360:cpu_offload
October 24, 2024 20:05 6m 0s
Add script to convert pickled Llama weights to DCP
CPU Unit Test #1715: Pull request #634 synchronize by rlrs
October 24, 2024 15:41 5m 41s rlrs:main
October 24, 2024 15:41 5m 41s
Fix PP clip_grad_norm
CPU Unit Test #1711: Pull request #649 synchronize by zijian-hu
October 24, 2024 08:46 6m 30s zijian-hu:zijian-hu/fix_pp_clip_grad_norm
October 24, 2024 08:46 6m 30s
Enable FSDP2 cpu offloading
CPU Unit Test #1707: Pull request #624 synchronize by mori360
October 24, 2024 01:31 6m 16s mori360:cpu_offload
October 24, 2024 01:31 6m 16s
Enable FSDP2 cpu offloading
CPU Unit Test #1706: Pull request #624 synchronize by mori360
October 24, 2024 01:10 6m 17s mori360:cpu_offload
October 24, 2024 01:10 6m 17s
Enable FSDP2 cpu offloading
CPU Unit Test #1705: Pull request #624 synchronize by mori360
October 24, 2024 01:03 6m 38s mori360:cpu_offload
October 24, 2024 01:03 6m 38s
Enable FSDP2 cpu offloading
CPU Unit Test #1704: Pull request #624 synchronize by mori360
October 23, 2024 21:17 6m 55s mori360:cpu_offload
October 23, 2024 21:17 6m 55s
Workaround for pytorch/pytorch#138575 distributed checkpoint loading …
CPU Unit Test #1703: Commit 1060fea pushed by awgu
October 23, 2024 16:02 6m 12s main
October 23, 2024 16:02 6m 12s
Add script to convert pickled Llama weights to DCP
CPU Unit Test #1702: Pull request #634 synchronize by rlrs
October 23, 2024 08:08 5m 41s rlrs:main
October 23, 2024 08:08 5m 41s
enable Context Parallel (#592)
CPU Unit Test #1700: Commit b19456a pushed by XilunWu
October 23, 2024 01:01 5m 50s main
October 23, 2024 01:01 5m 50s
enable Context Parallel
CPU Unit Test #1698: Pull request #592 synchronize by XilunWu
October 22, 2024 23:37 5m 34s gh/XilunWu/6/head
October 22, 2024 23:37 5m 34s
enable Context Parallel
CPU Unit Test #1697: Pull request #592 synchronize by XilunWu
October 22, 2024 22:43 5m 1s gh/XilunWu/6/head
October 22, 2024 22:43 5m 1s
enable Context Parallel
CPU Unit Test #1696: Pull request #592 synchronize by XilunWu
October 22, 2024 21:53 5m 15s gh/XilunWu/6/head
October 22, 2024 21:53 5m 15s
enable Context Parallel
CPU Unit Test #1695: Pull request #592 synchronize by XilunWu
October 22, 2024 20:51 5m 35s gh/XilunWu/6/head
October 22, 2024 20:51 5m 35s
enable Context Parallel
CPU Unit Test #1694: Pull request #592 synchronize by XilunWu
October 22, 2024 20:47 4m 41s gh/XilunWu/6/head
October 22, 2024 20:47 4m 41s
enable Context Parallel
CPU Unit Test #1693: Pull request #592 synchronize by XilunWu
October 22, 2024 20:45 2m 24s gh/XilunWu/6/head
October 22, 2024 20:45 2m 24s
Use expandable segments in run_llama_train.sh
CPU Unit Test #1688: Commit a5e57e4 pushed by awgu
October 22, 2024 20:28 6m 50s main
October 22, 2024 20:28 6m 50s
Various cleanups around distributed setup
CPU Unit Test #1687: Pull request #645 synchronize by awgu
October 22, 2024 20:04 7m 25s gh/awgu/21/head
October 22, 2024 20:04 7m 25s
Use expandable segments in run_llama_train.sh
CPU Unit Test #1686: Pull request #643 synchronize by awgu
October 22, 2024 20:04 7m 40s gh/awgu/20/head
October 22, 2024 20:04 7m 40s
Move output .float() to loss fn and compile it if compiling
CPU Unit Test #1685: Commit e10cb94 pushed by awgu
October 22, 2024 20:02 4m 50s main
October 22, 2024 20:02 4m 50s