Skip to content

Actions: pytorch/torchtitan

8 GPU Integration Test

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
834 workflow run results
834 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enable FSDP2 cpu offloading
8 GPU Integration Test #859: Pull request #624 synchronize by mori360
October 24, 2024 23:27 15m 32s mori360:cpu_offload
October 24, 2024 23:27 15m 32s
Enable FSDP2 cpu offloading
8 GPU Integration Test #858: Pull request #624 synchronize by mori360
October 24, 2024 23:26 1m 9s mori360:cpu_offload
October 24, 2024 23:26 1m 9s
Enable FSDP2 cpu offloading
8 GPU Integration Test #857: Pull request #624 synchronize by mori360
October 24, 2024 20:05 12m 48s mori360:cpu_offload
October 24, 2024 20:05 12m 48s
Add script to convert pickled Llama weights to DCP
8 GPU Integration Test #856: Pull request #634 synchronize by rlrs
October 24, 2024 15:41 17m 4s rlrs:main
October 24, 2024 15:41 17m 4s
Fix PP clip_grad_norm
8 GPU Integration Test #852: Pull request #649 synchronize by zijian-hu
October 24, 2024 08:46 16m 23s zijian-hu:zijian-hu/fix_pp_clip_grad_norm
October 24, 2024 08:46 16m 23s
single-host generation for integration testing
8 GPU Integration Test #850: Pull request #640 synchronize by jaysonfrancis
October 24, 2024 03:42 17m 9s jaysonfrancis:feature/simple_generation
October 24, 2024 03:42 17m 9s
Enable FSDP2 cpu offloading
8 GPU Integration Test #848: Pull request #624 synchronize by mori360
October 24, 2024 01:31 7m 55s mori360:cpu_offload
October 24, 2024 01:31 7m 55s
Enable FSDP2 cpu offloading
8 GPU Integration Test #847: Pull request #624 synchronize by mori360
October 24, 2024 01:10 7m 44s mori360:cpu_offload
October 24, 2024 01:10 7m 44s
Enable FSDP2 cpu offloading
8 GPU Integration Test #846: Pull request #624 synchronize by mori360
October 24, 2024 01:03 8m 6s mori360:cpu_offload
October 24, 2024 01:03 8m 6s
8 GPU Integration Test
8 GPU Integration Test #845: Scheduled
October 24, 2024 00:29 14m 29s main
October 24, 2024 00:29 14m 29s
Enable FSDP2 cpu offloading
8 GPU Integration Test #844: Pull request #624 synchronize by mori360
October 23, 2024 21:17 11m 56s mori360:cpu_offload
October 23, 2024 21:17 11m 56s
Workaround for pytorch/pytorch#138575 distributed checkpoint loading …
8 GPU Integration Test #843: Commit 1060fea pushed by awgu
October 23, 2024 16:02 12m 11s main
October 23, 2024 16:02 12m 11s
Add script to convert pickled Llama weights to DCP
8 GPU Integration Test #842: Pull request #634 synchronize by rlrs
October 23, 2024 08:08 12m 14s rlrs:main
October 23, 2024 08:08 12m 14s
enable Context Parallel (#592)
8 GPU Integration Test #840: Commit b19456a pushed by XilunWu
October 23, 2024 01:01 18m 9s main
October 23, 2024 01:01 18m 9s
8 GPU Integration Test
8 GPU Integration Test #839: Scheduled
October 23, 2024 00:29 16m 37s main
October 23, 2024 00:29 16m 37s
enable Context Parallel
8 GPU Integration Test #837: Pull request #592 synchronize by XilunWu
October 22, 2024 23:37 16m 47s gh/XilunWu/6/head
October 22, 2024 23:37 16m 47s
enable Context Parallel
8 GPU Integration Test #836: Pull request #592 synchronize by XilunWu
October 22, 2024 22:43 17m 48s gh/XilunWu/6/head
October 22, 2024 22:43 17m 48s
enable Context Parallel
8 GPU Integration Test #835: Pull request #592 synchronize by XilunWu
October 22, 2024 21:53 16m 0s gh/XilunWu/6/head
October 22, 2024 21:53 16m 0s
enable Context Parallel
8 GPU Integration Test #834: Pull request #592 synchronize by XilunWu
October 22, 2024 20:51 11m 34s gh/XilunWu/6/head
October 22, 2024 20:51 11m 34s
enable Context Parallel
8 GPU Integration Test #833: Pull request #592 synchronize by XilunWu
October 22, 2024 20:47 5m 1s gh/XilunWu/6/head
October 22, 2024 20:47 5m 1s
enable Context Parallel
8 GPU Integration Test #832: Pull request #592 synchronize by XilunWu
October 22, 2024 20:45 2m 36s gh/XilunWu/6/head
October 22, 2024 20:45 2m 36s
Use expandable segments in run_llama_train.sh
8 GPU Integration Test #827: Commit a5e57e4 pushed by awgu
October 22, 2024 20:28 12m 2s main
October 22, 2024 20:28 12m 2s
Various cleanups around distributed setup
8 GPU Integration Test #826: Pull request #645 synchronize by awgu
October 22, 2024 20:04 11m 2s gh/awgu/21/head
October 22, 2024 20:04 11m 2s
Use expandable segments in run_llama_train.sh
8 GPU Integration Test #825: Pull request #643 synchronize by awgu
October 22, 2024 20:04 10m 9s gh/awgu/20/head
October 22, 2024 20:04 10m 9s