Skip to content

Actions: ROCm/vllm

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
5,231 workflow runs
5,231 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Feature] Faster Custom Paged Attention kernels
Cleanup PR Body #136: Pull request #385 opened by tjtanaa
January 24, 2025 01:23 19s
January 24, 2025 01:23 19s
Using pytorch commit past the point when rowwise PR (https://github.c…
pre-commit #48: Commit 84f5d47 pushed by gshtras
January 23, 2025 22:30 4m 33s main
January 23, 2025 22:30 4m 33s
Pytorch rowwise scaled_mm
pre-commit #47: Pull request #384 synchronize by gshtras
January 23, 2025 18:30 4m 41s rowwise_torch_support
January 23, 2025 18:30 4m 41s
Pytorch rowwise scaled_mm
pre-commit #46: Pull request #384 opened by gshtras
January 23, 2025 18:29 4m 45s rowwise_torch_support
January 23, 2025 18:29 4m 45s
Pytorch rowwise scaled_mm
Cleanup PR Body #135: Pull request #384 opened by gshtras
January 23, 2025 18:29 16s
January 23, 2025 18:29 16s
[Bugfix]: Fix paged attention unit tests
pre-commit #45: Pull request #383 synchronize by tjtanaa
January 23, 2025 15:22 4m 40s EmbeddedLLM:pa-attn-test
January 23, 2025 15:22 4m 40s
Faster Custom Paged Attention kernels
pre-commit #44: Pull request #372 synchronize by sanyalington
January 23, 2025 15:15 4m 28s shsanyal_cpa_main_integration
January 23, 2025 15:15 4m 28s
[Bugfix]: Fix paged attention unit tests
Cleanup PR Body #134: Pull request #383 edited by tjtanaa
January 23, 2025 15:15 23s
January 23, 2025 15:15 23s
[Bugfix]: Fix paged attention unit tests
Cleanup PR Body #133: Pull request #383 edited by tjtanaa
January 23, 2025 15:15 26s
January 23, 2025 15:15 26s
[Bugfix]: Fix paged attention unit tests
pre-commit #43: Pull request #383 opened by tjtanaa
January 23, 2025 15:14 4m 30s EmbeddedLLM:pa-attn-test
January 23, 2025 15:14 4m 30s
[Bugfix]: Fix paged attention unit tests
Cleanup PR Body #132: Pull request #383 opened by tjtanaa
January 23, 2025 15:14 26s
January 23, 2025 15:14 26s
Faster Custom Paged Attention kernels
pre-commit #42: Pull request #372 synchronize by sanyalington
January 23, 2025 15:06 4m 28s shsanyal_cpa_main_integration
January 23, 2025 15:06 4m 28s
Close inactive issues and PRs
Close inactive issues and PRs #86: Scheduled
January 23, 2025 01:56 20s main
January 23, 2025 01:56 20s
Faster Custom Paged Attention kernels
pre-commit #41: Pull request #372 synchronize by gshtras
January 23, 2025 00:44 4m 38s shsanyal_cpa_main_integration
January 23, 2025 00:44 4m 38s
Returning the use of the proper stream in allreduce (#382)
pre-commit #40: Commit 5f9b40b pushed by gshtras
January 23, 2025 00:35 4m 31s main
January 23, 2025 00:35 4m 31s
Returning the use of the proper stream in allreduce
pre-commit #39: Pull request #382 synchronize by gshtras
January 23, 2025 00:32 4m 32s bring_stream_back
January 23, 2025 00:32 4m 32s
FP8 FA fixes (#381)
pre-commit #38: Commit a600e9f pushed by gshtras
January 23, 2025 00:32 4m 26s main
January 23, 2025 00:32 4m 26s
Returning the use of the proper stream in allreduce
pre-commit #37: Pull request #382 opened by gshtras
January 23, 2025 00:28 4m 35s bring_stream_back
January 23, 2025 00:28 4m 35s
Returning the use of the proper stream in allreduce
Cleanup PR Body #131: Pull request #382 opened by gshtras
January 23, 2025 00:28 19s
January 23, 2025 00:28 19s
Switching building to MI300.
pre-commit #36: Pull request #380 synchronize by Alexei-V-Ivanov-AMD
January 22, 2025 23:05 4m 26s mi300_agent_building
January 22, 2025 23:05 4m 26s
FP8 FA fixes
pre-commit #35: Pull request #381 synchronize by ilia-cher
January 22, 2025 23:00 4m 32s fp8_fix
January 22, 2025 23:00 4m 32s