Skip to content

[Refactor][MLA]: Independently pass q_nope & q_rope #26567

@vnadathur

Description

@vnadathur

🚀 The feature, motivation and pitch

This PR: #25103 shows custom op for MLA

Original issue (#24620) wants to

pass q_nope and q_rope independently instead of concatenated.

This will require a sizable refactor in order to split the two in every backend, so that both are passed truly independently.

This issue tracks progress towards this task, the current pr is this: #28368

cc @ProExpertProg

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

To triage

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions