Fix attention QK linkage error #24134

kunal-vaishnavi · 2025-03-21T22:26:27Z

Description

This PR moves the CUDA memcpy for the QK output when type T is equal to type QK from attention_impl.cu into attention_qk.cu.

Motivation and Context

This PR fixes a linkage error when type T and type QK are the same in attention_qk.cu.

yuslepukhin · 2025-03-21T22:30:33Z

Looks reasonable

yuslepukhin · 2025-03-21T22:31:59Z

The linkage error is due to the fact that Multihead attention code attempts to indirectly instantiate CopyQK with <float, float> but that does not exists

onnxruntime/contrib_ops/cuda/bert/attention_impl.cu

### Description This PR moves the CUDA memcpy for the QK output when type `T` is equal to type `QK` from `attention_impl.cu` into `attention_qk.cu`. ### Motivation and Context This PR fixes a linkage error when type `T` and type `QK` are the same in `attention_qk.cu`.

kunal-vaishnavi added 2 commits March 21, 2025 15:18

Update attention_qk.cu

9d94517

Update attention_impl.cu

dca5c5b

kunal-vaishnavi requested review from tianleiwu and yuslepukhin March 21, 2025 22:26

kunal-vaishnavi added 2 commits March 22, 2025 03:05

Add missing kernel

72d15aa

Fix ROCm build

586f534

tianleiwu approved these changes Mar 22, 2025

View reviewed changes

tianleiwu reviewed Mar 22, 2025

View reviewed changes

onnxruntime/contrib_ops/cuda/bert/attention_impl.cu Show resolved Hide resolved

kunal-vaishnavi merged commit 2b3d7fb into main Mar 24, 2025
87 of 91 checks passed

kunal-vaishnavi deleted the kvaishnavi/attention-qk branch March 24, 2025 02:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix attention QK linkage error #24134

Fix attention QK linkage error #24134

Uh oh!

kunal-vaishnavi commented Mar 21, 2025

Uh oh!

yuslepukhin commented Mar 21, 2025

Uh oh!

yuslepukhin commented Mar 21, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix attention QK linkage error #24134

Fix attention QK linkage error #24134

Uh oh!

Conversation

kunal-vaishnavi commented Mar 21, 2025

Description

Motivation and Context

Uh oh!

yuslepukhin commented Mar 21, 2025

Uh oh!

yuslepukhin commented Mar 21, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants