Skip to content

Conversation

@sunjiweiswift
Copy link

…cache

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

@gemini-code-assist
Copy link
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@sunjiweiswift sunjiweiswift marked this pull request as draft October 24, 2025 03:13
@sunjiweiswift sunjiweiswift changed the title add flash_attn_decode, flash_attn_extend intest of flash_attn_with_kv… [Intel XPU]add flash_attn_decode, flash_attn_extend intest of flash_attn_with_kv… Oct 24, 2025
@sunjiweiswift sunjiweiswift force-pushed the intel_attention_pure_deocde_extend branch from 68fbbec to 7e809ea Compare October 28, 2025 06:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant