Skip to content

Conversation

@kylesayrs
Copy link
Collaborator

Purpose

  • Support FP4 attention quantization

Postrequisites

Changes

  • Update observer mocking to better mirror LLM Compressor implementation
  • Update expected_shape of tensor_group to only use the second-to-last dimension, which does not change behavior but avoids division error.

Testing

Signed-off-by: Kyle Sayers <[email protected]>
Signed-off-by: Kyle Sayers <[email protected]>
Signed-off-by: Kyle Sayers <[email protected]>
Base automatically changed from kylesayrs/r3-only to main October 23, 2025 14:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant