NNCF Compress PT2E Support #14

anzr299 · 2025-10-15T09:10:05Z

Summary

Add compress pt2e to the openvino llama example flow. This change allows users to employ algorithms like AWQ, scale estimation etc.

Openvino llama support

cavusmustafa · 2025-11-03T20:11:17Z

examples/models/llama/export_llama_lib.py

+            raise ImportError(
+                "Please install nncf via backends/openvino/requirements.txt"
+            )
+        tokenizer = get_tokenizer(builder_exported.tokenizer_path)


I wonder if we can use helper functions to reduce this part of the code inside export_llama_lib.py maybe?

anzr299 added 3 commits September 24, 2025 16:56

init

a4c8327

update quantizer for PT2e

af502fe

Merge pull request #8 from cavusmustafa/openvino_llama_support

a63a894

Openvino llama support

anzr299 marked this pull request as draft October 15, 2025 09:10

anzr299 added 2 commits October 20, 2025 19:31

add awq and scale estimation

e0010bb

install nncf from the branch

b7f0abe

cavusmustafa reviewed Nov 3, 2025

View reviewed changes

anzr299 added 3 commits November 4, 2025 10:49

update local fixes

be2d7d5

review changes

adfe687

lintrunner

9a2e80c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

NNCF Compress PT2E Support #14

NNCF Compress PT2E Support #14

Uh oh!

anzr299 commented Oct 15, 2025

Uh oh!

cavusmustafa Nov 3, 2025

Uh oh!

anzr299 Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NNCF Compress PT2E Support #14

Are you sure you want to change the base?

NNCF Compress PT2E Support #14

Uh oh!

Conversation

anzr299 commented Oct 15, 2025

Summary

Uh oh!

cavusmustafa Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

anzr299 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants