Skip to content
3 changes: 2 additions & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -27,7 +27,8 @@ requires = [
"numba",
"fastapi<0.124.0",
"opencv-python-headless<=4.11.0.86", # Required to avoid numpy version conflict with vllm
"compressed_tensors>=0.11.0"
"compressed_tensors>=0.11.0",
"arctic-inference==0.1.1"
]
build-backend = "setuptools.build_meta"

Expand Down
1 change: 1 addition & 0 deletions requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -30,5 +30,6 @@ numba
#--extra-index-url https://mirrors.huaweicloud.com/ascend/repos/pypi
torch-npu==2.8.0

arctic-inference==0.1.1
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

high

For better maintainability and to follow the existing convention in this file, it's good practice to add a comment explaining the purpose of this new dependency. The PR title suggests it's for 'speculative decoding with suffix_decode'.

# Required for speculative decoding with suffix_decode
arctic-inference==0.1.1

transformers>=4.57.3
fastapi<0.124.0
Loading