Skip to content

Remove marlin warning#4918

Merged
robertgshaw2-redhat merged 1 commit intovllm-project:mainfrom
neuralmagic:remove_marlin_warning
May 20, 2024
Merged

Remove marlin warning#4918
robertgshaw2-redhat merged 1 commit intovllm-project:mainfrom
neuralmagic:remove_marlin_warning

Conversation

@alexm-redhat
Copy link
Copy Markdown
Collaborator

Removes marlin warning about reduction of m_block size. We found out that the warning is not indicative of performance, since Marlin actually performs very well on smaller GPUs anyways.

@robertgshaw2-redhat robertgshaw2-redhat enabled auto-merge (squash) May 20, 2024 13:57
@robertgshaw2-redhat robertgshaw2-redhat merged commit da5a0b5 into vllm-project:main May 20, 2024
@robertgshaw2-redhat robertgshaw2-redhat deleted the remove_marlin_warning branch May 20, 2024 14:55
dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024
robertgshaw2-redhat pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jun 8, 2024
robertgshaw2-redhat pushed a commit to neuralmagic/nm-vllm that referenced this pull request Jul 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants