-
Notifications
You must be signed in to change notification settings - Fork 125
Issues: predibase/lorax
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
RuntimeError: CUDA error: no kernel image is available for execution on the device
#535
opened Jul 3, 2024 by
nethi
3 of 4 tasks
Important: In latest main, the server can not serve more than 1 user
#512
opened Jun 12, 2024 by
prd-tuong-nguyen
1 of 4 tasks
can't start my local llama3 model server with docker
#511
opened Jun 12, 2024 by
cheney369
3 of 4 tasks
AssertionError when using model "google/gemma-2b" with multi-gpus
#500
opened Jun 6, 2024 by
tritct
2 of 4 tasks
make install
insufficient for running llama3-8B-Instruct
documentation
#484
opened May 22, 2024 by
fozziethebeat
2 of 4 tasks
Add HTTP status codes to docs
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#481
opened May 20, 2024 by
noyoshi
When caching adapters, cache the adapter ID + the API token pair
enhancement
New feature or request
good first issue
Good for newcomers
#479
opened May 20, 2024 by
noyoshi
Reject unknown fields from API requests
enhancement
New feature or request
good first issue
Good for newcomers
#478
opened May 20, 2024 by
noyoshi
Support inference on INF2 instance
enhancement
New feature or request
#477
opened May 20, 2024 by
prd-tuong-nguyen
Improve warmup checking for max new tokens when using speculative decoding
bug
Something isn't working
good first issue
Good for newcomers
#474
opened May 17, 2024 by
tgaddair
Bug Report: lorax-launcher failed with --source "s3" for model_id "mistralai/Mistral-7B-Instruct-v0.2"
bug
Something isn't working
#473
opened May 17, 2024 by
donjing
1 of 4 tasks
Ensure api_token is not included in the response on error
bug
Something isn't working
#469
opened May 15, 2024 by
tgaddair
Add all launcher args as optional in the Helm charts
enhancement
New feature or request
#465
opened May 9, 2024 by
tgaddair
Retrieve all lora models from Huggingface hub by base model setting.
enhancement
New feature or request
good first issue
Good for newcomers
#463
opened May 8, 2024 by
svjack
Improve async load for adapters to avoid main thread lockups in server
enhancement
New feature or request
#457
opened May 3, 2024 by
tgaddair
Batch inference endpoint (OpenAI compatible)
enhancement
New feature or request
#448
opened Apr 30, 2024 by
tgaddair
Llama3-8b-Instruct won't stop generating
bug
Something isn't working
#442
opened Apr 27, 2024 by
ekim322
4 tasks
Idefics2 and LLaVA
enhancement
New feature or request
#439
opened Apr 26, 2024 by
joaomsimoes
2 tasks done
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-06-07.