abetlen / llama-cpp-python Public

Notifications You must be signed in to change notification settings
Fork 1.1k
Star 8.7k

Code
Issues 500
Pull requests 72
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: abetlen/llama-cpp-python

Roadmap for v0.2

#487 opened Jul 18, 2023 by abetlen

Open

Add batched inference

#771 opened Sep 30, 2023 by abetlen

Open 37

Improve installation process

#1178 opened Feb 12, 2024 by abetlen

Open 8

Labels 23 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

500 Open 696 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

No GPU usage, occupied VRAM but only CPU is working

#1939 opened Feb 18, 2025 by 0xb1te

Specifying additional_files for model files in directory adds additional copy of directory to download URL

#1938 opened Feb 17, 2025 by zhudotexe

4 tasks done

Segmentation fault when converting embeddings into tensor

#1937 opened Feb 17, 2025 by devashishraj

Getting seg faults intermittently prior to streaming generation

#1936 opened Feb 17, 2025 by ekcrisp

Using OpenBlas to accelerate has no effect？

#1935 opened Feb 17, 2025 by buptmengjj

I cannot install this package with cuda12.4

#1933 opened Feb 13, 2025 by BoogonClothman

Failed building wheel for llama-cpp-python

#1932 opened Feb 12, 2025 by pklochowicz

llama.cpp transfer

#1931 opened Feb 11, 2025 by d-kleine

Please release a cuda build for v0.3.5

#1925 opened Feb 7, 2025 by ParisNeo

Installing Llama cpp python on Debian with no git installed throws an error

#1924 opened Feb 7, 2025 by ekcrisp

Unable to Build llama-cpp-python with Vulkan (Core Dump on Model Load)

#1923 opened Feb 6, 2025 by Talnz007

Distributed Inference

#1921 opened Feb 5, 2025 by lipere123

Macos Metal Github Release for python 3.12 is broken

#1918 opened Feb 4, 2025 by haixuanTao

See No CUDA toolset found log after -- CUDA Toolkit found. log

#1917 opened Feb 3, 2025 by TomaszDlubis

How to run CLIP model on GPU?

#1916 opened Feb 3, 2025 by zhu-j-faceonlive

Cuda Build Failed, Please take a look at this

#1915 opened Feb 1, 2025 by RealUnrealGameDev

RPC is broken due to change of interface in llama.cpp main repository (rpc : early register backend devices #11262)

#1914 opened Jan 30, 2025 by j-lag

Automate upstream llama.cpp sync

#1910 opened Jan 26, 2025 by bgs4free

pip install llama-cpp-python got stuck forever at "Configuring CMake" in docker

#1908 opened Jan 24, 2025 by jiafatom

openai API max_completion_tokens argument is ignored

#1907 opened Jan 24, 2025 by BenjaminMarechalEVITECH

4 tasks done

openai API n argument is ignored

#1906 opened Jan 24, 2025 by BenjaminMarechalEVITECH

4 tasks done

Wrong chat format for llava 1.5

#1905 opened Jan 24, 2025 by BenjaminMarechalEVITECH

4 tasks done

Add minicpm-o and qwen2-vl to the list of supported multimodal models.

#1904 opened Jan 24, 2025 by kseyhan

OSError: exception: access violation reading 0x0000000000000000

#1903 opened Jan 24, 2025 by andretisch

Question - Batch Processing

#1902 opened Jan 22, 2025 by virentakia

Previous 1 2 3 4 5 … 19 20 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-02-18.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly