Synchronize LLAMA_API with ggml-org/llama.cpp and update cuda workflow for windows #1966
JamePeng:main% was force-pushed and no longer has any new commits.
Pushing new commits will allow the pull request to be re-opened.
Pushing new commits will allow the pull request to be re-opened.