common : reimplement logging #9418

ggerganov · 2024-09-10T17:43:03Z

Merge ETA: Sep 15

Overhaul common/log.h. The main goal is to offload print IO to a separate thread in order to not affect the performance of the examples. Also add convenience options for timestamps, colors based on the log type and output to file.

By default, the logs should look the same as they do on master.

Adding the following options will make them look like this:

# set once in your env
export LLAMA_LOG_COLORS=1
export LLAMA_LOG_PREFIX=1
export LLAMA_LOG_TIMESTAMPS=1

# or pass CLI args
./llama-cli ... --log-colors --log-prefix --log-timestamps

Another important change is that the logs in llama-server have been significantly reformatted. I've always had trouble reading the output, so I changed the text in a way that is IMO easier to read. Also removed the option to output json logs as I don't think it has any practical value.

ggml-ci

bviksoe · 2024-09-13T19:11:44Z

Please don't take away the optional JSON formatting for logs.
This has been an important step to automate sending warnings and errors to the front-end when hosting the server process. Parsing the often very unstructured text output is tedious and error prone, while having the structured format like JSON makes it a whizz. Some logs can even function as progress notifications for a front-end, such as LOG_INFO("model loaded")

ggerganov · 2024-09-14T07:54:09Z

@bviksoe The server logs were never meant to be used in such way. These messages can be disabled and changed (even when in JSON format) without notice and therefore 3rd-party code should never rely on them. Instead, your frontend can query the server through the available endpoints. If you have a specific functionality in mind that is currently missing, submit a feature request and it will get implemented. Model loading and server status can already be queried properly through the existing API.

bviksoe · 2024-09-14T08:13:03Z

Thanks. I understand your desire to clean up and streamline the various parts of the project.
My argument is specifically actual error and warning messages produces during loading and even during streaming inference.
In server there are many such messages that are conveyed only in log. An example: LOG_ERROR("failed to get embeddings"...) If you want to create a user-friendly front-end, these should be accessible to the website/api user.

ggerganov · 2024-09-14T08:25:37Z

If you provide sample curl requests, combined with the output that you would expect server to return for each of them, we can extend the API and the responses. Feel free to open a feature request and list the missing functionality.

ggerganov/llama.cpp#9418

kyx0r · 2024-09-21T05:17:10Z

Prior to this, it was possible to use --log-disable to stop llama-cli from printing all model loading debug logs. It would only print the prompt loaded with -p flag and of course model output + user input when in interactive mode. Now this is broken, it doesn't print the prompt and the model output. As far as I can see the verbosity options are not implemented completely yet to be able to facilitate the old behavior.

I hope with gets fixed in the future, as right now it only works if --log-disable is removed. But that will make it dump everything instead of just the prompt. Also, --log-enable was removed, which was useful too, but my guess is once the verbosity level settings work as expected this will cover that aspect.

ggerganov · 2024-09-23T08:02:55Z

All messages with non-zero log level are output to stderr, so you can redirect stderr to /dev/null to get the old behavior.

kyx0r · 2024-09-23T14:46:11Z

Hi Georgi,
Understood, I did not check if it was still using stderr or not. It just wasn't obvious at a glance. I suppose users can continue using the stderr redirection hack for the time being. My hope that in the future though, more logging verbose levels get implemented so that it is more obvious and easier to control for the end users.

Also, even if you redirect to stderr, it doesn't remove everything. For example, you still have this message:
"== Running in interactive mode. =="
At least it's not too bad to just edit the source code and change the logging function it uses.

Kind Regards,
Kyryl

ggerganov/llama.cpp#9418

ericcurtin · 2024-09-25T08:46:40Z

@ggerganov , I noticed the same thing. --log-disable now breaks conversation mode

ggerganov · 2024-09-25T09:24:11Z

Don't use --log-disable. Instead redirect stderr to /dev/null if you don't need it.

Now instead of --log-disable we need to redirect to stderr: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

Now instead of --log-disable we need to redirect stderr to /dev/null: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

ggerganov#9418

ggerganov/llama.cpp#9418

github-actions bot added examples ggml changes relating to the ggml tensor library for machine learning testing Everything test related labels Sep 10, 2024

ggerganov force-pushed the gg/log branch from c1e3c16 to d206f87 Compare September 11, 2024 12:26

github-actions bot added the server label Sep 11, 2024

ggerganov force-pushed the gg/log branch from 5a0d164 to f029947 Compare September 11, 2024 16:44

github-actions bot added python python script changes devops improvements to build systems and github actions labels Sep 12, 2024

ggerganov force-pushed the gg/log branch from e7560ab to 2dff6f4 Compare September 12, 2024 12:06

common : reimplement the logger

2948768

ggml-ci

ggerganov force-pushed the gg/log branch from 51dbf51 to 2948768 Compare September 13, 2024 08:09

ggerganov added 3 commits September 13, 2024 11:15

log : print if build is debug [no ci]

078be07

examples : move gpt_init() after parsing the cli args

2afe0a0

log : add comments + adjust defaults

8f84210

ggml-ci

ggerganov force-pushed the gg/log branch from dfd4738 to 8f84210 Compare September 13, 2024 09:13

ggerganov added 3 commits September 13, 2024 12:42

server : improve log format

0d0dc11

server : fix verbose check

ff3b380

log : option to disable the log prefix

13226dc

ggml-ci

ggerganov force-pushed the gg/log branch from 1b87cd8 to 13226dc Compare September 13, 2024 11:52

ggerganov marked this pull request as ready for review September 13, 2024 11:59

log : cleanup, comments, build flags

40638f7

ggml-ci

ggerganov force-pushed the gg/log branch from e1ad8fd to 40638f7 Compare September 13, 2024 18:59

ggerganov added the merge ready indicates that this may be ready to merge soon and is just holding out in case of objections label Sep 14, 2024

ggerganov merged commit 6262d13 into master Sep 15, 2024
59 checks passed

ggerganov deleted the gg/log branch September 15, 2024 17:46

EZForever mentioned this pull request Sep 18, 2024

server : fix OpenSSL build by removing invalid LOG_INFO references #9529

Merged

4 tasks

ggerganov added a commit to ggerganov/ggml that referenced this pull request Sep 20, 2024

common : reimplement logging (llama/9418)

6f8cf41

ggerganov/llama.cpp#9418

ggerganov added a commit to ggerganov/ggml that referenced this pull request Sep 20, 2024

common : reimplement logging (llama/9418)

2297cb2

ggerganov/llama.cpp#9418

ggerganov mentioned this pull request Sep 23, 2024

log : add CONT level for continuing previous log entry #9610

Merged

4 tasks

ggerganov added a commit to ggerganov/whisper.cpp that referenced this pull request Sep 24, 2024

common : reimplement logging (llama/9418)

3089f61

ggerganov/llama.cpp#9418

ggerganov added a commit to ggerganov/whisper.cpp that referenced this pull request Sep 24, 2024

common : reimplement logging (llama/9418)

288ae51

ggerganov/llama.cpp#9418

ericcurtin added a commit to containers/ramalama that referenced this pull request Sep 25, 2024

llama-cli made a change

41d1bb5

Now instead of --log-disable we need to redirect to stderr: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

ericcurtin added a commit to containers/ramalama that referenced this pull request Sep 25, 2024

llama-cli made a change

c3aa315

Now instead of --log-disable we need to redirect to stderr: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

ericcurtin mentioned this pull request Sep 25, 2024

llama-cli made a change containers/ramalama#187

Merged

ericcurtin added a commit to containers/ramalama that referenced this pull request Sep 25, 2024

llama-cli made a change

65637d6

Now instead of --log-disable we need to redirect stderr to /dev/null: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

ericcurtin added a commit to containers/ramalama that referenced this pull request Sep 25, 2024

llama-cli made a change

4b1ae91

Now instead of --log-disable we need to redirect stderr to /dev/null: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

ericcurtin added a commit to containers/ramalama that referenced this pull request Sep 25, 2024

llama-cli made a change

29c0fec

Now instead of --log-disable we need to redirect stderr to /dev/null: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

ericcurtin added a commit to containers/ramalama that referenced this pull request Sep 25, 2024

llama-cli made a change

4210a6d

Now instead of --log-disable we need to redirect stderr to /dev/null: ggerganov/llama.cpp#9418 Signed-off-by: Eric Curtin <[email protected]>

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

common : reimplement logging (ggerganov#9418)

9efd2db

ggerganov#9418

lyapple2008 pushed a commit to lyapple2008/whisper.cpp.mars that referenced this pull request Nov 2, 2024

common : reimplement logging (llama/9418)

f6f17b8

ggerganov/llama.cpp#9418

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common : reimplement logging #9418

common : reimplement logging #9418

ggerganov commented Sep 10, 2024 •

edited

Loading

bviksoe commented Sep 13, 2024 •

edited

Loading

ggerganov commented Sep 14, 2024

bviksoe commented Sep 14, 2024

ggerganov commented Sep 14, 2024

kyx0r commented Sep 21, 2024

ggerganov commented Sep 23, 2024

kyx0r commented Sep 23, 2024

ericcurtin commented Sep 25, 2024

ggerganov commented Sep 25, 2024

common : reimplement logging #9418

common : reimplement logging #9418

Conversation

ggerganov commented Sep 10, 2024 • edited Loading

bviksoe commented Sep 13, 2024 • edited Loading

ggerganov commented Sep 14, 2024

bviksoe commented Sep 14, 2024

ggerganov commented Sep 14, 2024

kyx0r commented Sep 21, 2024

ggerganov commented Sep 23, 2024

kyx0r commented Sep 23, 2024

ericcurtin commented Sep 25, 2024

ggerganov commented Sep 25, 2024

ggerganov commented Sep 10, 2024 •

edited

Loading

bviksoe commented Sep 13, 2024 •

edited

Loading