-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
in_tail: Fluentd process goes unresponsive/hung and not tailing any of the log files #3882
Comments
@ashie request to please let me know if any workaround or remediations available for this unresponsive nature of td-agent issue. Thanks in advance. |
Could you try to get stack trace of Fluentd's worker process by sending |
|
@ashie Thanks for the response. here are the extracted sigdmp files from two instances. Please verify and let me know if you find any issues Please let me know if any information needed for further investigation. |
@ashie did you get any chance to go thorough the above sigdmp files? Please let me know if you need any further details here to identify the issue. |
It seems that your log is supervisor's one, not worker's one.
We need woker's log. |
sigdump-L2G.log |
@ashie Can you verify these files and let me know if you find any trace of issue in these files? Thanks in advance. |
It seems that fluentd process isn't stuck, probably it's just waiting next line. |
@ashie I tried placing the multiline_flush_interval 5s, but still unresponsiveness of td-agent.log continues as shown in these snippets: The log files shows this one line: Do I need to modify any thing in the output config as well? <match *.log> |
@ashie any further I can try to see if that can resolve the issue. |
@ashie Any update on this to mitigate the issue? |
This issue has been automatically marked as stale because it has been open 30 days with no activity. Remove stale label or comment or this issue will be closed in 7 days |
@ashie can we get any update on this.. is there any fix from the product end.. we still face this issue and seem the log rotate file issue.. |
Do you mean that |
It's been a while, so I close this. |
Describe the bug
As commented before in the #3614 (comment), the fluentd goes unresponsive or hung.
We tried upgrading the fluentd version from v1.11.5 to v1.15.2 on one of the server where we had a issue, but still we face the issue with the latest build.
[root@aau-lgiu06fms1 06:36:28]:/var/log/td-agent> ls -ltr
total 10024
drwxr-xr-x 2 td-agent td-agent 6 Aug 22 23:54 buffer
-rw-r--r-- 1 td-agent td-agent 59796 Aug 25 03:17 td-agent.log-20220825.gz
-rw-r----- 1 td-agent td-agent 35586 Aug 25 11:00 td-agent.log-20220826.gz
-rw-r----- 1 td-agent td-agent 129378 Aug 26 19:33 td-agent.log-20220827.gz
-rw-r----- 1 td-agent td-agent 878069 Aug 29 20:38 td-agent.log-20220830.gz
-rw-r----- 1 td-agent td-agent 53933 Aug 31 03:47 td-agent.log-20220831.gz
-rw-r--r-- 1 td-agent td-agent 306 Aug 31 15:05 t1.pos
-rw-r--r-- 1 td-agent td-agent 89 Aug 31 15:13 o1.pos
-rw-r--r-- 1 td-agent td-agent 73 Aug 31 15:14 s3.pos
-rw-r--r-- 1 td-agent td-agent 73 Aug 31 15:14 s2.pos
-rw-r--r-- 1 td-agent td-agent 96 Aug 31 15:14 b1.pos
-rw-r--r-- 1 td-agent td-agent 198 Aug 31 15:14 a1.pos
-rw-r--r-- 1 td-agent td-agent 19583 Aug 31 15:14 a2.pos
-rw-r--r-- 1 td-agent td-agent 13033 Aug 31 15:14 a3.pos
-rw-r--r-- 1 td-agent td-agent 68 Aug 31 15:14 s1.pos
-rw-r----- 1 td-agent td-agent 9027630 Aug 31 18:25 td-agent.log-20220901
-rw-r----- 1 td-agent td-agent 0 Sep 1 03:23 td-agent.log
To Reproduce
No error log message found in the logs.
Expected behavior
Logs to be pushed as usual after file rotation as fluentd
Your Environment
Your Configuration
Your Error Log
Additional context
We are seeing this issue on both 1.11.5 and 1.15.2 versions.
The text was updated successfully, but these errors were encountered: