Skip to content

Fleunt-Bit crashed with SIGSEV error #6897

@ezienecker

Description

@ezienecker

We have recently updated to fluent-bit 2.0.9 (Helm Chart version 0.24.0). Since then we regularly receive the error code 139 SIGSEV.

The following error message appears in the logs:

[2023/02/22 07:27:34] [ info] [filter:kubernetes:kubernetes.0]  token updated
[2023/02/22 07:29:06] [engine] caught signal (SIGSEGV)
#0  0x55c299f28772      in  edata_arena_ind_get() at lib/jemalloc-5.3.0/include/jemalloc/internal/edata.h:258
#1  0x55c299f28772      in  tcache_bin_flush_impl() at lib/jemalloc-5.3.0/src/tcache.c:350
#2  0x55c299f28772      in  tcache_bin_flush_bottom() at lib/jemalloc-5.3.0/src/tcache.c:519
#3  0x55c299f28772      in  je_tcache_bin_flush_small() at lib/jemalloc-5.3.0/src/tcache.c:529
#4  0x55c299f29cb9      in  tcache_gc_small() at lib/jemalloc-5.3.0/src/tcache.c:148
#5  0x55c299f2bd71      in  ???() at lib/jemalloc-5.3.0/src/tcache.c:414
#6  0x55c299f2e62f      in  je_te_event_trigger() at lib/jemalloc-5.3.0/src/thread_event.c:299
#7  0x55c299ebf6ac      in  te_event_advance() at lib/jemalloc-5.3.0/include/jemalloc/internal/thread_event.h:287
#8  0x55c299ebf6ac      in  thread_dalloc_event() at lib/jemalloc-5.3.0/include/jemalloc/internal/thread_event.h:293
#9  0x55c299ebf6ac      in  ifree() at lib/jemalloc-5.3.0/src/jemalloc.c:2896
#10 0x55c299ebf6ac      in  je_free_default() at lib/jemalloc-5.3.0/src/jemalloc.c:3021
#11 0x55c29a497053      in  map_metric_destroy() at lib/cmetrics/src/cmt_map.c:160
#12 0x55c29a4973f3      in  cmt_map_destroy() at lib/cmetrics/src/cmt_map.c:273
#13 0x55c29a480110      in  cmt_counter_destroy() at lib/cmetrics/src/cmt_counter.c:94
#14 0x55c29a4a57ff      in  cmt_destroy() at lib/cmetrics/src/cmetrics.c:101
#15 0x55c29a016e23      in  collect_metrics() at src/flb_metrics_exporter.c:201
#16 0x55c29a016f57      in  flb_me_fd_event() at src/flb_metrics_exporter.c:253
#17 0x55c299f9d7d0      in  flb_engine_handle_event() at src/flb_engine.c:497
#18 0x55c299f9d7d0      in  flb_engine_start() at src/flb_engine.c:853
#19 0x55c299f44b24      in  flb_lib_worker() at src/flb_lib.c:629
#20 0x7f181e43bea6      in  ???() at ???:0
#21 0x7f181dcefa2e      in  ???() at ???:0
#22 0xffffffffffffffff  in  ???() at ???:0

For other instances, the following error message is seen:

[2023/02/21 20:33:54] [ info] [filter:kubernetes:kubernetes.0]  token updated
[2023/02/21 20:43:54] [engine] caught signal (SIGSEGV)
#0  0x55c4622aae03      in  atomic_load_p() at lib/jemalloc-5.3.0/include/jemalloc/internal/atomic.h:83
#1  0x55c4622aae03      in  arena_get_from_edata() at lib/jemalloc-5.3.0/include/jemalloc/internal/arena_inlines_b.h:16
#2  0x55c4622aae03      in  je_large_dalloc() at lib/jemalloc-5.3.0/src/large.c:271
#3  0x55c46224e700      in  arena_dalloc_large() at lib/jemalloc-5.3.0/include/jemalloc/internal/arena_inlines_b.h:297
#4  0x55c46224e700      in  arena_dalloc() at lib/jemalloc-5.3.0/include/jemalloc/internal/arena_inlines_b.h:334
#5  0x55c46224e700      in  idalloctm() at lib/jemalloc-5.3.0/include/jemalloc/internal/jemalloc_internal_inlines_c.h:120
#6  0x55c46224e700      in  ifree() at lib/jemalloc-5.3.0/src/jemalloc.c:2887
#7  0x55c46224e700      in  je_free_default() at lib/jemalloc-5.3.0/src/jemalloc.c:3014
#8  0x55c4622e967a      in  flb_free() at include/fluent-bit/flb_mem.h:121
#9  0x55c4622ea913      in  flb_sds_destroy() at src/flb_sds.c:470
#10 0x55c46261b220      in  pack_record() at plugins/out_loki/loki.c:1233
#11 0x55c46261b6de      in  loki_compose_payload() at plugins/out_loki/loki.c:1381
#12 0x55c46261b7bd      in  cb_loki_flush() at plugins/out_loki/loki.c:1408
#13 0x55c4623079ae      in  output_pre_cb_flush() at include/fluent-bit/flb_output.h:528
#14 0x55c462d6e3a6      in  co_init() at lib/monkey/deps/flb_libco/amd64.c:117

This error also occurs in version

  • Helm Chart version 0.23.0 (fluent-bit version 2.0.8)
  • Helm Chart version 0.22.0 (fluent-bit version 2.0.8)
  • Helm Chart version 0.21.0 (fluent-bit version not checked)

Temporarily I have downgraded to version 1.9.9 (Helm Chart version 0.20.11). Everything seems to work so far.

Maybe a bug was introduce with version 2.x?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Stalewaiting-for-releaseThis has been fixed/merged but it's waiting to be included in a release.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions