-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Summary counter breaks under high load #189
Comments
This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 30 days |
Seems that problem is not reproducing after #241 |
Unfortunately, it's still relevant. See https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report . Start a cluster with Setup: local DEFAULT_QUANTILES = {
[0.99] = 1e-2,
}
local DEFAULT_AGE_PARAMS = {
age_buckets_count = 2,
max_age_time = 60,
} If changed to local DEFAULT_QUANTILES = {
[0.99] = 1e-3,
} everything seems fine. |
Make metrics quantile collector tolerated error [1] configurable. Change metrics quantile collector default tolerated error from 1e-2 to 1e-3. The motivation of this patch is a tarantool/metrics bug [2]. Sometimes quantile values turn to `-Inf` under high load when observations are small. It was reproduced in process of developing Grafana dashboard panels for CRUD stats. Quantile tolerated error could be changed with crud.cfg: crud.cfg{stats_quantile_tolerated_error = 1e-4} 1. https://www.tarantool.io/ru/doc/latest/book/monitoring/api_reference/#summary 2. tarantool/metrics#189 3. https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report Revert "stats: make quantile tolerated error configurable" This reverts commit 32c6f5eabecc907ef570b66e15029dc9b4d6debf.
Make metrics quantile collector tolerated error [1] configurable. Change metrics quantile collector default tolerated error from 1e-2 to 1e-3. The motivation of this patch is a tarantool/metrics bug [2]. Sometimes quantile values turn to `-Inf` under high load when observations are small. It was reproduced in process of developing Grafana dashboard panels for CRUD stats. Quantile tolerated error could be changed with crud.cfg: crud.cfg{stats_quantile_tolerated_error = 1e-4} 1. https://www.tarantool.io/ru/doc/latest/book/monitoring/api_reference/#summary 2. tarantool/metrics#189 3. https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report
Make metrics quantile collector tolerated error [1] configurable. Change metrics quantile collector default tolerated error from 1e-2 to 1e-3. The motivation of this patch is a tarantool/metrics bug [2]. Sometimes quantile values turn to `-Inf` under high load when observations are small. It was reproduced in process of developing Grafana dashboard panels for CRUD stats [3]. Quantile tolerated error could be changed with crud.cfg: crud.cfg{stats_quantile_tolerated_error = 1e-4} 1. https://www.tarantool.io/ru/doc/latest/book/monitoring/api_reference/#summary 2. tarantool/metrics#189 3. https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report
Make metrics quantile collector tolerated error [1] configurable. Change metrics quantile collector default tolerated error from 1e-2 to 1e-3. The motivation of this patch is a tarantool/metrics bug [2]. Sometimes quantile values turn to `-Inf` under high load when observations are small. It was reproduced in process of developing Grafana dashboard panels for CRUD stats [3]. Quantile tolerated error could be changed with crud.cfg: crud.cfg{stats_quantile_tolerated_error = 1e-4} 1. https://www.tarantool.io/ru/doc/latest/book/monitoring/api_reference/#summary 2. tarantool/metrics#189 3. https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report
Make metrics quantile collector tolerated error [1] configurable. Change metrics quantile collector default tolerated error from 1e-2 to 1e-3. The motivation of this patch is a tarantool/metrics bug [2]. Sometimes quantile values turn to `-Inf` under high load when observations are small. It was reproduced in process of developing Grafana dashboard panels for CRUD stats [3]. Quantile tolerated error could be changed with crud.cfg: crud.cfg{stats_quantile_tolerated_error = 1e-4} 1. https://www.tarantool.io/ru/doc/latest/book/monitoring/api_reference/#summary 2. tarantool/metrics#189 3. https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report
Make metrics quantile collector tolerated error [1] configurable. Change metrics quantile collector default tolerated error from 1e-2 to 1e-3. The motivation of this patch is a tarantool/metrics bug [2]. Sometimes quantile values turn to `-Inf` under high load when observations are small. It was reproduced in process of developing Grafana dashboard panels for CRUD stats [3]. Quantile tolerated error could be changed with crud.cfg: crud.cfg{stats_quantile_tolerated_error = 1e-4} 1. https://www.tarantool.io/ru/doc/latest/book/monitoring/api_reference/#summary 2. tarantool/metrics#189 3. https://github.com/tarantool/grafana-dashboard/tree/DifferentialOrange/crud-report
repro local fiber = require('fiber')
local clock = require('clock')
local log = require('log')
local function monitor(collector)
local time_before
while true do
time_before = clock.monotonic()
fiber.yield()
collector:observe(clock.monotonic() - time_before)
end
end
local function init()
local collector = require('metrics').summary('tnt_fiber_event_loop', 'event loop time',
{ [0.5] = 0.01, [0.9] = 0.01, [0.99] = 0.01, })
fiber.create(function() monitor(collector) end)
end
init()
require('console').start() os.exit(1) And manually:
|
I tried to monitor event loop with
after half a minute I've got
Also observation count in collector wasn't equal to the one in structure
The text was updated successfully, but these errors were encountered: