Implement C-level ping? #74

mperham · 2021-10-14T22:13:03Z

Hi, I've been trying to monitor Redis network latency within Sidekiq by using PING but I've learned that a process pegged at 100% CPU will dramatically overstate latency due to thread scheduling latency around the GVL. If you have 10 jobs crunching numbers, it may take 50-100ms to get a Ruby thread scheduled to process the PONG. Would you be interested in a special PING impl which is designed only to calculate round trip time in C, so as to avoid Ruby VM overhead?

I'm thinking something as simple as:

> redis.rtt_us
=> 267

where the result is the calculated RTT in µs.

See also sidekiq/sidekiq#5025

The text was updated successfully, but these errors were encountered:

nateberkopec · 2021-10-14T23:55:19Z

it may take 50-100ms to get a Ruby thread scheduled to process the PONG

More than that, even. Threads are only interrupted every 100ms, so the worse case scenario is NUMBER_OF_THREADS * 100ms. Ouch!

mperham · 2023-04-26T18:02:57Z

@byroot is this something you would be interested in providing in hiredis-client?

byroot · 2023-04-26T18:07:57Z

Hum, perhaps, I'd need to have a look at how doable it would be. As I'd need to skip the reader code because it needs the GVL.

I'll have a quick look tomorrow.

Latency is returned as a Float in milliseconds. For the Ruby driver, it's equivalent to measuring how long `call("PING")` takes, however with the Hiredis driver, the latency is measured without ever holding the GVL which allows to give a much more accurate measure that isn't impacted by GVL contention. Ref: redis/hiredis-rb#74 Test script: ```ruby def fibonacci( n ) return n if ( 0..1 ).include? n ( fibonacci( n - 1 ) + fibonacci( n - 2 ) ) end require "redis-client" if ENV["DRIVER"] == "hiredis" require "hiredis-client" end client = RedisClient.new threads = 10.times.map do Thread.new do loop do fibonacci(30) end end end 5.times do puts "latency: #{client.measure_latency}ms" sleep 1 end ``` ``` $ DRIVER=ruby bundle exec ruby -I hiredis-client/lib/ /tmp/measure-latency.rb latency: 1033.9850000143051ms latency: 1039.7799999713898ms latency: 1040.0930000543594ms latency: 1050.2749999761581ms latency: 1044.6280000209808ms ``` ``` $ DRIVER=hiredis bundle exec ruby -I hiredis-client/lib/ /tmp/measure-latency.rb latency: 0.307ms latency: 0.351ms latency: 0.236ms latency: 0.221ms latency: 0.321ms ```

Latency is returned as a Float in milliseconds. For the Ruby driver, it's equivalent to measuring how long `call("PING")` takes, however with the Hiredis driver, the latency is measured without ever holding the GVL which allows to give a much more accurate measure that isn't impacted by GVL contention. Ref: redis/hiredis-rb#74 Test script: ```ruby def fibonacci( n ) return n if ( 0..1 ).include? n ( fibonacci( n - 1 ) + fibonacci( n - 2 ) ) end require "redis-client" if ENV["DRIVER"] == "hiredis" require "hiredis-client" end client = RedisClient.new threads = 10.times.map do Thread.new do loop do fibonacci(30) end end end 5.times do puts "latency: #{client.measure_round_trip_delay}ms" sleep 1 end ``` ``` $ DRIVER=ruby bundle exec ruby -I hiredis-client/lib/ /tmp/measure-latency.rb latency: 1033.9850000143051ms latency: 1039.7799999713898ms latency: 1040.0930000543594ms latency: 1050.2749999761581ms latency: 1044.6280000209808ms ``` ``` $ DRIVER=hiredis bundle exec ruby -I hiredis-client/lib/ /tmp/measure-latency.rb latency: 0.307ms latency: 0.351ms latency: 0.236ms latency: 0.221ms latency: 0.321ms ```

mperham · 2025-01-23T16:22:08Z

If I want to use this number to distinguish between Redis latency issues and GVL latency issues, how can I do that? Right now I can't tell if the returned value is from Ruby (and includes GVL latency) or from C (and does not). Is there a way to capture both types of latency?

I guess I could do something like:

start = Time.now
rtt = redis.measure_round_trip_delay
stop = Time.now
gvl = (stop - start - rtt)

Any thoughts or ideas?

byroot · 2025-01-23T16:30:41Z

I guess I could do something like:

Yes, that's one way. That is assuming hiredis-client is used. If you're using pure ruby redis-client, then that won't make any difference.

You can also alternatively look at the various gems that hook into the GVL instrumentation API: https://github.com/byroot/byroot.github.io/pull/8/files#diff-32ffcd1b6c378d7bdbf5debfa717cfff28a6a806d12857d951c0e250371800ecR64

mperham · 2025-01-23T16:57:44Z

Speedshop's middleware (and really all of those tools/gems) look really promising but suffer from the same problem as all advanced tools: the people that need it most don't know about it or understand it. I can document its usage with Sidekiq but 95% of people won't use it.

One of my dependency rules for Sidekiq is to never depend on native code/extensions. hiredis is fine because the user opts in but I can't directly pull in any of those native gems. I might need to wait for Ruby itself to offer better GVL visibility.

Does rediss: imply hiredis usage? If so, that means most people will be using hiredis in production and the logic above useable. I could limit the above logic only when using rediss:.

byroot · 2025-01-23T16:59:05Z

Does rediss: imply hiredis usage?

No, rediss: implies SSL.

mperham · 2025-01-23T17:38:59Z

Ok, an OpenSSL socket. Got it. I presume this latency API will be in 0.23.3?

byroot · 2025-01-23T17:40:56Z

I presume this latency API will be in 0.23.3?

It was released two years ago in 0.15.0

mperham · 2025-01-23T17:49:51Z

Oops, my client compatibility layer was hiding it. It's working great now.

nateberkopec · 2025-01-23T21:06:55Z

the people that need it most don't know about it or understand it. I can document its usage with Sidekiq but 95% of people won't use it.

The middleware is a stepping stone to an automatic adaptive concurrency tool. Our end goal is that the user will not need to configure anything.

mperham mentioned this issue Oct 31, 2021

Redis::ConnectionError with { reconnect_attempts: 0 } caused (?) the leader to miss cron jobs sidekiq/sidekiq#4950

Closed

casperisfine mentioned this issue Apr 27, 2023

Implement RedisClient#measure_round_trip_delay redis-rb/redis-client#113

Merged

byroot closed this as completed Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement C-level ping? #74

Implement C-level ping? #74

mperham commented Oct 14, 2021

nateberkopec commented Oct 14, 2021

mperham commented Apr 26, 2023

byroot commented Apr 26, 2023

mperham commented Jan 23, 2025

byroot commented Jan 23, 2025

mperham commented Jan 23, 2025

byroot commented Jan 23, 2025

mperham commented Jan 23, 2025

byroot commented Jan 23, 2025

mperham commented Jan 23, 2025

nateberkopec commented Jan 23, 2025

Implement C-level ping? #74

Implement C-level ping? #74

Comments

mperham commented Oct 14, 2021

nateberkopec commented Oct 14, 2021

mperham commented Apr 26, 2023

byroot commented Apr 26, 2023

mperham commented Jan 23, 2025

byroot commented Jan 23, 2025

mperham commented Jan 23, 2025

byroot commented Jan 23, 2025

mperham commented Jan 23, 2025

byroot commented Jan 23, 2025

mperham commented Jan 23, 2025

nateberkopec commented Jan 23, 2025