Performance of `triomphe::Arc` is sometimes worse than `std::sync::Arc` #74

orium · 2023-11-02T19:14:38Z

Hi,

@michaelsproul and I ran some benchmarks in rpds . Most results improved when switching from std::sync::Arc to triomphe::Arc, but I was surprised that some benchmarks regressed significantly (> 30%). I'm wondering if there are optimizations on std::sync::Arc that were never ported to triomphe and that could improve triomphe and all projects that use it.

This is the PR with the benchmark results: orium/rpds#88

The text was updated successfully, but these errors were encountered:

Manishearth · 2023-11-02T19:19:45Z

No idea; this crate was written quite a while ago.

orium · 2023-11-03T11:41:37Z

If I take a look at the std::sync::Arc and port any potential optimizations to triomphe will you be available to review and merge them? (Not promising anything as I am busier that usual lately.)

michaelsproul · 2023-11-03T12:29:50Z

I was browsing std::sync::Arc's recent changes and this one stood out as something we could maybe backport: rust-lang/rust#115546. Although I'm not familiar enough with the memory model to know if this would make a big difference.

michaelsproul · 2023-11-03T12:37:56Z

This comment suggests is_unique relies on the Acquire semantics:

triomphe/src/arc.rs

Lines 546 to 549 in f78899c

    
           // See the extensive discussion in [1] for why this needs to be Acquire. 
        
           // 
        
           // [1] https://github.com/servo/servo/issues/21186 
        
           Self::count(self) == 1

Manishearth · 2023-11-03T15:04:36Z

I would be happy to get patches, I can't guarantee I'll be able to review and merge them in a timely manner but if they map closely to upstream code that would make it easier. Would also love help reviewing them

Dherse · 2023-11-28T13:03:26Z

I concur, I am working on Typst (as a contributor not owner/creator) and specifically on performance, we don't use weak refs to I figured swapping out std arcs for triomphe would help but performance is around 30% worse using triomphe so there is something here, perhaps an optimization that exists in the std lib and not here? As someone said, they have (apparently) relaxed the ordering in the std lib, perhaps we could do the same in Triomphe?
Thanks, Dherse

jmspiewak · 2024-05-21T20:52:31Z

The two differences I can see are:

In make_mut std uses Acquire only when the ref count is 1, triomphe always acquires. This shouldn't matter on x86.
Triomphe does an acquire load in drop, std an acquire fence. This is very minor.

Are you benchmarking heavily contended accesses or mostly single threaded?

michaelsproul · 2024-05-22T00:24:51Z

@jmspiewak The rpds benchmarks are mostly single-threaded

GnomedDev · 2024-09-08T10:51:05Z

Something to keep in mind: although orderings don't lead to different ASM on x86 in isolation, LLVM will still use them when optimising when functions are inlined, so weaker orderings can allow LLVM to reorder and generate better ASM.

orium mentioned this issue Nov 2, 2023

Use triomphe::Arc by default. orium/rpds#88

Merged

orium changed the title ~~Performance of triomphe::Arc is sometime worse than std::sync::Arc~~ Performance of triomphe::Arc is sometimes worse than std::sync::Arc Nov 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance of `triomphe::Arc` is sometimes worse than `std::sync::Arc` #74

Performance of `triomphe::Arc` is sometimes worse than `std::sync::Arc` #74

orium commented Nov 2, 2023 •

edited

Loading

Manishearth commented Nov 2, 2023

orium commented Nov 3, 2023

michaelsproul commented Nov 3, 2023

michaelsproul commented Nov 3, 2023

Manishearth commented Nov 3, 2023

Dherse commented Nov 28, 2023

jmspiewak commented May 21, 2024

michaelsproul commented May 22, 2024

GnomedDev commented Sep 8, 2024

Performance of triomphe::Arc is sometimes worse than std::sync::Arc #74

Performance of triomphe::Arc is sometimes worse than std::sync::Arc #74

Comments

orium commented Nov 2, 2023 • edited Loading

Manishearth commented Nov 2, 2023

orium commented Nov 3, 2023

michaelsproul commented Nov 3, 2023

michaelsproul commented Nov 3, 2023

Manishearth commented Nov 3, 2023

Dherse commented Nov 28, 2023

jmspiewak commented May 21, 2024

michaelsproul commented May 22, 2024

GnomedDev commented Sep 8, 2024

Performance of `triomphe::Arc` is sometimes worse than `std::sync::Arc` #74

Performance of `triomphe::Arc` is sometimes worse than `std::sync::Arc` #74

orium commented Nov 2, 2023 •

edited

Loading