Use the object crate for metadata reading #83640

bjorn3 · 2021-03-29T09:50:02Z

This allows sharing the metadata reader between cg_llvm, cg_clif and other codegen backends.

This is not currently useful for rlib reading with cg_spirv (rust-gpu) as it uses tar rather than ar as .rlib format, but it is useful for dylib reading required for loading proc macros. (cc @eddyb)

The object crate is already trusted as dependency of libstd through backtrace. As far as I know it supports reading all object file formats used by targets for which we support rust dylibs with crate metadata, but I am not certain. If this happens to not be the case, I could keep using LLVM for reading dylib metadata.

Marked as WIP for a perf run and as it is based on #83637.

rust-highfive · 2021-03-29T09:50:05Z

Some changes occured to rustc_codegen_cranelift

cc @bjorn3

rust-highfive · 2021-03-29T09:50:06Z

r? @matthewjasper

(rust-highfive has picked a reviewer for you, use r? to override)

bjorn3 · 2021-03-29T09:51:07Z

@bors try @rust-timer queue

rust-timer · 2021-03-29T09:51:08Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-03-29T09:51:17Z

⌛ Trying commit aeee8d87bffdd7370b2c2bf149bf1bfaf7bc18e0 with merge 652060f31a2669f8467ef860c7b663be75b0bd44...

bjorn3 · 2021-03-29T09:54:39Z

@bors r-

bjorn3 · 2021-03-29T09:58:39Z

@bors try @rust-timer queue

rust-timer · 2021-03-29T09:58:40Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-03-29T09:58:47Z

⌛ Trying commit d2b63d8cd784c90b2c96853d5acea0d6c8ab9976 with merge a7a4d9f098502325534a9e8e22b59dcec45582bc...

bjorn3 · 2021-03-29T12:16:48Z

@bors try @rust-timer queue

rust-timer · 2021-03-29T12:16:50Z

Awaiting bors try build completion.

@rustbot label: +S-waiting-on-perf

bors · 2021-03-29T12:17:01Z

⌛ Trying commit 29f195cc7bb5fbcaa7abfab7a1a9fff63ebcbcb0 with merge dfb3f56d49b7f86c1163b25c8f7d06ca511df6e1...

bors · 2021-03-29T13:05:01Z

☀️ Try build successful - checks-actions
Build commit: dfb3f56d49b7f86c1163b25c8f7d06ca511df6e1 (dfb3f56d49b7f86c1163b25c8f7d06ca511df6e1)

rust-timer · 2021-03-29T13:05:04Z

Queued dfb3f56d49b7f86c1163b25c8f7d06ca511df6e1 with parent 40334da, future comparison URL.

The version 1 resolver unifies enabled features across the whole workspace. This includes libstd which isn't allowed to depend on wasmparser.

bjorn3 · 2021-05-07T17:12:43Z

Spurious error. Re-running PR checks.

compiler/rustc_codegen_ssa/src/back/metadata.rs

nagisa · 2021-05-08T18:59:34Z

compiler/rustc_codegen_ssa/src/back/metadata.rs

+            let archive = object::read::archive::ArchiveFile::parse(&*data)
+                .map_err(|e| format!("{:?}", e))?;
+
+            for entry_result in archive.members() {


So, I think this (and use of mmap) may be one plausible reason for the slight performance regression. If my memory of the ar format serves me well, obtaining the list of members in an ar has to process the file effectively as if it was a ump list. I suspect that such a read pattern may be a pathological for mmap based I/O: kernel would try loading more data (page?) into memory only for us to inspect the file name and length before we jump to the next entry(-ies), discarding the rest of the data that kernel spent time loading in.

Without digging into LLVM's ArchiveRO implementation I can imagine that more precise reads could be more effective here.

ArchiveRO::open also uses mmap for header reading I think:

rust/compiler/rustc_llvm/llvm-wrapper/ArchiveWrapper.cpp

Line 66 in 6e17a5c

ErrorOr<std::unique_ptr<MemoryBuffer>> BufOr =

MemoryBufferRef doesn't export any method allowing read calls on the mapped file.

Cargo.lock

nagisa · 2021-05-08T19:15:18Z

The implementation is very reasonable and a huge code quality improvement over the LLVM based version IMO. I'm especially happy with the unification of metadata reading code between backends. The largest regressions introduce a instruction count hit of 0.7% in benchmarks in-line with those I'd expect in terms of the workflows that would be affected by this change most (check, debug).

While the hit is not trivially ignorable, its also small enough, I think, that the maintainability improvements would justify it. And it sounds like there may be a number of low-hanging fruit in archive parsing implementation too.

compiler/rustc_codegen_llvm/src/base.rs

alexcrichton · 2021-05-11T14:49:50Z

There's a segfault in #84449 and what I think is memory corruption (probably the same thing), and I would personally love to not have to track it down to some weird interaction with the LLVM C API here. I suspect it will "magically go away" if this were all Rust-based!

nagisa · 2021-05-11T14:58:07Z

Note that I and @philipc found a couple of obvious places to optimize the archive reading code. Some of that has landed and will eventually make its way over to rustc during natural passage of time. Some of that would need changes on the rustc side AFAICT.

AFAICT the code with ReadCache would also be somewhat simpler, too. Care to try that out? r=me regardless.

bjorn3 · 2021-05-11T16:28:45Z

AFAICT the code with ReadCache would also be somewhat simpler, too. Care to try that out?

I am not sure what you are referring to.

nagisa · 2021-05-14T10:26:01Z

@bors r+ Thanks.

I am not sure what you are referring to.

I was referring to the experiment made in this branch (linked to by one of the comments I linked above), and in particular this commit. But its fine anyway.

bors · 2021-05-14T10:26:04Z

📌 Commit 6381aaf has been approved by nagisa

bors · 2021-05-14T12:59:03Z

⌛ Testing commit 6381aaf with merge 75da570...

bors · 2021-05-14T15:40:20Z

☀️ Test successful - checks-actions
Approved by: nagisa
Pushing 75da570 to master...

Use the object crate for metadata reading This allows sharing the metadata reader between cg_llvm, cg_clif and other codegen backends. This is not currently useful for rlib reading with cg_spirv ([rust-gpu](https://github.com/EmbarkStudios/rust-gpu/)) as it uses tar rather than ar as .rlib format, but it is useful for dylib reading required for loading proc macros. (cc `@eddyb)` The object crate is already trusted as dependency of libstd through backtrace. As far as I know it supports reading all object file formats used by targets for which we support rust dylibs with crate metadata, but I am not certain. If this happens to not be the case, I could keep using LLVM for reading dylib metadata. Marked as WIP for a perf run and as it is based on rust-lang#83637.

rust-highfive assigned matthewjasper Mar 29, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Mar 29, 2021

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 29, 2021

This comment has been minimized.

Sign in to view

bors added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Mar 29, 2021

bjorn3 force-pushed the shared_metadata_reader branch from aeee8d8 to d2b63d8 Compare March 29, 2021 09:56

This comment has been minimized.

Sign in to view

bjorn3 force-pushed the shared_metadata_reader branch from d2b63d8 to 570f9a0 Compare March 29, 2021 10:02

This comment has been minimized.

Sign in to view

Disable wasm feature of object in cg_ssa

802fe17

The version 1 resolver unifies enabled features across the whole workspace. This includes libstd which isn't allowed to depend on wasmparser.

bjorn3 force-pushed the shared_metadata_reader branch from 9865eb5 to 802fe17 Compare May 7, 2021 16:57

This comment has been minimized.

Sign in to view

nagisa reviewed May 8, 2021

View reviewed changes

compiler/rustc_codegen_ssa/src/back/metadata.rs Outdated Show resolved Hide resolved

nagisa reviewed May 8, 2021

View reviewed changes

Cargo.lock Outdated Show resolved Hide resolved

bjorn3 added 2 commits May 10, 2021 09:46

Remove wasmparser

b65a92f

Better error messages

487427f

mati865 reviewed May 10, 2021

View reviewed changes

compiler/rustc_codegen_llvm/src/base.rs Outdated Show resolved Hide resolved

Add link to historic note

537e814

Use DefaultMetadataLoader in the hotplug_codegen_backend test

6381aaf

bors added the merged-by-bors This PR was explicitly merged by bors. label May 14, 2021

bors merged commit 75da570 into rust-lang:master May 14, 2021

rustbot added this to the 1.54.0 milestone May 14, 2021

bjorn3 deleted the shared_metadata_reader branch May 14, 2021 16:11

klensy mentioned this pull request Jun 2, 2021

global_asm expands arguments in comments #85944

Closed

petrochenkov mentioned this pull request Aug 4, 2022

change rlib format to distinguish native dependencies #100101

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the object crate for metadata reading #83640

Use the object crate for metadata reading #83640

bjorn3 commented Mar 29, 2021 •

edited

Loading

rust-highfive commented Mar 29, 2021

rust-highfive commented Mar 29, 2021

bjorn3 commented Mar 29, 2021

rust-timer commented Mar 29, 2021

bors commented Mar 29, 2021

This comment has been minimized.

bjorn3 commented Mar 29, 2021

bjorn3 commented Mar 29, 2021

rust-timer commented Mar 29, 2021

bors commented Mar 29, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

bjorn3 commented Mar 29, 2021

rust-timer commented Mar 29, 2021

bors commented Mar 29, 2021

bors commented Mar 29, 2021

rust-timer commented Mar 29, 2021

This comment has been minimized.

bjorn3 commented May 7, 2021

nagisa May 8, 2021

bjorn3 May 8, 2021

nagisa commented May 8, 2021

alexcrichton commented May 11, 2021

nagisa commented May 11, 2021 •

edited

Loading

bjorn3 commented May 11, 2021

nagisa commented May 14, 2021

bors commented May 14, 2021

bors commented May 14, 2021

bors commented May 14, 2021

Use the object crate for metadata reading #83640

Use the object crate for metadata reading #83640

Conversation

bjorn3 commented Mar 29, 2021 • edited Loading

rust-highfive commented Mar 29, 2021

rust-highfive commented Mar 29, 2021

bjorn3 commented Mar 29, 2021

rust-timer commented Mar 29, 2021

bors commented Mar 29, 2021

This comment has been minimized.

bjorn3 commented Mar 29, 2021

bjorn3 commented Mar 29, 2021

rust-timer commented Mar 29, 2021

bors commented Mar 29, 2021

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

bjorn3 commented Mar 29, 2021

rust-timer commented Mar 29, 2021

bors commented Mar 29, 2021

bors commented Mar 29, 2021

rust-timer commented Mar 29, 2021

This comment has been minimized.

bjorn3 commented May 7, 2021

nagisa May 8, 2021

Choose a reason for hiding this comment

bjorn3 May 8, 2021

Choose a reason for hiding this comment

nagisa commented May 8, 2021

alexcrichton commented May 11, 2021

nagisa commented May 11, 2021 • edited Loading

bjorn3 commented May 11, 2021

nagisa commented May 14, 2021

bors commented May 14, 2021

bors commented May 14, 2021

bors commented May 14, 2021

bjorn3 commented Mar 29, 2021 •

edited

Loading

nagisa commented May 11, 2021 •

edited

Loading