Reword or even allow-by-default box_vec #2404

killercup · 2018-01-27T16:24:10Z

In #2394 (comment) (a typical "only 20%-on-topic comment" of mine), I wrote

But thinking about the box_vec lint I'm not sure I agree with its reasoning:

Vec already keeps its contents in a separate area on the heap. So if you Box it, you just add another level of indirection without any benefit whatsoever.

There is a benefit I can think of: Boxing a Vec or String reduces the size from 3 words to 1. I'm sure some this may be a good idea, and, more importantly, this is most likely done on purpose. But maybe this is just a good idea in my imagination. Or maybe there is a crate that gives you a Vec-like API but stores length an capacity as part of the heap data and we should suggest using that?

And indeed—there is a crate that does just that, and it is @gankro's thin-vec. I'll allow the fact that it is not published to crates.io and has no readme to count for "this is an obscure use case".

Update: But it may be used in Gecko in the future (source).

So, I'm totally fine if nobody cares about this (I'll most likely never use this myself), but can I suggest changing the box_vec description to this? :)

Vec already keeps its contents in a separate area on the heap. So if you Box it, you add another level of indirection. This is often not the programmer's intention.

In the case you want to reduce the size of item on the stack, you should consider using a special Vec-like data structure that stores its length and capacity on the heap.

oli-obk · 2018-01-27T16:32:55Z

I'd go with the docs case, and if there's ever a crate on crates.io, we can add a suggestion to use it to the lint message

Gankra · 2018-01-29T15:16:48Z

Since I'm cited in this I should note that ThinVec is only reasonable by virtue of the boxing being "builtin" -- Box<Vec> is indeed a suspicious thing outside of very degenerate situations.

MoSal · 2019-10-22T14:01:47Z

So I have a use-case where large (but not huge) data is deserialized, and the deserialized data must be kept in memory as long as the app/service is running.

struct ChildA {
 id: String,
 foo: Option<Vec<Foo>>,
 bar: Option<Vec<Bar>>,
 // 10's of other elements that are either
 // Option<Vec<T>> or Option<T>, most of which
 // are None most of the time
}

struct ChildB {
 // similar to ChildA
}

struct Root {
 // some general info elements
 // ----------
 // // the number of elements in these vectors is big
 a: Option<Vec<ChildA>>,
 b: Option<Vec<ChildB>>,
}

Boxifying vectors (and other large types) in ChildA/ChildB makes a huge difference.

I think a useful heuristic would be to special-case if the boxed vec belongs to an enum variant, and allow the boxification if it reduces the enum size, in other words, letting large_enum_variant take over. If you think the boxed vector is going to be empty most of the time, then maybe you should put it inside an Option anyway.

oli-obk added the A-documentation Area: Adding or improving documentation label Jan 27, 2018

flip1995 mentioned this issue Jan 15, 2020

Implement issue finder for lint names #5049

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reword or even allow-by-default box_vec #2404

Reword or even allow-by-default box_vec #2404

killercup commented Jan 27, 2018 •

edited

Loading

oli-obk commented Jan 27, 2018

Gankra commented Jan 29, 2018

MoSal commented Oct 22, 2019

Reword or even allow-by-default box_vec #2404

Reword or even allow-by-default box_vec #2404

Comments

killercup commented Jan 27, 2018 • edited Loading

oli-obk commented Jan 27, 2018

Gankra commented Jan 29, 2018

MoSal commented Oct 22, 2019

killercup commented Jan 27, 2018 •

edited

Loading