Override Cycle::try_fold #62737

timvermeulen · 2019-07-16T23:30:22Z

It's not very pretty, but I believe this is the simplest way to correctly implement Cycle::try_fold. The following may seem correct:

loop {
    acc = self.iter.try_fold(acc, &mut f)?;
    self.iter = self.orig.clone();
}

...but this loops infinitely in case self.orig is empty, as opposed to returning acc. So we first have to fully iterate self.orig to check whether it is empty or not, and before that, we have to iterate the remaining elements of self.iter.

This should always call self.orig.clone() the same amount of times as repeated next() calls would.

r? @scottmcm

Alexendoo · 2019-07-24T12:51:43Z

Ping from triage, any updates? @scottmcm

scottmcm · 2019-07-27T21:59:52Z

I'm staring at the loop in here, and as the code it totally makes sense to me, but it has me wondering how much we actually trust the Clone implementation on the iterator we're getting.

The .next() implementation will exit if the cloned iterator ends up being empty somehow. Is it reasonable for a clone of an iterator to return a different number of things? For example, to have an iterator that reads from a &File. (I guess this is a libs questions...)

Could it make sense to just loop over the version that checks for empty, to avoid spinning on the clone if the iterator does end up empty somehow? Or maybe I'm worried for nothing...

timvermeulen · 2019-08-03T17:08:16Z

@scottmcm Even if .clone() erroneously returns an empty iterator, wouldn't that be caught by the emptiness check here? If the Clone implementation is incorrect I would expect .clone() to already return an empty iterator on line 400, as the original iterator will have been iterated at least once before that point.

The Clone implementation could produce an empty iterator only after multiple cycles, of course, but that seems a lot less likely to me. Or did you have a specific situation in mind where this could reasonably happen?

scottmcm · 2019-08-09T04:05:25Z

I was thinking of something like this: https://play.rust-lang.org/?version=stable&mode=debug&edition=2018&gist=71cd4c47937fc4f290ec9b1c65867435

But even that doesn't have two-or-more non-empty cycles followed by an empty one.

So I have to go all the way to this:

use std::cell::Cell;

fn main() {
    let n = &Cell::new(5);
    let c = &Cell::new(0);

    let it = std::iter::from_fn(|| {
        let x = c.get();
        if x == 0 {
            let x = n.get();
            n.set(x - 1);
            c.set(x);
            None
        } else {
            c.set(x - 1);
            Some(x)
        }
    });
    let it = it.cycle();
    it.for_each(|x| {
        dbg!(x);
    });
}

https://play.rust-lang.org/?edition=2018&gist=cd0b0f3e7187e8fa5aecb042f547483a

That's certainly a bit of a stretch, but it's easy to write and kindof an interesting way to do a triangular iterator. So maybe it'd be safer to just put the loop around the empty-checked version? And it feels like that outer loop isn't the place where this is improving perf, so it shouldn't cause any of the benchmarks to get materially worse. (I normally am fine with people getting slightly-odd behaviour from mutable closures, but introducing non-termination is scary to me.)

I guess I should also ping @rust-lang/libs to see if they have conventions for how to handle people doing unusual things in FnMut closures (like Iterator::next) passed to the library...

timvermeulen · 2019-08-09T10:20:28Z

Ah, so intentional misuse of cycle? I wouldn't really feel bad for breaking that code... But I also wouldn't mind changing it, if it doesn't impact performance. I'll come up with some benchmarks.

SimonSapin · 2019-08-16T14:15:18Z

Based only on the Self: Clone bound on the signature and the doc-comment, I feel it’s pretty clear that the cycle method expects that cloning the iterator will produce "the same" iterator.

Going out of one’s way to share mutable state between clones of the iterator (through Cell) breaks this expectation. I feel that it’s not necessary a bug for cycle to have arbitrary behavior (as long as it’s memory-safe) in that case, or for that behavior to change across versions.

scottmcm · 2019-08-17T07:08:29Z

Ok, I'll consider that libs approval for the somewhat-contrived behaviour change here. We have a good 10 weeks to revert if someone else has a strong opinion the other way.

@bors r+

I wonder if Iterator should document this expectation on Clone, like how Hash does?

bors · 2019-08-17T07:08:31Z

📌 Commit 688c112 has been approved by scottmcm

@scottmcm

…tmcm Override Cycle::try_fold It's not very pretty, but I believe this is the simplest way to correctly implement `Cycle::try_fold`. The following may seem correct: ```rust loop { acc = self.iter.try_fold(acc, &mut f)?; self.iter = self.orig.clone(); } ``` ...but this loops infinitely in case `self.orig` is empty, as opposed to returning `acc`. So we first have to fully iterate `self.orig` to check whether it is empty or not, and before _that_, we have to iterate the remaining elements of `self.iter`. This should always call `self.orig.clone()` the same amount of times as repeated `next()` calls would. r? @scottmcm

@scottmcm

…tmcm Override Cycle::try_fold It's not very pretty, but I believe this is the simplest way to correctly implement `Cycle::try_fold`. The following may seem correct: ```rust loop { acc = self.iter.try_fold(acc, &mut f)?; self.iter = self.orig.clone(); } ``` ...but this loops infinitely in case `self.orig` is empty, as opposed to returning `acc`. So we first have to fully iterate `self.orig` to check whether it is empty or not, and before _that_, we have to iterate the remaining elements of `self.iter`. This should always call `self.orig.clone()` the same amount of times as repeated `next()` calls would. r? @scottmcm

@ghost

Rollup of 4 pull requests Successful merges: - #62737 (Override Cycle::try_fold) - #63505 (Hash the remapped sysroot instead of the original.) - #63559 (rustc_codegen_utils: account for 1-indexed anonymous lifetimes in v0 mangling.) - #63621 (Modify librustc_llvm to pass -DNDEBUG while compiling.) Failed merges: r? @ghost

Override Cycle::try_fold

688c112

rust-highfive assigned scottmcm Jul 16, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jul 16, 2019

scottmcm added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. I-nominated S-waiting-on-team Status: Awaiting decision from the relevant subteam (see the T-<team> label). labels Aug 9, 2019

Dylan-DPC-zz removed the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Aug 16, 2019

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-team Status: Awaiting decision from the relevant subteam (see the T-<team> label). labels Aug 17, 2019

Centril mentioned this pull request Aug 17, 2019

Rollup of 5 pull requests #63653

Closed

Centril mentioned this pull request Aug 17, 2019

Rollup of 4 pull requests #63655

Merged

bors merged commit 688c112 into rust-lang:master Aug 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Override Cycle::try_fold #62737

Override Cycle::try_fold #62737

timvermeulen commented Jul 16, 2019

Alexendoo commented Jul 24, 2019

scottmcm commented Jul 27, 2019

timvermeulen commented Aug 3, 2019

scottmcm commented Aug 9, 2019 •

edited

Loading

timvermeulen commented Aug 9, 2019

SimonSapin commented Aug 16, 2019

scottmcm commented Aug 17, 2019

bors commented Aug 17, 2019

Override Cycle::try_fold #62737

Override Cycle::try_fold #62737

Conversation

timvermeulen commented Jul 16, 2019

Alexendoo commented Jul 24, 2019

scottmcm commented Jul 27, 2019

timvermeulen commented Aug 3, 2019

scottmcm commented Aug 9, 2019 • edited Loading

timvermeulen commented Aug 9, 2019

SimonSapin commented Aug 16, 2019

scottmcm commented Aug 17, 2019

bors commented Aug 17, 2019

scottmcm commented Aug 9, 2019 •

edited

Loading