Iterable vs Iterator distinction #2942

pkch · 2017-03-02T10:42:08Z

I think many python programmers think of an iterable as a container of items that allows several passes through it. In other words, they would think the following code is correct:

def count_max_values(iterable: Iterable) -> int:
    """Count the number of times the maximum value appears in iterable"""
    max_value = max(iterable, default=None)
    counter = 0
    for item in iterable:
        if item == max_value:
            counter += 1
    return counter

Obviously, if the argument was an Iterator, there would be no doubt that the above implementation is incorrect.

However, at the moment, the following code would pass the type check:

iter = (i for i in range(5))
count_max_values(iter)

The reason is that the definition of Iterable is currently only concerned with the presence of __get_item__() or __iter__() methods, and so every Iterator is automatically an Iterable.

Would it be worth redefining Iterator and Iterable? For example, the rule could be that if an object defines both __iter__ and __next__ methods, it is not an Iterable (since it's very weird for an iterable to have a __next__ method); otherwise, if it defines __iter__ or __getitem__, it is an Iterable. If necessary, an option could be given to the programmer to explicitly override this rule (marking as Iterable an object with both __iter__ and __next__; or as not Iterable an object with __iter__ and without __next__).

The text was updated successfully, but these errors were encountered:

ilevkivskyi · 2017-03-02T11:27:21Z

Would it be worth redefining Iterator and Iterable?

I think no, since it reflects the Python runtime semantics. Your code correctly passes mypy because it works at runtime without errors.

The problem with your code is a design/behavior error, not a type error: generator expression is already exhausted before the for loop. Your code implies that you need an "immutable" iterable. So that you could wrapt the argument in a tuple initially iterable = tuple(iterable). This something quite difficult to catch statically.

gvanrossum · 2017-03-02T15:36:13Z

We are not going to change this, but you can use the type Container for iterables that can be iterated repeatedly.

pkch · 2017-03-03T01:38:50Z

@gvanrossum You meant Collection, not Container; Container would cause mypy to reject the function as implemented because Container doesn't guarantee the presence of __iter__, which is implicitly used inside the function.

@ilevkivskyi I also thought it's difficult, but it appears this problem has recently been solved in 3.6 with the addition of typing.Collection which works perfectly:

from typing import Collection
def count_max_values(iterable: Collection) -> int:
    """Count the number of times the maximum value appears in iterable"""
    max_value = max(iterable, default=None)
    counter = 0
    for item in iterable:
        if item == max_value:
            counter += 1
    return counter

Now an attempt to call count_max_values( (i for i in range(5)) ) will be statically rejected by mypy.

gvanrossum · 2017-03-03T02:02:48Z

Yeah, sorry, I meant Collection. :-)

…

--Guido (mobile)

On Mar 2, 2017 5:38 PM, "pkch" ***@***.***> wrote: @gvanrossum <https://github.com/gvanrossum> You meant Collection, not Container; Container would cause mypy to reject the function as implemented because Container doesn't guarantee the presence of __iter__, which is implicitly used inside the function. @ilevkivskyi <https://github.com/ilevkivskyi> I also thought it's difficult, but it appears this problem has recently been solved. Collection works perfectly: from typing import Collection def count_max_values(iterable: Collection) -> int: """Count the number of times the maximum value appears in iterable""" max_value = max(iterable, default=None) counter = 0 for item in iterable: if item == max_value: counter += 1 return counter Now an attempt to call count_max_values( (i for i in range(5)) ) will be statically rejected by mypy. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2942 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ACwrMuXIT0wqU-nE5FggOy0RLoBD1Hljks5rh28rgaJpZM4MQ0qZ> .

gvanrossum closed this as completed Mar 2, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iterable vs Iterator distinction #2942

Iterable vs Iterator distinction #2942

pkch commented Mar 2, 2017

ilevkivskyi commented Mar 2, 2017

gvanrossum commented Mar 2, 2017

pkch commented Mar 3, 2017 •

edited

Loading

gvanrossum commented Mar 3, 2017 via email

Iterable vs Iterator distinction #2942

Iterable vs Iterator distinction #2942

Comments

pkch commented Mar 2, 2017

ilevkivskyi commented Mar 2, 2017

gvanrossum commented Mar 2, 2017

pkch commented Mar 3, 2017 • edited Loading

gvanrossum commented Mar 3, 2017 via email

pkch commented Mar 3, 2017 •

edited

Loading