Re: [Python-Dev] Incorrect length of collections.Counter objects / Multiplicity function

20 May 2010

      On Tue, May 18, 2010 at 11:00 PM, Gustavo Narea  wrote:
...
I've checked the new collections.Counter class and I think I've found a bug:
...
...
...
...
from collections import Counter
c1 = Counter([1, 2, 1, 3, 2])
c2 = Counter([1, 1, 2, 2, 3])
c3 = Counter([1, 1, 2, 3])
c1 == c2 and c3 not in (c1, c2)
True
# Perfect, so far. But... There's always a "but":
...
len(c1)
3
This is the intended behaviour;  it also agrees with what you get when
you iterate
over a Counter object:
...
...
...
list(c1)
[1, 2, 3]
As I understand it, there are other uses for Counter objects besides
treating them
as multisets;  I think the choices for len() and iter() reflected
those other uses.
...
Is this the intended behavior? If so, I'd like to propose a proper multiset
implementation for the standard library (preferably called "Multiset"; should
I create a PEP?).
Feel free!  The proposal should probably go to python-list or
python-ideas rather
than here, though.

See also this recent thread on python-list, and in particular the messages
from Raymond Hettinger in that thread:

http://mail.python.org/pipermail/python-list/2010-March/thread.html

Mark

Re: [Python-Dev] Incorrect length of collections.Counter objects / Multiplicity function

Mark Dickinson