EXOR or symmetric difference for the Counter class

Paddy paddy3118 at googlemail.com
Mon Aug 16 13:21:12 EDT 2010


On 14 Aug, 18:14, Raymond Hettinger <pyt... at rcn.com> wrote:
> On Aug 12, 1:20 pm, Paddy <paddy3... at googlemail.com> wrote:
>
> > I find myself needing to calculate the difference between two Counters
> > or multisets or bags.
>
> > I want those items that are unique to each bag.
>
> Tell us about your use cases.  I'm curious how a program would ascribe
> semantic meaning to the result.  The phrase "unique to each bag"
> doesn't quite cover it, perhaps something like "number in either
> source above the minimum held in common".
>
> AFAICT, I've never needed something like this as a primitive.  Even
> the xor operation for regular sets is rarely used.
>
> > I know how to
> > calculate it:
>
> >     >>> b = Counter(a=1, b=2)
> >     >>> c = Counter(a=3, b=1)
> >     >>> diff = (b - c) + (c - b)
> >     >>> diff
> >     Counter({'a': 2, 'b': 1})
>
> That seems simple enough.
> You could also use:
>
>  diff = (b | c) - (b & c)   # max(b,c) - min(b,c)
>
> Raymond

Hi Raymond and others,

Lets say you have two *sets* of integers representing two near-copies
of some system, then a measure of their difference could be calculated
as:

len(X.symmetric_difference(Y)) / (len(X) + len(Y)) * 100 %

If the two collections of integers are allowed duplicates then you
need a Counter/bag/multi-set type and the diff calculation I gave
originally.

Thanks.



More information about the Python-list mailing list