[Python-ideas] Re: PEP 472 -- Support for indexing with keyword arguments

18 Jul 2020

      On Fri, Jul 17, 2020 at 7:21 PM Steven D'Aprano  wrote:
...
On Fri, Jul 17, 2020 at 11:11:17AM -0400, Ricky Teachey wrote:
...
For backwards-compatibility, there will only ever be a single positional
argument passed into the method. That's because comma-separated values
in a subscript are already passed as a tuple:
# this calls __getitem__ with a single tuple argument
    obj[a,b:c,d]  ==> (1, slice(2, 3), 4)
So that's not going to change (at least not without a long and painful
deprecation process). But adding support for keyword arguments requires
no changes to any existing class or a new builtin "key object" type.
This strikes me as problematic for having a consistent mental model of how
stuff works in python. I think that for many the difference in the meaning
of the syntax between item-getting/setting and function-calling would be...
glaring.

The left-hand-side syntax below looks pretty much identical in both cases
(except for the sugar of the slice object), but it feels
surprisingly different meanings for the positional arguments:
...
...
...
a, b, c, d = (1, 2, 3, 4)
args = a, slice(b, c), d
#    LEFT HAND SIDE          RIGHT HAND SIDE
obj[a, b:c, d, e=5, f=6]  == obj.__getitem__(args, e=5, f=6)
f(a, slice(b,c), d, e=5, f=6)  != f(args, d, e=5, f=6)
On the one hand, a fairly experienced person (who is familiar with the
history of the item dunders, and a preexisting mental model that they are
always being supplied a single positional argument ) should not have too
much of a problem understanding WHY these would behave differently.

But on the other hand, even as an experienced person, this really messes
with my brain, looking at it. It's hard for me to believe this isn't going
to be a painful distinction for a large number of people to hold in their
head-- especially beginners (but not only beginners).

A potentially elegant way around this glaring difference in the meaning of
the syntax might be the key-object paradigm Jonathan Fine has suggested.
However, that only works if you *disallow mixing together* positional
arguments and kwd args inside the [ ]:
...
...
...
d[a="foo", b="bar"] == d[KeyObject(a="foo", b="bar")]
If you specifically desire to mix together positional and kwd arguments,
the key-object paradigm isn't nearly as elegant... there's still magical
stuff happening to "split off" the positional arguments from the keyword
arguments:
...
...
...
# The positional arguments aren't part of the KeyObject
d[a, b:c, d, e=5, f=6] == d.__getitem__((a, b:c, d), KeyObject(e=5,
f=6))
This raises a question that needs to be answered, then: what would be the
utility of mixing together positional and kwd arguments in this way?

Even the xarray examples given so far don't seem to make use of this
mixture. From my knowledge of pandas I am not sure what the meaning of
this would be, either.
...
But! On the other hand, the huge difference here is that currently, the
...
signature of the get/set item dunders only accepts a single key argument.
To be precise, you can put any signature you like into a `__getitem__`
method, but only the first positional argument will be called from
subscripting syntax.
I appreciate you pointing that out, it's a meaningful distinction because
it could mean that rewriting the default __getitem__ and __setitem__
signatures-- as you seem to be arguing for over the key-object paradigm--
won't be as nearly a painful/breaking change as it would be otherwise.
...
Even if you pass comma-separated values, including slices, they all get
packed into a tuple and passed as the first parameter `item`.
But there is no need to emulate that for keyword args. They can, and I
think should, simply be unpacked into keyword parameters exactly the
same as function call syntax does.
Yup. Personally I really like the idea that unpacking some dict x will work
the same as unpacking in a function call:
...
...
...
# Pretty!
d[**x] == d[a=1, b=2]
f(**x] == f(a=1, b=2)
Getting to the end here, I guess I'm really just wondering whether mixing
positional and kwd args is worth doing. If it isn't, then the key-object
paradigm seems like might be a nicer solution to me for the sole reason
that the mental model gets confused otherwise.