Mailman 3 PEP 681: Descriptor fields with dataclass_transform - Typing-sig

Feb. 22, 2022

So at runtime, a descriptor-typed field in a dataclass already a)
keeps the descriptor around on the class, b) invokes the descriptor
protocol on both class-level and instance-level attribute access, and
c) takes the value passed to `__init__` for the field and passes it to
the descriptor's __set__ method. So it seems like dataclasses already
implements precisely the behavior desired by this "new" feature, by
default! It's just the type checkers that treat this in a way
inconsistent with the runtime behavior. Both mypy and pyright expect
the `__init__` argument `x` here to be an instance of the descriptor
type itself, which is just wrong, given the actual runtime behavior.

The runtime behavior totally changes if we have `x: Descriptor` (no
"default value") instead of `x: Descriptor = Descriptor()`. In the
former case, dataclasses doesn't know or care about the annotated type
or the fact that it's a descriptor type, and there is no runtime
descriptor to attach to the class at all, so we get normal
descriptor-less runtime behavior.

My conclusion from this is that most likely nobody ever thought very
hard about how dataclasses should work with descriptor-typed fields,
and the runtime behavior we get is simply what falls out naturally
from the way dataclasses handles field default values (i.e. they are
preserved as class attributes, if present.)

From a type-checking perspective, it feels natural that `x:
Descriptor` and `x: Descriptor = Descriptor()` ought to specify the
same typing for the `x` __init__ arg and attribute, since in typing we
are used to the idea that `x: Foo` specifies the same type for `x`
regardless of presence or absence of an assigned value. But
practically it's hard to see the use case for `x: Descriptor` -- if no
actual descriptor object is attached to the class, then where does one
expect the runtime descriptor behavior to come from? Did the
SQLAlchemy use case that motivated this change require `x: Descriptor`
to behave as a descriptor, or only `x: Descriptor = Descriptor()`? If
the former, how does that even work at runtime? If we somehow want to
make `x: Descriptor` and `x: Descriptor = Descriptor()` equivalent,
then I don't think we can avoid the need to propose and make changes
to the runtime dataclasses behavior, and it's not clear to me what
kind of change would even be workable: there is no feasible way in the
case of `x: Descriptor` for dataclasses at runtime to reliably know
that `Descriptor` is a descriptor type, and there's no reasonable way
for it to conjure a descriptor object into existence where none was
given.

If, on the other hand, the PEP is not intending to propose changes to
the runtime behavior of dataclasses, this suggests that a) no new
feature or dataclass_transform argument is needed in the PEP to serve
the descriptor use case, b) the current runtime behavior of
dataclasses (including the difference in behavior between `x:
Descriptor` and `x: Descriptor = Descriptor()`) should be specified as
the default behavior of dataclass_transform with descriptor field
types, and c) type-checkers should fix their dataclass handling to
make their typecheck for descriptor fields match the already-existing
runtime behavior.

I think my objection to the name `delete_class_attributes` is moot
given the above, but just for completeness I'll offer it anyway. The
_only_ case in which dataclasses ever deletes any class attribute is
in the case where the "default value" for a field is a Field object.
In this case the Field class attribute is replaced with the default
value specified in the Field object, if any, or is deleted if none. In
all other cases, dataclasses doesn't mess with class attributes at
all. In the case of `x: Foo = Foo()` it leaves that `Foo()` instance
as the class attribute; in the case of `x: Foo` there is no class
attribute to begin with, so nothing is deleted. So it doesn't make
sense for `delete_class_attributes=False` to imply different treatment
of either `x: Foo` or `x: Foo = Foo()`, since in neither of those
cases would dataclasses ever do anything that could be described as
deleting a class attribute.

Carl

PEP 681: Descriptor fields with dataclass_transform

Erik De Bonte

Jelle Zijlstra

Carl Meyer

Erik De Bonte

Carl Meyer

Erik De Bonte

Carl Meyer

Eric Traut

Erik De Bonte

Carl Meyer

Erik De Bonte

Carl Meyer

Erik De Bonte

Eric V. Smith

tags

participants (5)