The case against static type checking, in detail (long)
(This is a fragment of an email that I sent to Guido earlier, I mention this here so that Guido can skip reading it. Of course, I recognize that most people here already know all this - but since it relates to the recent discussion about the value of type checking, I'd like to post it here as a kind of "manifesto" of why Python is the way it is.) Strongly-typed languages such as C++ and Java require up-front declaration of everything. It is the nature of such languages that there is a lot of cross-checking in the compiler between the declaration of a thing and its use. The idea is to prevent programmer errors by insuring internal consistency. However, we find in practice that much of the programmer's effort is spent in maintaining this cross-checking structure. To use a building analogy, a statically-typed program is like a truss structure, where there's a complex web of cross-braces, and a force applied at any given point is spread out over the whole structure. Each time such a program is modified, the programmer must partially dismantle and then re-assemble the existing structure. This takes time. It also clutters the code. Reading the source code of a program written in a statically typed language reveals that a substantial part of the text serves only to support the compile-time checking of definitions, or provides visual redundancy to aid the programmer in connecting two aspects of the program which are defined far apart. An example of what I mean is the use of variable type declarations - even in a statically typed language, it would be fairly easy for the compiler to automatically infer most variable types if the language were designed that way; The fact that the programmer is required to manually specify these types serves as an additional consistency check on the code. However, time spend serving the needs of these consistency checks is time away from actually serving the functional purpose of the code. Programmers in Python, on the other hand, not only need not worry about type declarations, they also spend much less time worrying about converting from one type to another in order to meet the constraints of a particular API. This is one of the reasons why I can generally write Python code about 4 times as fast as the C++ equivalent. (Understand that this is coming from someone who loves working in C++ and Java and has used them daily for the last 15 years. At the same time, however, I also enjoy programming in Python and I recognize that each language has their strengths.) There is also the question of how much static typing helps improve program reliability. In statically typed languages, there are two kinds of ways that types are used. In languages such as C and Pascal, the type declarations serve primarily as a consistency check. However, in C++ template metaprogramming, and in languages like Haskell, there is a second use for types, which is to provide a kind of type calculus or type inferencing, which gives additional expressive power to the language. C++ templates can act as powerful code generators, allowing the programmer to program in ever higher levels of abstraction and express the basic ideas even more succinctly and clearly than before. In a rapid-prototyping environment, the second use of types can be a major productivity win; However I would argue that the first use of types, consistency checking, is less beneficial, and is often more of a distraction to the programmer than a help. Yes, static type checking does detect some errors; But it also causes errors by making the code larger and more wordy, because that the programmer cannot hold large portions of the program in their mind all at once, which can lead to errors in overall design. It means the programmer spends more time thinking about the behavior of individual variables and less about the algorithm as a whole. At this point, I want to talk about a related matter, another fundamental design aspect of Python which I call "decriminalization of minor errors". An example of this is illustrated by the recent discussion over string slicing. As you know, when you attempt to index a string with a slice object that extends outside of the bounds of the string, the range is silently truncated. Some argued that Python should be more strict, and report an error when this occurs - but instead, it was reaffirmed that the current behavior is correct. I would agree that this current behavior is the more Pythonic, and is part of a general pattern, which I shall attempt to describe: To "decriminalize" an error means to find some alternative interpretation of the programmer's intent that produces a non-error result. That is, given a syntactical construct, and a choice of several interpretations of what that construct should mean, attempt to pick an interpretation that, when executed, does not produce an error. In the design of the Python language, it is a regular practice to decriminalize minor errors, provided that the alternative interpretation can meet some fairly strict criteria: That it is useful, intuitive, reasonable, and pedagogically sound. Note that this is a much more conservative rule than that used by languages such as Rexx, Javascript, and Perl, languages which make "heroic efforts" to bend the interpretation of an operation to a non-error result. Python does not do this. Nor is decriminalizing errors isn't the same as ignoring errors. Errors are still, and should be, enforced vigorously and reported. The distinction is that decriminalizing an error results in code that produces a useful, logical result, whereas ignoring errors results in code that produces garbage or nothing. Decriminalization comes about when we broaden our definitions of what is the correct result of a given operation. A couple of other examples of decriminalization: 1) there are languages in which the only valid argument for an 'if' statement is a boolean. Attempts to say "if 0" are errors. In Python we relax that rule, allowing any type to be used as the argument to an if-statement. We do this by having a broader interpretation of what it means to test an object for 'trueness', and allow 'trueness' to be implied by 'non-emptiness'. 2) Duck-typing is a decriminalization of the error that polymorphic types are required to inherit from a common interface. It also decriminalizes "missing methods", as long as those methods are never called. Again, this is due to having a broader interpretation of 'polymorphism'. (In fact, this aspect of Python is so fundamental, that I think that it deserves its own acronym alongside TOOWTDI and others, but I can't think of a short, pithy description of it. Maybe IOANEIR - "Interpret operations as non-errors if reasonable.") Both static typing and decriminalization serve the same end - telling the programmer "don't sweat the small stuff". Both are very helpful and powerful, because they allow programmers to spend much less time worrying about minor error cases, things that would have to be checked for in C++. Python code is simply more *concise* than the C++ equivalent, yet it achieves this without being terse and cryptic, because the text of a Python program more closely embodies the "essence" of an algorithm, uncluttered by other agendas. The price we pay for this, of course, is that sometimes errors show up much later (like, after ship) than they would have otherwise. But unit testing can catch a lot of the same errors. And in many cases, the seriousness of such errors depends on what we mean by "ship". It's one thing to discover a fatal error after you've pressed thousands of CDs and shipped them all over the world; It's a much different matter if the program has the ability to automatically update itself, or is downloaded from some kind of subscription model such as a package manager. In many environments, it is far more important to get something done quickly and validate the general concept, than it is to insure that the code is 100% correct. In other words, if it would take you 6 months to write it in a statically typed language, but only 2 months to write it in a dynamic language - well, that's 4 extra months you have to write unit tests and make sure it's right! And in the mean time, you can have real users banging on the code and making sure of something that is far more important, which is whether what you wrote is the right thing at all. -- Talin
Wow that's a good posting. Can you put it on a website so I can show it to friends when they argue about dynamic typing sucks? :] Christian
On Thu, Apr 26, 2007 at 05:34:18PM +0200, Christian Heimes wrote:
Wow that's a good posting. Can you put it on a website so I can show it to friends when they argue about dynamic typing sucks? :]
At least it in the mailing list archive: http://mail.python.org/pipermail/python-ideas/2007-April/000552.html Oleg. -- Oleg Broytmann http://phd.pp.ru/ phd@phd.pp.ru Programmers don't die, they just GOSUB without RETURN.
On Apr 25, 2007, at 10:59 PM, Talin wrote:
However, we find in practice that much of the programmer's effort is spent in maintaining this cross-checking structure.
How does this effort compare to writing all of the equivalent type tests? Static type checking subsumes the need to write tests to ensure that for every operation, the inputs are valid types, and the result type will be a valid type.
To use a building analogy, a statically-typed program is like a truss structure, where there's a complex web of cross-braces, and a force applied at any given point is spread out over the whole structure. Each time such a program is modified, the programmer must partially dismantle and then re-assemble the existing structure.
However the work to ensure the re-assembled structure is completely valid is shifted from human inspection and possibly incomplete tests, to static analysis. Its like having a computer check all of the cross brace connections. When the modifications are small dismantle/reassmbly costs can be dominated by the checking costs.
Yes, static type checking does detect some errors; But it also causes errors by making the code larger and more wordy, because that the programmer cannot hold large portions of the program in their mind all at once, which can lead to errors in overall design. It means the programmer spends more time thinking about the behavior of individual variables and less about the algorithm as a whole.
Thats like saying stairs should not have rails because thinking about where to put your hand gets in the way of thinking about where to put your feet! Proposals for static type checking in Python have long included the concept of optional type checking where programs without declarations continue to run. So clearly the desire not to clutter or force work on a type-declaration averse programmer is already taken as a requirement. -Tony
Tony Lownds wrote:
Thats like saying stairs should not have rails because thinking about where to put your hand gets in the way of thinking about where to put your feet!
Instead of rails, Python stairs have bouncy cushions along the sides and at the bottom to catch you gently if you happen to fall, rather than burden you with having to hold on every time you use the stairs, even though on most occasions you don't fall. Also it provides a lot more escalators. -- Greg
On 4/26/07, Greg Ewing <greg.ewing@canterbury.ac.nz> wrote:
Tony Lownds wrote:
Thats like saying stairs should not have rails because thinking about where to put your hand gets in the way of thinking about where to put your feet!
Instead of rails, Python stairs have bouncy cushions along the sides and at the bottom to catch you gently if you happen to fall, rather than burden you with having to hold on every time you use the stairs, even though on most occasions you don't fall.
Also it provides a lot more escalators.
But beware of the rotating knives! -- --Guido van Rossum (home page: http://www.python.org/~guido/)
participants (6)
-
Christian Heimes
-
Greg Ewing
-
Guido van Rossum
-
Oleg Broytmann
-
Talin
-
Tony Lownds