[Python-ideas] Pre-conditions and post-conditions

Sat Sep 15 02:51:59 EDT 2018

Hi,
Let me make a couple of practical examples from the work-in-progress (
https://github.com/Parquery/pypackagery, branch mristin/initial-version) to
illustrate again the usefulness of the contracts and why they are, in my
opinion, superior to assertions and unit tests.

What follows is a list of function signatures decorated with contracts from
pypackagery library preceded by a human-readable description of the
contracts.

The invariants tell us what format to expect from the related string
properties.

@icontract.inv(lambda self: self.name.strip() == self.name)
@icontract.inv(lambda self: self.line.endswith("\n"))
class Requirement:
    """Represent a requirement in requirements.txt."""

    def __init__(self, name: str, line: str) -> None:
        """
        Initialize.

        :param name: package name
        :param line: line in the requirements.txt file
        """
        ...

The postcondition tells us that the resulting map keys the values on their
name property.

@icontract.post(lambda result: all(val.name == key for key, val in
result.items()))
def parse_requirements(text: str, filename: str = '<unknown>') ->
Mapping[str, Requirement]:
    """
    Parse requirements file and return package name -> package
requirement as in requirements.txt

    :param text: content of the ``requirements.txt``
    :param filename: where we got the ``requirements.txt`` from (URL or path)
    :return: name of the requirement (*i.e.* pip package) -> parsed requirement
    """
    ...

The postcondition ensures that the resulting list contains only unique
elements. Mind that if you returned a set, the order would have been lost.

@icontract.post(lambda result: len(result) == len(set(result)),
enabled=icontract.SLOW)
def missing_requirements(module_to_requirement: Mapping[str, str],
                         requirements: Mapping[str, Requirement]) -> List[str]:
    """
    List requirements from module_to_requirement missing in the
``requirements``.

    :param module_to_requirement: parsed ``module_to_requiremnt.tsv``
    :param requirements: parsed ``requirements.txt``
    :return: list of requirement names
    """
    ...

Here is a bit more complex example.
- The precondition A requires that all the supplied relative paths
(rel_paths) are indeed relative (as opposed to absolute).
- The postcondition B ensures that the initial set of paths (given in
rel_paths) is included in the results.
- The postcondition C ensures that the requirements in the results are the
subset of the given requirements.
- The precondition D requires that there are no missing requirements (*i.e.
*that each requirement in the given module_to_requirement is also defined
in the given requirements).

@icontract.pre(lambda rel_paths: all(rel_pth.root == "" for rel_pth in
rel_paths))  # A
@icontract.post(
    lambda rel_paths, result: all(pth in result.rel_paths for pth in rel_paths),
    enabled=icontract.SLOW,
    description="Initial relative paths included")  # B
@icontract.post(
    lambda requirements, result: all(req.name in requirements for req
in result.requirements),
    enabled=icontract.SLOW)  # C
@icontract.pre(
    lambda requirements, module_to_requirement:
missing_requirements(module_to_requirement, requirements) == [],
    enabled=icontract.SLOW)  # D
def collect_dependency_graph(root_dir: pathlib.Path, rel_paths:
List[pathlib.Path],
                             requirements: Mapping[str, Requirement],
                             module_to_requirement: Mapping[str, str])
-> Package:

    """
    Collect the dependency graph of the initial set of python files
from the code base.

    :param root_dir: root directory of the codebase such as
"/home/marko/workspace/pqry/production/src/py"
    :param rel_paths: initial set of python files that we want to
package. These paths are relative to root_dir.
    :param requirements: requirements of the whole code base, mapped
by package name
    :param module_to_requirement: module to requirement correspondence
of the whole code base
    :return: resolved depedendency graph including the given initial
relative paths,
    """

I hope these examples convince you (at least a little bit :-)) that
contracts are easier and clearer to write than asserts. As noted before in
this thread, you can have the same *behavior* with asserts as long as you
don't need to inherit the contracts. But the contract decorators make it
very explicit what conditions should hold *without* having to look into the
implementation. Moreover, it is very hard to ensure the postconditions with
asserts as soon as you have a complex control flow since you would need to
duplicate the assert at every return statement. (You could implement a
context manager that ensures the postconditions, but a context manager is
not more readable than decorators and you have to duplicate them as
documentation in the docstring).

In my view, contracts are also superior to many kinds of tests. As the
contracts are *always* enforced, they also enforce the correctness
throughout the program execution whereas the unit tests and doctests only
cover a list of selected cases. Furthermore, writing the contracts in these
examples as doctests or unit tests would escape the attention of most less
experienced programmers which are not  used to read unit tests as
documentation. Finally, these unit tests would be much harder to read than
the decorators (*e.g.*, the unit test would supply invalid arguments and
then check for ValueError which is already a much more convoluted piece of
code than the preconditions and postconditions as decorators. Such testing
code also lives in a file separate from the original implementation making
it much harder to locate and maintain).

Mind that the contracts *do not* *replace* the unit tests or the doctests.
The contracts make merely tests obsolete that test that the function or
class actually observes the contracts. Design-by-contract helps you skip
those tests and focus on the more complex ones that test the behavior.
Another positive effect of the contracts is that they make your tests
deeper: if you specified the contracts throughout the code base, a test of
a function that calls other functions in its implementation will also make
sure that all the contracts of that other functions hold. This can be
difficult to implement  with standard unit test frameworks.

Another aspect of the design-by-contract, which is IMO ignored quite often,
is the educational one. Contracts force the programmer to actually sit down
and think *formally* about the inputs and the outputs (hopefully?) *before*
she starts to implement a function. Since many schools use Python to teach
programming (especially at high school level), I imagine writing contracts
of a function to be a very good exercise in formal thinking for the
students.

Please let me know what points *do not *convince you that Python needs
contracts (in whatever form -- be it as a standard library, be it as a
language construct, be it as a widely adopted and collectively maintained
third-party library). I would be very glad to address these points in my
next message(s).

Cheers,
Marko
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/python-ideas/attachments/20180915/a0ae3b65/attachment-0001.html>