python 3 and Unicode line breaking

Antoine Pitrou solipsis at pitrou.net
Fri Jan 14 18:23:55 EST 2011


On Fri, 14 Jan 2011 06:29:27 -0800 (PST)
leoboiko <leoboiko at gmail.com> wrote:
> 
> And it generally doesn’t try to pick good places to break lines
> at all, just making the assumption that 1 character = 1 column
> and that breaking on ASCII whitespaces/hyphens is enough.  We
> can’t really blame textwrap for that, it is a very simple module
> and Unicode line breaking gets complex fast (that’s why the
> consortium provides a ready-made algorithm).  It’s just that,
> with python3’s emphasis on Unicode support, I was surprised not
> to be able to find an UAX #14 implementation.  I thought someone
> would surely have written one and I simply couldn’t find, so I
> asked precisely that.

If you're willing to help on that matter (or some aspects of them,
textwrap-specific or not), you can open an issue on
http://bugs.python.org and propose a patch.

See also http://docs.python.org/devguide/#contributing if you need more
info on how to contribute.

Regards

Antoine.





More information about the Python-list mailing list