raise UnicodeError, "label too long"

Flavio fccoelho at gmail.com
Thu Jan 25 01:25:19 CET 2007

something like this, for instance:

but even url with any non-ascii characters such as this


also fail when passed to urlopen :
File "/usr/lib/python2.4/encodings/idna.py", line 72, in ToASCII
    raise UnicodeError, "label too long"
UnicodeError: label too long

very strange, because I tried other unicode urls  from the python
console like this


and it works normally:

Martin v. Löwis escreveu:
> Flavio schrieb:
> > What I am doing is very simple:
> >
> > I fetch an url (html page) parse it using BeautifulSoup, extract the
> > links and try to open each of the links, repeating the cycle.
> >
> > Beautiful soup converts the html to unicode. That's why when I try to
> > open the links extracted from the page I get this error.
> >
> > This is bad, since some links do contain strings with non-ascii
> > characters.
> Please try answering the exact question that Marc asked:
> what is an example for unicode string that triggers the
> exception?
> Regards,
> Martin

More information about the Python-list mailing list