Parsing XML with ElementTree (unicode problem?)
R.Brodie at rl.ac.uk
Mon Jul 23 16:46:15 CEST 2007
<oren.tsur at gmail.com> wrote in message
news:1185200976.082516.105420 at 57g2000hsv.googlegroups.com...
> so what's the difference? how comes parsing is fine
> in the first case but erroneous in the second case?
You may have guessed the encoding wrong. It probably
wasn't utf-8 to start with but iso8859-1 or similar.
What actual byte value is in the file?
> 2. there is another problem that might be similar I get a similar
> error if the content of the (locally saved) xml have special
> characters such as '&'
Either the originator of the XML has messed up, or whatever
you have done to save a local copy has mangled it.
More information about the Python-list