Character encodings and codecs
vincent wehren
v.wehren at home.nl
Sat Feb 1 02:43:17 EST 2003
"Grumfish" <nobody at nowhere.com> schrieb im Newsbeitrag
news:agcm3v8ep1thj579baihnokcr3o6b7c6eb at 4ax.com...
> I'm trying to read and write files with differend character encodings.
> I would like to convert a file from one encoding to another. I assume
> I should use the codec module but I'm not sure how.
Try "unicode()" to convert your byte-oriented input into unicode. Then use
"encode()" on your unicode strings into
your target encoding.
How can I read in
> a part at a time of a multibyte encoded file without destroying the
> text?
Do you mean: reading chunks without accidentally breaking up characters?
>
> Could some one please post a short example of converting a file from
> one multibyte encoding to another without reading the whole file into
> memory?
>
> Any online tutorials would also be very helpful.
More information about the Python-list
mailing list