Proper use of the codecs module.
Chris Angelico
rosuav at gmail.com
Fri Aug 16 18:14:20 EDT 2013
On Fri, Aug 16, 2013 at 3:02 PM, Andrew <andrew at invalid.invalid> wrote:
> I have a mixed binary/text file[0], and the text portions use a radically
> nonstandard character set. I want to read them easily given information
> about the character encoding and an offset for the beginning of a string.
To add to all the information already given: Is the file small enough
to comfortably fit into memory? If so, you'll find it a LOT easier to
play with strings in RAM than files on disk. Even if not, you may find
a lot of tasks simplified by just reading a kay or a meg in and then
working within that. That spares you the fiddliness of read(1) all the
time, at the expense of potentially reading more than you need.
ChrisA
More information about the Python-list
mailing list