Re: [Python-Dev] just say no...

--- Guido van Rossum <guido@CNRI.Reston.VA.US> wrote: thing
This would be a desirable option in almost every case. Default is an exception (I want to know my data is not clean), but an option to specify an error character. It is usually a question mark but Mike tells me that some encodings specify the error character to use. Example - I query a Sybase Unicode database containing European accents or Japanese. By default it will give me question marks. If I issue the command 'set char_convert utf8', then I see the lot (as garbage, but never mind). If it always errored whenever a query result contained unexpected data, it would be almost impossible to maintain the database. If I wrote my own codec class for a family of encodings, I'd give it an even wider variety of error-logging options - maybe a mode where it told me where in the file the dodgy characters were. We've already taken the key step by allowing codecs to be separate objects registered at run-time, implemented in either C or Python. This means that once again Python will have the most flexible solution around. - Andy ===== Andy Robinson Robinson Analytics Ltd. ------------------ My opinions are the official policy of Robinson Analytics Ltd. They just vary from day to day. __________________________________________________ Do You Yahoo!? Bid and sell for free at http://auctions.yahoo.com
participants (1)
-
Andy Robinson