[Tutor] extract meaningful data from garbage
anand.shashwat at gmail.com
Sun Jan 3 16:33:33 CET 2010
I am almost doing same thing i.e. to give the values left unparsed a certain
name - 'NIL', and currently I'm redirecting output to a text file. Searching
for 'NIL' tells me where my match failed, although writing it seperately to
a different file dint occurred to me. And yes the job is to reduce as much
manual work as possible, I got it now. Thanks for the help :)
On Sun, Jan 3, 2010 at 8:24 PM, Alan Gauld <alan.gauld at btinternet.com>wrote:
> "Shashwat Anand" <anand.shashwat at gmail.com> wrote
> as to match all of them. The task is time-consuming but with every new
>> test-sets exceptions are becoming less and less. (There are .2 million
> One final thing to try is to identify records where you *failed* to find
> a match and re write them into an error file. The error file can then
> be manually processed if need be.
> You might also be able to clean up the error file by not writing lines
> that you know to be non-useful. The resultant error file might then
> show up some further patterns that you can exploit.
> Its all about eliminating as much manual effort as possible and
> making the manual work that is left over as easy as possible.
> ie Accept that you won't ever get 100% success and aim to
> minimise the pain as much as possible.
> Alan Gauld
> Author of the Learn to Program web site
> Tutor maillist - Tutor at python.org
> To unsubscribe or change subscription options:
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Tutor