[SciPy-Dev] possible speed-up for arffread

Ralf Gommers ralf.gommers at googlemail.com
Thu Jun 17 06:51:29 EDT 2010


On Wed, Jun 16, 2010 at 10:46 AM, Benjamin Root <ben.root at ou.edu> wrote:

> Hello,
>
> I was looking at the scipy.io.arff module to see if I could easily shave
> some processing time for loading an ARFF file.  Doing some profiling on a
> file with 40,000 floating point numbers pointed me to the safe_float()
> function in the arffread.py file.  In it, it was stripping the string token
> of any whitespace and then comparing it to '?' (which is ARFF's missing data
> indicator).  I found that if one just does a check for the '?' character,
> you can shave almost 30% of the processing time off of the safe_float()
> function.
>
> In addition, I found a very slight improvement by calculating the range(ni)
> once and reusing that variable in the generator function.  Attached is my
> patch file.
>
> It isn't much, but it is noticeable.
>
> That looks good, thanks. I included it in
http://github.com/rgommers/scipy/tree/io, will commit that soon.

Cheers,
Ralf
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/scipy-dev/attachments/20100617/b78e4432/attachment.html>


More information about the SciPy-Dev mailing list