[SciPy-user] Questions on scipy.io.read_array()

charlie charlie.xia.fdu at gmail.com
Wed Feb 20 22:42:36 EST 2008


Hi,

I am a newbie to scipy.
I am currently using it to deal with some statistical problems with possible
missing values.
these values are labeled 'na' in my data file.
However when I tried to read in my data into an array and substitute 'na'
with -1 (for example) by:
read_array( datafile, ..., missing=-1)
The array I got doesn't cast 'na' value into -1, but 0 - the default value
of parameter "missing".
And when I check mail list, I found the issue has already be raised by Joris
De Ridder:
http://article.gmane.org/gmane.comp.python.scientific.user/3700/match=read%5farray+missing
So I guess there is something wrong with regard to scipy.io library.
Does anybody come across the same problem?
Should I raise a ticket for this seemingly bug?

Also, I'd like to ask for two general questions:
first, how efficient is python+numpy+scipy 's with major calls to statistics
distribution functions,
as compared to Matlab, C++ with CEPHES or GSL, and etc.
I compared it with my old R program, it seems python+numpy+scipy is little
bit faster.
Can anybody provide with some references to this?

Another question is there a good package handle missing values well within
scipy?
Such as it can store the value as missing and fill it with different
inference method when desired.

Thanks!

Charlie
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.scipy.org/pipermail/scipy-user/attachments/20080220/356060d3/attachment.html>


More information about the SciPy-User mailing list