[Numpy-discussion] Updates on NA / NaN values for Ints?

Nick Eubank nickeubank at gmail.com
Fri Nov 28 21:02:15 EST 2014

Thanks Nathaniel and Ben.

Nathaniel, I completely understand! Believe me, I appreciate all you folks
do already, and am hoping to some day improve my skills enough to
contribute myself.

Ben: they only handle it for Floats. If you introduce an NA to an int64
series, it just coerces the type into Float64.

With that said, someone on their list suggested they were going to try and
integrate it with dynd.

(ha! good catch Nathaniel -- sorry for double dipping, Internet initially
suggested they were waiting to see what the numpy team would do)

On Fri Nov 28 2014 at 5:52:19 PM Benjamin Root <ben.root at ou.edu> wrote:

> That being said, doesn't Pandas support something like what you are asking
> for? While Pandas would like to push the NA support down into the NumPy
> level for seamless interaction with other SciPy libraries, it does a very
> decent job with NA on its own, and depending on the sort of code you need
> to write, you might rarely ever have to directly interface with NumPy.
> Pandas is very powerful for data analysis, particularly for those coming
> from R and S+. Those of us who use NumPy directly tend to be involved in
> more generalized numerical tasks than those developing Pandas.
> Perhaps you should raise your concerns on the Pandas mailing list? They
> might be better suited to help you with any hurdles that you foresee. The
> people on that list are very familiar with dealing with "dirty data".
> Cheers!
> Ben Root
> On Fri, Nov 28, 2014 at 8:08 PM, Nathaniel Smith <njs at pobox.com> wrote:
>> On Fri, Nov 28, 2014 at 9:12 PM, Nick Eubank <nickeubank at gmail.com>
>> wrote:
>> > Hello Numpy!
>> >
>> > Curious if there were any timelines for when NA / NaN support might
>> arrive
>> > for Ints.
>> >
>> > (I'm sure this is asked and answered many times, but what I can find
>> seems
>> > very out of date.)
>> >
>> > I'm strongly considering migrating from R/Stata/Matlab to Pandas/Numpy,
>> but
>> > the inability to handle Na in Int types seems like a big limitation for
>> > someone always working with dirty data like me (a social scientist),
>> and the
>> > Pandas docs suggest they're waiting for Numpy to add support. Just
>> curious
>> > when that might happen!
>> There's no particular timeline, no. We don't have any budget or
>> employees so any timeline would be a fantasy anyway :-). It's
>> definitely possible to do and we'd like to do it, but it'll happen
>> when someone with the right skills and interests finds the time to do
>> the work.
>> -n
>> --
>> Nathaniel J. Smith
>> Postdoctoral researcher - Informatics - University of Edinburgh
>> http://vorpus.org
>> _______________________________________________
>> NumPy-Discussion mailing list
>> NumPy-Discussion at scipy.org
>> http://mail.scipy.org/mailman/listinfo/numpy-discussion
> _______________________________________________
> NumPy-Discussion mailing list
> NumPy-Discussion at scipy.org
> http://mail.scipy.org/mailman/listinfo/numpy-discussion
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.python.org/pipermail/numpy-discussion/attachments/20141129/ca81e2cb/attachment.html>

More information about the NumPy-Discussion mailing list