[Numpy-discussion] Updates on NA / NaN values for Ints?
ben.root at ou.edu
Fri Nov 28 22:23:09 EST 2014
Ah, I didn't notice the coercion before. Dynnd has been an interesting
project for a couple years now, and it is in the right position of being
able to develop and experiment with new concepts. I hope they can work out
the NA/NaN/mask semantics in a way that is intuitive and can coexist. That
was the major sticking point in previous discussions that prevented its
On Nov 28, 2014 9:02 PM, "Nick Eubank" <nickeubank at gmail.com> wrote:
> Thanks Nathaniel and Ben.
> Nathaniel, I completely understand! Believe me, I appreciate all you folks
> do already, and am hoping to some day improve my skills enough to
> contribute myself.
> Ben: they only handle it for Floats. If you introduce an NA to an int64
> series, it just coerces the type into Float64.
> With that said, someone on their list suggested they were going to try and
> integrate it with dynd.
> (ha! good catch Nathaniel -- sorry for double dipping, Internet initially
> suggested they were waiting to see what the numpy team would do)
> On Fri Nov 28 2014 at 5:52:19 PM Benjamin Root <ben.root at ou.edu> wrote:
>> That being said, doesn't Pandas support something like what you are
>> asking for? While Pandas would like to push the NA support down into the
>> NumPy level for seamless interaction with other SciPy libraries, it does a
>> very decent job with NA on its own, and depending on the sort of code you
>> need to write, you might rarely ever have to directly interface with NumPy.
>> Pandas is very powerful for data analysis, particularly for those coming
>> from R and S+. Those of us who use NumPy directly tend to be involved in
>> more generalized numerical tasks than those developing Pandas.
>> Perhaps you should raise your concerns on the Pandas mailing list? They
>> might be better suited to help you with any hurdles that you foresee. The
>> people on that list are very familiar with dealing with "dirty data".
>> Ben Root
>> On Fri, Nov 28, 2014 at 8:08 PM, Nathaniel Smith <njs at pobox.com> wrote:
>>> On Fri, Nov 28, 2014 at 9:12 PM, Nick Eubank <nickeubank at gmail.com>
>>> > Hello Numpy!
>>> > Curious if there were any timelines for when NA / NaN support might
>>> > for Ints.
>>> > (I'm sure this is asked and answered many times, but what I can find
>>> > very out of date.)
>>> > I'm strongly considering migrating from R/Stata/Matlab to
>>> Pandas/Numpy, but
>>> > the inability to handle Na in Int types seems like a big limitation for
>>> > someone always working with dirty data like me (a social scientist),
>>> and the
>>> > Pandas docs suggest they're waiting for Numpy to add support. Just
>>> > when that might happen!
>>> There's no particular timeline, no. We don't have any budget or
>>> employees so any timeline would be a fantasy anyway :-). It's
>>> definitely possible to do and we'd like to do it, but it'll happen
>>> when someone with the right skills and interests finds the time to do
>>> the work.
>>> Nathaniel J. Smith
>>> Postdoctoral researcher - Informatics - University of Edinburgh
>>> NumPy-Discussion mailing list
>>> NumPy-Discussion at scipy.org
>> NumPy-Discussion mailing list
>> NumPy-Discussion at scipy.org
> NumPy-Discussion mailing list
> NumPy-Discussion at scipy.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the NumPy-Discussion