[Numpy-discussion] bug in genfromtxt with missing values?

Skipper Seabold jsseabold at gmail.com
Mon Jan 24 17:47:59 EST 2011


Am I misreading the docs or missing something?  Consider the following
adapted from here:
http://docs.scipy.org/doc/numpy/user/basics.io.genfromtxt.html

from StringIO import StringIO
import numpy as np

data = "1, 2, 3\n4, ,5"

np.genfromtxt(StringIO(data), delimiter=",", names="a,b,c",
missing_values=" ", filling_values=0)
array([(1.0, 2.0, 3.0), (4.0, nan, 5.0)],
      dtype=[('a', '<f8'), ('b', '<f8'), ('c', '<f8')])

np.genfromtxt(StringIO(data), delimiter=",", names="a,b,c",
missing_values={'b':" "}, filling_values={'b' : 0})
array([(1.0, 2.0, 3.0), (4.0, 0.0, 5.0)],
      dtype=[('a', '<f8'), ('b', '<f8'), ('c', '<f8')])

Unless I use the dict for missing_values, it doesn't fill them in.
Without named columns

np.genfromtxt(StringIO(data), delimiter=",", missing_values=" ",
filling_values=0)
array([[  1.,   2.,   3.],
       [  4.,  nan,   5.]])

np.genfromtxt(StringIO(data), delimiter=",", missing_values={1 :" "},
filling_values={1 :0})
array([[ 1.,  2.,  3.],
       [ 4.,  0.,  5.]])

Skipper



More information about the NumPy-Discussion mailing list