From irving at naml.us Sun Feb 1 02:20:25 2009 From: irving at naml.us (Geoffrey Irving) Date: Sat, 31 Jan 2009 23:20:25 -0800 Subject: [Numpy-discussion] minor improvment to ones In-Reply-To: References: Message-ID: <7f9d599f0901312320k39d6be4bx4038c259ec75ce29@mail.gmail.com> On Fri, Jan 30, 2009 at 5:18 AM, Neal Becker wrote: > A nit, but it would be nice if 'ones' could fill with a value other than 1. > > Maybe an optional val= keyword? You can use the "tile" function for this. "tile(3,3)" creates an array of 3 3's. Geoffrey From raik.gruenberg at crg.es Sun Feb 1 06:02:04 2009 From: raik.gruenberg at crg.es (Raik Gruenberg) Date: Sun, 01 Feb 2009 12:02:04 +0100 Subject: [Numpy-discussion] puzzle: generate index with many ranges In-Reply-To: <5A3AA485-6FDB-4F02-96C7-D77DCFBAEFF6@stsci.edu> References: <5A3AA485-6FDB-4F02-96C7-D77DCFBAEFF6@stsci.edu> Message-ID: <4985812C.4050008@crg.es> Beautiful! That should do the trick. Now let's see how this performs against the list comprehension... Thanks a lot! Raik Rick White wrote: > Here's a technique that works: > > Python 2.4.2 (#5, Nov 21 2005, 23:08:11) > [GCC 4.0.0 20041026 (Apple Computer, Inc. build 4061)] on darwin > Type "help", "copyright", "credits" or "license" for more information. > >>> import numpy as np > >>> a = np.array([0,4,0,11]) > >>> b = np.array([-1,11,4,15]) > >>> rangelen = b-a+1 > >>> cumlen = rangelen.cumsum() > >>> c = np.arange(cumlen[-1],dtype=np.int32) > >>> c += np.repeat(a[1:]-c[cumlen[0:-1]], rangelen[1:]) > >>> print c > [ 4 5 6 7 8 9 10 11 0 1 2 3 4 11 12 13 14 15] > > The basic idea is that the difference of your desired output from a > simple range is an array with a bunch of constant values appended > together, and that is what repeat() does. I'm assuming that you'll > never have b < a. Notice the slight ugliness of prepending the > elements at the beginning so that the cumsum starts with zero. > (Maybe there is a cleaner way to do that.) > > This does create a second array (via the repeat) that is the same > length as the result. If that uses too much memory, you could break > up the repeat and update of c into segments using a loop. (You > wouldn't need a loop for every a,b element -- do a bunch in each > iteration.) > > -- Rick > > Raik Gruenberg wrote: > >> Hi there, >> >> perhaps someone has a bright idea for this one: >> >> I want to concatenate ranges of numbers into a single array (for >> indexing). So I >> have generated an array "a" with starting positions, for example: >> >> a = [4, 0, 11] >> >> I have an array b with stop positions: >> >> b = [11, 4, 15] >> >> and I would like to generate an index array that takes 4..11, then >> 0..4, then >> 11..15. >> >> In reality, a and b have 10000+ elements and the arrays to be >> "sliced" are very >> large so I want to avoid any for loops etc. Any idea how this could >> be done? I >> thought some combination of *repeat* and adding of *arange* should >> do the trick >> but just cannot nail it down. >> >> Thanks in advance for any hints! >> >> Greetings, >> Raik > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > -- ________________________________ Dr. Raik Gruenberg http://www.raiks.de/contact.html ________________________________ From sebastian.walter at gmail.com Sun Feb 1 06:40:56 2009 From: sebastian.walter at gmail.com (Sebastian Walter) Date: Sun, 1 Feb 2009 12:40:56 +0100 Subject: [Numpy-discussion] using numpy functions on an array of objects In-Reply-To: <3d375d730901311524l70a05603vd06bea11d30e5675@mail.gmail.com> References: <49835272.1060704@noaa.gov> <3d375d730901301305r6eb3cf11p5f4e8a0641bbf65c@mail.gmail.com> <3d375d730901311524l70a05603vd06bea11d30e5675@mail.gmail.com> Message-ID: On Sun, Feb 1, 2009 at 12:24 AM, Robert Kern wrote: > On Sat, Jan 31, 2009 at 10:30, Sebastian Walter > wrote: >> Wouldn't it be nice to have numpy a little more generic? >> All that would be needed was a little check of the arguments. >> >> If I do: >> numpy.trace(4) >> shouldn't numpy be smart enough to regard the 4 as a 1x1 array? > > Why? It's not a 1x1 array. It's a scalar. If you want a 1x1 array, > give it a 1x1 array. > >> numpy.sin(4) works! > > Yes, numpy.sin() operates on scalars in addition to arrays. > >> and if >> x = my_class(4) >> >> wouldn't it be nice if >> >> numpy.trace(x) >> would call >> x.trace() ? >> >> numpy.sin(my_class(4)) works! >> >> Wouldn't it be nice if numpy worked a little more consistent. >> Is this worth a ticket? Or am I missing something here? > > numpy.sin() is a ufunc. Unary ufuncs will call the method of the same > name on objects in an object array (or the scalar itself if given an > object scalar). For example: > > In [8]: class MyClass(object): > ...: def __init__(self, x): > ...: self.x = x > ...: def __repr__(self): > ...: return 'MyClass(%r)' % (self.x,) > ...: def sin(self): > ...: return MyClass(self.x+1) > ...: > ...: > > In [9]: sin(MyClass(4)) > Out[9]: MyClass(5) > > In [10]: sin([MyClass(4), MyClass(5)]) > Out[10]: array([MyClass(5), MyClass(6)], dtype=object) > > > You'll notice that numpy.sin() does not try to call the list.sin() > method when given the list. It interprets it as an object array, and > calls the MyClass.sin() method on each of the elements. > > numpy.trace() is not an unary ufunc. It's just a function that > operates on (N>=2)-D arrays. You simply couldn't apply the same rules > as numpy.sin(). Otherwise, it would try to call the .trace() method on > each of the objects in your container, and obviously you can't > implement trace that way. > > Having numpy.trace(x) simply call x.trace() would not be making numpy > more consistent. > > Now, that said, the implementation of numpy.trace(x, *args) is > actually simply asarray(x).trace(*args). That should probably be > asanyarray(x) in order to allow ndarray subclasses. But this only > works because ndarray.trace() already exists. Making every function in > numpy check for a method first is just not going to happen. Ok I see. I understand your reasoning. Nonetheless, I didn't suggest that trace() and sin() are the same, because they are not. I just wanted to express that they should act the same if the object is of *unknown type*. I mean, numpy.sin(MyClass(3)) works. In the worst of all possible worlds, numpy would raise an exception because MyClass is *not an array or a scalar*. But it doesn't. And that is really cool! It's awesome that numpy works on arbitrary type for the most part. In contrast, if trace(X) encounters an unknown type it simply raises an exception. It could as well try *in the very end* to call X.trace(). I.e. *not* "numpy check for a method first" but *numpy check for a method as last resort*. That wouldn't do any harm, would it? And it is not a major effort to add those simple checks to dot(), trace(), inv() . I could provide a patch. But if that is deemed "not going to happen" for whatever reasons. Is there a good workaround? I.e. if I do import numpy import mypackage can I overwrite the functions of numpy? I mean, that is quite a hack, but is the next *best* option. The reason for that need is, that I am writing a Python module to compute higher order derivatives of functions that are given as an algorithm on matrix operations: e.g. we want to compute the Hessian of the function def f(X,Y,Z): """ X is (N,M) array, Y is (M,K) array, Z is (K, L) array""" V = numpy.dot(X,Y) W = numpy.dot(Y,Z) return numpy.trace(V*W) To do that, I generalized the real numbers to the field of truncated Taylor polynomials of scalars and real matrices to truncated Taylor polynomials of matrices. The theory is explained on http://en.wikipedia.org/wiki/Automatic_differentiation You can have a look at the unit test, e.g. at def test_2x2dot2x2_reverse() on http://github.com/b45ch1/algopy/blob/bd7154e2e7a7e6e1931addc0d9ec0604d488d73f/unit_tests/matrix_reverse.py Mtc is a class of Matrix Taylor Coefficients. The class Function is used to build the computational graph of function nodes. I think it would be a nice addition to everyone who is doing scientific programming with numpy because derivatives are often required and divided differences suck for anything but first order on small algorithms and especially if one wants to differentiate solutions of ODEs or PDEs. If trace, dot, inv, etc. don't work that way, ppl would have to define two versions of the function f to make it work. best regards, Sebastian Walter > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From nmb at wartburg.edu Sun Feb 1 16:30:52 2009 From: nmb at wartburg.edu (Neil Martinsen-Burrell) Date: Sun, 1 Feb 2009 21:30:52 +0000 (UTC) Subject: [Numpy-discussion] example reading binary Fortran file References: David Froger gmail.com> writes: > Hy,My question is about reading Fortran binary file (oh no this question > again...) I've posted this before, but I finally got it cleaned up for the Cookbook. For this purpose I use a subclass of file that has methods for reading unformatted Fortran data. See http://www.scipy.org/Cookbook/FortranIO/FortranFile. I'd gladly see this in numpy or scipy somewhere, but I'm not sure where it belongs. > program makeArray > implicit none > integer,parameter:: nx=10,ny=20 > real(4),dimension(nx,ny):: ux,uy,p > integer :: i,j > open(11,file='uxuyp.bin',form='unformatted') > do i = 1,nx > do j = 1,ny?? > ux(i,j) = real(i*j)?? > uy(i,j) = real(i)/real(j)?? > p (i,j)? = real(i) + real(j) > enddo > enddo > write(11) ux,uy > write(11) p > close(11) > end program makeArray When I run the above program compiled with gfortran on my Intel Mac, I can read it back with:: >>> import numpy as np >>> from fortranfile import FortranFile >>> f=FortranFile('uxuyp.bin', endian='<') >>> uxuy = f.readReals(prec='f') # 'f' for default reals >>> len(uxuy) 400 >>> ux = np.array(uxuy[:200]).reshape((20,10)).T >>> uy = np.array(uxuy[200:]).reshape((20,10)).T >>> p = f.readReals('f').reshape((20,10)).T >>> ux array([[ 1., 2., 3., 4., 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20.], [ 2., 4., 6., 8., 10., 12., 14., 16., 18., 20., 22., 24., 26., 28., 30., 32., 34., 36., 38., 40.], [ 3., 6., 9., 12., 15., 18., 21., 24., 27., 30., 33., 36., 39., 42., 45., 48., 51., 54., 57., 60.], [ 4., 8., 12., 16., 20., 24., 28., 32., 36., 40., 44., 48., 52., 56., 60., 64., 68., 72., 76., 80.], [ 5., 10., 15., 20., 25., 30., 35., 40., 45., 50., 55., 60., 65., 70., 75., 80., 85., 90., 95., 100.], [ 6., 12., 18., 24., 30., 36., 42., 48., 54., 60., 66., 72., 78., 84., 90., 96., 102., 108., 114., 120.], [ 7.,Proxy-Connection: keep-alive Cache-Control: max-age=0 14., 21., 28., 35., 42., 49., 56., 63., 70., 77., 84., 91., 98., 105., 112., 119., 126., 133., 140.], [ 8., 16., 24., 32., 40., 48., 56., 64., 72., 80., 88., 96., 104., 112., 120., 128., 136., 144., 152., 160.], [ 9., 18., 27., 36., 45., 54., 63., 72., 81., 90., 99., 108., 117., 126., 135., 144., 153., 162., 171., 180.], [ 10., 20., 30., 40., 50., 60., 70., 80., 90., 100., 110., 120., 130., 140., 150., 160., 170., 180., 190., 200.]]) >>> uy array([[ 1. , 0.5 , 0.33333334, 0.25 , 0.2 , 0.16666667, 0.14285715, 0.125 , 0.11111111, 0.1 , 0.09090909, 0.08333334, 0.07692308, 0.07142857, 0.06666667, 0.0625 , 0.05882353, 0.05555556, 0.05263158, 0.05 ], [ 2. , 1. , 0.66666669, 0.5 , 0.40000001, 0.33333334, 0.2857143 , 0.25 , 0.22222222, 0.2 , 0.18181819, 0.16666667, 0.15384616, 0.14285715, 0.13333334, 0.125 , 0.11764706, 0.11111111, 0.10526316, 0.1 ], [ 3. , 1.5 , 1. , 0.75 , 0.60000002, 0.5 , 0.42857143, 0.375 , 0.33333334, 0.30000001, 0.27272728, 0.25 , 0.23076923, 0.21428572, 0.2 , 0.1875 , 0.17647059, 0.16666667, 0.15789473, 0.15000001], [ 4. , 2. , 1.33333337, 1. , 0.80000001, 0.66666669, 0.5714286 , 0.5 , 0.44444445, 0.40000001, 0.36363637, 0.33333334, 0.30769232, 0.2857143 , 0.26666668, 0.25 , 0.23529412, 0.22222222, 0.21052632, 0.2 ], [ 5. , 2.5 , 1.66666663, 1.25 , 1. , 0.83333331, 0.71428573, 0.625 , 0.55555558, 0.5 , 0.45454547, 0.41666666, 0.38461539, 0.35714287, 0.33333334, 0.3125 , 0.29411766, 0.27777779, 0.2631579 , 0.25 ], [ 6. , 3. , 2. , 1.5 , 1.20000005, 1. , 0.85714287, 0.75 , 0.66666669, 0.60000002, 0.54545456, 0.5 , 0.46153846, 0.42857143, 0.40000001, 0.375 , 0.35294119, 0.33333334, 0.31578946, 0.30000001], [ 7. , 3.5 , 2.33333325, 1.75 , 1.39999998, 1.16666663, 1. , 0.875 , 0.77777779, 0.69999999, 0.63636363, 0.58333331, 0.53846157, 0.5 , 0.46666667, 0.4375 , 0.41176471, 0.3888889 , 0.36842105, 0.34999999], [ 8. , 4. , 2.66666675, 2. , 1.60000002, 1.33333337, 1.14285719, 1. , 0.8888889 , 0.80000001, 0.72727275, 0.66666669, 0.61538464, 0.5714286 , 0.53333336, 0.5 , 0.47058824, 0.44444445, 0.42105263, 0.40000001], [ 9. , 4.5 , 3. , 2.25 , 1.79999995, 1.5 , 1.28571427, 1.125 , 1. , 0.89999998, 0.81818181, 0.75 , 0.69230771, 0.64285713, 0.60000002, 0.5625 , 0.52941179, 0.5 , 0.47368422, 0.44999999], [ 10. , 5. , 3.33333325, 2.5 , 2. , 1.66666663, 1.42857146, 1.25 , 1.11111116, 1. , 0.90909094, 0.83333331, 0.76923078, 0.71428573, 0.66666669, 0.625 , 0.58823532, 0.55555558, 0.52631581, 0.5 ]]) >>> p array([[ 2., 3., 4., 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21.], [ 3., 4., 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22.], [ 4., 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23.], [ 5., 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24.], [ 6., 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25.], [ 7., 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25., 26.], [ 8., 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25., 26., 27.], [ 9., 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25., 26., 27., 28.], [ 10., 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25., 26., 27., 28., 29.], [ 11., 12., 13., 14., 15., 16., 17., 18., 19., 20., 21., 22., 23., 24., 25., 26., 27., 28., 29., 30.]]) Note that you have to provide the shape information for ux and uy because fortran writes them together as a stream of 400 numbers. -Neil From dsdale24 at gmail.com Sun Feb 1 18:32:41 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Sun, 1 Feb 2009 18:32:41 -0500 Subject: [Numpy-discussion] question about ufuncs Message-ID: I've been playing with __array_wrap__ to make quantities with units play well with numpy's ufuncs. For example, __array_wrap__ makes it is possible to do the following: >>> numpy.sqrt([1.,4.,9.]*m**2) array([1.,2.,3.])*m Is there an analog to __array_wrap__ for preprocessing arrays on their way *into* a ufunc? For example, it would be nice if one could do something like: numpy.sin([1,2,3]*arcseconds) where we have the opportunity to inspect the context, convert the Quantity to units of radians, and then actually call the ufunc. Is this possible, or does one have to reimplement such functions? Thanks, Darren -------------- next part -------------- An HTML attachment was scrubbed... URL: From pgmdevlist at gmail.com Sun Feb 1 19:33:45 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Sun, 1 Feb 2009 19:33:45 -0500 Subject: [Numpy-discussion] question about ufuncs In-Reply-To: References: Message-ID: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> On Feb 1, 2009, at 6:32 PM, Darren Dale wrote: > > > Is there an analog to __array_wrap__ for preprocessing arrays on > their way *into* a ufunc? For example, it would be nice if one could > do something like: > > numpy.sin([1,2,3]*arcseconds) > > where we have the opportunity to inspect the context, convert the > Quantity to units of radians, and then actually call the ufunc. Is > this possible, or does one have to reimplement such functions? Just an idea: look at the code for numpy.ma ufuncs (in numpy.ma.core). By defining a few classes for unary, binary and domained functions, you could probably do what you want, without having to recode all the functions by hand. Another idea would be to define some specific __mul__ or __rmul__ rules for your units, so that the list would be transformed into a UnitArray... From dsdale24 at gmail.com Sun Feb 1 19:39:04 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Sun, 1 Feb 2009 19:39:04 -0500 Subject: [Numpy-discussion] question about ufuncs In-Reply-To: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> References: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> Message-ID: On Sun, Feb 1, 2009 at 7:33 PM, Pierre GM wrote: > > On Feb 1, 2009, at 6:32 PM, Darren Dale wrote: > > > > > > Is there an analog to __array_wrap__ for preprocessing arrays on > > their way *into* a ufunc? For example, it would be nice if one could > > do something like: > > > > numpy.sin([1,2,3]*arcseconds) > > > > where we have the opportunity to inspect the context, convert the > > Quantity to units of radians, and then actually call the ufunc. Is > > this possible, or does one have to reimplement such functions? > > Just an idea: look at the code for numpy.ma ufuncs (in numpy.ma.core). > By defining a few classes for unary, binary and domained functions, > you could probably do what you want, without having to recode all the > functions by hand. > Another idea would be to define some specific __mul__ or __rmul__ > rules for your units, so that the list would be transformed into a > UnitArray... I have pretty good implementations of the arithmetic operators, so ([1,2,3]*m)*([4,5,6]*J) already works. numpy.multiply and numpy.sqrt needed help with array_wrap. I'll study your stuff in ma, thanks for the pointer. Darren -------------- next part -------------- An HTML attachment was scrubbed... URL: From mattdm at mattdm.org Sun Feb 1 21:58:18 2009 From: mattdm at mattdm.org (Matthew Miller) Date: Sun, 1 Feb 2009 21:58:18 -0500 Subject: [Numpy-discussion] numpy.load and gzip file handles Message-ID: <20090202025818.GA21605@jadzia.bu.edu> Hi everyone. I'd like to log the state of my program as it progresses. Using the numpy.save / numpy.load functions on the same filehandle repeatedly works very well for this -- but ends up making a file which very quickly grows to gigabytes. The data compresses well, though, so I thought I'd use Python's built-in gzip module underneath. This works great for saving -- but when it comes time to play back, there's an issue: >>> import numpy >>> import gzip >>> f=open("test.gz") >>> g=gzip.GzipFile(None,"rb",9,f) >>> g >>> numpy.load(g) Traceback (most recent call last): File "", line 1, in File "/usr/lib64/python2.5/site-packages/numpy/lib/io.py", line 133, in load fid.seek(-N,1) # back-up TypeError: seek() takes exactly 2 arguments (3 given) Turns out you can't rewind gzip file handles in Python. Oops. The offending code is that which distinguishes between npy and npz files. Could there maybe be something added to just trust me that it's an npy? Or better yet, is there something I'm doing wrong / overlooking? Thanks! -- Matthew Miller mattdm at mattdm.org From stefan at sun.ac.za Mon Feb 2 01:01:54 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Mon, 2 Feb 2009 08:01:54 +0200 Subject: [Numpy-discussion] numpy.load and gzip file handles In-Reply-To: <20090202025818.GA21605@jadzia.bu.edu> References: <20090202025818.GA21605@jadzia.bu.edu> Message-ID: <9457e7c80902012201q4187f42fv89a69f13834d5cbe@mail.gmail.com> 2009/2/2 Matthew Miller : > I'd like to log the state of my program as it progresses. Using the > numpy.save / numpy.load functions on the same filehandle repeatedly works > very well for this -- but ends up making a file which very quickly grows to > gigabytes. The data compresses well, though, so I thought I'd use Python's > built-in gzip module underneath. This works great for saving -- but when it > comes time to play back, there's an issue: > > >>> import numpy > >>> import gzip > >>> f=open("test.gz") > >>> g=gzip.GzipFile(None,"rb",9,f) > >>> g > > >>> numpy.load(g) > Traceback (most recent call last): > File "", line 1, in > File "/usr/lib64/python2.5/site-packages/numpy/lib/io.py", line 133, in load > fid.seek(-N,1) # back-up > TypeError: seek() takes exactly 2 arguments (3 given) The GzipFile in Python 2.5 does not support the 2nd ("whence") argument. The solution may be to use this wrapper from the EffBot: http://effbot.org/librarybook/gzip-example-2.py In order to "back-port" that functionality. Regards St?fan From mattdm at mattdm.org Mon Feb 2 01:10:10 2009 From: mattdm at mattdm.org (Matthew Miller) Date: Mon, 2 Feb 2009 01:10:10 -0500 Subject: [Numpy-discussion] numpy.load and gzip file handles In-Reply-To: <9457e7c80902012201q4187f42fv89a69f13834d5cbe@mail.gmail.com> References: <20090202025818.GA21605@jadzia.bu.edu> <9457e7c80902012201q4187f42fv89a69f13834d5cbe@mail.gmail.com> Message-ID: <20090202061010.GA8806@jadzia.bu.edu> On Mon, Feb 02, 2009 at 08:01:54AM +0200, St?fan van der Walt wrote: > The GzipFile in Python 2.5 does not support the 2nd ("whence") > argument. The solution may be to use this wrapper from the EffBot: > http://effbot.org/librarybook/gzip-example-2.py > In order to "back-port" that functionality. Unless I'm misunderstanding, even with the wrapper one can't actually seek backwards, which is what the numpy code wants to do. In the meantime, I'm just using numpy.lib.format.read_array() directly. -- Matthew Miller mattdm at mattdm.org From nwagner at iam.uni-stuttgart.de Mon Feb 2 07:33:43 2009 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Mon, 02 Feb 2009 13:33:43 +0100 Subject: [Numpy-discussion] Fortran binary files and numpy/scipy Message-ID: Hi all, How can I import FORTRAN binary files using numpy ? In FORTRAN I can do OPEN(10,FILE='test.mat',FORM='unformatted') 100 CONTINUE READ(10,END=999) IROW, ICOL, VALUE GOTO 100 999 CONTINUE END And In Python/numpy ? .... Any pointer would be appreciated. Thanks in advance Nils From matthieu.brucher at gmail.com Mon Feb 2 07:39:32 2009 From: matthieu.brucher at gmail.com (Matthieu Brucher) Date: Mon, 2 Feb 2009 13:39:32 +0100 Subject: [Numpy-discussion] Fortran binary files and numpy/scipy In-Reply-To: References: Message-ID: Hi, There was a discussion about this last week. You can find it int he archives ;) Matthieu 2009/2/2 Nils Wagner : > Hi all, > > How can I import FORTRAN binary files using numpy ? > > In FORTRAN I can do > > OPEN(10,FILE='test.mat',FORM='unformatted') > > 100 CONTINUE > READ(10,END=999) IROW, ICOL, VALUE > GOTO 100 > > 999 CONTINUE > END > > And In Python/numpy ? > > .... > > > Any pointer would be appreciated. > > Thanks in advance > Nils > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -- Information System Engineer, Ph.D. Website: http://matthieu-brucher.developpez.com/ Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 LinkedIn: http://www.linkedin.com/in/matthieubrucher From nwagner at iam.uni-stuttgart.de Mon Feb 2 09:22:42 2009 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Mon, 02 Feb 2009 15:22:42 +0100 Subject: [Numpy-discussion] Fortran binary files and numpy/scipy In-Reply-To: References: Message-ID: On Mon, 2 Feb 2009 13:39:32 +0100 Matthieu Brucher wrote: > Hi, > > There was a discussion about this last week. You can >find it int he archives ;) > > Matthieu Hi Matthieu, Sorry but I missed that. Anyway I have some trouble with my short example. g77 -c binary_fortran.f g77 -o io binary_fortran.o ./io 11 254 254. 12 253 126. 13 252 84. 14 251 62. 15 250 50. 16 249 41. 17 248 35. 18 247 30. 19 246 27. 20 245 24. python -i read_fortran.py >>> a array([(16, 1090921693195, 254.0), (16, 16, 5.3686493512014268e-312), (4638566878703255552, 16, 7.9050503334599447e-323), (1082331758605, 4635611391447793664, 7.9050503334599447e-323), (16, 1078036791310, 62.0), (16, 16, 5.3049894774872906e-312), (4632233691727265792, 16, 7.9050503334599447e-323), (1069446856720, 4630967054332067840, 7.9050503334599447e-323), (16, 1065151889425, 35.0), (16, 16, 5.2413296037731544e-312), (4629137466983448576, 16, 7.9050503334599447e-323), (1056561954835, 4628293042053316608, 7.9050503334599447e-323), (16, 1052266987540, 24.0)], dtype=[('irow', ' -------------- next part -------------- A non-text attachment was scrubbed... Name: read_fortran.py Type: text/x-python Size: 141 bytes Desc: not available URL: From Mike.Colonno at spacex.com Mon Feb 2 10:28:10 2009 From: Mike.Colonno at spacex.com (Mike Colonno) Date: Mon, 2 Feb 2009 07:28:10 -0800 Subject: [Numpy-discussion] Building on WinXP 64-bit, Intel Compilers In-Reply-To: <657E769D35612D4CAB1FEAACB0A92BB503094E5F3F@MAIL2.spacex.corp> References: <9cca9e840901271705s21803633u67be3ed8bad54835@mail.gmail.com> <497FC559.4000801@ar.media.kyoto-u.ac.jp> <9cca9e840901271931s1706de0egd28382e4125644a@mail.gmail.com> <49801FF8.8040805@ar.media.kyoto-u.ac.jp> <9cca9e840901280818l2106e710q16599bf1423c3caf@mail.gmail.com> <5b8d13220901281636y2fe1b55dt3cb59dd46b43b1d0@mail.gmail.com> <9cca9e840901290757y10350efftfbdb8e4204c16795@mail.gmail.com> <789d27b10901300432q28226381x829575e8b1fa23fe@mail.gmail.com> <9cca9e840901300731y69ee6f49ra758ba4f8ccced69@mail.gmail.com>, <9cca9e840901311048q21afab1cv7351403f8c95f3af@mail.gmail.com>, <657E769D35612D4CAB1FEAACB0A92BB503094E5F3C@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3F@MAIL2.spacex.corp> Message-ID: <657E769D35612D4CAB1FEAACB0A92BB5030984055E@MAIL2.spacex.corp> Hi folks ~ Any thoughts on the below? I n searching the web I found some other references to " ImportError: DLL load failed: Invalid access to memory location" but none specific to Numpy. As an aside: will there be a Windows x64 binary distributed with the next release of Numpy / Scipy? Does anyone have a working installer now? It may be easier to just wait for an official release and use an existing binary in the meantime vs. banging my head against this build. >>>from numpy import * Traceback (most recent call last): File "C:\", line 1, in File "C:\Python26\Lib\site-packages\numpy\__init__.py", line 130, in import add_newdocs File "C:\Python26\Lib\site-packages\numpy\add_newdocs.py", line 9, in from lib import add_newdoc File "C:\Python26\Lib\site-packages\numpy\lib\__init__.py", line 161, in from polynomial import * File "C:\Python26\Lib\site-packages\numpy\lib\polynomial.py", line 18, in from numpy.linalg import eigvals, lstsq File "C:\Python26\Lib\site-packages\numpy\linalg\__init__.py", line 47, in from linalg import * File "C:\Python26\Lib\site-packages\numpy\linalg\linalg.py", line 22, in from numpy.linalg import lapack_lite ImportError: DLL load failed: Invalid access to memory location. Thanks for the help, ~Mike C. -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Mon Feb 2 10:55:36 2009 From: cournape at gmail.com (David Cournapeau) Date: Tue, 3 Feb 2009 00:55:36 +0900 Subject: [Numpy-discussion] Building on WinXP 64-bit, Intel Compilers In-Reply-To: <657E769D35612D4CAB1FEAACB0A92BB5030984055E@MAIL2.spacex.corp> References: <9cca9e840901271705s21803633u67be3ed8bad54835@mail.gmail.com> <9cca9e840901280818l2106e710q16599bf1423c3caf@mail.gmail.com> <5b8d13220901281636y2fe1b55dt3cb59dd46b43b1d0@mail.gmail.com> <9cca9e840901290757y10350efftfbdb8e4204c16795@mail.gmail.com> <789d27b10901300432q28226381x829575e8b1fa23fe@mail.gmail.com> <9cca9e840901300731y69ee6f49ra758ba4f8ccced69@mail.gmail.com> <9cca9e840901311048q21afab1cv7351403f8c95f3af@mail.gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3C@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3F@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB5030984055E@MAIL2.spacex.corp> Message-ID: <5b8d13220902020755i10eda0f3y5b255ec2578bddea@mail.gmail.com> On Tue, Feb 3, 2009 at 12:28 AM, Mike Colonno wrote: > Hi folks ~ > > > > Any thoughts on the below? It is hard to say, but I suspect a problem in your BLAS/LAPACK because it fails on importing lapack_lite. Your error message is so generic that it does not give any useful information. I have worked quite a bit last december on 64 bits support using the free compilers mingw - it worked OK for the C code, but I have not managed yet to build a numpy with a full BLAS/LAPACK, which required a fortran compiler. Since I can't have access to a fortran compiler on that platform, I can't fix the remaining problems. And nobody stepped in to fix the problems either, David From rmay31 at gmail.com Mon Feb 2 11:17:13 2009 From: rmay31 at gmail.com (Ryan May) Date: Mon, 02 Feb 2009 10:17:13 -0600 Subject: [Numpy-discussion] Fortran binary files and numpy/scipy In-Reply-To: References:

Message-ID: <49871C89.4000800@gmail.com> Nils Wagner wrote: > On Mon, 2 Feb 2009 13:39:32 +0100 > Matthieu Brucher wrote: >> Hi, >> >> There was a discussion about this last week. You can find it int he >> archives ;) >> >> Matthieu > > Hi Matthieu, > > Sorry but I missed that. > Anyway I have some trouble with my short example. > > g77 -c binary_fortran.f > g77 -o io binary_fortran.o > ./io > > 11 254 254. > 12 253 126. > 13 252 84. > 14 251 62. > 15 250 50. > 16 249 41. > 17 248 35. > 18 247 30. > 19 246 27. > 20 245 24. > > python -i read_fortran.py > >>>> a > array([(16, 1090921693195, 254.0), (16, 16, 5.3686493512014268e-312), > (4638566878703255552, 16, 7.9050503334599447e-323), > (1082331758605, 4635611391447793664, 7.9050503334599447e-323), > (16, 1078036791310, 62.0), (16, 16, 5.3049894774872906e-312), > (4632233691727265792, 16, 7.9050503334599447e-323), > (1069446856720, 4630967054332067840, 7.9050503334599447e-323), > (16, 1065151889425, 35.0), (16, 16, 5.2413296037731544e-312), > (4629137466983448576, 16, 7.9050503334599447e-323), > (1056561954835, 4628293042053316608, 7.9050503334599447e-323), > (16, 1052266987540, 24.0)], > dtype=[('irow', ' > How can I fix the problem ? > Every write statement in fortran first writes out the number of bytes that will follow, *then* the actual data. So, for instance, the first write to file in your program will write the bytes corresponding to these values: 16 X(1) Y(1) Z(1) The 16 comes from the size of 2 ints and 1 double. Since you're always writing out the 3 values, and they're always the same size, try adding another integer column as the first field in your array. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From hanni.ali at gmail.com Mon Feb 2 12:23:51 2009 From: hanni.ali at gmail.com (Hanni Ali) Date: Mon, 2 Feb 2009 13:23:51 -0400 Subject: [Numpy-discussion] Building on WinXP 64-bit, Intel Compilers In-Reply-To: <5b8d13220902020755i10eda0f3y5b255ec2578bddea@mail.gmail.com> References: <9cca9e840901271705s21803633u67be3ed8bad54835@mail.gmail.com> <5b8d13220901281636y2fe1b55dt3cb59dd46b43b1d0@mail.gmail.com> <9cca9e840901290757y10350efftfbdb8e4204c16795@mail.gmail.com> <789d27b10901300432q28226381x829575e8b1fa23fe@mail.gmail.com> <9cca9e840901300731y69ee6f49ra758ba4f8ccced69@mail.gmail.com> <9cca9e840901311048q21afab1cv7351403f8c95f3af@mail.gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3C@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3F@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB5030984055E@MAIL2.spacex.corp> <5b8d13220902020755i10eda0f3y5b255ec2578bddea@mail.gmail.com> Message-ID: <789d27b10902020923h7099680awafa35905b30fb79a@mail.gmail.com> Hi David, I used free trials of the Intel and PGI compilers to try to compile an external BLAS/LAPACK in conjunction with VS 2008. I also had no problems getting the C code to compile, but couldn't get linking to work succesfully with fortran stuff. I would not be surprised if we could get a licence from Intel or PGI to provide a pre-compiled exe. Hanni 2009/2/2 David Cournapeau > On Tue, Feb 3, 2009 at 12:28 AM, Mike Colonno > wrote: > > Hi folks ~ > > > > > > > > Any thoughts on the below? > > It is hard to say, but I suspect a problem in your BLAS/LAPACK because > it fails on importing lapack_lite. Your error message is so generic > that it does not give any useful information. > > I have worked quite a bit last december on 64 bits support using the > free compilers mingw - it worked OK for the C code, but I have not > managed yet to build a numpy with a full BLAS/LAPACK, which required a > fortran compiler. Since I can't have access to a fortran compiler on > that platform, I can't fix the remaining problems. And nobody stepped > in to fix the problems either, > > David > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Mon Feb 2 12:53:22 2009 From: cournape at gmail.com (David Cournapeau) Date: Tue, 3 Feb 2009 02:53:22 +0900 Subject: [Numpy-discussion] Building on WinXP 64-bit, Intel Compilers In-Reply-To: <789d27b10902020923h7099680awafa35905b30fb79a@mail.gmail.com> References: <9cca9e840901271705s21803633u67be3ed8bad54835@mail.gmail.com> <9cca9e840901290757y10350efftfbdb8e4204c16795@mail.gmail.com> <789d27b10901300432q28226381x829575e8b1fa23fe@mail.gmail.com> <9cca9e840901300731y69ee6f49ra758ba4f8ccced69@mail.gmail.com> <9cca9e840901311048q21afab1cv7351403f8c95f3af@mail.gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3C@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB503094E5F3F@MAIL2.spacex.corp> <657E769D35612D4CAB1FEAACB0A92BB5030984055E@MAIL2.spacex.corp> <5b8d13220902020755i10eda0f3y5b255ec2578bddea@mail.gmail.com> <789d27b10902020923h7099680awafa35905b30fb79a@mail.gmail.com> Message-ID: <5b8d13220902020953s1bd6e830n9debe5ae79dcfe50@mail.gmail.com> On Tue, Feb 3, 2009 at 2:23 AM, Hanni Ali wrote: > Hi David, > > I used free trials of the Intel and PGI compilers to try to compile an > external BLAS/LAPACK in conjunction with VS 2008. I also had no problems > getting the C code to compile, but couldn't get linking to work succesfully > with fortran stuff. I would not be surprised if we could get a licence from > Intel or PGI to provide a pre-compiled exe. I hope to be able to announce some good news on that front very soon :) David From nwagner at iam.uni-stuttgart.de Mon Feb 2 14:18:25 2009 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Mon, 02 Feb 2009 20:18:25 +0100 Subject: [Numpy-discussion] Fortran binary files and numpy/scipy In-Reply-To: <49871C89.4000800@gmail.com> References:

<49871C89.4000800@gmail.com> Message-ID: On Mon, 02 Feb 2009 10:17:13 -0600 Ryan May wrote: > Nils Wagner wrote: >> On Mon, 2 Feb 2009 13:39:32 +0100 >> Matthieu Brucher wrote: >>> Hi, >>> >>> There was a discussion about this last week. You can >>>find it int he >>> archives ;) >>> >>> Matthieu >> >> Hi Matthieu, >> >> Sorry but I missed that. >> Anyway I have some trouble with my short example. >> >> g77 -c binary_fortran.f >> g77 -o io binary_fortran.o >> ./io >> >> 11 254 254. >> 12 253 126. >> 13 252 84. >> 14 251 62. >> 15 250 50. >> 16 249 41. >> 17 248 35. >> 18 247 30. >> 19 246 27. >> 20 245 24. >> >> python -i read_fortran.py >> >>>>> a >> array([(16, 1090921693195, 254.0), (16, 16, >>5.3686493512014268e-312), >> (4638566878703255552, 16, >>7.9050503334599447e-323), >> (1082331758605, 4635611391447793664, >>7.9050503334599447e-323), >> (16, 1078036791310, 62.0), (16, 16, >>5.3049894774872906e-312), >> (4632233691727265792, 16, >>7.9050503334599447e-323), >> (1069446856720, 4630967054332067840, >>7.9050503334599447e-323), >> (16, 1065151889425, 35.0), (16, 16, >>5.2413296037731544e-312), >> (4629137466983448576, 16, >>7.9050503334599447e-323), >> (1056561954835, 4628293042053316608, >>7.9050503334599447e-323), >> (16, 1052266987540, 24.0)], >> dtype=[('irow', '>'> >> How can I fix the problem ? >> > > Every write statement in fortran first writes out the >number of bytes that will > follow, *then* the actual data. So, for instance, the >first write to file in > your program will write the bytes corresponding to these >values: > > 16 X(1) Y(1) Z(1) > > The 16 comes from the size of 2 ints and 1 double. > Since you're always writing > out the 3 values, and they're always the same size, try >adding another integer > column as the first field in your array. > > Ryan > > -- > Ryan May > Graduate Research Assistant > School of Meteorology > University of Oklahoma > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion Hi Ryan, I have modified the python script. import numpy as np fname = open("bin.dat",'rb') dt = np.dtype([('isize',int),('irow',int),('icol',int),('value',float)]) a = np.fromfile(fname,dtype=dt) >>> a array([(16, 1090921693195, 4643140847074803712, 7.9050503334599447e-323), (16, 1086626725900, 4638566878703255552, 7.9050503334599447e-323), (16, 1082331758605, 4635611391447793664, 7.9050503334599447e-323), (16, 1078036791310, 4633922541587529728, 7.9050503334599447e-323), (16, 1073741824015, 4632233691727265792, 7.9050503334599447e-323), (16, 1069446856720, 4630967054332067840, 7.9050503334599447e-323), (16, 1065151889425, 4630122629401935872, 7.9050503334599447e-323), (16, 1060856922130, 4629137466983448576, 7.9050503334599447e-323), (16, 1056561954835, 4628293042053316608, 7.9050503334599447e-323), (16, 1052266987540, 4627448617123184640, 7.9050503334599447e-323)], dtype=[('isize', ' References:

<49871C89.4000800@gmail.com> Message-ID: <49874BA6.3050806@gmail.com> Nils Wagner wrote: > On Mon, 02 Feb 2009 10:17:13 -0600 > Ryan May wrote: >> Every write statement in fortran first writes out the >> number of bytes that will >> follow, *then* the actual data. So, for instance, the >> first write to file in >> your program will write the bytes corresponding to these >> values: >> >> 16 X(1) Y(1) Z(1) >> >> The 16 comes from the size of 2 ints and 1 double. >> Since you're always writing >> out the 3 values, and they're always the same size, try >> adding another integer >> column as the first field in your array. >> >> Ryan > Hi Ryan, > > I have modified the python script. > > import numpy as np > fname = open("bin.dat",'rb') > dt = > np.dtype([('isize',int),('irow',int),('icol',int),('value',float)]) > a = np.fromfile(fname,dtype=dt) > > >>>> a > array([(16, 1090921693195, 4643140847074803712, > 7.9050503334599447e-323), > (16, 1086626725900, 4638566878703255552, > 7.9050503334599447e-323), > (16, 1082331758605, 4635611391447793664, > 7.9050503334599447e-323), > (16, 1078036791310, 4633922541587529728, > 7.9050503334599447e-323), > (16, 1073741824015, 4632233691727265792, > 7.9050503334599447e-323), > (16, 1069446856720, 4630967054332067840, > 7.9050503334599447e-323), > (16, 1065151889425, 4630122629401935872, > 7.9050503334599447e-323), > (16, 1060856922130, 4629137466983448576, > 7.9050503334599447e-323), > (16, 1056561954835, 4628293042053316608, > 7.9050503334599447e-323), > (16, 1052266987540, 4627448617123184640, > 7.9050503334599447e-323)], > dtype=[('isize', ' ' > Is this a 64-bit problem ? > I don't know if it's a 64-bit problem per-se, so much as a disagreement between fortran and numpy. Numpy is making the size of the integer fields 8 bytes, while in Fortran, they're only 4 bytes. When constructing your dtype, use np.int32 or ' References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> Message-ID: On Mon, 02 Feb 2009 13:38:14 -0600 Ryan May wrote: > Nils Wagner wrote: >> On Mon, 02 Feb 2009 10:17:13 -0600 >> Ryan May wrote: >>> Every write statement in fortran first writes out the >>> number of bytes that will >>> follow, *then* the actual data. So, for instance, the >>> first write to file in >>> your program will write the bytes corresponding to these >>> values: >>> >>> 16 X(1) Y(1) Z(1) >>> >>> The 16 comes from the size of 2 ints and 1 double. >>> Since you're always writing >>> out the 3 values, and they're always the same size, try >>> adding another integer >>> column as the first field in your array. >>> >>> Ryan > >> Hi Ryan, >> >> I have modified the python script. >> >> import numpy as np >> fname = open("bin.dat",'rb') >> dt = >> np.dtype([('isize',int),('irow',int),('icol',int),('value',float)]) >> a = np.fromfile(fname,dtype=dt) >> >> >>>>> a >> array([(16, 1090921693195, 4643140847074803712, >> 7.9050503334599447e-323), >> (16, 1086626725900, 4638566878703255552, >> 7.9050503334599447e-323), >> (16, 1082331758605, 4635611391447793664, >> 7.9050503334599447e-323), >> (16, 1078036791310, 4633922541587529728, >> 7.9050503334599447e-323), >> (16, 1073741824015, 4632233691727265792, >> 7.9050503334599447e-323), >> (16, 1069446856720, 4630967054332067840, >> 7.9050503334599447e-323), >> (16, 1065151889425, 4630122629401935872, >> 7.9050503334599447e-323), >> (16, 1060856922130, 4629137466983448576, >> 7.9050503334599447e-323), >> (16, 1056561954835, 4628293042053316608, >> 7.9050503334599447e-323), >> (16, 1052266987540, 4627448617123184640, >> 7.9050503334599447e-323)], >> dtype=[('isize', '>('icol', >> '> >> Is this a 64-bit problem ? >> > > I don't know if it's a 64-bit problem per-se, so much as >a disagreement between > fortran and numpy. Numpy is making the size of the >integer fields 8 bytes, while > in Fortran, they're only 4 bytes. When constructing >your dtype, use np.int32 or > 'that fixes it. > dt = np.dtype([('isize','int32'),('irow','int32'),('icol','int32'),('value','float')]) >>> a array([(16, 0, 11, 1.2549267404367662e-321), (1081065472, 16, 0, 7.9050503334599447e-323), (12, 253, 0, 3.4485523805914514e-313), (0, 16, 0, 5.3474293932967148e-312), (0, 1079312384, 16, 3.3951932655444357e-313), (0, 14, 251, 62.0), (16, 0, 16, 3.1829936864479085e-313), (250, 0, 1078525952, 7.9050503334599447e-323), (16, 0, 16, 1.2302234581447039e-321), (1078231040, 16, 0, 7.9050503334599447e-323), (17, 248, 0, 3.4484552433329538e-313), (0, 16, 0, 5.2413296037731544e-312), (0, 1077805056, 16, 3.3951932655444357e-313), (0, 19, 246, 27.0), (16, 0, 16, 4.2439915819305446e-313), (245, 0, 1077411840, 7.9050503334599447e-323)], dtype=[('isize', ' References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> Message-ID: <49875287.6030805@gmail.com> Nils Wagner wrote: >>> Is this a 64-bit problem ? >>> >> I don't know if it's a 64-bit problem per-se, so much as >> a disagreement between >> fortran and numpy. Numpy is making the size of the >> integer fields 8 bytes, while >> in Fortran, they're only 4 bytes. When constructing >> your dtype, use np.int32 or >> '> that fixes it. >> > > dt = > np.dtype([('isize','int32'),('irow','int32'),('icol','int32'),('value','float')]) > > >>>> a > array([(16, 0, 11, 1.2549267404367662e-321), > (1081065472, 16, 0, 7.9050503334599447e-323), > (12, 253, 0, 3.4485523805914514e-313), > (0, 16, 0, 5.3474293932967148e-312), > (0, 1079312384, 16, 3.3951932655444357e-313), (0, > 14, 251, 62.0), > (16, 0, 16, 3.1829936864479085e-313), > (250, 0, 1078525952, 7.9050503334599447e-323), > (16, 0, 16, 1.2302234581447039e-321), > (1078231040, 16, 0, 7.9050503334599447e-323), > (17, 248, 0, 3.4484552433329538e-313), > (0, 16, 0, 5.2413296037731544e-312), > (0, 1077805056, 16, 3.3951932655444357e-313), (0, > 19, 246, 27.0), > (16, 0, 16, 4.2439915819305446e-313), > (245, 0, 1077411840, 7.9050503334599447e-323)], > dtype=[('isize', ' ' Maybe on 64-bit machines, the number of bytes is 64-bits instead of 32 (see the fact that the first 12 bytes of the file are 16 0 11. Try: dt = np.dtype([('isize','int64'),('irow','int32'),('icol','int32'),('value','float')]) Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From nwagner at iam.uni-stuttgart.de Mon Feb 2 15:57:36 2009 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Mon, 02 Feb 2009 21:57:36 +0100 Subject: [Numpy-discussion] Fortran binary files and numpy/scipy In-Reply-To: <49875287.6030805@gmail.com> References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> Message-ID: On Mon, 02 Feb 2009 14:07:35 -0600 Ryan May wrote: > Nils Wagner wrote: >>>> Is this a 64-bit problem ? >>>> >>> I don't know if it's a 64-bit problem per-se, so much as >>> a disagreement between >>> fortran and numpy. Numpy is making the size of the >>> integer fields 8 bytes, while >>> in Fortran, they're only 4 bytes. When constructing >>> your dtype, use np.int32 or >>> '>> that fixes it. >>> >> >> dt = >> np.dtype([('isize','int32'),('irow','int32'),('icol','int32'),('value','float')]) >> >> >>>>> a >> array([(16, 0, 11, 1.2549267404367662e-321), >> (1081065472, 16, 0, 7.9050503334599447e-323), >> (12, 253, 0, 3.4485523805914514e-313), >> (0, 16, 0, 5.3474293932967148e-312), >> (0, 1079312384, 16, 3.3951932655444357e-313), >>(0, >> 14, 251, 62.0), >> (16, 0, 16, 3.1829936864479085e-313), >> (250, 0, 1078525952, 7.9050503334599447e-323), >> (16, 0, 16, 1.2302234581447039e-321), >> (1078231040, 16, 0, 7.9050503334599447e-323), >> (17, 248, 0, 3.4484552433329538e-313), >> (0, 16, 0, 5.2413296037731544e-312), >> (0, 1077805056, 16, 3.3951932655444357e-313), >>(0, >> 19, 246, 27.0), >> (16, 0, 16, 4.2439915819305446e-313), >> (245, 0, 1077411840, 7.9050503334599447e-323)], >> dtype=[('isize', '>('icol', >> '> > > Maybe on 64-bit machines, the number of bytes is 64-bits >instead of 32 (see the > fact that the first 12 bytes of the file are 16 0 11. > Try: > > dt = > np.dtype([('isize','int64'),('irow','int32'),('icol','int32'),('value','float')]) > > Ryan > Strange >>> a array([(16, 11, 254, 254.0), (16, 16, 0, 5.3686493512014268e-312), (4638566878703255552, 16, 0, 7.9050503334599447e-323), (1082331758605, 0, 1079312384, 7.9050503334599447e-323), (16, 14, 251, 62.0), (16, 16, 0, 5.3049894774872906e-312), (4632233691727265792, 16, 0, 7.9050503334599447e-323), (1069446856720, 0, 1078231040, 7.9050503334599447e-323), (16, 17, 248, 35.0), (16, 16, 0, 5.2413296037731544e-312), (4629137466983448576, 16, 0, 7.9050503334599447e-323), (1056561954835, 0, 1077608448, 7.9050503334599447e-323), (16, 20, 245, 24.0)], dtype=[('isize', ' References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> Message-ID: <498769B3.5020307@gmail.com> Nils Wagner wrote: > On Mon, 02 Feb 2009 14:07:35 -0600 > Ryan May wrote: >> Nils Wagner wrote: >>>>> Is this a 64-bit problem ? >>>>> >>>> I don't know if it's a 64-bit problem per-se, so much as >>>> a disagreement between >>>> fortran and numpy. Numpy is making the size of the >>>> integer fields 8 bytes, while >>>> in Fortran, they're only 4 bytes. When constructing >>>> your dtype, use np.int32 or >>>> '>>> that fixes it. >>>> >>> dt = >>> np.dtype([('isize','int32'),('irow','int32'),('icol','int32'),('value','float')]) >>> >>> >>>>>> a >>> array([(16, 0, 11, 1.2549267404367662e-321), >>> (1081065472, 16, 0, 7.9050503334599447e-323), >>> (12, 253, 0, 3.4485523805914514e-313), >>> (0, 16, 0, 5.3474293932967148e-312), >>> (0, 1079312384, 16, 3.3951932655444357e-313), >>> (0, >>> 14, 251, 62.0), >>> (16, 0, 16, 3.1829936864479085e-313), >>> (250, 0, 1078525952, 7.9050503334599447e-323), >>> (16, 0, 16, 1.2302234581447039e-321), >>> (1078231040, 16, 0, 7.9050503334599447e-323), >>> (17, 248, 0, 3.4484552433329538e-313), >>> (0, 16, 0, 5.2413296037731544e-312), >>> (0, 1077805056, 16, 3.3951932655444357e-313), >>> (0, >>> 19, 246, 27.0), >>> (16, 0, 16, 4.2439915819305446e-313), >>> (245, 0, 1077411840, 7.9050503334599447e-323)], >>> dtype=[('isize', '>> ('icol', >>> '>> >> Maybe on 64-bit machines, the number of bytes is 64-bits >> instead of 32 (see the >> fact that the first 12 bytes of the file are 16 0 11. >> Try: >> >> dt = >> np.dtype([('isize','int64'),('irow','int32'),('icol','int32'),('value','float')]) >> >> Ryan >> > > Strange > >>>> a > array([(16, 11, 254, 254.0), (16, 16, 0, > 5.3686493512014268e-312), > (4638566878703255552, 16, 0, > 7.9050503334599447e-323), > (1082331758605, 0, 1079312384, > 7.9050503334599447e-323), > (16, 14, 251, 62.0), (16, 16, 0, > 5.3049894774872906e-312), > (4632233691727265792, 16, 0, > 7.9050503334599447e-323), > (1069446856720, 0, 1078231040, > 7.9050503334599447e-323), > (16, 17, 248, 35.0), (16, 16, 0, > 5.2413296037731544e-312), > (4629137466983448576, 16, 0, > 7.9050503334599447e-323), > (1056561954835, 0, 1077608448, > 7.9050503334599447e-323), > (16, 20, 245, 24.0)], > dtype=[('isize', ' ' Apparently I was slightly off on the details (it's been awhile since I had to deal with this nonsense). The number of bytes written is written before *and* after writing your actual data. So the following should work: dtype=[('isize', ' References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> <498769B3.5020307@gmail.com> Message-ID: <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> I'm trying to test out f2py in Windows (python 2.5.4 32-bit for now + most recent Numpy). I'd like to use the Intel compilers, but msvc is fine if needed. I get the output below about which I have a question re: the warning about VS version. I have VS 2008 currently which should have no trouble making binaries compatible with older version of VS(?) Is there any way around this error with VS > 2003? Thanks, ~Mike C. C:\Python25\Lib\site-packages\numpy\f2py\docs>C:\Python25\Scripts\f2py.py -c --f compiler=intel -m hello hello.f Ignoring "Python was built with Visual Studio 2003; extensions must be built with a compiler than can generate compatible binaries. Visual Studio 2003 was not found on this system. If you have Cygwin installed, you can try compiling with MingW32, by passing "-c mingw32" to setup.py." (one s hould fix me in fcompiler/compaq.py) running build running config_cc unifing config_cc, config, build_clib, build_ext, build commands --compiler opti ons running config_fc unifing config_fc, config, build_clib, build_ext, build commands --fcompiler opt ions running build_src building extension "hello" sources f2py options: [] f2py:> c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5\hellomodule.c creating c:\docume~1\mike\locals~1\temp\tmptd0t5g creating c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5 Reading fortran codes... Reading file 'hello.f' (format:fix,strict) Post-processing... Block: hello Block: foo Post-processing (stage 2)... Building modules... Building module "hello"... Constructing wrapper function "foo"... foo(a) Wrote C/API module "hello" to file "c:\docume~1\mike\locals~1\temp\tmptd 0t5g\src.win32-2.5/hellomodule.c" adding 'c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5\fortranobject.c ' to sources. adding 'c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5' to include_dir s. copying C:\Python25\lib\site-packages\numpy\f2py\src\fortranobject.c -> c:\docum e~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5 copying C:\Python25\lib\site-packages\numpy\f2py\src\fortranobject.h -> c:\docum e~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5 running build_ext No module named msvccompiler in numpy.distutils; trying from distutils error: Python was built with Visual Studio 2003; extensions must be built with a compiler than can generate compatible binaries. Visual Studio 2003 was not found on this system. If you have Cygwin installed, you can try compiling with MingW32, by passing "-c mingw32" to setup.py. -------------- next part -------------- An HTML attachment was scrubbed... URL: From patrickmarshwx at gmail.com Mon Feb 2 19:01:51 2009 From: patrickmarshwx at gmail.com (Patrick Marsh) Date: Mon, 2 Feb 2009 18:01:51 -0600 Subject: [Numpy-discussion] f2py: VS version on Windows In-Reply-To: <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> <498769B3.5020307@gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> Message-ID: Hi Mike, Python extensions must be built using the same compiler as was used to build the Python binary. Python 2.5.4 was built using MSVC2003 and so extensions for it must be built using the same compiler. The exception to this rule is that extensions built using mingw32 (and msys) will work with most, if not all, windows Python binaries. -Patrick On Mon, Feb 2, 2009 at 5:56 PM, Mike Colonno wrote: > I'm trying to test out f2py in Windows (python 2.5.4 32-bit for > now + most recent Numpy). I'd like to use the Intel compilers, but msvc is > fine if needed. I get the output below about which I have a question re: the > warning about VS version. I have VS 2008 currently which should have no > trouble making binaries compatible with older version of VS(?) Is there any > way around this error with VS > 2003? > > > > Thanks, > > ~Mike C. > > > > > > C:\Python25\Lib\site-packages\numpy\f2py\docs>C:\Python25\Scripts\f2py.py -c > --f > > compiler=intel -m hello hello.f > > Ignoring "Python was built with Visual Studio 2003; > > extensions must be built with a compiler than can generate compatible > binaries. > > Visual Studio 2003 was not found on this system. If you have Cygwin > installed, > > you can try compiling with MingW32, by passing "-c mingw32" to setup.py." > (one s > > hould fix me in fcompiler/compaq.py) > > running build > > running config_cc > > unifing config_cc, config, build_clib, build_ext, build commands --compiler > opti > > ons > > running config_fc > > unifing config_fc, config, build_clib, build_ext, build commands --fcompiler > opt > > ions > > running build_src > > building extension "hello" sources > > f2py options: [] > > f2py:> c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5\hellomodule.c > > creating c:\docume~1\mike\locals~1\temp\tmptd0t5g > > creating c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5 > > Reading fortran codes... > > Reading file 'hello.f' (format:fix,strict) > > Post-processing... > > Block: hello > > Block: foo > > Post-processing (stage 2)... > > Building modules... > > Building module "hello"... > > Constructing wrapper function "foo"... > > foo(a) > > Wrote C/API module "hello" to file > "c:\docume~1\mike\locals~1\temp\tmptd > > 0t5g\src.win32-2.5/hellomodule.c" > > adding > 'c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5\fortranobject.c > > ' to sources. > > adding 'c:\docume~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5' to > include_dir > > s. > > copying C:\Python25\lib\site-packages\numpy\f2py\src\fortranobject.c -> > c:\docum > > e~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5 > > copying C:\Python25\lib\site-packages\numpy\f2py\src\fortranobject.h -> > c:\docum > > e~1\mike\locals~1\temp\tmptd0t5g\src.win32-2.5 > > running build_ext > > No module named msvccompiler in numpy.distutils; trying from distutils > > error: Python was built with Visual Studio 2003; > > extensions must be built with a compiler than can generate compatible > binaries. > > Visual Studio 2003 was not found on this system. If you have Cygwin > installed, > > you can try compiling with MingW32, by passing "-c mingw32" to setup.py. > > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > -- Patrick Marsh Graduate Research Assistant School of Meteorology University of Oklahoma http://www.patricktmarsh.com From mtrumpis at berkeley.edu Mon Feb 2 19:21:53 2009 From: mtrumpis at berkeley.edu (mtrumpis at berkeley.edu) Date: Mon, 2 Feb 2009 16:21:53 -0800 (PST) Subject: [Numpy-discussion] SVD errors Message-ID: <34928.128.32.52.185.1233620513.squirrel@calmail.berkeley.edu> Hello list.. I've run into two SVD errors over the last few days. Both errors are identical in numpy/scipy. I've submitted a ticket for the 1st problem (numpy ticket #990). Summary is: some builds of the lapack_lite module linking against system LAPACK (not the bundled dlapack_lite.o, etc) give a "LinAlgError: SVD did not converge" exception on my matrix. This error does occur using Mac's Accelerate framework LAPACK, and a coworker's Ubuntu LAPACK version. It does not seem to happen using ATLAS LAPACK (nor using Octave/Matlab on said Ubuntu) Just today I've come across a negative singular value cropping up in an SVD of a different matrix. This error does occur on my ATLAS LAPACK based numpy, as well as on the Ubuntu setup. And once again, it does not happen in Octave/Matlab. I'm using numpy 1.3.0.dev6336 -- don't know what the Ubuntu box is running. Here are some npy files for the two different cases: https://cirl.berkeley.edu/twiki/pub/User/MikeTrumpis/noconverge_operator.npy https://cirl.berkeley.edu/twiki/pub/User/MikeTrumpis/negsval_operator.npy Mike From robert.kern at gmail.com Mon Feb 2 19:27:05 2009 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 2 Feb 2009 18:27:05 -0600 Subject: [Numpy-discussion] SVD errors In-Reply-To: <34928.128.32.52.185.1233620513.squirrel@calmail.berkeley.edu> References: <34928.128.32.52.185.1233620513.squirrel@calmail.berkeley.edu> Message-ID: <3d375d730902021627x2ef54411j5bb9504927ff831@mail.gmail.com> On Mon, Feb 2, 2009 at 18:21, wrote: > Hello list.. I've run into two SVD errors over the last few days. Both > errors are identical in numpy/scipy. > > I've submitted a ticket for the 1st problem (numpy ticket #990). Summary > is: some builds of the lapack_lite module linking against system LAPACK > (not the bundled dlapack_lite.o, etc) give a "LinAlgError: SVD did not > converge" exception on my matrix. This error does occur using Mac's > Accelerate framework LAPACK, and a coworker's Ubuntu LAPACK version. It > does not seem to happen using ATLAS LAPACK (nor using Octave/Matlab on > said Ubuntu) These are almost certainly issues with the particular implementations of LAPACK that you are using. I don't think there is anything we can do from numpy or scipy to change this. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From lou_boog2000 at yahoo.com Mon Feb 2 19:40:04 2009 From: lou_boog2000 at yahoo.com (Lou Pecora) Date: Mon, 2 Feb 2009 16:40:04 -0800 (PST) Subject: [Numpy-discussion] SVD errors In-Reply-To: <34928.128.32.52.185.1233620513.squirrel@calmail.berkeley.edu> Message-ID: <459961.27309.qm@web34408.mail.mud.yahoo.com> I ran into this problem a year or so ago. I suspect my messages to the list are in the archives somewhere. It is a known problem and involves a hard-coded maximum number of iterations in the SVD code. The problem is on the LaPack side. You can go in and change it, but then you have to recompile everything and rebuild Numpy, etc. etc. Not sure how easy/hard this is. I avoided it. What I found that worked for me (depends on your numerical situation) is to take the original matrix you are trying to decompose, say A, and examine, instead, the SVD of A^T A. Then the singular values of that matrix are the square of the singular values of A. This worked for me, but my original matrix was square. Maybe that helped. Don't know. It's worth a try. -- Lou Pecora, my views are my own. --- On Mon, 2/2/09, mtrumpis at berkeley.edu wrote: > From: mtrumpis at berkeley.edu > Subject: [Numpy-discussion] SVD errors > To: numpy-discussion at scipy.org > Date: Monday, February 2, 2009, 7:21 PM > Hello list.. I've run into two SVD errors over the last > few days. Both > errors are identical in numpy/scipy. > > I've submitted a ticket for the 1st problem (numpy > ticket #990). Summary > is: some builds of the lapack_lite module linking against > system LAPACK > (not the bundled dlapack_lite.o, etc) give a > "LinAlgError: SVD did not > converge" exception on my matrix. This error does > occur using Mac's > Accelerate framework LAPACK, and a coworker's Ubuntu > LAPACK version. It > does not seem to happen using ATLAS LAPACK (nor using > Octave/Matlab on > said Ubuntu) > > Just today I've come across a negative singular value > cropping up in an > SVD of a different matrix. This error does occur on my > ATLAS LAPACK based > numpy, as well as on the Ubuntu setup. And once again, it > does not happen > in Octave/Matlab. > > I'm using numpy 1.3.0.dev6336 -- don't know what > the Ubuntu box is running. > > Here are some npy files for the two different cases: > > https://cirl.berkeley.edu/twiki/pub/User/MikeTrumpis/noconverge_operator.npy > https://cirl.berkeley.edu/twiki/pub/User/MikeTrumpis/negsval_operator.npy > > Mike > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From sturla at molden.no Mon Feb 2 20:55:58 2009 From: sturla at molden.no (Sturla Molden) Date: Tue, 3 Feb 2009 02:55:58 +0100 (CET) Subject: [Numpy-discussion] f2py: VS version on Windows In-Reply-To: References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> <498769B3.5020307@gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> Message-ID: <263280a61eb7f65d0aee5504fa956201.squirrel@webmail.uio.no> > Python extensions must be built using the same compiler as was used to > build the Python binary. Python 2.5.4 was built using MSVC2003 and so > extensions for it must be built using the same compiler. The exception > to this rule is that extensions built using mingw32 (and msys) will > work with most, if not all, windows Python binaries. This is NOT correct. What you cannot do, is safely share CRT objects between different CRTs. You cannot malloc() some memory with msvcrt.dll, and subsequently free() the memory with msvcr71.dll. Similarly, you cannot fopen() a FILE* with msvcr71.dll and fread() with msvcrt.dll. Applications that do this will sooner or later fail in mysterious ways. Thus to be on the safe side, you should link the same CRT as Python and other native extensions use. That is, msvcr71.dll for Python 2.5. MSVC2003 usually does this by default, but there are exceptions. For example, Visual C++ 2003 Toolkit links against a static version of msvcr71. GCC (mingw) will by default link with msvcrt.dll. It will link with the same CRT as Python if you link with -lmsvcr71. You can create link libraries against this CRT for most compilers, you just need a .def file. Thus, you can use all compilers. If your extension DO NOT share its CRT objects with Python, it does not matter what compiler you use. IANAL, but there is a licensing issue one needs to be aware of: You are not allowed to redistribute msvcr71.dll except if you own a VS2003 license and have used that compiler. This is important if you use py2exe to create a Windows executable. This is one example where GCC cannot be used. But it is a legal issue, not a technical one. On the other hand, I don't think Microsoft has ever percecuted anyone for infringment of their msvcr71.dll copyright. They just want you to develop for their OS. But if you are going to use py2exe, you have to sort this problem out, or simply redistribute a full Python distro instead. In that case, PSF has built Python with a licensed VC2003 compiler, and you are just redustributing their binary installer (which is ok). Sturla Molden From sturla at molden.no Mon Feb 2 21:03:31 2009 From: sturla at molden.no (Sturla Molden) Date: Tue, 3 Feb 2009 03:03:31 +0100 (CET) Subject: [Numpy-discussion] f2py: VS version on Windows In-Reply-To: <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> References:

<49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> <498769B3.5020307@gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> Message-ID: > I'm trying to test out f2py in Windows (python 2.5.4 32-bit > for now + most recent Numpy). I'd like to use the Intel > compilers, but msvc is fine if needed. I get the output below > about which I have a question re: the warning about VS > version. I have VS 2008 currently which should have no trouble > making binaries compatible with older version of VS(?) Is > there any way around this error with VS > 2003? Shortly speaking: you must for Python 2.5 link with msvcr71.dll. The version of the MSVC compiler is unimportant if you can force it to use this CRT. Check your link libraries, and see if you find one for this DLL. Otherwise, you need to create a .def file for this DLL and create the link library from that. And make sure you don't link with other CRT versions. S.M. From patrickmarshwx at gmail.com Mon Feb 2 21:12:10 2009 From: patrickmarshwx at gmail.com (Patrick Marsh) Date: Mon, 2 Feb 2009 20:12:10 -0600 Subject: [Numpy-discussion] f2py: VS version on Windows In-Reply-To: References: <49871C89.4000800@gmail.com> <49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> <498769B3.5020307@gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> Message-ID: You learn something new every day. It turns out I had previously been given wrong information and I apologize for passing that along to the list. -Patrick On Mon, Feb 2, 2009 at 8:03 PM, Sturla Molden wrote: > >> I'm trying to test out f2py in Windows (python 2.5.4 32-bit >> for now + most recent Numpy). I'd like to use the Intel >> compilers, but msvc is fine if needed. I get the output below >> about which I have a question re: the warning about VS >> version. I have VS 2008 currently which should have no trouble >> making binaries compatible with older version of VS(?) Is >> there any way around this error with VS > 2003? > > Shortly speaking: you must for Python 2.5 link with msvcr71.dll. The > version of the MSVC compiler is unimportant if you can force it to use > this CRT. Check your link libraries, and see if you find one for this DLL. > Otherwise, you need to create a .def file for this DLL and create the link > library from that. And make sure you don't link with other CRT versions. > > S.M. > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -- Patrick Marsh Graduate Research Assistant School of Meteorology University of Oklahoma http://www.patricktmarsh.com From cournape at gmail.com Mon Feb 2 22:25:40 2009 From: cournape at gmail.com (David Cournapeau) Date: Tue, 3 Feb 2009 12:25:40 +0900 Subject: [Numpy-discussion] f2py: VS version on Windows In-Reply-To: <263280a61eb7f65d0aee5504fa956201.squirrel@webmail.uio.no> References:

<49874BA6.3050806@gmail.com> <49875287.6030805@gmail.com> <498769B3.5020307@gmail.com> <657E769D35612D4CAB1FEAACB0A92BB503098E574A@MAIL2.spacex.corp> <263280a61eb7f65d0aee5504fa956201.squirrel@webmail.uio.no> <5b8d13220902021925w696472f2h12d0925640b3e433@mail.gmail.com> Message-ID: <657E769D35612D4CAB1FEAACB0A92BB503098E5931@MAIL2.spacex.corp> Thanks to all for clearing this up. I have been bouncing this issue off the folks at Intel and they allege that Intel C++ should be able to do this independent of the version of VS used originally (I am skeptical). I am still getting some MKL-related missing symbol errors that we are clearing up and I will post anything useful that I discover. ~Mike C. -----Original Message----- From: numpy-discussion-bounces at scipy.org [mailto:numpy-discussion-bounces at scipy.org] On Behalf Of David Cournapeau Sent: Monday, February 02, 2009 7:26 PM To: Discussion of Numerical Python Subject: Re: [Numpy-discussion] f2py: VS version on Windows On Tue, Feb 3, 2009 at 10:55 AM, Sturla Molden wrote: > >> Python extensions must be built using the same compiler as was used to >> build the Python binary. Python 2.5.4 was built using MSVC2003 and so >> extensions for it must be built using the same compiler. The exception >> to this rule is that extensions built using mingw32 (and msys) will >> work with most, if not all, windows Python binaries. > > > This is NOT correct. Although it is technically true, that's relatively irrelevant for practical matters: it is not currently possible to build numpy with VS 2008 and 7.1 CRT, and you have to build numpy with the same compiler as the one used by python if you use MS compilers. David _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion From rmay31 at gmail.com Tue Feb 3 11:24:42 2009 From: rmay31 at gmail.com (Ryan May) Date: Tue, 03 Feb 2009 10:24:42 -0600 Subject: [Numpy-discussion] genloadtxt question Message-ID: <49886FCA.50407@gmail.com> Pierre, Should the following work? import numpy as np from StringIO import StringIO converter = {'date':lambda s: datetime.strptime(s,'%Y-%m-%d %H:%M:%SZ')} data = np.ndfromtxt(StringIO('2009-02-03 12:00:00Z,72214.0'), delimiter=',', names=['date','stid'], dtype=None, converters=converter) Right now, it's giving me the following: Traceback (most recent call last): File "check_oban.py", line 15, in converters=converter) File "/home/rmay/.local/lib64/python2.5/site-packages/numpy/lib/io.py", line 993, in ndfromtxt return genfromtxt(fname, **kwargs) File "/home/rmay/.local/lib64/python2.5/site-packages/numpy/lib/io.py", line 842, in genfromtxt locked=True) File "/home/rmay/.local/lib64/python2.5/site-packages/numpy/lib/_iotools.py", line 472, in update self.type = self._getsubdtype(func('0')) File "check_oban.py", line 9, in lambda s: datetime.strptime(s,'%Y-%m-%d %H:%M:%SZ').replace(tzinfo=UTC)} File "/usr/lib64/python2.5/_strptime.py", line 330, in strptime (data_string, format)) ValueError: time data did not match format: data=0 fmt=%Y-%m-%d %H:%M:%SZ Which comes from a part of the code in updating converters where it passes the string '0' to the converter. Are the converters expected to handle what amounts to bad input even though the file itself has no such problems? Specifying the dtype doesn't appear to help either. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From pgmdevlist at gmail.com Tue Feb 3 12:17:01 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 3 Feb 2009 12:17:01 -0500 Subject: [Numpy-discussion] genloadtxt question In-Reply-To: <49886FCA.50407@gmail.com> References: <49886FCA.50407@gmail.com> Message-ID: On Feb 3, 2009, at 11:24 AM, Ryan May wrote: > Pierre, > > Should the following work? > > import numpy as np > from StringIO import StringIO > > converter = {'date':lambda s: datetime.strptime(s,'%Y-%m-%d %H:%M: > %SZ')} > data = np.ndfromtxt(StringIO('2009-02-03 12:00:00Z,72214.0'), > delimiter=',', > names=['date','stid'], dtype=None, converters=converter) Well, yes, it should work. That's indeed a problem with the getsubdtype method of the converter. The problem is that we need to estimate the datatype of the output of the converter. In most cases, trying to convert '0' works properly, not in yours however. In r6338, I force the type to object if converting '0' does not work. That's a patch till the next corner case... From simpson at math.toronto.edu Tue Feb 3 12:19:45 2009 From: simpson at math.toronto.edu (Gideon Simpson) Date: Tue, 3 Feb 2009 12:19:45 -0500 Subject: [Numpy-discussion] array vector elementwise multiplication Message-ID: <6A3525EC-A389-4DDF-A68C-BB8455417916@math.toronto.edu> I have an M x N matrix A and two vectors, an M dimensional vector x and an N dimensional vector y. I would like to be able to do two things. 1. Multiply, elementwise, every column of A by x 2. Multiply, elementwise, every row of A by y. What's the "quick" way to do this in numpy? -gideon From rmay31 at gmail.com Tue Feb 3 12:27:19 2009 From: rmay31 at gmail.com (Ryan May) Date: Tue, 03 Feb 2009 11:27:19 -0600 Subject: [Numpy-discussion] genloadtxt question In-Reply-To: References: <49886FCA.50407@gmail.com> Message-ID: <49887E77.3050409@gmail.com> Pierre GM wrote: > On Feb 3, 2009, at 11:24 AM, Ryan May wrote: > >> Pierre, >> >> Should the following work? >> >> import numpy as np >> from StringIO import StringIO >> >> converter = {'date':lambda s: datetime.strptime(s,'%Y-%m-%d %H:%M: >> %SZ')} >> data = np.ndfromtxt(StringIO('2009-02-03 12:00:00Z,72214.0'), >> delimiter=',', >> names=['date','stid'], dtype=None, converters=converter) > > Well, yes, it should work. That's indeed a problem with the > getsubdtype method of the converter. > The problem is that we need to estimate the datatype of the output of > the converter. In most cases, trying to convert '0' works properly, > not in yours however. In r6338, I force the type to object if > converting '0' does not work. That's a patch till the next corner > case... Thanks for the quick patch! And yeah, I can't think of any better behavior. It's actually what I ended up doing in my conversion function, so, if nothing else, it removes the user from having to write that kind of boilerplate code. Thanks, Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From david.froger.info at gmail.com Tue Feb 3 13:30:46 2009 From: david.froger.info at gmail.com (David Froger) Date: Tue, 3 Feb 2009 19:30:46 +0100 Subject: [Numpy-discussion] example reading binary Fortran file In-Reply-To: <200902031015.17490.faltet@pytables.org> References: <6df541a5c8d6ecf26b6e38f404958401.squirrel@webmail.uio.no> <200902031015.17490.faltet@pytables.org> Message-ID: Thanks a lot Fransesc and Neil, yours messages really help me. I'll look at these solutions attentively. Here is what I write recently, but I begin to understand it's effectively not portable... def fread(fileObject,*arrayAttributs): """ Reading in a binary (=unformatted) Fortran file Let's call 'record' the list of arrays written with one write in the Fortran file. Call one fread per write. Parameter : * fileObject, eg: fileObject = open("data.bin",'rb') * arrayAttributs = ( (shape1,dtype1,readorskip1), (shape2,dtype2,readorskip2) ...) * shape: eg: (100,200) * dtype: eg: 'f8' (big endian, double precision) * readorskip = [0|1] * 1: the array is read and return * 0: the size of the array is skipped to read the array after, the array isn't returned Exemples (with write ux,uy,p in the Fortran code) * f = open("uxuyp.bin",'rb') nx,ny = 100,200 p = readFortran( f, ((nx,ny),' fileSize: import logging logging.error('To much data to be read in %r',fileObject.name) logging.error('File Size: %r',fileSize) logging.error('To be read: %r',recordBytes) NoneList = [] for (shape,dtype,read) in arrayAttributs: if read: NoneList.append(None) return NoneList # skip the four bytes in the beginning of the record fileObject.seek(4,1) # read the arrays in record arrays = [] for (shape,dtype,read) in arrayAttributs: # number of elements to be read in this array count=1 for size in shape: count *= size if read: array = numpy.fromfile(fileObject, count=count, dtype=dtype).reshape(shape, order='F') arrays.append(array) else: dtype = numpy.dtype(dtype) arrayBytes = count*dtype.itemsize fileObject.seek(arrayBytes,1) # skip the four bytes at the end of the record fileObject.seek(4,1) -------------- next part -------------- An HTML attachment was scrubbed... URL: From david.froger.info at gmail.com Tue Feb 3 13:34:15 2009 From: david.froger.info at gmail.com (David Froger) Date: Tue, 3 Feb 2009 19:34:15 +0100 Subject: [Numpy-discussion] example reading binary Fortran file In-Reply-To: References: <6df541a5c8d6ecf26b6e38f404958401.squirrel@webmail.uio.no> <200902031015.17490.faltet@pytables.org> Message-ID: the last line was missing : return arrays -------------- next part -------------- An HTML attachment was scrubbed... URL: From rmay31 at gmail.com Tue Feb 3 15:54:37 2009 From: rmay31 at gmail.com (Ryan May) Date: Tue, 03 Feb 2009 14:54:37 -0600 Subject: [Numpy-discussion] Operations on masked items Message-ID: <4988AF0D.60504@gmail.com> Pierre, I know you did some preliminary work on helping to make sure that doing operations on masked arrays doesn't change the underlying data. I ran into the following today. import numpy as np a = np.ma.array([1,2,3], mask=[False, True, False]) b = a * 10 c = 10 * a print b.data # Prints [10 2 30] Good! print c.data # Prints [10 10 30] Oops. I tracked it down to __call__ on the _MaskedBinaryOperation class. If there's a mask on the data, you use: result = np.where(m, da, self.f(da, db, *args, **kwargs)) You can see that if a (and hence da) is a scalar, your masked values end up with the value of the scalar. If this is getting too hairy to handle not touching data, I understand. I just thought I should point out the inconsistency here. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From rmay31 at gmail.com Tue Feb 3 16:00:28 2009 From: rmay31 at gmail.com (Ryan May) Date: Tue, 03 Feb 2009 15:00:28 -0600 Subject: [Numpy-discussion] Operations on masked items In-Reply-To: <4988AF0D.60504@gmail.com> References: <4988AF0D.60504@gmail.com> Message-ID: <4988B06C.3020504@gmail.com> Ryan May wrote: > Pierre, > > I know you did some preliminary work on helping to make sure that doing > operations on masked arrays doesn't change the underlying data. I ran into the > following today. > > import numpy as np > a = np.ma.array([1,2,3], mask=[False, True, False]) > b = a * 10 > c = 10 * a > print b.data # Prints [10 2 30] Good! > print c.data # Prints [10 10 30] Oops. > > I tracked it down to __call__ on the _MaskedBinaryOperation class. If there's a > mask on the data, you use: > > result = np.where(m, da, self.f(da, db, *args, **kwargs)) > > You can see that if a (and hence da) is a scalar, your masked values end up with > the value of the scalar. If this is getting too hairy to handle not touching > data, I understand. I just thought I should point out the inconsistency here. Well, I guess I hit send too soon. Here's one easy solution (consistent with what you did for __radd__), change the code for __rmul__ to do: return multiply(self, other) instead of: return multiply(other, self) That fixes it for me, and I don't see how it would break anything. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From pgmdevlist at gmail.com Tue Feb 3 16:19:16 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Tue, 3 Feb 2009 16:19:16 -0500 Subject: [Numpy-discussion] Operations on masked items In-Reply-To: <4988B06C.3020504@gmail.com> References: <4988AF0D.60504@gmail.com> <4988B06C.3020504@gmail.com> Message-ID: On Feb 3, 2009, at 4:00 PM, Ryan May wrote: > > Well, I guess I hit send too soon. Here's one easy solution > (consistent with > what you did for __radd__), change the code for __rmul__ to do: > > return multiply(self, other) > > instead of: > > return multiply(other, self) > > That fixes it for me, and I don't see how it would break anything. Good call, but once again: "Thou shalt not put trust in ye masked values [1]". >>> a = np.ma.array([1,2,3],mask=[0,1,0]) >>> b = np.ma.array([10, 20, 30], mask=[0,1,0]) >>> (a*b).data array([10, 2, 90]) >>> (b*a).data array([10, 20, 90]) So yes, __mul__ is not commutative when you deal w/ masked arrays (at least, when you try to access the data under a mask). Nothing I can do. Remember that preventing the underlying data to be modified is NEVER guaranteed... [1] Epistle of Paul (Dubois). From rmay31 at gmail.com Tue Feb 3 18:00:42 2009 From: rmay31 at gmail.com (Ryan May) Date: Tue, 03 Feb 2009 17:00:42 -0600 Subject: [Numpy-discussion] Operations on masked items In-Reply-To: References: <4988AF0D.60504@gmail.com> <4988B06C.3020504@gmail.com> Message-ID: <4988CC9A.3060308@gmail.com> Pierre GM wrote: > On Feb 3, 2009, at 4:00 PM, Ryan May wrote: >> Well, I guess I hit send too soon. Here's one easy solution >> (consistent with >> what you did for __radd__), change the code for __rmul__ to do: >> >> return multiply(self, other) >> >> instead of: >> >> return multiply(other, self) >> >> That fixes it for me, and I don't see how it would break anything. > > Good call, but once again: "Thou shalt not put trust in ye masked > values [1]". > > >>> a = np.ma.array([1,2,3],mask=[0,1,0]) > >>> b = np.ma.array([10, 20, 30], mask=[0,1,0]) > >>> (a*b).data > array([10, 2, 90]) > >>> (b*a).data > array([10, 20, 90]) > > So yes, __mul__ is not commutative when you deal w/ masked arrays (at > least, when you try to access the data under a mask). Nothing I can > do. Remember that preventing the underlying data to be modified is > NEVER guaranteed... Fair enough. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From ellisonbg.net at gmail.com Tue Feb 3 19:00:05 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 16:00:05 -0800 Subject: [Numpy-discussion] Few minor issues with numscons Message-ID: <6ce0ac130902031600m7a056b09rfd3069da3e8df028@mail.gmail.com> David, I am trying to use numscons to build a project and am running into some problems: Two smaller issues and one show stopper. First, the smaller ones: * The web presense of numscons is currently very confusing. There are a couple of locations with info about it, but the most prominent ones appear to be quite outdated: http://projects.scipy.org/scipy/numpy/wiki/NumScons ...refers to... https://code.launchpad.net/numpy.scons.support Which is no longer being used and doesn't have any links to the most recent versions I had to hunt for a while to find the develoment repo, which is on github and the most recent release, which is now at pypi. It is probably a good idea to update these locations so as not to confuse new folks (or old folks too). * The scons/scons-local subdir is not installed when running python setup.py install. I had to use python setupegg.py to get this to install in the right place. I will send a different email in a second about the show stopper as it is a bigger topic related to a numpy bug. Cheers, Brian From ellisonbg.net at gmail.com Tue Feb 3 19:12:52 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 16:12:52 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET Message-ID: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> I am trying to use numscons to build a project and have run into a show stopper. I am using: OS X 10.5 The builtin Python 2.5.2 Here is what I see upon running python setup.py scons: scons: Reading SConscript files ... DistutilsPlatformError: $MACOSX_DEPLOYMENT_TARGET mismatch: now "10.3" but "10.5" during configure: File "/Users/bgranger/Library/Python/2.5/src/numscons/tests/examples/checkers/SConstruct", line 2: GetInitEnvironment(ARGUMENTS).DistutilsSConscript('SConscript') File "/Users/bgranger/Library/Python/2.5/site-packages/numscons-0.9.4-py2.5.egg/numscons/core/numpyenv.py", line 108: [this goes on for a while] This bug is one that I am familiar with. Here is a sketch: * numpy.distutils sets MACOSX_DEPLOYMENT_TARGET=10.3 if MACOSX_DEPLOYMENT_TARGET is not set in the environment. * But, the built-in Python on OS X 10.5 has MACOSX_DEPLOYMENT_TARGET=10.5. When Python is built, it saves this info in a file. * When called distutils checks to make sure that the current value of MACOSX_DEPLOYMENT_TARGET matches the one that was used to build Python. Hence the mismatch. I am pretty sure that the offending code is in: numpy.distutils.fcompiler.gnu.get_flags_linker_so I think I know how to fix this and will get started on it, but I wanted to see if anyone else had any experience with this or knew another way around this. Cheers, Brian From robert.kern at gmail.com Tue Feb 3 19:17:34 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 3 Feb 2009 18:17:34 -0600 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> Message-ID: <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> On Tue, Feb 3, 2009 at 18:12, Brian Granger wrote: > I am trying to use numscons to build a project and have run into a > show stopper. I am using: > > OS X 10.5 > The builtin Python 2.5.2 > > Here is what I see upon running python setup.py scons: > > scons: Reading SConscript files ... > DistutilsPlatformError: $MACOSX_DEPLOYMENT_TARGET mismatch: now "10.3" > but "10.5" during configure: > File "/Users/bgranger/Library/Python/2.5/src/numscons/tests/examples/checkers/SConstruct", > line 2: > GetInitEnvironment(ARGUMENTS).DistutilsSConscript('SConscript') > File "/Users/bgranger/Library/Python/2.5/site-packages/numscons-0.9.4-py2.5.egg/numscons/core/numpyenv.py", > line 108: > [this goes on for a while] > > This bug is one that I am familiar with. Here is a sketch: > > * numpy.distutils sets MACOSX_DEPLOYMENT_TARGET=10.3 if > MACOSX_DEPLOYMENT_TARGET is not set in the environment. > > * But, the built-in Python on OS X 10.5 has > MACOSX_DEPLOYMENT_TARGET=10.5. When Python is built, it saves this > info in a file. > > * When called distutils checks to make sure that the current value of > MACOSX_DEPLOYMENT_TARGET matches the one that was used to build > Python. > > Hence the mismatch. I am pretty sure that the offending code is in: > > numpy.distutils.fcompiler.gnu.get_flags_linker_so > > I think I know how to fix this and will get started on it, but I > wanted to see if anyone else had any experience with this or knew > another way around this. Well, the workaround is to set MACOSX_DEPLOYMENT_TARGET=10.5 in your environment. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From ellisonbg.net at gmail.com Tue Feb 3 19:20:54 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 16:20:54 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> Message-ID: <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> Robert, Thanks. Yes, I just saw that this will work. When I fixed this in Cython a while back this workaround wouldn't work. Would you still consider this a bug? The logic to fix it is fairly simply. Brian On Tue, Feb 3, 2009 at 4:17 PM, Robert Kern wrote: > On Tue, Feb 3, 2009 at 18:12, Brian Granger wrote: >> I am trying to use numscons to build a project and have run into a >> show stopper. I am using: >> >> OS X 10.5 >> The builtin Python 2.5.2 >> >> Here is what I see upon running python setup.py scons: >> >> scons: Reading SConscript files ... >> DistutilsPlatformError: $MACOSX_DEPLOYMENT_TARGET mismatch: now "10.3" >> but "10.5" during configure: >> File "/Users/bgranger/Library/Python/2.5/src/numscons/tests/examples/checkers/SConstruct", >> line 2: >> GetInitEnvironment(ARGUMENTS).DistutilsSConscript('SConscript') >> File "/Users/bgranger/Library/Python/2.5/site-packages/numscons-0.9.4-py2.5.egg/numscons/core/numpyenv.py", >> line 108: >> [this goes on for a while] >> >> This bug is one that I am familiar with. Here is a sketch: >> >> * numpy.distutils sets MACOSX_DEPLOYMENT_TARGET=10.3 if >> MACOSX_DEPLOYMENT_TARGET is not set in the environment. >> >> * But, the built-in Python on OS X 10.5 has >> MACOSX_DEPLOYMENT_TARGET=10.5. When Python is built, it saves this >> info in a file. >> >> * When called distutils checks to make sure that the current value of >> MACOSX_DEPLOYMENT_TARGET matches the one that was used to build >> Python. >> >> Hence the mismatch. I am pretty sure that the offending code is in: >> >> numpy.distutils.fcompiler.gnu.get_flags_linker_so >> >> I think I know how to fix this and will get started on it, but I >> wanted to see if anyone else had any experience with this or knew >> another way around this. > > Well, the workaround is to set MACOSX_DEPLOYMENT_TARGET=10.5 in your > environment. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From robert.kern at gmail.com Tue Feb 3 19:23:42 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 3 Feb 2009 18:23:42 -0600 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> Message-ID: <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> On Tue, Feb 3, 2009 at 18:20, Brian Granger wrote: > Robert, > > Thanks. > > Yes, I just saw that this will work. When I fixed this in Cython a > while back this workaround wouldn't work. Would you still consider > this a bug? The logic to fix it is fairly simply. What is the fix you are thinking of? -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Catherine.M.Moroney at jpl.nasa.gov Tue Feb 3 19:27:12 2009 From: Catherine.M.Moroney at jpl.nasa.gov (Catherine Moroney) Date: Tue, 3 Feb 2009 16:27:12 -0800 Subject: [Numpy-discussion] reading binary Fortran Message-ID: <8674E2D1-D119-4885-9686-36EC666CE9EF@jpl.nasa.gov> I've noticed a lot of discussion on how to read binary files written out from Fortran, and nobody seems to have mentioned how to modify your Fortran code so it writes out a file that can be read with numpy.fromfile() in a single line. For example, to write out a NLINE x NSMP array of floats in Fortran in a line by line fashion, the code below works just fine. open(iunit,file=flname,form='unformatted',access='direct', & recl=nsmp*4,status='replace',action='write') do iline=1,nline write(iunit,rec=iline) (data(iline,ismp),ismp=1,nsmp) end do close(iunit) It's more code than simply "write(iunit) data", but it has the advantage of being easily imported into python, matlab and other software packages that can read flat binary data. To import a file written out in this fashion into numpy, a single call to numpy.fromfile(flname) works with no need to define datatypes or the like. Apologies if I'm missing some reason why the above doesn't work or is not preferable to the "write(iunit) data". Catherine From ellisonbg.net at gmail.com Tue Feb 3 19:34:48 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 16:34:48 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> Message-ID: <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> > What is the fix you are thinking of? This is how Cython currently handles this logic. This would have to be modified to include the additional case of a user setting MACOSX_DEPLOYMENT_TARGET in their environment, but that logic is already in numpy.distutils.fcompiler.gnu.get_flags_linker_so This is really just a special case for 1) OS X 10.5 and 2) built-in Python. # MACOSX_DEPLOYMENT_TARGET can be set to 10.3 in most cases. # But for the built-in Python 2.5.1 on Leopard, it needs to be set for 10.5. # This looks like a bug that will be fixed in 2.5.2. If Apple updates their # Python to 2.5.2, this fix should be OK. import distutils.sysconfig as sc python_prefix = sc.get_config_var('prefix') leopard_python_prefix = '/System/Library/Frameworks/Python.framework/Versions/2.5' full_version = "%s.%s.%s" % sys.version_info[:3] if python_prefix == leopard_python_prefix and full_version == '2.5.1': os.environ["MACOSX_DEPLOYMENT_TARGET"] = "10.5" else: os.environ["MACOSX_DEPLOYMENT_TARGET"] = "10.3" From robert.kern at gmail.com Tue Feb 3 19:42:31 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 3 Feb 2009 18:42:31 -0600 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> Message-ID: <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> On Tue, Feb 3, 2009 at 18:34, Brian Granger wrote: >> What is the fix you are thinking of? > > This is how Cython currently handles this logic. This would have to > be modified to include the additional case of a user setting > MACOSX_DEPLOYMENT_TARGET in their environment, but that logic is > already in numpy.distutils.fcompiler.gnu.get_flags_linker_so > > This is really just a special case for 1) OS X 10.5 and 2) built-in Python. > > # MACOSX_DEPLOYMENT_TARGET can be set to 10.3 in most cases. > # But for the built-in Python 2.5.1 on Leopard, it needs to be set for 10.5. > # This looks like a bug that will be fixed in 2.5.2. If Apple updates their > # Python to 2.5.2, this fix should be OK. > import distutils.sysconfig as sc > python_prefix = sc.get_config_var('prefix') > leopard_python_prefix = > '/System/Library/Frameworks/Python.framework/Versions/2.5' > full_version = "%s.%s.%s" % sys.version_info[:3] > if python_prefix == leopard_python_prefix and full_version == '2.5.1': > os.environ["MACOSX_DEPLOYMENT_TARGET"] = "10.5" > else: > os.environ["MACOSX_DEPLOYMENT_TARGET"] = "10.3" Hmm, that's still going to break for any custom build that decides to build Python with a specific MACOSX_DEPLOYMENT_TARGET. If you're going to fix it at all, it should default to the value in the Makefile that sysconfig is going to check against. The relevant code to copy is in sysconfig._init_posix(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From ellisonbg.net at gmail.com Tue Feb 3 19:53:14 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 16:53:14 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> Message-ID: <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> > Hmm, that's still going to break for any custom build that decides to > build Python with a specific MACOSX_DEPLOYMENT_TARGET. If you're going > to fix it at all, it should default to the value in the Makefile that > sysconfig is going to check against. The relevant code to copy is in > sysconfig._init_posix(). Yes, I agree that sysconfig._init_posix() has the proper logic for this. This logic should also be applied to Cython as well probably. Would you say that the proper fix then is to inspect the Makefile and set MACOSX_DEPLOYMENT_TARGET to the valued used to build Python itself. Or should we still try to set it to 10.3 in some cases (like the current numpy.distutils does) or look at the environment as well? Cheers, Brian From robert.kern at gmail.com Tue Feb 3 19:58:51 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 3 Feb 2009 18:58:51 -0600 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> Message-ID: <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> On Tue, Feb 3, 2009 at 18:53, Brian Granger wrote: >> Hmm, that's still going to break for any custom build that decides to >> build Python with a specific MACOSX_DEPLOYMENT_TARGET. If you're going >> to fix it at all, it should default to the value in the Makefile that >> sysconfig is going to check against. The relevant code to copy is in >> sysconfig._init_posix(). > > Yes, I agree that sysconfig._init_posix() has the proper logic for > this. This logic should also be applied to Cython as well probably. > > Would you say that the proper fix then is to inspect the Makefile and > set MACOSX_DEPLOYMENT_TARGET to the valued used to build Python > itself. Or should we still try to set it to 10.3 in some cases (like > the current numpy.distutils does) or look at the environment as well? 1) Trust the environment variable if given and let distutils raise its error message (why not raise it ourselves? distutils' error message and explanation is already out in THE GOOGLE.) 2) Otherwise, use the value in the Makefile if it's there. 3) If it's not even in the Makefile for whatever reason, go with 10.3. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From charlesr.harris at gmail.com Tue Feb 3 20:09:52 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 3 Feb 2009 18:09:52 -0700 Subject: [Numpy-discussion] array vector elementwise multiplication In-Reply-To: <6A3525EC-A389-4DDF-A68C-BB8455417916@math.toronto.edu> References: <6A3525EC-A389-4DDF-A68C-BB8455417916@math.toronto.edu> Message-ID: On Tue, Feb 3, 2009 at 10:19 AM, Gideon Simpson wrote: > I have an M x N matrix A and two vectors, an M dimensional vector x > and an N dimensional vector y. I would like to be able to do two > things. > > 1. Multiply, elementwise, every column of A by x > > 2. Multiply, elementwise, every row of A by y. > > What's the "quick" way to do this in numpy? > In [1]: M = ones((3,3)) In [2]: x = arange(3) In [3]: M*x Out[3]: array([[ 0., 1., 2.], [ 0., 1., 2.], [ 0., 1., 2.]]) In [4]: x[:,newaxis]*M Out[4]: array([[ 0., 0., 0.], [ 1., 1., 1.], [ 2., 2., 2.]]) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Tue Feb 3 20:13:02 2009 From: cournape at gmail.com (David Cournapeau) Date: Wed, 4 Feb 2009 10:13:02 +0900 Subject: [Numpy-discussion] Few minor issues with numscons In-Reply-To: <6ce0ac130902031600m7a056b09rfd3069da3e8df028@mail.gmail.com> References: <6ce0ac130902031600m7a056b09rfd3069da3e8df028@mail.gmail.com> Message-ID: <5b8d13220902031713n68524e14g1608ca9156972d7d@mail.gmail.com> Hi Brian, On Wed, Feb 4, 2009 at 9:00 AM, Brian Granger wrote: > David, > > I am trying to use numscons to build a project and am running into > some problems: > > Two smaller issues and one show stopper. First, the smaller ones: > > * The web presense of numscons is currently very confusing. There are > a couple of locations with info about it, but the most prominent ones > appear to be quite outdated: > > http://projects.scipy.org/scipy/numpy/wiki/NumScons > > ...refers to... > > https://code.launchpad.net/numpy.scons.support > > Which is no longer being used and doesn't have any links to the most > recent versions I had to hunt for a while to find the develoment > repo, which is on github and the most recent release, which is now at > pypi. The releases are on Pypi for quite some time. I converted the repo to git and put it on github, but I have not really worked on numscons for several months now for lack of time ( and because numscons it mostly "done" and the main limitations of numscons are not fixable without fixing some fairly major scons limitations). Basically, the repo on github is only the convertion from bzr, without any new features. > It is probably a good idea to update these locations so as not to > confuse new folks (or old folks too). > > * The scons/scons-local subdir is not installed when running python > setup.py install. I had to use python setupegg.py to get this to > install in the right place. That's strange, you are not the first one to mention this bug, but I never had this problem myself - I myself never use setupegg except when creating eggs for pypi. cheers, David From cournape at gmail.com Tue Feb 3 21:04:25 2009 From: cournape at gmail.com (David Cournapeau) Date: Wed, 4 Feb 2009 11:04:25 +0900 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: References: Message-ID: <5b8d13220902031804w6cb1bf98q68a5ae627700076a@mail.gmail.com> On Tue, Feb 3, 2009 at 11:49 PM, Pierre GM wrote: > All, > When can we expect numpy 1.3 to be released ? Looking at the log from 1.2.x, the main committers were Charles Harris, you and me for (that makes ~ 80 % of the commits) for *numpy*. Then, there is the doc itself, which has seen major work I believe, although I have not followed much. Talking about the things I have worked on for 1.3, I did not have the time to finish everything. The things which I would like to be done before a 1.3.0 release are: - fix formatting issues that Pauli and me worked on (for locale independence and all): I believe it is mostly a matter of merging the changes in the trunk, and testing. Pauli, do you agree ? - make sure 1.3.0 builds and runs fine on 2.6, in particular on windows. - see if one can support windows x64. I think official 2.6 support (with binaries for the platforms where we support binaries), x64 support and everything which has been done already would be enough to make a release. David From ellisonbg.net at gmail.com Wed Feb 4 00:21:48 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 21:21:48 -0800 Subject: [Numpy-discussion] Few minor issues with numscons In-Reply-To: <5b8d13220902031713n68524e14g1608ca9156972d7d@mail.gmail.com> References: <6ce0ac130902031600m7a056b09rfd3069da3e8df028@mail.gmail.com> <5b8d13220902031713n68524e14g1608ca9156972d7d@mail.gmail.com> Message-ID: <6ce0ac130902032121kf6dc4b1rba48401d5e5ba3@mail.gmail.com> > The releases are on Pypi for quite some time. I converted the repo to > git and put it on github, but I have not really worked on numscons for > several months now for lack of time ( and because numscons it mostly > "done" and the main limitations of numscons are not fixable without > fixing some fairly major scons limitations). > > Basically, the repo on github is only the convertion from bzr, without > any new features. Do you have plans to continue to maintain it though? One other things I forgot to mention. I first tried the head of your git repo and numpy complained that the version of numscons (0.10) was too *old*. It wanted the version to be greater than somehting like 0.9.1, and clearly it was, so it looks like there is a bug in the numscons version parsing in numpy.distutils. >> * The scons/scons-local subdir is not installed when running python >> setup.py install. I had to use python setupegg.py to get this to >> install in the right place. > > That's strange, you are not the first one to mention this bug, but I > never had this problem myself - I myself never use setupegg except > when creating eggs for pypi. OK, I will have a look at this further to see what the cause is. Thanks for a great package that solve a really painful set of issues! Brian > cheers, > > David > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From ellisonbg.net at gmail.com Wed Feb 4 00:22:25 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Tue, 3 Feb 2009 21:22:25 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> Message-ID: <6ce0ac130902032122i1ba31be3y9c2d034ee0a1d2fc@mail.gmail.com> > 1) Trust the environment variable if given and let distutils raise its > error message (why not raise it ourselves? distutils' error message > and explanation is already out in THE GOOGLE.) > > 2) Otherwise, use the value in the Makefile if it's there. > > 3) If it's not even in the Makefile for whatever reason, go with 10.3. Sounds good, do you want to me work up a patch? Brian > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From robert.kern at gmail.com Wed Feb 4 00:32:01 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 3 Feb 2009 23:32:01 -0600 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <6ce0ac130902032122i1ba31be3y9c2d034ee0a1d2fc@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> <6ce0ac130902032122i1ba31be3y9c2d034ee0a1d2fc@mail.gmail.com> Message-ID: <3d375d730902032132l33417bedia7dacf08f4e7996e@mail.gmail.com> On Tue, Feb 3, 2009 at 23:22, Brian Granger wrote: >> 1) Trust the environment variable if given and let distutils raise its >> error message (why not raise it ourselves? distutils' error message >> and explanation is already out in THE GOOGLE.) >> >> 2) Otherwise, use the value in the Makefile if it's there. >> >> 3) If it's not even in the Makefile for whatever reason, go with 10.3. > > Sounds good, do you want to me work up a patch? Yes, please. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From scott.sinclair.za at gmail.com Wed Feb 4 01:06:31 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Wed, 4 Feb 2009 08:06:31 +0200 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: <5b8d13220902031804w6cb1bf98q68a5ae627700076a@mail.gmail.com> References: <5b8d13220902031804w6cb1bf98q68a5ae627700076a@mail.gmail.com> Message-ID: <6a17e9ee0902032206n2c0f4d30o70dcb02e18a4aa6d@mail.gmail.com> > 2009/2/4 David Cournapeau : > On Tue, Feb 3, 2009 at 11:49 PM, Pierre GM wrote: >> All, >> When can we expect numpy 1.3 to be released ? > > I think official 2.6 support (with binaries for the platforms where we > support binaries), x64 support and everything which has been done > already would be enough to make a release. There are a bunch of documentation patches that should to be reviewed and applied to SVN before the release (especially those marked 'Needs review' or better). http://docs.scipy.org/numpy/patch/ Cheers, Scott From nadavh at visionsense.com Wed Feb 4 05:55:06 2009 From: nadavh at visionsense.com (Nadav Horesh) Date: Wed, 04 Feb 2009 12:55:06 +0200 Subject: [Numpy-discussion] Error building numpy documentation Message-ID: <1233744906.25601.9.camel@nadav.envision.co.il> I just dowloads the latest numpy's svn version and tried to build its documentation with $ make latex on the doc subdirectory, and got the following error message: writing... Sphinx error: too many nesting section levels for LaTeX, at heading: numpy.ma.MaskedArray.__lt__ make: *** [latex] Error 1 Machine: 64 bit gentoo linux running texlive2007 Any ideas? Nadav From pav at iki.fi Wed Feb 4 04:04:38 2009 From: pav at iki.fi (Pauli Virtanen) Date: Wed, 4 Feb 2009 09:04:38 +0000 (UTC) Subject: [Numpy-discussion] Numpy 1.3 release date ? References: <5b8d13220902031804w6cb1bf98q68a5ae627700076a@mail.gmail.com> Message-ID: Wed, 04 Feb 2009 11:04:25 +0900, David Cournapeau wrote: [clip] > Talking about the things I have worked on for 1.3, I did not have the > time to finish everything. The things which I would like to be done > before a 1.3.0 release are: > - fix formatting issues that Pauli and me worked on (for locale > independence and all): I believe it is mostly a matter of merging the > changes in the trunk, and testing. Pauli, do you agree ? Agree, I think it should be OK, and IIRC tests passed also on Windows (well, Wine to be accurate). I don't recall anything more that would be required to do. It might be nice also to address this wart: http://scipy.org/scipy/numpy/ticket/883 for Numpy 1.3. But it's a separate issue from the locale and formatting problems. Pauli From david at ar.media.kyoto-u.ac.jp Wed Feb 4 03:57:53 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Wed, 04 Feb 2009 17:57:53 +0900 Subject: [Numpy-discussion] Few minor issues with numscons In-Reply-To: <6ce0ac130902032121kf6dc4b1rba48401d5e5ba3@mail.gmail.com> References: <6ce0ac130902031600m7a056b09rfd3069da3e8df028@mail.gmail.com> <5b8d13220902031713n68524e14g1608ca9156972d7d@mail.gmail.com> <6ce0ac130902032121kf6dc4b1rba48401d5e5ba3@mail.gmail.com> Message-ID: <49895891.5020705@ar.media.kyoto-u.ac.jp> Brian Granger wrote: >> The releases are on Pypi for quite some time. I converted the repo to >> git and put it on github, but I have not really worked on numscons for >> several months now for lack of time ( and because numscons it mostly >> "done" and the main limitations of numscons are not fixable without >> fixing some fairly major scons limitations). >> >> Basically, the repo on github is only the convertion from bzr, without >> any new features. >> > > Do you have plans to continue to maintain it though? > I can't say for sure; I am still quite interested in working on build/distributions problems in numpy and more generally python. But in my mind, this work only really makes sense if it becomes integrated to numpy.distutils and even can be used by arbitrary python extension as some point. And that's just not realistic at this point because of some fundamental scons limitations (I would like scons to be callable from a pure python scripts, as a library). So I spent quite some time on scons itself (and more recently waf, which started as a scons fork). There is also the fundamental time problem, but we all are in the same boat on this one :) > One other things I forgot to mention. I first tried the head of your > git repo and numpy complained that the version of numscons (0.10) was > too *old*. It wanted the version to be greater than somehting like > 0.9.1, and clearly it was, so it looks like there is a bug in the > numscons version parsing in numpy.distutils. > Can be a stupid bug in the version check. It should be ok with numpy trunk, though. David From william at resolversystems.com Wed Feb 4 05:11:21 2009 From: william at resolversystems.com (William Reade) Date: Wed, 04 Feb 2009 10:11:21 +0000 Subject: [Numpy-discussion] Resolver One -- .NET spreadsheet with NumPy -- beta testers? Message-ID: <498969C9.1070306@resolversystems.com> Hi all We're about to release the first public beta of Resolver One, our Pythonic spreadsheet, with (rather basic) NumPy integration. It requires 32-bit Windows and .NET 2.0SP1 to run, so it may not be appropriate for everyone, but -- if anyone is interested in playing with it -- it would be really useful to have some real live NumPy users telling us what we could improve. If anyone here would like to try it out, please let me know, and I'll arrange beta access. Cheers William http://resolversystems.com/ From scott.sinclair.za at gmail.com Wed Feb 4 05:15:14 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Wed, 4 Feb 2009 12:15:14 +0200 Subject: [Numpy-discussion] Error building numpy documentation In-Reply-To: <1233744906.25601.9.camel@nadav.envision.co.il> References: <1233744906.25601.9.camel@nadav.envision.co.il> Message-ID: <6a17e9ee0902040215g619e605fkb1d6a401484c854c@mail.gmail.com> > 2009/2/4 Nadav Horesh : > I just dowloads the latest numpy's svn version and tried to build its > documentation with > > $ make latex > > on the doc subdirectory, and got the following error message: > > writing... Sphinx error: > too many nesting section levels for LaTeX, at heading: > numpy.ma.MaskedArray.__lt__ > make: *** [latex] Error 1 > > Machine: 64 bit gentoo linux running texlive2007 > > Any ideas? No ideas, but this has been reported before: http://projects.scipy.org/pipermail/numpy-discussion/2009-January/039917.html I've filed a ticket: http://scipy.org/scipy/numpy/ticket/998 Cheers, Scott From david at ar.media.kyoto-u.ac.jp Wed Feb 4 05:22:56 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Wed, 04 Feb 2009 19:22:56 +0900 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: <6a17e9ee0902032206n2c0f4d30o70dcb02e18a4aa6d@mail.gmail.com> References: <5b8d13220902031804w6cb1bf98q68a5ae627700076a@mail.gmail.com> <6a17e9ee0902032206n2c0f4d30o70dcb02e18a4aa6d@mail.gmail.com> Message-ID: <49896C80.9070904@ar.media.kyoto-u.ac.jp> Scott Sinclair wrote: > > There are a bunch of documentation patches that should to be reviewed > and applied to SVN before the release (especially those marked 'Needs > review' or better). > > http://docs.scipy.org/numpy/patch/ In my mind, documentations fixes are much less important than code, not because documentation matters less, but because it can be handled at the last moment much more easily - there is little chance that a doc change breaks on windows only, for example. David From scott.sinclair.za at gmail.com Wed Feb 4 06:04:11 2009 From: scott.sinclair.za at gmail.com (Scott Sinclair) Date: Wed, 4 Feb 2009 13:04:11 +0200 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: <49896C80.9070904@ar.media.kyoto-u.ac.jp> References: <5b8d13220902031804w6cb1bf98q68a5ae627700076a@mail.gmail.com> <6a17e9ee0902032206n2c0f4d30o70dcb02e18a4aa6d@mail.gmail.com> <49896C80.9070904@ar.media.kyoto-u.ac.jp> Message-ID: <6a17e9ee0902040304o35b696cfgc75ba7ba3005dc5a@mail.gmail.com> > 2009/2/4 David Cournapeau : > Scott Sinclair wrote: >> >> There are a bunch of documentation patches that should to be reviewed >> and applied to SVN before the release (especially those marked 'Needs >> review' or better). >> >> http://docs.scipy.org/numpy/patch/ > > In my mind, documentations fixes are much less important than code, not > because documentation matters less, but because it can be handled at the > last moment much more easily - there is little chance that a doc change > breaks on windows only, for example. Sure. Just trying to encourage a few more reviews and documentation contributions. Cheers, Scott From josef.pktd at gmail.com Wed Feb 4 11:00:36 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 4 Feb 2009 11:00:36 -0500 Subject: [Numpy-discussion] poly1d left versus right multiplication with np numbers Message-ID: <1cd32cbb0902040800j4436b68dl690c90f7cf1e4c2@mail.gmail.com> I just had a hard to find bug in my program. poly1d treats numpy scalars differently than python numbers when left or right multiplication is used. Essentially, if the first term is the numpy scalar, multiplied by a polynomial, then the result is an np.array. If the order is reversed, then the result is an instance of np.poly1d. The return types are also the same for numpy arrays, which is at least understandable, although a warning would be good) When using plain (python) numbers, then both left and right multiplication of the number with the polynomial returns a polynomial. Is this a bug or a feature? I didn't see it mentioned in the docs. My problem for debugging was that in the examples I used python numbers while the program used numpy scalars, and it took me a while to figure out that this is the source of my bugs. examples below Josef >>> polys [poly1d([1]), poly1d([-1., 0.]), poly1d([ 1., 0., -1.]), poly1d([ 1., 0., -3., 0.]), poly1d([ 1., 0., -6., 0., 3.])] np.array on left is fine >>> (polys[2]*np.array(0.5/6.0) + polys[3]*np.array(0.5/24.0)) poly1d([ 0.02083333, 0.08333333, -0.0625 , -0.08333333]) >>> (polys[2]*0.5/6.0 + polys[3]*0.5/24.0) poly1d([ 0.02083333, 0.08333333, -0.0625 , -0.08333333]) >>> (0.5/6.0*polys[2] + 0.5/24.0*polys[3]) poly1d([ 0.02083333, 0.08333333, -0.0625 , -0.08333333]) problems with np.array on left >>> (np.array(0.5/6.0)*polys[2] + np.array(0.5/24.0)*polys[3]) Traceback (most recent call last): File "", line 1, in (np.array(0.5/6.0)*polys[2] + np.array(0.5/24.0)*polys[3]) ValueError: shape mismatch: objects cannot be broadcast to a single shape >>> np.array(0.5/6.0)*polys[2] array([ 0.08333333, 0. , -0.08333333]) >>> polys[2]*np.array(0.5/6.0) poly1d([ 0.08333333, 0. , -0.08333333]) >>> 0.5/6.0*polys[2] poly1d([ 0.08333333, 0. , -0.08333333]) >>> np.array(0.5/6.0) array(0.083333333333333329) same with numpy scalar >>> np.array([0.5/6.0])[0]*polys[2] array([ 0.08333333, 0. , -0.08333333]) >>> np.array([0.5/6.0])[0] 0.083333333333333329 From pgmdevlist at gmail.com Wed Feb 4 11:19:38 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 4 Feb 2009 11:19:38 -0500 Subject: [Numpy-discussion] poly1d left versus right multiplication with np numbers In-Reply-To: <1cd32cbb0902040800j4436b68dl690c90f7cf1e4c2@mail.gmail.com> References: <1cd32cbb0902040800j4436b68dl690c90f7cf1e4c2@mail.gmail.com> Message-ID: <5164FDE2-A811-4FE4-A370-5939F85C7268@gmail.com> On Feb 4, 2009, at 11:00 AM, josef.pktd at gmail.com wrote: > I just had a hard to find bug in my program. poly1d treats numpy > scalars differently than python numbers when left or right > multiplication is used. > > Essentially, if the first term is the numpy scalar, multiplied by a > polynomial, then the result is an np.array. > If the order is reversed, then the result is an instance of np.poly1d. > The return types are also the same for numpy arrays, which is at least > understandable, although a warning would be good) > > When using plain (python) numbers, then both left and right > multiplication of the number with the polynomial returns a polynomial. > > Is this a bug or a feature? I didn't see it mentioned in the docs. Looks like yet another example of ticket #826: http://scipy.org/scipy/numpy/ticket/826 This one is getting quite a problem, and I have no idea how to fix it... From josef.pktd at gmail.com Wed Feb 4 11:44:11 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 4 Feb 2009 11:44:11 -0500 Subject: [Numpy-discussion] poly1d left versus right multiplication with np numbers In-Reply-To: <5164FDE2-A811-4FE4-A370-5939F85C7268@gmail.com> References: <1cd32cbb0902040800j4436b68dl690c90f7cf1e4c2@mail.gmail.com> <5164FDE2-A811-4FE4-A370-5939F85C7268@gmail.com> Message-ID: <1cd32cbb0902040844u6305466ej182831b573e893e2@mail.gmail.com> On Wed, Feb 4, 2009 at 11:19 AM, Pierre GM wrote: > > On Feb 4, 2009, at 11:00 AM, josef.pktd at gmail.com wrote: > >> I just had a hard to find bug in my program. poly1d treats numpy >> scalars differently than python numbers when left or right >> multiplication is used. >> >> Essentially, if the first term is the numpy scalar, multiplied by a >> polynomial, then the result is an np.array. >> If the order is reversed, then the result is an instance of np.poly1d. >> The return types are also the same for numpy arrays, which is at least >> understandable, although a warning would be good) >> >> When using plain (python) numbers, then both left and right >> multiplication of the number with the polynomial returns a polynomial. >> >> Is this a bug or a feature? I didn't see it mentioned in the docs. > > Looks like yet another example of ticket #826: > http://scipy.org/scipy/numpy/ticket/826 > This one is getting quite a problem, and I have no idea how to fix > it... Thanks, yes it looks exactly like this ticket. At least, once I know about it, it is not too difficult to work around, but it costs a lot of debugging time to figure this out. Josef From jh at physics.ucf.edu Wed Feb 4 12:01:45 2009 From: jh at physics.ucf.edu (jh at physics.ucf.edu) Date: Wed, 04 Feb 2009 12:01:45 -0500 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: (numpy-discussion-request@scipy.org) References: Message-ID: Scott Sinclair wrote: >> 2009/2/4 David Cournapeau : >> On Tue, Feb 3, 2009 at 11:49 PM, Pierre GM wrote: >>> All, >>> When can we expect numpy 1.3 to be released ? >> >> I think official 2.6 support (with binaries for the platforms where we >> support binaries), x64 support and everything which has been done >> already would be enough to make a release. I'd like to see a self-consistent and reasonably recent numpy/scipy make the Ubuntu 9.04 FeatureFreeze deadline of 19 February. Is this possible? One of the reasons we had discussed doing more frequent releases was to get consistent and recent packages in the mainstream Linuxes, and Ubuntu is very popular in our community. >There are a bunch of documentation patches that should to be reviewed >and applied to SVN before the release (especially those marked 'Needs >review' or better). >http://docs.scipy.org/numpy/patch/ At this point, we're focussing on getting semi-decent drafts of as many docstrings as possible rather than on polishing the ones we have as drafts, on the basis of "something is always better than nothing as long as it's not wrong". So, please don't hold up a release for review, just get as many drafts in as possible. Also, it has become clear that we need separate technical and writing reviews, as a number of pages marked as "reviewed" are deficient in one or the other area, probably the one that wasn't the strength of the reviewer. That entails discussion and revamping of the site, which can happen once we have a larger percentage of the docstrings in draft form. (Any discussion on the merits of this should take place on scipy-dev, which is the home of doc discussions.) --jh-- From watson.jim at gmail.com Wed Feb 4 12:02:54 2009 From: watson.jim at gmail.com (James Watson) Date: Wed, 4 Feb 2009 17:02:54 +0000 Subject: [Numpy-discussion] porting NumPy to Python 3 Message-ID: Hi Ondrej, To get 2to3 to run without warnings, the following files require minor changes: - numpy/distutils/fcompiler/intel.py - numpy/distutils/misc_util.py - numpy/distutils/command/build_src.py - numpy/f2py/crackfortran.py - numpy/lib/function_base.py - numpy/linalg/lapack_lite/make_lite.py There are also other (possibly many, still working on this) files that require syntactic changes to run post 2to3. Is there anywhere specific I should upload these changes? Is this list appropriate? Is there a developer I can send these and future patches to? James. > Date: Tue, 3 Feb 2009 01:03:15 -0800 > From: Ondrej Certik > Subject: Re: [Numpy-discussion] porting NumPy to Python 3 > To: Discussion of Numerical Python > Message-ID: > <85b5c3130902030103w69180a6bm4143050b4b82bf1f at mail.gmail.com> > Content-Type: text/plain; charset=UTF-8 > > Hi James, > > On Thu, Jan 29, 2009 at 2:11 AM, James Watson wrote: >> Hi, >> >> I am interested in contributing to the port of NumPy to Python 3. Who >> I should coordinate effort with? >> >> I have started at the Python end of the problem (as opposed to >> http://www.scipy.org/Python3k), e.g. I have several patches to get >> 2to3 to work on NumPy's Python source code. > > I am sorry that noone has replied to your email. Could you please > upload your patches somewhere? > > Ondrej From bpederse at gmail.com Wed Feb 4 12:09:51 2009 From: bpederse at gmail.com (Brent Pedersen) Date: Wed, 4 Feb 2009 09:09:51 -0800 Subject: [Numpy-discussion] genfromtxt view with object dtype Message-ID: hi, i am using genfromtxt, with a dtype like this: [('seqid', '|S24'), ('source', '|S16'), ('type', '|S16'), ('start', ' References: Message-ID: <1C0B2F62-23C4-47CE-BC31-863629FAE543@gmail.com> On Feb 4, 2009, at 12:09 PM, Brent Pedersen wrote: > hi, i am using genfromtxt, with a dtype like this: > [('seqid', '|S24'), ('source', '|S16'), ('type', '|S16'), ('start', > ' ' References: Message-ID: On Wed, Feb 4, 2009 at 10:02 AM, James Watson wrote: > Hi Ondrej, > > To get 2to3 to run without warnings, the following files require minor > changes: > - numpy/distutils/fcompiler/intel.py > - numpy/distutils/misc_util.py > - numpy/distutils/command/build_src.py > - numpy/f2py/crackfortran.py > - numpy/lib/function_base.py > - numpy/linalg/lapack_lite/make_lite.py > > There are also other (possibly many, still working on this) files that > require syntactic changes to run post 2to3. > > Is there anywhere specific I should upload these changes? Is this > list appropriate? Is there a developer I can send these and future > patches to? > Maybe we could add a python 3 milestone so folks could post tickets for these sorts of things. For the time being you could just post to the list here with the patches and something in the subject line that identifies it as a patch for python 3. I think we could make any small changes as long as they are compatible with python 2.4. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed Feb 4 12:48:31 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 4 Feb 2009 10:48:31 -0700 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: References: Message-ID: On Wed, Feb 4, 2009 at 10:01 AM, wrote: > Scott Sinclair wrote: > >> 2009/2/4 David Cournapeau : > >> On Tue, Feb 3, 2009 at 11:49 PM, Pierre GM > wrote: > >>> All, > >>> When can we expect numpy 1.3 to be released ? > >> > >> I think official 2.6 support (with binaries for the platforms where we > >> support binaries), x64 support and everything which has been done > >> already would be enough to make a release. > > I'd like to see a self-consistent and reasonably recent numpy/scipy > make the Ubuntu 9.04 FeatureFreeze deadline of 19 February. Is this > possible? One of the reasons we had discussed doing more frequent > releases was to get consistent and recent packages in the mainstream > Linuxes, and Ubuntu is very popular in our community. > That deadline is pretty close ;) There has been a dearth of developer time these last few months, what with work, holidays, and dissertations, and I don't know when that is going to get better. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From bsouthey at gmail.com Wed Feb 4 13:13:43 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 04 Feb 2009 12:13:43 -0600 Subject: [Numpy-discussion] porting NumPy to Python 3 In-Reply-To: References: Message-ID: <4989DAD7.9060507@gmail.com> Charles R Harris wrote: > > > On Wed, Feb 4, 2009 at 10:02 AM, James Watson > wrote: > > Hi Ondrej, > > To get 2to3 to run without warnings, the following files require > minor changes: > - numpy/distutils/fcompiler/intel.py > - numpy/distutils/misc_util.py > - numpy/distutils/command/build_src.py > - numpy/f2py/crackfortran.py > - numpy/lib/function_base.py > - numpy/linalg/lapack_lite/make_lite.py > > There are also other (possibly many, still working on this) files that > require syntactic changes to run post 2to3. > > Is there anywhere specific I should upload these changes? Is this > list appropriate? Is there a developer I can send these and future > patches to? > > > Maybe we could add a python 3 milestone so folks could post tickets > for these sorts of things. For the time being you could just post to > the list here with the patches and something in the subject line that > identifies it as a patch for python 3. I think we could make any small > changes as long as they are compatible with python 2.4. > > Chuck > > > ------------------------------------------------------------------------ > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > If you follow Guido's recommendation (at the bottom of http://docs.python.org/dev/3.0/whatsnew/3.0.html), then we should first be compatible with Python 2.6 (about the current stage) . Then compatible using Python 2.3 with the -3 flag (warn about Python 3.x incompatibilities) and fix warnings like: numpy/core/__init__.py:11: DeprecationWarning: the cPickle module has been removed in Python 3.0 (Of course you lose speed of cPickle if you blindly change it to use the old Pickle in Python 2.x) Finally use the 2to3 tool to get a Python 3 version. Bruce From bsouthey at gmail.com Wed Feb 4 15:40:24 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Wed, 04 Feb 2009 14:40:24 -0600 Subject: [Numpy-discussion] ImportError: No module named dateutil.parser Message-ID: <4989FD38.7080009@gmail.com> Hi, I just updated to the latest SVN but I get a failure when running the tests due to missing dateutil.parser. Should this module exist in Numpy or just a inappropriate test? Bruce >>> numpy.test() Running unit tests for numpy NumPy version 1.3.0.dev6338 NumPy is installed in /usr/lib64/python2.5/site-packages/numpy Python version 2.5.2 (r252:60911, Sep 30 2008, 15:42:03) [GCC 4.3.2 20080917 (Red Hat 4.3.2-4)] nose version 0.10.3 .......................................................................................................................................................................................................................................................KKKKKKKK..................................................................................................................................................................................................K.....................................................................................................................................................E........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................ ====================================================================== ERROR: Tests updatemapper ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/lib64/python2.5/site-packages/numpy/lib/tests/test__iotools.py", line 133, in test_upgrademapper import dateutil.parser ImportError: No module named dateutil.parser ---------------------------------------------------------------------- Ran 1888 tests in 6.084s FAILED (KNOWNFAIL=9, errors=1) From robert.kern at gmail.com Wed Feb 4 15:48:31 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 4 Feb 2009 14:48:31 -0600 Subject: [Numpy-discussion] ImportError: No module named dateutil.parser In-Reply-To: <4989FD38.7080009@gmail.com> References: <4989FD38.7080009@gmail.com> Message-ID: <3d375d730902041248l256d1402wc2a684aaa9d60db8@mail.gmail.com> On Wed, Feb 4, 2009 at 14:40, Bruce Southey wrote: > Hi, > I just updated to the latest SVN but I get a failure when running the > tests due to missing dateutil.parser. Should this module exist in Numpy > or just a inappropriate test? It's a bad test. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Wed Feb 4 15:54:38 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 4 Feb 2009 15:54:38 -0500 Subject: [Numpy-discussion] ImportError: No module named dateutil.parser In-Reply-To: <4989FD38.7080009@gmail.com> References: <4989FD38.7080009@gmail.com> Message-ID: <3433B34A-1F97-40E5-B37C-379A778604EE@gmail.com> On Feb 4, 2009, at 3:40 PM, Bruce Southey wrote: > Hi, > I just updated to the latest SVN but I get a failure when running the > tests due to missing dateutil.parser. Should this module exist in > Numpy > or just a inappropriate test? I put the corresponding tests in a try/except ImportError block (r6339), that should fix it. From robert.kern at gmail.com Wed Feb 4 15:56:24 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 4 Feb 2009 14:56:24 -0600 Subject: [Numpy-discussion] ImportError: No module named dateutil.parser In-Reply-To: <3433B34A-1F97-40E5-B37C-379A778604EE@gmail.com> References: <4989FD38.7080009@gmail.com> <3433B34A-1F97-40E5-B37C-379A778604EE@gmail.com> Message-ID: <3d375d730902041256i323140cre3e41ab19075553f@mail.gmail.com> On Wed, Feb 4, 2009 at 14:54, Pierre GM wrote: > > On Feb 4, 2009, at 3:40 PM, Bruce Southey wrote: > >> Hi, >> I just updated to the latest SVN but I get a failure when running the >> tests due to missing dateutil.parser. Should this module exist in >> Numpy >> or just a inappropriate test? > > I put the corresponding tests in a try/except ImportError block > (r6339), that should fix it. No, rewrite the test to not use external libraries, please. Test the functionality without needing dateutils. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From bpederse at gmail.com Wed Feb 4 16:03:16 2009 From: bpederse at gmail.com (Brent Pedersen) Date: Wed, 4 Feb 2009 13:03:16 -0800 Subject: [Numpy-discussion] genfromtxt view with object dtype In-Reply-To: <1C0B2F62-23C4-47CE-BC31-863629FAE543@gmail.com> References: <1C0B2F62-23C4-47CE-BC31-863629FAE543@gmail.com> Message-ID: On Wed, Feb 4, 2009 at 9:36 AM, Pierre GM wrote: > > On Feb 4, 2009, at 12:09 PM, Brent Pedersen wrote: > >> hi, i am using genfromtxt, with a dtype like this: >> [('seqid', '|S24'), ('source', '|S16'), ('type', '|S16'), ('start', >> '> ' > Brent, > Please post a simple, self-contained example with a few lines of the > file you want to load. > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > hi pierre, here is an example. thanks, -brent ###################### import numpy as np from cStringIO import StringIO gffstr = """\ ##gff-version 3 1\tucb\tgene\t2234602\t2234702\t.\t-\t.\tID=grape_1_2234602_2234702;match=EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA 1\tucb\tgene\t2300292\t2302123\t.\t+\t.\tID=grape_1_2300292_2302123;match=EVM_prediction_supercontig_244.8 1\tucb\tgene\t2303615\t2303967\t.\t+\t.\tID=grape_1_2303615_2303967;match=EVM_prediction_supercontig_244.8 1\tucb\tgene\t2303616\t2303966\t.\t+\t.\tParent=grape_1_2303615_2303967 1\tucb\tgene\t3596400\t3596503\t.\t-\t.\tID=grape_1_3596400_3596503;match=evm.TU.supercontig_167.27 1\tucb\tgene\t3600651\t3600977\t.\t-\t.\tmatch=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA """ dtype = {'names' : ('seqid', 'source', 'type', 'start', 'end', 'score', 'strand', 'phase', 'attrs') , 'formats': ['S24', 'S16', 'S16', 'i4', 'i4', 'f8', 'S1', 'i4', 'S128']} #OK with S128 for attrs print np.genfromtxt(StringIO(gffstr), dtype = dtype) def _attr(kvstr): pairs = [kv.split("=") for kv in kvstr.split(";")] return dict(pairs) # change S128 to object to have col attrs as dictionary dtype['formats'][-1] = 'O' converters = {8: _attr } #NOT OK print np.genfromtxt(StringIO(gffstr), dtype = dtype, converters=converters) From simon.palmer at gmail.com Wed Feb 4 16:18:33 2009 From: simon.palmer at gmail.com (Simon Palmer) Date: Wed, 4 Feb 2009 13:18:33 -0800 (PST) Subject: [Numpy-discussion] Multiplying a matrix by a vector Message-ID: <21839828.post@talk.nabble.com> Bit of a newb question I suspect... I have a matrix and a vector which has the same number of elements as the matrix has rows. I want to multiply each element in a row in the matrix by the corresponding element in the vector. I can obviously do this with a loop, but am guessing there is a more elegant (and faster?) solution. Just for clarity... Matrix [X1, X2, X3] [Y1, Y2, Y3] Vector [A, B] Desired result [X1*A, X2*A, X3*A] [Y1*B, Y2*B, Y3*B] -- View this message in context: http://www.nabble.com/Multiplying-a-matrix-by-a-vector-tp21839828p21839828.html Sent from the Numpy-discussion mailing list archive at Nabble.com. From h5py at alfven.org Wed Feb 4 16:22:38 2009 From: h5py at alfven.org (Andrew Collette) Date: Wed, 4 Feb 2009 13:22:38 -0800 Subject: [Numpy-discussion] Array dtype problems Message-ID: Hello, I'm having an issue with 'array' dtypes; if I do this: mydtype = numpy.dtype(('i', (4,))) arr = numpy.empty((100,), dtype=mydtype) then the array datatype shape is "absorbed" into the shape of the array. This simplifies indexing, but is really annoying as it means that the dtype doesn't "round trip"; I can't even use use astype as it complains about a shape mismatch. How can I create an array in a manner that preserves the dtype array information? Or is there a way to take an existing array of the correct shape and "re-cast" it to use an array dtype? Thanks, Andrew Collette From pgmdevlist at gmail.com Wed Feb 4 17:12:51 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 4 Feb 2009 17:12:51 -0500 Subject: [Numpy-discussion] ImportError: No module named dateutil.parser In-Reply-To: <3d375d730902041256i323140cre3e41ab19075553f@mail.gmail.com> References: <4989FD38.7080009@gmail.com> <3433B34A-1F97-40E5-B37C-379A778604EE@gmail.com> <3d375d730902041256i323140cre3e41ab19075553f@mail.gmail.com> Message-ID: <5348E9DF-067B-4D6E-ABE4-9D36F35442BF@gmail.com> On Feb 4, 2009, at 3:56 PM, Robert Kern wrote: > > No, rewrite the test to not use external libraries, please. Test the > functionality without needing dateutils. OK then, should be fixed in r6340. From robert.kern at gmail.com Wed Feb 4 17:15:21 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 4 Feb 2009 16:15:21 -0600 Subject: [Numpy-discussion] ImportError: No module named dateutil.parser In-Reply-To: <5348E9DF-067B-4D6E-ABE4-9D36F35442BF@gmail.com> References: <4989FD38.7080009@gmail.com> <3433B34A-1F97-40E5-B37C-379A778604EE@gmail.com> <3d375d730902041256i323140cre3e41ab19075553f@mail.gmail.com> <5348E9DF-067B-4D6E-ABE4-9D36F35442BF@gmail.com> Message-ID: <3d375d730902041415p2fbaen9d1182671afce01b@mail.gmail.com> On Wed, Feb 4, 2009 at 16:12, Pierre GM wrote: > > On Feb 4, 2009, at 3:56 PM, Robert Kern wrote: > >> No, rewrite the test to not use external libraries, please. Test the >> functionality without needing dateutils. > > OK then, should be fixed in r6340. Thank you! -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Wed Feb 4 17:48:49 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Wed, 04 Feb 2009 14:48:49 -0800 Subject: [Numpy-discussion] Multiplying a matrix by a vector In-Reply-To: <21839828.post@talk.nabble.com> References: <21839828.post@talk.nabble.com> Message-ID: <498A1B51.8030609@noaa.gov> Simon Palmer wrote: > I have a matrix and a vector which has the same number of elements as the > matrix has rows. I want to multiply each element in a row in the matrix by > the corresponding element in the vector. >>> M = np.arange(6).reshape((2,3)) >>> M array([[0, 1, 2], [3, 4, 5]]) >>> v = np.array((4,5)).reshape((-1,1)) # make it a column vector >>> v array([[4], [5]]) >>> M * v array([[ 0, 4, 8], [15, 20, 25]]) you can also do it with np.newaxis: >>> v = np.array((4,5)) >>> M * v[:,np.newaxis] array([[ 0, 4, 8], [15, 20, 25]]) http://www.scipy.org/EricsBroadcastingDoc If you are really working with a matrix, rather than a 2-d array, you may want to look at the np.matrix object. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From pgmdevlist at gmail.com Wed Feb 4 17:50:38 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 4 Feb 2009 17:50:38 -0500 Subject: [Numpy-discussion] Renaming a field of an object array Message-ID: <864EDE43-CC49-469F-A2C8-65D947CC0B51@gmail.com> All, I'm a tad puzzled by the following behavior (I'm trying to correct a bug in genfromtxt): I'm creating an empty structured ndarray, using np.object as dtype. >>> a = np.empty(1,dtype=[('',np.object)]) array([(None,)], dtype=[('f0', '|O4')]) Now, I'd like to rename the field: >>> a.view([('NAME',np.object)]) TypeError: Cannot change data-type for object array. I understand why I can't change the *type* of the field, but not why I can't change its name that way. What would be an option that wouldn't involve creating a new array ? Thx in advance. From bpederse at gmail.com Wed Feb 4 18:25:17 2009 From: bpederse at gmail.com (Brent Pedersen) Date: Wed, 4 Feb 2009 15:25:17 -0800 Subject: [Numpy-discussion] Renaming a field of an object array In-Reply-To: <864EDE43-CC49-469F-A2C8-65D947CC0B51@gmail.com> References: <864EDE43-CC49-469F-A2C8-65D947CC0B51@gmail.com> Message-ID: On Wed, Feb 4, 2009 at 2:50 PM, Pierre GM wrote: > All, > I'm a tad puzzled by the following behavior (I'm trying to correct a > bug in genfromtxt): > > I'm creating an empty structured ndarray, using np.object as dtype. > > >>> a = np.empty(1,dtype=[('',np.object)]) > array([(None,)], > dtype=[('f0', '|O4')]) > > Now, I'd like to rename the field: > >>> a.view([('NAME',np.object)]) > TypeError: Cannot change data-type for object array. > > I understand why I can't change the *type* of the field, but not why I > can't change its name that way. What would be an option that wouldn't > involve creating a new array ? > Thx in advance. > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > hi, i was looking at this as well. the code in arrayobject.c doesnt match the error string. i changed the code to do what the error string says and seems thing to work. i think the if-block below it should also use xor (not changed in this patch), but i'm not a c programmer so i may be missing something obvious. svn diff numpy/core/src/arrayobject.c Index: numpy/core/src/arrayobject.c =================================================================== --- numpy/core/src/arrayobject.c (revision 6338) +++ numpy/core/src/arrayobject.c (working copy) @@ -6506,9 +6506,16 @@ PyErr_SetString(PyExc_TypeError, "invalid data-type for array"); return -1; } - if (PyDataType_FLAGCHK(newtype, NPY_ITEM_HASOBJECT) || - PyDataType_FLAGCHK(newtype, NPY_ITEM_IS_POINTER) || - PyDataType_FLAGCHK(self->descr, NPY_ITEM_HASOBJECT) || + if (PyDataType_FLAGCHK(newtype, NPY_ITEM_HASOBJECT) ^ + PyDataType_FLAGCHK(self->descr, NPY_ITEM_HASOBJECT)) { + PyErr_SetString(PyExc_TypeError, \ + "Cannot change data-type for object " \ + "array."); + Py_DECREF(newtype); + return -1; + } + + if (PyDataType_FLAGCHK(newtype, NPY_ITEM_IS_POINTER) || PyDataType_FLAGCHK(self->descr, NPY_ITEM_IS_POINTER)) { PyErr_SetString(PyExc_TypeError, \ "Cannot change data-type for object " \ From cournape at gmail.com Wed Feb 4 21:40:02 2009 From: cournape at gmail.com (David Cournapeau) Date: Thu, 5 Feb 2009 11:40:02 +0900 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: References: Message-ID: <5b8d13220902041840y796c5037ubc82aa431735037f@mail.gmail.com> On Thu, Feb 5, 2009 at 2:48 AM, Charles R Harris wrote: > > > On Wed, Feb 4, 2009 at 10:01 AM, wrote: >> >> Scott Sinclair wrote: >> >> 2009/2/4 David Cournapeau : >> >> On Tue, Feb 3, 2009 at 11:49 PM, Pierre GM >> >> wrote: >> >>> All, >> >>> When can we expect numpy 1.3 to be released ? >> >> >> >> I think official 2.6 support (with binaries for the platforms where we >> >> support binaries), x64 support and everything which has been done >> >> already would be enough to make a release. >> >> I'd like to see a self-consistent and reasonably recent numpy/scipy >> make the Ubuntu 9.04 FeatureFreeze deadline of 19 February. Is this >> possible? One of the reasons we had discussed doing more frequent >> releases was to get consistent and recent packages in the mainstream >> Linuxes, and Ubuntu is very popular in our community. > > That deadline is pretty close ;) I think it is safe to say it will not be possible to release numpy for that date. Concerning Ubuntu, it sounds more realistic to make sure it is updated to 1.2.1, and if possible to release 0.7.0. David From pgmdevlist at gmail.com Wed Feb 4 23:51:34 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 4 Feb 2009 23:51:34 -0500 Subject: [Numpy-discussion] genfromtxt view with object dtype In-Reply-To: References: <1C0B2F62-23C4-47CE-BC31-863629FAE543@gmail.com> Message-ID: OK, Brent, try r6341. I fixed genfromtxt for cases like yours (explicit dtype involving a np.object). Note that the fix won't work if the dtype is nested and involves np.objects (as we would hit the pb of renaming fields we observed...). Let me know how it goes. P. On Feb 4, 2009, at 4:03 PM, Brent Pedersen wrote: > On Wed, Feb 4, 2009 at 9:36 AM, Pierre GM > wrote: >> >> On Feb 4, 2009, at 12:09 PM, Brent Pedersen wrote: >> >>> hi, i am using genfromtxt, with a dtype like this: >>> [('seqid', '|S24'), ('source', '|S16'), ('type', '|S16'), ('start', >>> '>> ('phase', >>> '> >> Brent, >> Please post a simple, self-contained example with a few lines of the >> file you want to load. >> >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> > > hi pierre, here is an example. > thanks, > -brent > > ###################### > > import numpy as np > from cStringIO import StringIO > > gffstr = """\ > ##gff-version 3 > 1\tucb\tgene\t2234602\t2234702\t.\t-\t. > \tID > = > grape_1_2234602_2234702 > ;match > = > EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA > 1\tucb\tgene\t2300292\t2302123\t.\t+\t. > \tID=grape_1_2300292_2302123;match=EVM_prediction_supercontig_244.8 > 1\tucb\tgene\t2303615\t2303967\t.\t+\t. > \tID=grape_1_2303615_2303967;match=EVM_prediction_supercontig_244.8 > 1\tucb\tgene\t2303616\t2303966\t.\t+\t. > \tParent=grape_1_2303615_2303967 > 1\tucb\tgene\t3596400\t3596503\t.\t-\t. > \tID=grape_1_3596400_3596503;match=evm.TU.supercontig_167.27 > 1\tucb\tgene\t3600651\t3600977\t.\t-\t. > \tmatch=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA > """ > > dtype = {'names' : > ('seqid', 'source', 'type', 'start', 'end', > 'score', 'strand', 'phase', 'attrs') , > 'formats': > ['S24', 'S16', 'S16', 'i4', 'i4', 'f8', > 'S1', 'i4', 'S128']} > > #OK with S128 for attrs > print np.genfromtxt(StringIO(gffstr), dtype = dtype) > > > > def _attr(kvstr): > pairs = [kv.split("=") for kv in kvstr.split(";")] > return dict(pairs) > > # change S128 to object to have col attrs as dictionary > dtype['formats'][-1] = 'O' > converters = {8: _attr } > #NOT OK > print np.genfromtxt(StringIO(gffstr), dtype = dtype, > converters=converters) > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From bpederse at gmail.com Thu Feb 5 00:22:31 2009 From: bpederse at gmail.com (Brent Pedersen) Date: Wed, 4 Feb 2009 21:22:31 -0800 Subject: [Numpy-discussion] genfromtxt view with object dtype In-Reply-To: References: <1C0B2F62-23C4-47CE-BC31-863629FAE543@gmail.com>

Message-ID: On Wed, Feb 4, 2009 at 8:51 PM, Pierre GM wrote: > OK, Brent, try r6341. > I fixed genfromtxt for cases like yours (explicit dtype involving a > np.object). > Note that the fix won't work if the dtype is nested and involves > np.objects (as we would hit the pb of renaming fields we observed...). > Let me know how it goes. > P. > that fixes it. thanks again pierre! -b > On Feb 4, 2009, at 4:03 PM, Brent Pedersen wrote: > >> On Wed, Feb 4, 2009 at 9:36 AM, Pierre GM >> wrote: >>> >>> On Feb 4, 2009, at 12:09 PM, Brent Pedersen wrote: >>> >>>> hi, i am using genfromtxt, with a dtype like this: >>>> [('seqid', '|S24'), ('source', '|S16'), ('type', '|S16'), ('start', >>>> '>>> ('phase', >>>> '>> >>> Brent, >>> Please post a simple, self-contained example with a few lines of the >>> file you want to load. >>> >>> _______________________________________________ >>> Numpy-discussion mailing list >>> Numpy-discussion at scipy.org >>> http://projects.scipy.org/mailman/listinfo/numpy-discussion >>> >> >> hi pierre, here is an example. >> thanks, >> -brent >> >> ###################### >> >> import numpy as np >> from cStringIO import StringIO >> >> gffstr = """\ >> ##gff-version 3 >> 1\tucb\tgene\t2234602\t2234702\t.\t-\t. >> \tID >> = >> grape_1_2234602_2234702 >> ;match >> = >> EVM_prediction_supercontig_1.248,EVM_prediction_supercontig_1.248.mRNA >> 1\tucb\tgene\t2300292\t2302123\t.\t+\t. >> \tID=grape_1_2300292_2302123;match=EVM_prediction_supercontig_244.8 >> 1\tucb\tgene\t2303615\t2303967\t.\t+\t. >> \tID=grape_1_2303615_2303967;match=EVM_prediction_supercontig_244.8 >> 1\tucb\tgene\t2303616\t2303966\t.\t+\t. >> \tParent=grape_1_2303615_2303967 >> 1\tucb\tgene\t3596400\t3596503\t.\t-\t. >> \tID=grape_1_3596400_3596503;match=evm.TU.supercontig_167.27 >> 1\tucb\tgene\t3600651\t3600977\t.\t-\t. >> \tmatch=evm.model.supercontig_1217.1,evm.model.supercontig_1217.1.mRNA >> """ >> >> dtype = {'names' : >> ('seqid', 'source', 'type', 'start', 'end', >> 'score', 'strand', 'phase', 'attrs') , >> 'formats': >> ['S24', 'S16', 'S16', 'i4', 'i4', 'f8', >> 'S1', 'i4', 'S128']} >> >> #OK with S128 for attrs >> print np.genfromtxt(StringIO(gffstr), dtype = dtype) >> >> >> >> def _attr(kvstr): >> pairs = [kv.split("=") for kv in kvstr.split(";")] >> return dict(pairs) >> >> # change S128 to object to have col attrs as dictionary >> dtype['formats'][-1] = 'O' >> converters = {8: _attr } >> #NOT OK >> print np.genfromtxt(StringIO(gffstr), dtype = dtype, >> converters=converters) >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From opossumnano at gmail.com Thu Feb 5 03:14:44 2009 From: opossumnano at gmail.com (Tiziano Zito) Date: Thu, 5 Feb 2009 09:14:44 +0100 Subject: [Numpy-discussion] Numpy 1.3 release date ? In-Reply-To: References: Message-ID: <20090205081443.GA25806@localhost> what about fixing http://scipy.org/scipy/scipy/ticket/812 ? this is actually a scipy-numpy compatibility problem, where numpy is wrong IMO. it is a one-line fix, I think. thank you! tiziano From trevor at notcows.com Thu Feb 5 15:45:38 2009 From: trevor at notcows.com (Trevor Clarke) Date: Thu, 5 Feb 2009 15:45:38 -0500 Subject: [Numpy-discussion] help writing an array subtype Message-ID: <7bde5d400902051245x754ac906h21887d8ddda39036@mail.gmail.com> I'm embedded python (and numpy) is a C++ app and I'm trying to access my array data in numpy but I'm not sure where to start. Due to data sizes, I don't have access to the entire array contiguously, my app implements a virtual memory like paging system where variable sized pages of data are brought into memory on demand. Each page has a contiguous block of data containing at least the requested piece of the array (if it's more, it's arranged so the appropriate data can be accessed with strides). I'd like to load pages when they are accessed in an ndarray and when a new view is requested. I was thinking of loading and holding the most recently accessed page of the array on a per view basis. (i.e. if a view is created it can access a different part of the array from the original array) I've ready over the documentation on subclassing ndarray but there's a lot of information there and I'm not quite sure the best place to start. Could someone point me in the right direction or perhaps offer a better solution? -------------- next part -------------- An HTML attachment was scrubbed... URL: From rmay31 at gmail.com Thu Feb 5 18:06:49 2009 From: rmay31 at gmail.com (Ryan May) Date: Thu, 05 Feb 2009 17:06:49 -0600 Subject: [Numpy-discussion] Argsort Message-ID: <498B7109.8060501@gmail.com> Hi, Ok, what am I missing here: x = np.array([[4,2],[5,3]]) x[x.argsort(1)] array([[[5, 3], [4, 2]], [[5, 3], [4, 2]]]) I was expecting: array([[2,4],[3,5]]) Certainly not a 3D array. What am I doing wrong? Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From oliphant at enthought.com Thu Feb 5 18:08:49 2009 From: oliphant at enthought.com (Travis E. Oliphant) Date: Thu, 05 Feb 2009 17:08:49 -0600 Subject: [Numpy-discussion] Selection of only a certain number of fields Message-ID: <498B7181.5000300@enthought.com> Hi all, I've been fairly quiet on this list for awhile due to work and family schedule, but I think about how things can improve regularly. One feature that's been requested by a few people is the ability to select multiple fields from a structured array. Thus, suppose *arr* is a structured array with dtype: [('name', 'S25'), ('height', float), ('age', int), ('gender', 'S8') ] Then, newarr = arr[['name', 'age']] should be a structured array with just the name and age fields. It seems to me that there are two reasonable behaviors here (and possibly two different approaches to getting those behaviors): 1) Copy the data into a new array with a new dtype 2) Create a new array that is just a view of the old data with some of the fields "hidden" I lean for having the proposed syntax do #2, but wonder what other people think. Any opinions and/or suggestions? -Travis -- Travis Oliphant Enthought, Inc. (512) 536-1057 http://www.enthought.com oliphant at enthought.com From robert.kern at gmail.com Thu Feb 5 18:15:56 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 5 Feb 2009 17:15:56 -0600 Subject: [Numpy-discussion] Argsort In-Reply-To: <498B7109.8060501@gmail.com> References: <498B7109.8060501@gmail.com> Message-ID: <3d375d730902051515sb7afa3ah469522cd536ca19@mail.gmail.com> On Thu, Feb 5, 2009 at 17:06, Ryan May wrote: > Hi, > > Ok, what am I missing here: > > x = np.array([[4,2],[5,3]]) > x[x.argsort(1)] > > array([[[5, 3], > [4, 2]], > > [[5, 3], > [4, 2]]]) > > I was expecting: > > array([[2,4],[3,5]]) > > Certainly not a 3D array. What am I doing wrong? Remember that x[i] applies the index array i to the first axis of x. In order to apply i to the second axis, you need an argument in the first position, too, to give the indices that apply to the first axis. Remember that the arguments will be broadcast against each other. In [11]: x[ [[0],[1]], x.argsort(1)] Out[11]: array([[2, 4], [3, 5]]) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From gael.varoquaux at normalesup.org Thu Feb 5 18:16:18 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Fri, 6 Feb 2009 00:16:18 +0100 Subject: [Numpy-discussion] PEP: named axis (was: Selection of only a certain number of fields) In-Reply-To: <498B7181.5000300@enthought.com> References: <498B7181.5000300@enthought.com> Message-ID: <20090205231618.GA21014@phare.normalesup.org> On Thu, Feb 05, 2009 at 05:08:49PM -0600, Travis E. Oliphant wrote: > I've been fairly quiet on this list for awhile due to work and family > schedule, but I think about how things can improve regularly. One > feature that's been requested by a few people is the ability to select > multiple fields from a structured array. Hey Travis, I have no opinion on the above, as I don't have this use case. However, as you are talking about implementing something, I jump on the occasion to suggest another gadget, slightly related: I would like named axis. Suppose you have a 5D array, I would like to be able to give each axis names, eg (to chose an example you might be familiar with) ('Frontal', 'Lateral', 'Axial', 'Time', 'Subjects'). And if this could be understood be numpy operations (say ufuncs and fancy indexing) so that I could do (a is my 5D array): >>> b = a.mean(axis='Time') >>> b.axis ('Frontal', 'Lateral', 'Axial', 'Subjects') I believe this would make a big difference for people working with n-dimensional arrays, where n is large. I do realize this is probably a lot of work, this is why I had been refraining from mentioning it. I don't feel I can implement this. Cheers, Ga?l From pgmdevlist at gmail.com Thu Feb 5 19:37:19 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 5 Feb 2009 19:37:19 -0500 Subject: [Numpy-discussion] Selection of only a certain number of fields In-Reply-To: <498B7181.5000300@enthought.com> References: <498B7181.5000300@enthought.com> Message-ID: <3D14D1EA-AC5C-4283-B9FB-137580E1BAB5@gmail.com> On Feb 5, 2009, at 6:08 PM, Travis E. Oliphant wrote: > > Hi all, > > I've been fairly quiet on this list for awhile due to work and family > schedule, but I think about how things can improve regularly. One > feature that's been requested by a few people is the ability to select > multiple fields from a structured array. > > [...] +1 for #2. Note that we now have a drop_fields function in np.lib.recfunctions, a reimplementation of the equivalent function in matplotlib. It works along the lines of your proposition #1 (create a new array w/ a new dtype and fill it) From ellisonbg.net at gmail.com Thu Feb 5 23:00:07 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Thu, 5 Feb 2009 20:00:07 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <3d375d730902032132l33417bedia7dacf08f4e7996e@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031617t43726c8as3b9e8d46583f2152@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> <6ce0ac130902032122i1ba31be3y9c2d034ee0a1d2fc@mail.gmail.com> <3d375d730902032132l33417bedia7dacf08f4e7996e@mail.gmail.com> Message-ID: <6ce0ac130902052000x761c47cen46a3815c2b741713@mail.gmail.com> Robert, Can you have a look at the following fix and see if it is satisfactory? http://github.com/ellisonbg/numpy/blob/81360e93968968dc9dcbafd7895da7cec5015a3c/numpy/distutils/fcompiler/gnu.py Brian On Tue, Feb 3, 2009 at 9:32 PM, Robert Kern wrote: > On Tue, Feb 3, 2009 at 23:22, Brian Granger wrote: >>> 1) Trust the environment variable if given and let distutils raise its >>> error message (why not raise it ourselves? distutils' error message >>> and explanation is already out in THE GOOGLE.) >>> >>> 2) Otherwise, use the value in the Makefile if it's there. >>> >>> 3) If it's not even in the Makefile for whatever reason, go with 10.3. >> >> Sounds good, do you want to me work up a patch? > > Yes, please. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From oliphant at enthought.com Thu Feb 5 23:09:26 2009 From: oliphant at enthought.com (Travis Oliphant) Date: Thu, 05 Feb 2009 22:09:26 -0600 Subject: [Numpy-discussion] Selection of only a certain number of fields In-Reply-To: <3D14D1EA-AC5C-4283-B9FB-137580E1BAB5@gmail.com> References: <498B7181.5000300@enthought.com> <3D14D1EA-AC5C-4283-B9FB-137580E1BAB5@gmail.com> Message-ID: <498BB7F6.6040603@enthought.com> Pierre GM wrote: > On Feb 5, 2009, at 6:08 PM, Travis E. Oliphant wrote: > > >> Hi all, >> >> I've been fairly quiet on this list for awhile due to work and family >> schedule, but I think about how things can improve regularly. One >> feature that's been requested by a few people is the ability to select >> multiple fields from a structured array. >> > > >> [...] >> > > +1 for #2. > > Note that we now have a drop_fields function in np.lib.recfunctions, a > reimplementation of the equivalent function in matplotlib. It works > along the lines of your proposition #1 (create a new array w/ a new > dtype and fill it) > After more thought, I think I was too eager in my suggestion of #2. It's actually not really possible to do a view the way I would want it to work. It would be possible to create a data-type with hidden-fields, but a copy would be not "get rid of the extra data". Thus newarr = arr[['name', 'age']].copy() would be exactly the same size as arr because elements are copied wholesale and each "row" is a single element in the NumPy array. Some infrastructure would have to be implemented at a fundamental level to handle partial-element manipulation similar at least in spirit to what is needed to handle bit-level striding on a fundamental level. Also, I don't remember if we resolved how hidden fields would be shown in the array interface. So, I think that we may be stuck with #1 which at least is consistent with the "fancy-indexing" is a copy pattern (and is just syntatic sugar for capability you've already implemented in recfunctions). -Travis From robert.kern at gmail.com Thu Feb 5 23:13:31 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 5 Feb 2009 22:13:31 -0600 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <6ce0ac130902052000x761c47cen46a3815c2b741713@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <6ce0ac130902031620t782b2387hbd2de5e8e0ced580@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> <6ce0ac130902032122i1ba31be3y9c2d034ee0a1d2fc@mail.gmail.com> <3d375d730902032132l33417bedia7dacf08f4e7996e@mail.gmail.com> <6ce0ac130902052000x761c47cen46a3815c2b741713@mail.gmail.com> Message-ID: <3d375d730902052013t49f5fbechb89bb8e1332bfb07@mail.gmail.com> On Thu, Feb 5, 2009 at 22:00, Brian Granger wrote: > Robert, > > Can you have a look at the following fix and see if it is satisfactory? > > http://github.com/ellisonbg/numpy/blob/81360e93968968dc9dcbafd7895da7cec5015a3c/numpy/distutils/fcompiler/gnu.py Looks good. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From oliphant at enthought.com Thu Feb 5 23:17:54 2009 From: oliphant at enthought.com (Travis Oliphant) Date: Thu, 05 Feb 2009 22:17:54 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <20090205231618.GA21014@phare.normalesup.org> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> Message-ID: <498BB9F2.5080501@enthought.com> Gael Varoquaux wrote: > On Thu, Feb 05, 2009 at 05:08:49PM -0600, Travis E. Oliphant wrote: > >> I've been fairly quiet on this list for awhile due to work and family >> schedule, but I think about how things can improve regularly. One >> feature that's been requested by a few people is the ability to select >> multiple fields from a structured array. >> > > Hey Travis, > > I have no opinion on the above, as I don't have this use case. However, as > you are talking about implementing something, I jump on the occasion to > suggest another gadget, slightly related: I would like named axis. > Suppose you have a 5D array, I would like to be able to give each axis > names, eg (to chose an example you might be familiar with) ('Frontal', > 'Lateral', 'Axial', 'Time', 'Subjects'). And if this could be understood > be numpy operations (say ufuncs and fancy indexing) so that I could do (a > is my 5D array): > > This could be implemented but would require adding information to the NumPy array. I've been thinking for a long time that we ought to add a "dictionary" attribute to the NumPy array (i.e. a new member to the PyArrayObject data-structure). A lot of subclasses of NumPy arrays just add meta-information that could be stored there. Then, it would be a trivial thing to check to see if the dictionary had say an "axis_mapping" keyword and if so then do the conversions found there. I think this has been brought up before, though. What do people think about adding a default dictionary to every instance of a NumPy array. The question that always arises in this context which I don't have good answers for is what do you do with the dictionary on the output of ufuncs? One approach is to always return NULL for the dictionary and don't try and guess. A slightly different one is to at least handle the case where all inputs have the same dictionary and return a new "shallow" copy of that. -Travis From ellisonbg.net at gmail.com Thu Feb 5 23:19:22 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Thu, 5 Feb 2009 20:19:22 -0800 Subject: [Numpy-discussion] numscons/numpy.distutils bug related to MACOSX_DEPLOYMENT_TARGET In-Reply-To: <3d375d730902052013t49f5fbechb89bb8e1332bfb07@mail.gmail.com> References: <6ce0ac130902031612t376a1a59n431a08c899b3a517@mail.gmail.com> <3d375d730902031623p6171dd09gbcc9bf75a793c8b1@mail.gmail.com> <6ce0ac130902031634ye40a829gbac28740fa281231@mail.gmail.com> <3d375d730902031642o65baa95ciebde9b5cc8f8b800@mail.gmail.com> <6ce0ac130902031653x2a304265t41b3c4f70bc257ef@mail.gmail.com> <3d375d730902031658l2b727ad2t3cfbaff32865e013@mail.gmail.com> <6ce0ac130902032122i1ba31be3y9c2d034ee0a1d2fc@mail.gmail.com> <3d375d730902032132l33417bedia7dacf08f4e7996e@mail.gmail.com> <6ce0ac130902052000x761c47cen46a3815c2b741713@mail.gmail.com> <3d375d730902052013t49f5fbechb89bb8e1332bfb07@mail.gmail.com> Message-ID: <6ce0ac130902052019ne743034of817f02e31312f1f@mail.gmail.com> Great, what is the best way of rolling this into numpy? Brian On Thu, Feb 5, 2009 at 8:13 PM, Robert Kern wrote: > On Thu, Feb 5, 2009 at 22:00, Brian Granger wrote: >> Robert, >> >> Can you have a look at the following fix and see if it is satisfactory? >> >> http://github.com/ellisonbg/numpy/blob/81360e93968968dc9dcbafd7895da7cec5015a3c/numpy/distutils/fcompiler/gnu.py > > Looks good. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From robert.kern at gmail.com Thu Feb 5 23:29:42 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 5 Feb 2009 22:29:42 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <498BB9F2.5080501@enthought.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> Message-ID: <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> On Thu, Feb 5, 2009 at 22:17, Travis Oliphant wrote: > Gael Varoquaux wrote: >> On Thu, Feb 05, 2009 at 05:08:49PM -0600, Travis E. Oliphant wrote: >> >>> I've been fairly quiet on this list for awhile due to work and family >>> schedule, but I think about how things can improve regularly. One >>> feature that's been requested by a few people is the ability to select >>> multiple fields from a structured array. >>> >> >> Hey Travis, >> >> I have no opinion on the above, as I don't have this use case. However, as >> you are talking about implementing something, I jump on the occasion to >> suggest another gadget, slightly related: I would like named axis. >> Suppose you have a 5D array, I would like to be able to give each axis >> names, eg (to chose an example you might be familiar with) ('Frontal', >> 'Lateral', 'Axial', 'Time', 'Subjects'). And if this could be understood >> be numpy operations (say ufuncs and fancy indexing) so that I could do (a >> is my 5D array): >> >> > This could be implemented but would require adding information to the > NumPy array. More than that, though. Every function and method that takes an axis or reduces an axis will need to be rewritten. For that reason, I'm -1 on the proposal. > I've been thinking for a long time that we ought to add a > "dictionary" attribute to the NumPy array (i.e. a new member to the > PyArrayObject data-structure). A lot of subclasses of NumPy arrays > just add meta-information that could be stored there. > > Then, it would be a trivial thing to check to see if the dictionary had > say an "axis_mapping" keyword and if so then do the conversions found > there. > > I think this has been brought up before, though. What do people think > about adding a default dictionary to every instance of a NumPy array. > > The question that always arises in this context which I don't have good > answers for is what do you do with the dictionary on the output of > ufuncs? One approach is to always return NULL for the dictionary and > don't try and guess. A slightly different one is to at least handle > the case where all inputs have the same dictionary and return a new > "shallow" copy of that. I'm of the opinion that it should never guess. We have no idea what semantics are being placed on the dict. Even in the case where all of the inputs have the same dict, the operation may easily invalidate the metadata. For example, a reduction on one of these axis-decorated arrays would make the axis labels incorrect. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Fri Feb 6 01:13:54 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 05 Feb 2009 22:13:54 -0800 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <498BB9F2.5080501@enthought.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> Message-ID: <498BD522.8090508@noaa.gov> Travis Oliphant wrote: > What do people think > about adding a default dictionary to every instance of a NumPy array. It sound kind of heavyweight to me. I tend to use lots of small arrays (to represent an x,y point, for instance). There are enough performance issues with that as it stands. Maybe an empty dict isn't much, but it is extra. -Chris -- Christopher Barker, Ph.D. Oceanographer NOAA/OR&R/HAZMAT (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception From robert.kern at gmail.com Fri Feb 6 01:20:37 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 6 Feb 2009 00:20:37 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <498BD522.8090508@noaa.gov> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <498BD522.8090508@noaa.gov> Message-ID: <3d375d730902052220i34972e78r2605a4fb376d440f@mail.gmail.com> On Fri, Feb 6, 2009 at 00:13, Christopher Barker wrote: > Travis Oliphant wrote: >> What do people think >> about adding a default dictionary to every instance of a NumPy array. > > It sound kind of heavyweight to me. I tend to use lots of small arrays > (to represent an x,y point, for instance). There are enough performance > issues with that as it stands. Maybe an empty dict isn't much, but it is > extra. I think we can create the dict on demand, so there will be no overhead except for the space for the pointer in the struct. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From olivier.grisel at ensta.org Fri Feb 6 02:33:11 2009 From: olivier.grisel at ensta.org (Olivier Grisel) Date: Fri, 6 Feb 2009 08:33:11 +0100 Subject: [Numpy-discussion] PEP: named axis (was: Selection of only a certain number of fields) In-Reply-To: <20090205231618.GA21014@phare.normalesup.org> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> Message-ID: +1 On Feb 6, 2009 12:16 AM, "Gael Varoquaux" wrote: On Thu, Feb 05, 2009 at 05:08:49PM -0600, Travis E. Oliphant wrote: > I've been fairly quiet on this list for awhile due to work and family > schedule, but I think about how things can improve regularly. One > feature that's been requested by a few people is the ability to select > multiple fields from a structured array. Hey Travis, I have no opinion on the above, as I don't have this use case. However, as you are talking about implementing something, I jump on the occasion to suggest another gadget, slightly related: I would like named axis. Suppose you have a 5D array, I would like to be able to give each axis names, eg (to chose an example you might be familiar with) ('Frontal', 'Lateral', 'Axial', 'Time', 'Subjects'). And if this could be understood be numpy operations (say ufuncs and fancy indexing) so that I could do (a is my 5D array): >>> b = a.mean(axis='Time') >>> b.axis ('Frontal', 'Lateral', 'Axial', 'Subjects') I believe this would make a big difference for people working with n-dimensional arrays, where n is large. I do realize this is probably a lot of work, this is why I had been refraining from mentioning it. I don't feel I can implement this. Cheers, Ga?l _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at pytables.org Fri Feb 6 02:49:48 2009 From: faltet at pytables.org (Francesc Alted) Date: Fri, 6 Feb 2009 08:49:48 +0100 Subject: [Numpy-discussion] Selection of only a certain number of fields In-Reply-To: <498BB7F6.6040603@enthought.com> References: <498B7181.5000300@enthought.com> <3D14D1EA-AC5C-4283-B9FB-137580E1BAB5@gmail.com> <498BB7F6.6040603@enthought.com> Message-ID: <200902060849.48610.faltet@pytables.org> A Friday 06 February 2009, Travis Oliphant escrigu?: > Pierre GM wrote: > > On Feb 5, 2009, at 6:08 PM, Travis E. Oliphant wrote: > >> Hi all, > >> > >> I've been fairly quiet on this list for awhile due to work and > >> family schedule, but I think about how things can improve > >> regularly. One feature that's been requested by a few people is > >> the ability to select multiple fields from a structured array. > >> > >> > >> > >> [...] > > > > +1 for #2. > > > > Note that we now have a drop_fields function in > > np.lib.recfunctions, a reimplementation of the equivalent function > > in matplotlib. It works along the lines of your proposition #1 > > (create a new array w/ a new dtype and fill it) > > After more thought, I think I was too eager in my suggestion of #2. > It's actually not really possible to do a view the way I would want > it to work. It would be possible to create a data-type with > hidden-fields, but a copy would be not "get rid of the extra data". > > Thus newarr = arr[['name', 'age']].copy() would be exactly the same > size as arr because elements are copied wholesale and each "row" is > a single element in the NumPy array. Some infrastructure would > have to be implemented at a fundamental level to handle > partial-element manipulation similar at least in spirit to what is > needed to handle bit-level striding on a fundamental level. > > Also, I don't remember if we resolved how hidden fields would be > shown in the array interface. > > So, I think that we may be stuck with #1 which at least is consistent > with the "fancy-indexing" is a copy pattern (and is just syntatic > sugar for capability you've already implemented in recfunctions). Mmh, I'd also vote for #2 for performance reasons, but as the implementation seems quite involved, I suppose that #1 would be great too. Cheers, -- Francesc Alted From gael.varoquaux at normalesup.org Fri Feb 6 03:09:38 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Fri, 6 Feb 2009 09:09:38 +0100 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> Message-ID: <20090206080938.GA7464@phare.normalesup.org> On Thu, Feb 05, 2009 at 10:29:42PM -0600, Robert Kern wrote: > >> I have no opinion on the above, as I don't have this use case. However, as > >> you are talking about implementing something, I jump on the occasion to > >> suggest another gadget, slightly related: I would like named axis. > >> Suppose you have a 5D array, I would like to be able to give each axis > >> names, eg (to chose an example you might be familiar with) ('Frontal', > >> 'Lateral', 'Axial', 'Time', 'Subjects'). And if this could be understood > >> be numpy operations (say ufuncs and fancy indexing) so that I could do (a > >> is my 5D array): > > This could be implemented but would require adding information to the > > NumPy array. > More than that, though. Every function and method that takes an axis > or reduces an axis will need to be rewritten. For that reason, I'm -1 > on the proposal. Yes, this is the reason why this proposition is actually a lot of work. This is also what makes it interesting. Sticking information on a numpy array is useful, but it does not achieve something that is not feasible without the help of numpy. The proposition is about much more than that. Ga?l From gael.varoquaux at normalesup.org Fri Feb 6 03:10:28 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Fri, 6 Feb 2009 09:10:28 +0100 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <3d375d730902052220i34972e78r2605a4fb376d440f@mail.gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <498BD522.8090508@noaa.gov> <3d375d730902052220i34972e78r2605a4fb376d440f@mail.gmail.com> Message-ID: <20090206081028.GB7464@phare.normalesup.org> On Fri, Feb 06, 2009 at 12:20:37AM -0600, Robert Kern wrote: > On Fri, Feb 6, 2009 at 00:13, Christopher Barker wrote: > > Travis Oliphant wrote: > >> What do people think > >> about adding a default dictionary to every instance of a NumPy array. > > It sound kind of heavyweight to me. I tend to use lots of small arrays > > (to represent an x,y point, for instance). There are enough performance > > issues with that as it stands. Maybe an empty dict isn't much, but it is > > extra. > I think we can create the dict on demand, so there will be no overhead > except for the space for the pointer in the struct. I am +1 for the dict created on demand. It seems like a great idea to help subclassing. Ga?l From stefan at sun.ac.za Fri Feb 6 04:12:31 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 6 Feb 2009 11:12:31 +0200 Subject: [Numpy-discussion] Selection of only a certain number of fields In-Reply-To: <498BB7F6.6040603@enthought.com> References: <498B7181.5000300@enthought.com> <3D14D1EA-AC5C-4283-B9FB-137580E1BAB5@gmail.com> <498BB7F6.6040603@enthought.com> Message-ID: <9457e7c80902060112t4bf30199h3d41b529c7d53b26@mail.gmail.com> Hi Travis 2009/2/6 Travis Oliphant : > Thus newarr = arr[['name', 'age']].copy() would be exactly the same > size as arr because elements are copied wholesale and each "row" is a > single element in the NumPy array. Some infrastructure would have to > be implemented at a fundamental level to handle partial-element > manipulation similar at least in spirit to what is needed to handle > bit-level striding on a fundamental level. I like your suggestion! Can you think of a way to implement #2 with the correct copy semantics? Being able to create a view without copying is such a big plus that it is worth considering, even at an implementation cost. Regards St?fan From stefan at sun.ac.za Fri Feb 6 04:22:20 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 6 Feb 2009 11:22:20 +0200 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> Message-ID: <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> Hi Robert 2009/2/6 Robert Kern : >> This could be implemented but would require adding information to the >> NumPy array. > > More than that, though. Every function and method that takes an axis > or reduces an axis will need to be rewritten. For that reason, I'm -1 > on the proposal. Are you -1 on the array dictionary, or on using it to do axis mapping? I would imagine that Gael would be happier even if he had to do axis = x.meta.axis['Lateral'] some_func(x, axis) > I'm of the opinion that it should never guess. We have no idea what > semantics are being placed on the dict. Even in the case where all of > the inputs have the same dict, the operation may easily invalidate the > metadata. For example, a reduction on one of these axis-decorated > arrays would make the axis labels incorrect. That's a good point. So what would be a sane way of propagating meta-data? If we don't want to make any assumptions, it becomes the user's responsibility to do it manually. Cheers St?fan From dsdale24 at gmail.com Fri Feb 6 08:48:47 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Fri, 6 Feb 2009 08:48:47 -0500 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> Message-ID: On Fri, Feb 6, 2009 at 4:22 AM, St?fan van der Walt wrote: > Hi Robert > > 2009/2/6 Robert Kern : > >> This could be implemented but would require adding information to the > >> NumPy array. > > > > More than that, though. Every function and method that takes an axis > > or reduces an axis will need to be rewritten. For that reason, I'm -1 > > on the proposal. > > Are you -1 on the array dictionary, or on using it to do axis mapping? > I would imagine that Gael would be happier even if he had to do > > axis = x.meta.axis['Lateral'] > some_func(x, axis) > > > I'm of the opinion that it should never guess. We have no idea what > > semantics are being placed on the dict. Even in the case where all of > > the inputs have the same dict, the operation may easily invalidate the > > metadata. For example, a reduction on one of these axis-decorated > > arrays would make the axis labels incorrect. > > That's a good point. So what would be a sane way of propagating > meta-data? If we don't want to make any assumptions, it becomes the > user's responsibility to do it manually. > In which case they end up writing a subclass to propagate only that portion of the dict that they were using. I'll add another example where a subclass would be needed to propagate the metadata: physical quantities. I have a package (nearly ready to submit to this list for request for comment) that uses a dict subclass to describe the dimensionality of a quantity, like {m:1, s:-1}. I don't think this metadata dictionary would be useful for quantities since two arrays may have the same dimensionality but the propagated dimensionality would be {m:1, s:-1} for addition, {m:2, s:-2} for multiplication, {} for division. Darren -------------- next part -------------- An HTML attachment was scrubbed... URL: From bsouthey at gmail.com Fri Feb 6 10:31:47 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Fri, 06 Feb 2009 09:31:47 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> Message-ID: <498C57E3.6090507@gmail.com> Hi, +1 on the idea but how will this work with other numpy methods? suppose *arr* is a structured array with dtype: [('name', 'S25'), ('height', float), ('age', int), ('gender', 'S8') ] Would you be able to first define a list of columns such as cols=['height', 'age'] arr[cols] This would be a handy feature. For example, for some compatible array A, would you be able to the following with a view np.linalg.lstsq(A, arr[['age']]) np.linalg.lstsq(A,arr[['height', 'age']] Bruce From faltet at pytables.org Fri Feb 6 10:42:01 2009 From: faltet at pytables.org (Francesc Alted) Date: Fri, 6 Feb 2009 16:42:01 +0100 Subject: [Numpy-discussion] How to guess the compiler type from distuils? Message-ID: <200902061642.01694.faltet@pytables.org> Hi, We would like to use the numpy.distutils machinery for numexpr and I'd like to add different compiler flags depending on the compiler used. Anbody knows a simple way to do this with numpy.distutils (ore plain distutils)? Thanks, -- Francesc Alted From faltet at pytables.org Fri Feb 6 11:27:45 2009 From: faltet at pytables.org (Francesc Alted) Date: Fri, 6 Feb 2009 17:27:45 +0100 Subject: [Numpy-discussion] How to guess the compiler type from distuils? In-Reply-To: <200902061642.01694.faltet@pytables.org> References: <200902061642.01694.faltet@pytables.org> Message-ID: <200902061727.45553.faltet@pytables.org> A Friday 06 February 2009, Francesc Alted escrigu?: > Hi, > > We would like to use the numpy.distutils machinery for numexpr and > I'd like to add different compiler flags depending on the compiler > used. Anbody knows a simple way to do this with numpy.distutils (ore > plain distutils)? I've figured out how to do it. For the records, you simply have to import this: from numpy.distutils.command.build_ext import build_ext as numpy_build_ext and then define the next class: class build_ext(numpy_build_ext): def build_extension(self, ext): # at this point we know what the C compiler is. c = self.compiler old_compile_options = None # For MS Visual C, we use /O1 instead of the default /Ox, # as /Ox takes a long time (~5 mins) to compile. # The speed of the code isn't noticeably different. if c.compiler_type == 'msvc': if not c.initialized: c.initialize() old_compile_options = c.compile_options[:] if '/Ox' in c.compile_options: c.compile_options.remove('/Ox') c.compile_options.append('/O1') ext.extra_compile_args = [] numpy_build_ext.build_extension(self, ext) if old_compile_options is not None: self.compiler.compile_options = old_compile_options where you can taylor compiler options at your will. Cheers, -- Francesc Alted From reakinator at gmail.com Fri Feb 6 11:29:52 2009 From: reakinator at gmail.com (Rich E) Date: Fri, 6 Feb 2009 17:29:52 +0100 Subject: [Numpy-discussion] default float type of array not accepted by SWIG wrapped C functions In-Reply-To: <875E04698300DB4FA52B4219ABA6039B126E9D05BD@ES02SNLNT.srn.sandia.gov> References: <875E04698300DB4FA52B4219ABA6039B126E9D05BD@ES02SNLNT.srn.sandia.gov> Message-ID: I ended up solving my problem in SWIG, so I might as well post it here. I just made my own 'array' and 'zeros' functions with floating point precision as follows: %pythoncode %{ from numpy import array as np_array def array (n, type='float32'): return(np_array(n, type)) from numpy import zeros as np_zeros def zeros (n, type='float32'): return(np_zeros(n, type)) %} Pretty basic, I know. But it cuts down on alot of unnecessary code. - Rich On Thu, Jan 22, 2009 at 6:09 PM, Spotz, William F wrote: > Rich, > > Basic python only supports double precision floats, so that is not an option. > > NumPy does not have, as far as I know, a way to set the default precision, although it might be a reasonable request. > > As for the SWIG interface file, almost anything is possible. Can you give an example of a function prototype you are wrapping, the %apply directive you use and and example of python code accessing it? > > -Bill > ________________________________________ > From: numpy-discussion-bounces at scipy.org [numpy-discussion-bounces at scipy.org] On Behalf Of Rich E [reakinator at gmail.com] > Sent: Thursday, January 22, 2009 11:45 AM > To: Discussion of Numerical Python > Subject: [Numpy-discussion] default float type of array not accepted by SWIG wrapped C functions > > Hi all, > > I have a SWIG wrapped C library that uses 32bit floating point arrays, > using the numpy.i typemapping system for passing the arrays. For > every array that I make, I have to convert it using astype('float32'), > else python complains that I tried to pass a double-precision array. > > Is there any way to set the default floating point precision to 32bit, > in python or in the SWIG interface file? > > regards, > Rich > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From Chris.Barker at noaa.gov Fri Feb 6 12:09:35 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 06 Feb 2009 09:09:35 -0800 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> Message-ID: <498C6ECF.1090408@noaa.gov> Darren Dale wrote: > I have a package (nearly ready to > submit to this list for request for comment) that uses a dict subclass > to describe the dimensionality of a quantity, like {m:1, s:-1}. I'm looking forward to that -- you may have just saved me a bunch of coding! -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From dsdale24 at gmail.com Fri Feb 6 12:53:09 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Fri, 6 Feb 2009 12:53:09 -0500 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <498C6ECF.1090408@noaa.gov> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> <498C6ECF.1090408@noaa.gov> Message-ID: On Fri, Feb 6, 2009 at 12:09 PM, Christopher Barker wrote: > Darren Dale wrote: > > I have a package (nearly ready to > > submit to this list for request for comment) that uses a dict subclass > > to describe the dimensionality of a quantity, like {m:1, s:-1}. > > I'm looking forward to that -- you may have just saved me a bunch of > coding! > > There is an alpha available at packages.python.org/quantities, and a development site at launchpad.net/python-quantities . There is a link to a mailing list at the launchpad site as well. I think there is still a couple weekends worth of effort on documentation, unit testing, and ufuncs before it is really ready for further discussion here. Darren -------------- next part -------------- An HTML attachment was scrubbed... URL: From suchindra at gmail.com Fri Feb 6 14:24:37 2009 From: suchindra at gmail.com (Suchindra Sandhu) Date: Fri, 6 Feb 2009 14:24:37 -0500 Subject: [Numpy-discussion] numpy.any oddity Message-ID: Hi, I accidently stumbled upon this odd behavior by numpy.any. The following code leaks memory - for i in xrange(10000000): print N.any({'whatever': N.arange(10000000)}) Ofcourse, I called "any" on a dict object by accident, but it should not really leak memory. I am running numpy version 1.0.4 with python 2.5.2 Cheers, Suchindra -------------- next part -------------- An HTML attachment was scrubbed... URL: From kbasye1 at jhu.edu Fri Feb 6 14:00:10 2009 From: kbasye1 at jhu.edu (Ken Basye) Date: Fri, 06 Feb 2009 14:00:10 -0500 Subject: [Numpy-discussion] Can I fill an existing array from an iterator? Message-ID: <498C88BA.5010300@jhu.edu> Hi Folks, I wonder if there's a way to fill an existing array from an iterator without creating a temporary array. That is, I'm looking for something that has the effect of >>> target = np.array(xrange(9), dtype = float) >>> target[:] = np.fromiter(repeat(3.14159, 9), dtype=float) without creating a second array object in the second line. The closest I got was this: >>> target[:] = xrange(-9, 0) >>> target[:] = tuple(repeat(5.5, 9)) Note that xrange isn't really an iterator, and appears to be handled specially. Emptying the iterator into a tuple works, but I was hoping for something even more direct. If this isn't possible, I wonder what people would think of adding an optional 'out' argument to the 'fromXXX' functions which behaved like the 'out' argument to sum() and other functions. Thanks, Ken From nwagner at iam.uni-stuttgart.de Fri Feb 6 15:16:28 2009 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Fri, 06 Feb 2009 21:16:28 +0100 Subject: [Numpy-discussion] xblas and numpy Message-ID: Hi all, Just curious. Is it possible to use xblas with numpy ? http://www.netlib.org/xblas/ Nils From robert.kern at gmail.com Fri Feb 6 15:39:45 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 6 Feb 2009 14:39:45 -0600 Subject: [Numpy-discussion] numpy.any oddity In-Reply-To: References: Message-ID: <3d375d730902061239p791234e7k8d0f9d04e2a66464@mail.gmail.com> On Fri, Feb 6, 2009 at 13:24, Suchindra Sandhu wrote: > Hi, > > I accidently stumbled upon this odd behavior by numpy.any. The following > code leaks memory - > > for i in xrange(10000000): > print N.any({'whatever': N.arange(10000000)}) > > Ofcourse, I called "any" on a dict object by accident, but it should not > really leak memory. > > I am running numpy version 1.0.4 with python 2.5.2 Upgrade to a more recent version of numpy. I do not see a leak. I vaguely recall there being a problem in the 1.0.x series with object-dtype scalars, which is what numpy.any() will convert a dict to. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Fri Feb 6 16:02:32 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 6 Feb 2009 15:02:32 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <498C57E3.6090507@gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498C57E3.6090507@gmail.com> Message-ID: <3d375d730902061302n104e7b73jdf70c220f42c9bcb@mail.gmail.com> On Fri, Feb 6, 2009 at 09:31, Bruce Southey wrote: > Hi, > +1 on the idea but how will this work with other numpy methods? > > suppose *arr* is a structured array with dtype: > > [('name', 'S25'), > ('height', float), > ('age', int), > ('gender', 'S8') > ] > > > Would you be able to first define a list of columns such as > cols=['height', 'age'] > arr[cols] > This would be a handy feature. Yes. ndarray.__getitem__() doesn't know anything about where the list of strings comes from. > For example, for some compatible array A, would you be able to the > following with a view > > np.linalg.lstsq(A, arr[['age']]) > np.linalg.lstsq(A,arr[['height', 'age']] No, you'd still get a record array, which lstsq() doesn't know what to do with. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From michael.s.gilbert at gmail.com Fri Feb 6 16:24:18 2009 From: michael.s.gilbert at gmail.com (Michael S. Gilbert) Date: Fri, 6 Feb 2009 16:24:18 -0500 Subject: [Numpy-discussion] Purpose for bit-wise and'ing the initial mersenne twister key? Message-ID: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> In numpy/random/mtrand/randomkit.c on line 159, the initial mersenne twister key (populated from /dev/urandom) gets bit-wise and'ed with 0xffffffff. I'm just curious as why this is done. A bit-wise and with all ones should just give you your original quantity back, right? I don't think there is a problem since the operation doesn't really do anything, and the same thing exists in the mersenne twister reference code, but I am curious as to why it is even there in the first place. Thanks for any thoughts. Regards, Mike From robert.kern at gmail.com Fri Feb 6 16:25:35 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 6 Feb 2009 15:25:35 -0600 Subject: [Numpy-discussion] Purpose for bit-wise and'ing the initial mersenne twister key? In-Reply-To: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> References: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> Message-ID: <3d375d730902061325h190b1de4o572f2ca63fd34d63@mail.gmail.com> On Fri, Feb 6, 2009 at 15:24, Michael S. Gilbert wrote: > In numpy/random/mtrand/randomkit.c on line 159, the initial mersenne twister key (populated from /dev/urandom) gets bit-wise and'ed with 0xffffffff. I'm just curious as why this is done. A bit-wise and with all ones should just give you your original quantity back, right? I don't think there is a problem since the operation doesn't really do anything, and the same thing exists in the mersenne twister reference code, but I am curious as to why it is even there in the first place. Thanks for any thoughts. On most 64-bit machines, unsigned longs are 64 bits, so 0xffffffffUL is only 32 bits of 1s. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From dsdale24 at gmail.com Fri Feb 6 16:25:42 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Fri, 6 Feb 2009 16:25:42 -0500 Subject: [Numpy-discussion] question about ufuncs In-Reply-To: References: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> Message-ID: On Sun, Feb 1, 2009 at 7:39 PM, Darren Dale wrote: > On Sun, Feb 1, 2009 at 7:33 PM, Pierre GM wrote: > >> >> On Feb 1, 2009, at 6:32 PM, Darren Dale wrote: >> > >> > >> > Is there an analog to __array_wrap__ for preprocessing arrays on >> > their way *into* a ufunc? For example, it would be nice if one could >> > do something like: >> > >> > numpy.sin([1,2,3]*arcseconds) >> > >> > where we have the opportunity to inspect the context, convert the >> > Quantity to units of radians, and then actually call the ufunc. Is >> > this possible, or does one have to reimplement such functions? >> >> Just an idea: look at the code for numpy.ma ufuncs (in numpy.ma.core). >> By defining a few classes for unary, binary and domained functions, >> you could probably do what you want, without having to recode all the >> functions by hand. >> Another idea would be to define some specific __mul__ or __rmul__ >> rules for your units, so that the list would be transformed into a >> UnitArray... > > > I have pretty good implementations of the arithmetic operators, so > ([1,2,3]*m)*([4,5,6]*J) already works. numpy.multiply and numpy.sqrt needed > help with array_wrap. I'll study your stuff in ma, thanks for the pointer. > I've been looking at how ma implements things like multiply() and MaskedArray.__mul__. I'm surprised that MaskedArray.__mul__ actually calls ma.multiply() rather than calling super(MaskedArray,self).__mul__(). Maybe that is the way ndarray does it, but I don't think this is the right approach for my quantity subclasses. If I want to make a MaskedQuantity (someday), MaskedQuantity.__mul__ should be calling super(MaskedQuantity,self).__mul__(), not reimplementations of numpy.multiply or ma.multiply, right? As I understand it, the point of __array_wrap__ is to provide a mechanism such that ndarray subclasses could work with numpy's built-in ufuncs. In my case, I had been planning to use the calling ufunc as a key to find the appropriate way to propagate whatever metadata is associated with the subclass, for example: def __array_wrap__(self, obj, context): try: result = super(Quantity,self).__array_wrap__(obj,context).view(type(self)) except: result = obj.view(type(self)) ufunc, objs, huh = context result._dimensionality = self._propagate_dimensionality[ufunc](objs) return result Where self._propagate_dimensionality is a dictionary of functions operating on Dimensionality objects. There are some cases where the default numpy function expects certain units on the way in, like the trig functions, which I think would have to be reimplemented. But aside from that, is there anything wrong with taking this approach? It seems to allow quantities to integrate pretty well with the numpy builtins. Darren -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Feb 6 16:30:36 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 6 Feb 2009 15:30:36 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> Message-ID: <3d375d730902061330r29c19ce7sb44d03e8b79616fa@mail.gmail.com> On Fri, Feb 6, 2009 at 03:22, St?fan van der Walt wrote: > Hi Robert > > 2009/2/6 Robert Kern : >>> This could be implemented but would require adding information to the >>> NumPy array. >> >> More than that, though. Every function and method that takes an axis >> or reduces an axis will need to be rewritten. For that reason, I'm -1 >> on the proposal. > > Are you -1 on the array dictionary, or on using it to do axis mapping? I'm -1 on rewriting every axis= argument to accept strings. I'm +1 on a generic metadata dict that does not implicitly propagate. > I would imagine that Gael would be happier even if he had to do > > axis = x.meta.axis['Lateral'] > some_func(x, axis) That's fine with me. >> I'm of the opinion that it should never guess. We have no idea what >> semantics are being placed on the dict. Even in the case where all of >> the inputs have the same dict, the operation may easily invalidate the >> metadata. For example, a reduction on one of these axis-decorated >> arrays would make the axis labels incorrect. > > That's a good point. So what would be a sane way of propagating > meta-data? If we don't want to make any assumptions, it becomes the > user's responsibility to do it manually. I don't think there is *any* sane way of numpy propagating the user's metadata. The user must be the one to do it. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From rmay31 at gmail.com Fri Feb 6 16:56:32 2009 From: rmay31 at gmail.com (Ryan May) Date: Fri, 6 Feb 2009 15:56:32 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <3d375d730902061330r29c19ce7sb44d03e8b79616fa@mail.gmail.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> <3d375d730902052029l21f675d1leecb0d405db5dc23@mail.gmail.com> <9457e7c80902060122o365cbc28mf5a0bc22d5bc6f6d@mail.gmail.com> <3d375d730902061330r29c19ce7sb44d03e8b79616fa@mail.gmail.com> Message-ID: On Fri, Feb 6, 2009 at 3:30 PM, Robert Kern wrote: > On Fri, Feb 6, 2009 at 03:22, St?fan van der Walt > wrote: > > Hi Robert > > > > 2009/2/6 Robert Kern : > >>> This could be implemented but would require adding information to the > >>> NumPy array. > >> > >> More than that, though. Every function and method that takes an axis > >> or reduces an axis will need to be rewritten. For that reason, I'm -1 > >> on the proposal. > > > > Are you -1 on the array dictionary, or on using it to do axis mapping? > > I'm -1 on rewriting every axis= argument to accept strings. I'm +1 on > a generic metadata dict that does not implicitly propagate. > > > I would imagine that Gael would be happier even if he had to do > > > > axis = x.meta.axis['Lateral'] > > some_func(x, axis) > > That's fine with me. > > >> I'm of the opinion that it should never guess. We have no idea what > >> semantics are being placed on the dict. Even in the case where all of > >> the inputs have the same dict, the operation may easily invalidate the > >> metadata. For example, a reduction on one of these axis-decorated > >> arrays would make the axis labels incorrect. > > > > That's a good point. So what would be a sane way of propagating > > meta-data? If we don't want to make any assumptions, it becomes the > > user's responsibility to do it manually. > > I don't think there is *any* sane way of numpy propagating the user's > metadata. The user must be the one to do it. > I'm +1 on all of what Robert said. I've considered writing a subclass/wrapping just so I can make metadata available while passing around recarrays. It'd save me a bunch of work. I don't think there's anything wrong with making the user propagate the dictionary. Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma -------------- next part -------------- An HTML attachment was scrubbed... URL: From pgmdevlist at gmail.com Fri Feb 6 17:18:39 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Fri, 6 Feb 2009 17:18:39 -0500 Subject: [Numpy-discussion] question about ufuncs In-Reply-To: References: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> Message-ID: <457C5E99-AD7D-4592-B589-836D8530A1FE@gmail.com> On Feb 6, 2009, at 4:25 PM, Darren Dale wrote: > > I've been looking at how ma implements things like multiply() and > MaskedArray.__mul__. I'm surprised that MaskedArray.__mul__ actually > calls ma.multiply() rather than calling > super(MaskedArray,self).__mul__(). There's some under-the-hood machinery to deal with the data, and we need to be able to manipulate it *before* the operation takes place. The super() approach calls __array_wrap__ on the result, so *after* the operation took place, and that's not what we wanted... > Maybe that is the way ndarray does it, but I don't think this is the > right approach for my quantity subclasses. If I want to make a > MaskedQuantity (someday), MaskedQuantity.__mul__ should be calling > super(MaskedQuantity,self).__mul__(), not reimplementations of > numpy.multiply or ma.multiply, right? You'll end up calling ma.multiply anyway (super(MaskedQuantity,self).__mul__ will call MaskedArray.__mul__ which calls ma.multiply... So yes, I think you can stick to the super() approach in your case > > There are some cases where the default numpy function expects > certain units on the way in, like the trig functions, which I think > would have to be reimplemented. And you can probably define a generic class to deal with that instead of reimplementing the functions individually (and we're back to the initial advice). > But aside from that, is there anything wrong with taking this > approach? It seems to allow quantities to integrate pretty well with > the numpy builtins. Go and try, the problems (if any) will show up... From michael.s.gilbert at gmail.com Fri Feb 6 17:57:23 2009 From: michael.s.gilbert at gmail.com (Michael S. Gilbert) Date: Fri, 6 Feb 2009 17:57:23 -0500 Subject: [Numpy-discussion] Purpose for bit-wise and'ing the initial mersenne twister key? In-Reply-To: <3d375d730902061325h190b1de4o572f2ca63fd34d63@mail.gmail.com> References: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> <3d375d730902061325h190b1de4o572f2ca63fd34d63@mail.gmail.com> Message-ID: <20090206175723.45e986fb.michael.s.gilbert@gmail.com> Ok, so isn't this a slight waste of memory then (a doubling on 64-bit platforms)? Of course the tradeoff is whether you want to maintain two codebases for 32- and 64-bit or just one. The advantages of a single codebase probably outweight an increase in memory usage since we're only talking about the difference between 2kB and 4kB, which is fairly insignificant. Mike On Fri, 6 Feb 2009 15:25:35 -0600 Robert Kern wrote: > On Fri, Feb 6, 2009 at 15:24, Michael S. Gilbert > wrote: > > In numpy/random/mtrand/randomkit.c on line 159, the initial mersenne twister key (populated from /dev/urandom) gets bit-wise and'ed with 0xffffffff. I'm just curious as why this is done. A bit-wise and with all ones should just give you your original quantity back, right? I don't think there is a problem since the operation doesn't really do anything, and the same thing exists in the mersenne twister reference code, but I am curious as to why it is even there in the first place. Thanks for any thoughts. > > On most 64-bit machines, unsigned longs are 64 bits, so 0xffffffffUL > is only 32 bits of 1s. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Fri Feb 6 17:56:00 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 6 Feb 2009 16:56:00 -0600 Subject: [Numpy-discussion] Purpose for bit-wise and'ing the initial mersenne twister key? In-Reply-To: <20090206175723.45e986fb.michael.s.gilbert@gmail.com> References: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> <3d375d730902061325h190b1de4o572f2ca63fd34d63@mail.gmail.com> <20090206175723.45e986fb.michael.s.gilbert@gmail.com> Message-ID: <3d375d730902061456m64fdac20uaa601f02c7ebb3e7@mail.gmail.com> On Fri, Feb 6, 2009 at 16:57, Michael S. Gilbert wrote: > Ok, so isn't this a slight waste of memory then (a doubling on 64-bit platforms)? Of course the tradeoff is whether you want to maintain two codebases for 32- and 64-bit or just one. The advantages of a single codebase probably outweight an increase in memory usage since we're only talking about the difference between 2kB and 4kB, which is fairly insignificant. I'm not going to modify the upstream source and risk introducing bugs. PS: I am on the mailing list. You do not need to Cc: me. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From michael.s.gilbert at gmail.com Fri Feb 6 18:14:31 2009 From: michael.s.gilbert at gmail.com (Michael S. Gilbert) Date: Fri, 6 Feb 2009 18:14:31 -0500 Subject: [Numpy-discussion] Purpose for bit-wise and'ing the initial mersenne twister key? In-Reply-To: <3d375d730902061456m64fdac20uaa601f02c7ebb3e7@mail.gmail.com> References: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> <3d375d730902061325h190b1de4o572f2ca63fd34d63@mail.gmail.com> <20090206175723.45e986fb.michael.s.gilbert@gmail.com> <3d375d730902061456m64fdac20uaa601f02c7ebb3e7@mail.gmail.com> Message-ID: <20090206181431.cdf5e7f0.michael.s.gilbert@gmail.com> > I'm not going to modify the upstream source and risk introducing bugs. I agree, its not worth risking it to save 2k of memory. From dsdale24 at gmail.com Fri Feb 6 18:11:10 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Fri, 6 Feb 2009 18:11:10 -0500 Subject: [Numpy-discussion] question about ufuncs In-Reply-To: <457C5E99-AD7D-4592-B589-836D8530A1FE@gmail.com> References: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> <457C5E99-AD7D-4592-B589-836D8530A1FE@gmail.com> Message-ID: On Fri, Feb 6, 2009 at 5:18 PM, Pierre GM wrote: > > On Feb 6, 2009, at 4:25 PM, Darren Dale wrote: > > > > > I've been looking at how ma implements things like multiply() and > > MaskedArray.__mul__. I'm surprised that MaskedArray.__mul__ actually > > calls ma.multiply() rather than calling > > super(MaskedArray,self).__mul__(). > > There's some under-the-hood machinery to deal with the data, and we > need to be able to manipulate it *before* the operation takes place. > The super() approach calls __array_wrap__ on the result, so *after* > the operation took place, and that's not what we wanted... > It looks like there are enough cases where manipulation needs to happen on the way in that it might be useful to consider a mechanism for doing so. It could avoid the need for lots of wrappers and decorators down the road. > > > Maybe that is the way ndarray does it, but I don't think this is the > > right approach for my quantity subclasses. If I want to make a > > MaskedQuantity (someday), MaskedQuantity.__mul__ should be calling > > super(MaskedQuantity,self).__mul__(), not reimplementations of > > numpy.multiply or ma.multiply, right? > > You'll end up calling ma.multiply anyway > (super(MaskedQuantity,self).__mul__ will call MaskedArray.__mul__ > which calls ma.multiply... So yes, I think you can stick to the > super() approach in your case > > > > > There are some cases where the default numpy function expects > > certain units on the way in, like the trig functions, which I think > > would have to be reimplemented. > > And you can probably define a generic class to deal with that instead > of reimplementing the functions individually (and we're back to the > initial advice). > > > > But aside from that, is there anything wrong with taking this > > approach? It seems to allow quantities to integrate pretty well with > > the numpy builtins. > > Go and try, the problems (if any) will show up... > Oh boy... -------------- next part -------------- An HTML attachment was scrubbed... URL: From michael.s.gilbert at gmail.com Fri Feb 6 18:18:44 2009 From: michael.s.gilbert at gmail.com (Michael S. Gilbert) Date: Fri, 6 Feb 2009 18:18:44 -0500 Subject: [Numpy-discussion] Purpose for bit-wise and'ing the initial mersenne twister key? In-Reply-To: <3d375d730902061456m64fdac20uaa601f02c7ebb3e7@mail.gmail.com> References: <20090206162418.0d8dd677.michael.s.gilbert@gmail.com> <3d375d730902061325h190b1de4o572f2ca63fd34d63@mail.gmail.com> <20090206175723.45e986fb.michael.s.gilbert@gmail.com> <3d375d730902061456m64fdac20uaa601f02c7ebb3e7@mail.gmail.com> Message-ID: <20090206181844.9b2070ff.michael.s.gilbert@gmail.com> > I'm not going to modify the upstream source and risk introducing bugs. BTW, there is a 64-bit version of the reference mersenne twister implementation available [1]. Mike [1] http://www.math.sci.hiroshima-u.ac.jp/~m-mat/MT/VERSIONS/C-LANG/mt19937-64.c From dsdale24 at gmail.com Fri Feb 6 18:55:39 2009 From: dsdale24 at gmail.com (Darren Dale) Date: Fri, 6 Feb 2009 18:55:39 -0500 Subject: [Numpy-discussion] question about ufuncs In-Reply-To: References: <48A25BC8-FA0A-44EE-89E2-F6C9939E8A22@gmail.com> <457C5E99-AD7D-4592-B589-836D8530A1FE@gmail.com> Message-ID: On Fri, Feb 6, 2009 at 6:11 PM, Darren Dale wrote: > On Fri, Feb 6, 2009 at 5:18 PM, Pierre GM wrote: > >> >> On Feb 6, 2009, at 4:25 PM, Darren Dale wrote: >> >> > >> > I've been looking at how ma implements things like multiply() and >> > MaskedArray.__mul__. I'm surprised that MaskedArray.__mul__ actually >> > calls ma.multiply() rather than calling >> > super(MaskedArray,self).__mul__(). >> >> There's some under-the-hood machinery to deal with the data, and we >> need to be able to manipulate it *before* the operation takes place. >> The super() approach calls __array_wrap__ on the result, so *after* >> the operation took place, and that's not what we wanted... >> > > It looks like there are enough cases where manipulation needs to happen on > the way in that it might be useful to consider a mechanism for doing so. It > could avoid the need for lots of wrappers and decorators down the road. > > >> >> > Maybe that is the way ndarray does it, but I don't think this is the >> > right approach for my quantity subclasses. If I want to make a >> > MaskedQuantity (someday), MaskedQuantity.__mul__ should be calling >> > super(MaskedQuantity,self).__mul__(), not reimplementations of >> > numpy.multiply or ma.multiply, right? >> >> You'll end up calling ma.multiply anyway >> (super(MaskedQuantity,self).__mul__ will call MaskedArray.__mul__ >> which calls ma.multiply... So yes, I think you can stick to the >> super() approach in your case >> >> > >> > There are some cases where the default numpy function expects >> > certain units on the way in, like the trig functions, which I think >> > would have to be reimplemented. >> >> And you can probably define a generic class to deal with that instead >> of reimplementing the functions individually (and we're back to the >> initial advice). >> >> >> > But aside from that, is there anything wrong with taking this >> > approach? It seems to allow quantities to integrate pretty well with >> > the numpy builtins. >> >> Go and try, the problems (if any) will show up... >> > > Oh boy... > Well, without an analog to __array_wrap__ on the way in, the out parameter for ufuncs poses a serious downside: q1=1*m q2=1*ft numpy.add(q1,q2,q1) I can raise an error complaining that you cant add meters and feet, but if I catch it in __array_wrap__ it is too late to prevent overwriting q1 with meaningless data. Also: q1=[[1,1],[1,1]]*m q2=[[2,2],[2,2]]*m numpy.multiply(q1[0],q2[0],q1[0]) This will again raise an error, since quantities will not allow you to attempt to change the units of a view. But numpy.multiply would have already modified the data. It seems like integrating quantities with existing numpy and scipy routines would be greatly simplified if I could take advantage of __array_wrap__, but I don't think I can use it if I can't raise errors in time to avoid corrupting data in place. I guess for the purposes of demonstrating the packages usefulness, for now I should wrap or reimplement numpy's ufuncs and defer using __array_wrap__ unless it becomes possible for subclasses to manipulate data on the way in to ufuncs. Darren -------------- next part -------------- An HTML attachment was scrubbed... URL: From alan at ajackson.org Fri Feb 6 22:39:08 2009 From: alan at ajackson.org (Alan Jackson) Date: Fri, 6 Feb 2009 21:39:08 -0600 Subject: [Numpy-discussion] PEP: named axis In-Reply-To: <498BB9F2.5080501@enthought.com> References: <498B7181.5000300@enthought.com> <20090205231618.GA21014@phare.normalesup.org> <498BB9F2.5080501@enthought.com> Message-ID: <20090206213908.105ddcb9@ajackson.org> On Thu, 05 Feb 2009 22:17:54 -0600 Travis Oliphant wrote: > Gael Varoquaux wrote: > > On Thu, Feb 05, 2009 at 05:08:49PM -0600, Travis E. Oliphant wrote: > > > >> I've been fairly quiet on this list for awhile due to work and family > >> schedule, but I think about how things can improve regularly. One > >> feature that's been requested by a few people is the ability to select > >> multiple fields from a structured array. > >> > > > -Travis Seems to me that would be getting numpy arrays very close to the R data frame object - which could be a good thing. -- ----------------------------------------------------------------------- | Alan K. Jackson | To see a World in a Grain of Sand | | alan at ajackson.org | And a Heaven in a Wild Flower, | | www.ajackson.org | Hold Infinity in the palm of your hand | | Houston, Texas | And Eternity in an hour. - Blake | ----------------------------------------------------------------------- From ondrej at certik.cz Sat Feb 7 01:04:00 2009 From: ondrej at certik.cz (Ondrej Certik) Date: Fri, 6 Feb 2009 22:04:00 -0800 Subject: [Numpy-discussion] preferred numpy build system Message-ID: <85b5c3130902062204p6f397fa0yb5b90178bb7d9b83@mail.gmail.com> Hi, I have couple beginners questions about numscons. What is the preferred build system for numpy now, is it numscons? The README doesn't mention numscons, so I am a bit confused what the future plan is. Also by doing: $ python setupscons.py install Running from numpy source directory. Traceback (most recent call last): File "setupscons.py", line 56, in raise DistutilsError('\n'.join(msg)) distutils.errors.DistutilsError: You cannot build numpy with scons without the numscons package (Failure was: No module named numscons) so the numscons package needs to be installed -- but it is not in Debian. So is it supposed to be in Debian? Is numscons supposed to be a build system for other projects as well? Why not to just send the needed patches to scons and just use scons? Thanks, Ondrej From nwagner at iam.uni-stuttgart.de Sat Feb 7 08:03:15 2009 From: nwagner at iam.uni-stuttgart.de (Nils Wagner) Date: Sat, 07 Feb 2009 14:03:15 +0100 Subject: [Numpy-discussion] ERROR: Test flat on masked_matrices Message-ID: ====================================================================== ERROR: Test flat on masked_matrices ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/local/lib64/python2.5/site-packages/numpy/ma/tests/test_core.py", line 1127, in test_flat test = ma.array(np.matrix([[1, 2, 3]]), mask=[0, 0, 1]) NameError: global name 'ma' is not defined ---------------------------------------------------------------------- Ran 1897 tests in 14.713s FAILED (KNOWNFAIL=9, errors=1) From neilcrighton at gmail.com Sat Feb 7 09:27:43 2009 From: neilcrighton at gmail.com (Neil) Date: Sat, 7 Feb 2009 14:27:43 +0000 (UTC) Subject: [Numpy-discussion] Selection of only a certain number of fields References: <498B7181.5000300@enthought.com> Message-ID: Travis E. Oliphant enthought.com> writes: > I've been fairly quiet on this list for awhile due to work and family > schedule, but I think about how things can improve regularly. One > feature that's been requested by a few people is the ability to select > multiple fields from a structured array. > > Thus, suppose *arr* is a structured array with dtype: > > [('name', 'S25'), > ('height', float), > ('age', int), > ('gender', 'S8') > ] > > Then, newarr = arr[['name', 'age']] should be a structured array with > just the name and age fields. > What are some common use cases for this feature? I use structured arrays quite a lot, but I haven't found myself wanting something like this. If I do need a subset of a structured array generally I use something like [rec[n] for n in 'name age gender'.split()] For me that use case doesn't come up very often though. From cournape at gmail.com Sat Feb 7 09:42:52 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 7 Feb 2009 23:42:52 +0900 Subject: [Numpy-discussion] preferred numpy build system In-Reply-To: <85b5c3130902062204p6f397fa0yb5b90178bb7d9b83@mail.gmail.com> References: <85b5c3130902062204p6f397fa0yb5b90178bb7d9b83@mail.gmail.com> Message-ID: <5b8d13220902070642h281ce30bn2e6b6fcf1583345f@mail.gmail.com> Hi Ondrej, On Sat, Feb 7, 2009 at 3:04 PM, Ondrej Certik wrote: > Hi, > > I have couple beginners questions about numscons. What is the > preferred build system for numpy now, is it numscons? The README > doesn't mention numscons, so I am a bit confused what the future plan > is. > Sorry for the confusion: numscons is NOT the preferred build system. The current numpy.distutils extensions, as shipped by numpy, is the preferred one. Numscons is more an experiment, if you want. > Also by doing: > > $ python setupscons.py install > Running from numpy source directory. > Traceback (most recent call last): > File "setupscons.py", line 56, in > raise DistutilsError('\n'.join(msg)) > distutils.errors.DistutilsError: You cannot build numpy with scons > without the numscons package > (Failure was: No module named numscons) > > > so the numscons package needs to be installed -- but it is not in > Debian. No, it not. > So is it supposed to be in Debian? No, I don't think it should be. It is not yet stabilized code wise, so it does not make much sense to package it. > Is numscons supposed to be > a build system for other projects as well? Why not to just send the > needed patches to scons and just use scons? Because you cannot just use scons. Numscons is a library build on top of scons, for the needs of numpy. There also needs to be some hook from numpy.distutils to use scons (numscons adds a new distutils command, which is used instead of build to build any compiled code-based extensions). Most of the changes needed for scons have been integrated upstream, though, except one or two things. David From suchindra at gmail.com Sat Feb 7 10:08:34 2009 From: suchindra at gmail.com (Suchindra Sandhu) Date: Sat, 7 Feb 2009 10:08:34 -0500 Subject: [Numpy-discussion] numpy.any oddity In-Reply-To: <3d375d730902061239p791234e7k8d0f9d04e2a66464@mail.gmail.com> References: <3d375d730902061239p791234e7k8d0f9d04e2a66464@mail.gmail.com> Message-ID: Thanks. I tried the latest version and indeed there is no leak. Cheers, Suchindra On Fri, Feb 6, 2009 at 3:39 PM, Robert Kern wrote: > On Fri, Feb 6, 2009 at 13:24, Suchindra Sandhu > wrote: > > Hi, > > > > I accidently stumbled upon this odd behavior by numpy.any. The following > > code leaks memory - > > > > for i in xrange(10000000): > > print N.any({'whatever': N.arange(10000000)}) > > > > Ofcourse, I called "any" on a dict object by accident, but it should not > > really leak memory. > > > > I am running numpy version 1.0.4 with python 2.5.2 > > Upgrade to a more recent version of numpy. I do not see a leak. I > vaguely recall there being a problem in the 1.0.x series with > object-dtype scalars, which is what numpy.any() will convert a dict > to. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at pytables.org Sat Feb 7 13:17:57 2009 From: faltet at pytables.org (Francesc Alted) Date: Sat, 7 Feb 2009 19:17:57 +0100 Subject: [Numpy-discussion] Selection of only a certain number of fields In-Reply-To: References: <498B7181.5000300@enthought.com> Message-ID: <200902071917.57537.faltet@pytables.org> A Saturday 07 February 2009, Neil escrigu?: > Travis E. Oliphant enthought.com> writes: > > I've been fairly quiet on this list for awhile due to work and > > family schedule, but I think about how things can improve > > regularly. One feature that's been requested by a few people is > > the ability to select multiple fields from a structured array. > > > > Thus, suppose *arr* is a structured array with dtype: > > > > [('name', 'S25'), > > ('height', float), > > ('age', int), > > ('gender', 'S8') > > ] > > > > Then, newarr = arr[['name', 'age']] should be a structured array > > with just the name and age fields. > > What are some common use cases for this feature? > > I use structured arrays quite a lot, but I haven't found myself > wanting something like this. If I do need a subset of a structured > array generally I use something like > > [rec[n] for n in 'name age gender'.split()] Good point. However, there are still some very valid reasons for having an idiom like: newarr = arr[['name', 'age']] returning a record array. The first one (and most important IMO), is that newarr continues to be an structured array (BTW, when changed this name from the original record array?), and you can use all the features of these beasts with it. Other reason (albeit a bit secondary) is that its data buffer can be shared through the array interface with other applications, or plain C code, in a relatively straightforward way. However, if newarr becomes a list (or dictionary), this is simply not possible. Cheers, -- Francesc Alted From ondrej at certik.cz Sat Feb 7 13:21:55 2009 From: ondrej at certik.cz (Ondrej Certik) Date: Sat, 7 Feb 2009 10:21:55 -0800 Subject: [Numpy-discussion] preferred numpy build system In-Reply-To: <5b8d13220902070642h281ce30bn2e6b6fcf1583345f@mail.gmail.com> References: <85b5c3130902062204p6f397fa0yb5b90178bb7d9b83@mail.gmail.com> <5b8d13220902070642h281ce30bn2e6b6fcf1583345f@mail.gmail.com> Message-ID: <85b5c3130902071021q3f524771ra4b1f175d06f0f68@mail.gmail.com> Hi David, > Sorry for the confusion: numscons is NOT the preferred build system. > The current numpy.distutils extensions, as shipped by numpy, is the > preferred one. Numscons is more an experiment, if you want. Ah, I see, thanks for the clarification. >> So is it supposed to be in Debian? > > No, I don't think it should be. It is not yet stabilized code wise, so > it does not make much sense to package it. Ok. > >> Is numscons supposed to be >> a build system for other projects as well? Why not to just send the >> needed patches to scons and just use scons? > > Because you cannot just use scons. Numscons is a library build on top > of scons, for the needs of numpy. There also needs to be some hook > from numpy.distutils to use scons (numscons adds a new distutils > command, which is used instead of build to build any compiled > code-based extensions). Most of the changes needed for scons have been > integrated upstream, though, except one or two things. I see. I think it's a bit confusing that one needs to build a new build system just to build numpy, e.g. that both distutils and scons are not good enough. Ondrej From pgmdevlist at gmail.com Sat Feb 7 13:52:18 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Sat, 7 Feb 2009 13:52:18 -0500 Subject: [Numpy-discussion] ERROR: Test flat on masked_matrices In-Reply-To: References: Message-ID: On Feb 7, 2009, at 8:03 AM, Nils Wagner wrote: > > ====================================================================== > ERROR: Test flat on masked_matrices > ---------------------------------------------------------------------- > Traceback (most recent call last): > File > "/usr/local/lib64/python2.5/site-packages/numpy/ma/tests/ > test_core.py", > line 1127, in test_flat > test = ma.array(np.matrix([[1, 2, 3]]), mask=[0, 0, > 1]) > NameError: global name 'ma' is not defined Oops, sorry about that... From ellisonbg.net at gmail.com Sat Feb 7 13:55:19 2009 From: ellisonbg.net at gmail.com (Brian Granger) Date: Sat, 7 Feb 2009 10:55:19 -0800 Subject: [Numpy-discussion] preferred numpy build system In-Reply-To: <85b5c3130902071021q3f524771ra4b1f175d06f0f68@mail.gmail.com> References: <85b5c3130902062204p6f397fa0yb5b90178bb7d9b83@mail.gmail.com> <5b8d13220902070642h281ce30bn2e6b6fcf1583345f@mail.gmail.com> <85b5c3130902071021q3f524771ra4b1f175d06f0f68@mail.gmail.com> Message-ID: <6ce0ac130902071055k2bed3c2ape8c9b633429e38cf@mail.gmail.com> > I see. I think it's a bit confusing that one needs to build a new > build system just to build numpy, e.g. that both distutils and scons > are not good enough. I would not say that numscons is a *new* build system. Rather, I look at numscons as a glue layer that allows scons to be used within distutils for building extensions and libraries. distutils/numpy/distutils does things that scons doesn't and scons does things that distutils doesn't. But you definitely need the glue layer to use both approaches together. Cheers, Brian > Ondrej > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From oliphant at enthought.com Sat Feb 7 15:45:28 2009 From: oliphant at enthought.com (Travis E. Oliphant) Date: Sat, 07 Feb 2009 14:45:28 -0600 Subject: [Numpy-discussion] Selection of only a certain number of fields In-Reply-To: <200902071917.57537.faltet@pytables.org> References: <498B7181.5000300@enthought.com> <200902071917.57537.faltet@pytables.org> Message-ID: <498DF2E8.8060500@enthought.com> Francesc Alted wrote: > A Saturday 07 February 2009, Neil escrigu?: > >> Travis E. Oliphant enthought.com> writes: >> >>> I've been fairly quiet on this list for awhile due to work and >>> family schedule, but I think about how things can improve >>> regularly. One feature that's been requested by a few people is >>> the ability to select multiple fields from a structured array. >>> >>> Thus, suppose *arr* is a structured array with dtype: >>> >>> [('name', 'S25'), >>> ('height', float), >>> ('age', int), >>> ('gender', 'S8') >>> ] >>> >>> Then, newarr = arr[['name', 'age']] should be a structured array >>> with just the name and age fields. >>> >> What are some common use cases for this feature? >> >> I use structured arrays quite a lot, but I haven't found myself >> wanting something like this. If I do need a subset of a structured >> array generally I use something like >> >> [rec[n] for n in 'name age gender'.split()] >> > > Good point. However, there are still some very valid reasons for having > an idiom like: > > newarr = arr[['name', 'age']] > > returning a record array. > > The first one (and most important IMO), is that newarr continues to be > an structured array (BTW, when changed this name from the original > record array?), To avoid confusion with the "record array" subclass which maps attributes to fields, Eric Jones and I have been using this terminology for about a year. -Travis -- Travis Oliphant Enthought, Inc. (512) 536-1057 (office) (512) 536-1059 (fax) http://www.enthought.com oliphant at enthought.com From stefan at sun.ac.za Sun Feb 8 04:54:04 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Sun, 8 Feb 2009 11:54:04 +0200 Subject: [Numpy-discussion] Behaviour of integer powers Message-ID: <9457e7c80902080154h7d65e8cbl9aaf451ef2ff389@mail.gmail.com> Hi all, Ticket #955 (http://scipy.org/scipy/numpy/ticket/955) touches on the following issue: >>> 0.0 ** np.array([-1, 0, 1], dtype=np.int32) array([ Inf, 1., 0.]) >>> 0.0 ** np.array([-1, 0, 1], dtype=np.int32)[0] ------------------------------------------------------------ Traceback (most recent call last): File "", line 1, in ZeroDivisionError: 0.0 cannot be raised to a negative power This is on a 32-bit platform. As I understand this happens because, in the second case, Python sees that "-1" is an int and does the power operation. In other words, when we raise to the power of an array, the NumPy machinery is involved, whereas if we raise to np.int32(-1), it is not. This is due to the int32 type deriving from the Python int. I can't think of an easy way to address the problem, but I was hoping to get some advice from the list. Thanks St?fan From robert.kern at gmail.com Sun Feb 8 05:09:23 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 8 Feb 2009 04:09:23 -0600 Subject: [Numpy-discussion] Behaviour of integer powers In-Reply-To: <9457e7c80902080154h7d65e8cbl9aaf451ef2ff389@mail.gmail.com> References: <9457e7c80902080154h7d65e8cbl9aaf451ef2ff389@mail.gmail.com> Message-ID: <3d375d730902080209i22224febp218d62345d76474b@mail.gmail.com> On Sun, Feb 8, 2009 at 03:54, St?fan van der Walt wrote: > Hi all, > > Ticket #955 (http://scipy.org/scipy/numpy/ticket/955) touches on the > following issue: > >>>> 0.0 ** np.array([-1, 0, 1], dtype=np.int32) > array([ Inf, 1., 0.]) >>>> 0.0 ** np.array([-1, 0, 1], dtype=np.int32)[0] > ------------------------------------------------------------ > Traceback (most recent call last): > File "