From fperez.net at gmail.com Thu Feb 1 01:03:50 2007 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 31 Jan 2007 23:03:50 -0700 Subject: [Numpy-discussion] [SciPy-user] What it the best way to install on a Mac? In-Reply-To: References: <723eb6930701310617v42a4a098s5582ef6dff01e8d4@mail.gmail.com> <45C0C5F5.2000100@unc.edu> <20070131223800.GA20121@avicenna.cc.columbia.edu>

Message-ID: On 1/31/07, Sanjiv Das wrote: > OK - will give that a shot as well. Its a good suggestion! > cheers Great. And it looks like the bulk of that work has already been done for you, as I found out today thanks to the good and trusty Google Alerts service: http://mohinish.blogspot.com/2007/01/python-for-matlab-users-on-mac.html Feel free to open up a page on the ipython wiki: http://ipython.scipy.org/moin Just make a login for yourself and type away! Cheers, f From fperez.net at gmail.com Thu Feb 1 01:05:51 2007 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 31 Jan 2007 23:05:51 -0700 Subject: [Numpy-discussion] [SciPy-user] What it the best way to install on a Mac? In-Reply-To: References: <723eb6930701310617v42a4a098s5582ef6dff01e8d4@mail.gmail.com> <45C0C5F5.2000100@unc.edu> <20070131223800.GA20121@avicenna.cc.columbia.edu>

Message-ID: On 1/31/07, Fernando Perez wrote: > On 1/31/07, Sanjiv Das wrote: > > OK - will give that a shot as well. Its a good suggestion! > > cheers And I should add: thanks! Recent discussions on this list indicate there's a real need for this information, so a complete and step by step summary of this info in a purely tutorial manner may well help many. Having it in a publicly editable format will ensure it can stay up to date as the problem evolves. Cheers, f From zpincus at stanford.edu Thu Feb 1 03:28:17 2007 From: zpincus at stanford.edu (Zachary Pincus) Date: Thu, 1 Feb 2007 00:28:17 -0800 Subject: [Numpy-discussion] array copy-to-self and views Message-ID: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> Hello folks, I recently was trying to write code to modify an array in-place (so as not to invalidate any references to that array) via the standard python idiom for lists, e.g.: a[:] = numpy.flipud(a) Now, flipud returns a view on 'a', so assigning that to 'a[:]' provides pretty strange results as the buffer that is being read (the view) is simultaneously modified. Here is an example: In [2]: a = numpy.arange(10).reshape((5,2)) In [3]: a Out[3]: array([[0, 1], [2, 3], [4, 5], [6, 7], [8, 9]]) In [4]: numpy.flipud(a) Out[4]: array([[8, 9], [6, 7], [4, 5], [2, 3], [0, 1]]) In [5]: a[:] = numpy.flipud(a) In [6]: a Out[6]: array([[8, 9], [6, 7], [4, 5], [6, 7], [8, 9]]) A question, then: Does this represent a bug? Or perhaps there is a better idiom for modifying an array in-place than 'a[:] = ...'? Or is incumbent on the user to ensure that any time an array is directly modified, that the modifying array is not a view of the original array? Thanks for any thoughts, Zach Pincus Program in Biomedical Informatics and Department of Biochemistry Stanford University School of Medicine From peridot.faceted at gmail.com Thu Feb 1 04:07:29 2007 From: peridot.faceted at gmail.com (Anne Archibald) Date: Thu, 1 Feb 2007 04:07:29 -0500 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> Message-ID: On 01/02/07, Zachary Pincus wrote: > I recently was trying to write code to modify an array in-place (so > as not to invalidate any references to that array) via the standard > python idiom for lists, e.g.: You can do this, but if your concern is invalidating references, you might want to think about rearranging your application so you can just return "new" arrays (that may share elements), if that is possible. > a[:] = numpy.flipud(a) > > Now, flipud returns a view on 'a', so assigning that to 'a[:]' > provides pretty strange results as the buffer that is being read (the > view) is simultaneously modified. Here is an example: > A question, then: Does this represent a bug? Or perhaps there is a > better idiom for modifying an array in-place than 'a[:] = ...'? Or is > incumbent on the user to ensure that any time an array is directly > modified, that the modifying array is not a view of the original array? It's the user's job to keep them separate. Sorry. If you're worried - say if it's an array you don't have much control over (so it might share elements without you knowing), you can either return a new array, or if you must modify it in place, copy the right-hand side before using it (a[:]=flipud(a).copy()). It would in principle be possible for numpy to provide a function that tells you if two arrays might share data (simply compare the pointer to the malloc()ed storage and ignore strides and offset; a bit conservative but probably Good Enough, though a bit more cleverness should let one get the Right Answer efficiently). Anne M. Archibald From emsellem at obs.univ-lyon1.fr Thu Feb 1 07:24:56 2007 From: emsellem at obs.univ-lyon1.fr (Eric Emsellem) Date: Thu, 01 Feb 2007 13:24:56 +0100 Subject: [Numpy-discussion] problem with installation of numpy: undefined symbols Message-ID: <45C1DC18.6010200@obs.univ-lyon1.fr> Hi, after trying to solve an installation problem with scipy, I had to reinstall everything from scratch, and so I now turned back to numpy the installation of which does not work for me (which may in fact explain the pb I had with scipy). To be clear on what I do: - I install blas first, and create a libfblas.a which I copy under /usr/local/lib (using a g77 -fno-second-underscore -O2 -c *.f, ar r libfblas.a *.o, ranlib libfblas.a) - I then define BLAS environment variable accordingly (/usr/local/lib/libfblas.a) - I compile lapack-3.1.0, using "make lapacklib" - I then use a precompiled ATLAS Linux version P4SSE2 (I tried compiling it but it does have the same result) - I copy all ATLAS .a and .h in /usr/local/lib/atlas, and define the ATLAS variable accordingly - I then merge the *.o to have an extended liblapack.a by doing the usual: cd /usr/local/lib/atlas cp liblapack.a liblapack.a_ATLAS mkdir tmp cd tmp ar x ../liblapack.a_ATLAS cp /soft/python/tar/Science_Packages/lapack-3.1.0/lapack_LINUX.a ../liblapack.a ar r ../liblapack.a *.o - I finally get the svn version of numpy, and do the "python setup.py install" (no site.cfg ! but environment variables defined as mentioned above) - I go into ipython, and try: import numpy and I get : exceptions.ImportError Traceback (most recent call last) ................ ---> 40 import linalg ................ ImportError: /usr/local/lib/python2.4/site-packages/numpy/linalg/lapack_lite.so: undefined symbol: ATL_cGetNB I have tried many ways to avoid this problem but does not manage to solve it. Any help is welcome there. (and sorry for those who already saw my numerous posts on scipy forum: I am trying to get to the heart of the pb...) thanks in advance! Eric From robert.kern at gmail.com Thu Feb 1 11:08:34 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 10:08:34 -0600 Subject: [Numpy-discussion] problem with installation of numpy: undefined symbols In-Reply-To: <45C1DC18.6010200@obs.univ-lyon1.fr> References: <45C1DC18.6010200@obs.univ-lyon1.fr> Message-ID: <45C21082.8060406@gmail.com> Eric Emsellem wrote: > - I finally get the svn version of numpy, and do the "python setup.py > install" (no site.cfg ! but environment variables defined as mentioned > above) Show us the output of $ cd ~/src/numpy # or whereever $ python setup.py config Most likely, you are having the same problem you had with scipy. You will probably need to make a site.cfg with the correct information about ATLAS. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Thu Feb 1 12:38:12 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 09:38:12 -0800 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> Message-ID: <45C22584.4060904@noaa.gov> Zachary Pincus wrote: > Hello folks, > > I recently was trying to write code to modify an array in-place (so > as not to invalidate any references to that array) I'm not sure what this means exactly. > via the standard > python idiom for lists, e.g.: > > a[:] = numpy.flipud(a) > > Now, flipud returns a view on 'a', so assigning that to 'a[:]' > provides pretty strange results as the buffer that is being read (the > view) is simultaneously modified. yes, weird. So why not just: a = numpy.flipud(a) Since flipud returns a view, the new "a" will still be using the same data array. Does this satisfy your need above? You've created a new numpy array object, but that was created by flipud anyway, so there is no performance loss. It's too bad that to do this you need to know that flipud created a view, rather than a copy of the data, as if it were a copy, you would need to do the a[:] trick to make sure a kept the same data, but that's the price we pay for the flexibility and power of numpy -- the alternative is to have EVERYTHING create a copy, but there were be a substantial performance hit for that. NOTE: the docstring doesn't make it clear that a view is created: >>> help(numpy.flipud) Help on function flipud in module numpy.lib.twodim_base: flipud(m) returns an array with the columns preserved and rows flipped in the up/down direction. Works on the first dimension of m. NOTE2: Maybe these kinds of functions should have an optional flag that specified whether you want a view or a copy -- I'd have expected a copy in this case! QUESTION: How do you tell if two arrays are views on the same data: is checking if they have the same .base reliable? >>> a = numpy.array((1,2,3,4)) >>> b = a.view() >>> a.base is b.base False No, I guess not. Maybe .base should return self if it's the originator of the data. Is there a reliable way? I usually just test by changing a value in one to see if it changes in the other, but that's one heck of kludge! >>> a.__array_interface__['data'][0] == b.__array_interface__['data'][0] True seems to work, but that's pretty ugly! -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From zpincus at stanford.edu Thu Feb 1 13:52:13 2007 From: zpincus at stanford.edu (Zachary Pincus) Date: Thu, 1 Feb 2007 10:52:13 -0800 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <45C22584.4060904@noaa.gov> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C22584.4060904@noaa.gov> Message-ID: <4A126763-2364-41AB-AD1C-ACBB5CE265FF@stanford.edu> > Zachary Pincus wrote: >> Hello folks, >> >> I recently was trying to write code to modify an array in-place (so >> as not to invalidate any references to that array) > > I'm not sure what this means exactly. Say one wants to keep two different variables referencing a single in- memory list, as so: a = [1,2,3] b = a Now, if 'b' and 'a' go to live in different places (different class instances or whatever) but we want 'b' and 'a' to always refer to the same in-memory object, so that 'id(a) == id(b)', we need to make sure to not assign a brand new list to either one. That is, if we do something like 'a = [i + 1 for i in a]' then 'id (a) != id(b)'. However, we can do 'a[:] = [i + 1 for i in a]' to modify a in-place. This is not super-common, but it's also not an uncommon python idiom. I was in my email simply pointing out that na?vely translating that idiom to the numpy case can cause unexpected behavior in the case of views. I think that this is is unquestionably a bug -- isn't the point of views that the user shouldn't need to care if a particular array object is a view or not? Given the lack of methods to query whether an array is a view, or what it might be a view on, this seems like a reasonable perspective... I mean, if certain operations produce completely different results when one of the operands is a view, that *seems* like a bug. It might not be worth fixing, but I can't see how that behavior would be considered a feature. However, I do think there's a legitimate question about whether it would be worth fixing -- there could be a lot of complicated checks to catch these kind of corner cases. >> via the standard >> python idiom for lists, e.g.: >> >> a[:] = numpy.flipud(a) >> >> Now, flipud returns a view on 'a', so assigning that to 'a[:]' >> provides pretty strange results as the buffer that is being read (the >> view) is simultaneously modified. > > yes, weird. So why not just: > > a = numpy.flipud(a) > > Since flipud returns a view, the new "a" will still be using the same > data array. Does this satisfy your need above? Nope -- though 'a' and 'numpy.flipud(a)' share the same data, the actual ndarray instances are different. This means that any other references to the 'a' array (made via 'b = a' or whatever) now refer to the old 'a', not the flipped one. The only other option for sharing arrays is to encapsulate them as attributes of *another* object, which itself won't change. That seems a bit clumsy. > It's too bad that to do this you need to know that flipud created a > view, rather than a copy of the data, as if it were a copy, you would > need to do the a[:] trick to make sure a kept the same data, but > that's > the price we pay for the flexibility and power of numpy -- the > alternative is to have EVERYTHING create a copy, but there were be a > substantial performance hit for that. Well, Anne's email suggests another alternative -- each time a view is created, keep track of the original array from whence it came, and then only make a copy when collisions like the above would take place. And actually, I suspect that views already need to keep a reference to their original array in order to keep that array from being deleted before the view is. But I don't know the guts of numpy well enough to say for sure. > NOTE: the docstring doesn't make it clear that a view is created: > >>>> help(numpy.flipud) > Help on function flipud in module numpy.lib.twodim_base: > > flipud(m) > returns an array with the columns preserved and rows flipped in > the up/down direction. Works on the first dimension of m. > > NOTE2: Maybe these kinds of functions should have an optional flag > that > specified whether you want a view or a copy -- I'd have expected a > copy > in this case! Well, it seems like in most cases one does not need to care whether one is looking at a view or an array. The only time that comes to mind is when you're attempting to modify the array in-place, e.g. a[] = Even if the maybe-bug above were easily fixable (again, not sure about that), you might *still* want to be able to figure out if a were a view before such a modification. Whether this needs a runtime 'is_view' method, or just consistent documentation about what returns a view, isn't clear to me. Certainly the latter couldn't hurt. > QUESTION: > How do you tell if two arrays are views on the same data: is > checking if > they have the same .base reliable? > >>>> a = numpy.array((1,2,3,4)) >>>> b = a.view() >>>> a.base is b.base > False > > No, I guess not. Maybe .base should return self if it's the originator > of the data. > > Is there a reliable way? I usually just test by changing a value in > one > to see if it changes in the other, but that's one heck of kludge! > >>>> a.__array_interface__['data'][0] == b.__array_interface__['data'] >>>> [0] > True > > seems to work, but that's pretty ugly! Good question. As I mentioned above, I assume that this information is tracked internally to prevent the 'original' array data from being deleted before any views have; however I really don't know how it is exposed. Zach From tim.hochberg at ieee.org Thu Feb 1 14:24:04 2007 From: tim.hochberg at ieee.org (Timothy Hochberg) Date: Thu, 1 Feb 2007 12:24:04 -0700 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <4A126763-2364-41AB-AD1C-ACBB5CE265FF@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C22584.4060904@noaa.gov> <4A126763-2364-41AB-AD1C-ACBB5CE265FF@stanford.edu> Message-ID: On 2/1/07, Zachary Pincus wrote: [CHOP] I think that this is is unquestionably a bug It's not a bug. It's a design decision. It has certain consequences. Many good, some bad and some that just take some getting used to. -- isn't the point of > views that the user shouldn't need to care if a particular array > object is a view or not? As you state elsewhere, the issue isn't whether a given object is a view per se, it's whether the objects that you are operating on refer to the same block of memory. They could both be views, even of the same object, and as long as they're disjoint, it's not a problem. Given the lack of methods to query whether > an array is a view, or what it might be a view on, this seems like a > reasonable perspective... I mean, if certain operations produce > completely different results when one of the operands is a view, that > *seems* like a bug. It might not be worth fixing, but I can't see how > that behavior would be considered a feature. View semantics are a feature. A powerful and sometime dangerous feature. Sometimes the consequences of these semantics can bite people, but that doesn't make them a bug. [CHOP] > > > Good question. As I mentioned above, I assume that this information > is tracked internally to prevent the 'original' array data from being > deleted before any views have; however I really don't know how it is > exposed. I believe that a reference is held to the original array, so the array itself won't be deleted even if all of the references to it go away. The details may be different, but that's the gist of it. Even ifyou could access this, it wouldn't really tell you anything useful since two slices could refer to pieces of the original chunk of data, yet still be disjoint. If you wanted to be able to figure this out, probably the thing to do is just to actually look at the block of data occupied by each array and see if they overlap. I think you could even do this without resorting to C by using the array interface. However, I'd like to repeat what my doctor said as a kid when I complained that "it hurts when I do this": "Don't do that!" -- Some Radom Doctor In other words, I think you'd be better off restructuring your code so that this isn't an issue. I've been using Numeric/numarray/numpy for over ten years now this has never been a significant issue for me. -- //=][=\\ tim.hochberg at ieee.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From oliphant at ee.byu.edu Thu Feb 1 14:30:33 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 12:30:33 -0700 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> Message-ID: <45C23FD9.6010108@ee.byu.edu> Zachary Pincus wrote: >Hello folks, > >I recently was trying to write code to modify an array in-place (so >as not to invalidate any references to that array) via the standard >python idiom for lists, e.g.: > >a[:] = numpy.flipud(a) > >Now, flipud returns a view on 'a', so assigning that to 'a[:]' >provides pretty strange results as the buffer that is being read (the >view) is simultaneously modified. Here is an example: > > This is a known feature of the "view" concept. It has been present in Numeric from the beginning. Performing operations in-place using a view always gives "hard-to-predict" results. It depends completely on how the algorithms are implemented. Knowing that numpy.flipud(a) is just a different way to write a[::-1,...] which works for any nested-sequence, helps you realize that if a is already an array, then it returns a reversed view, but when copied back into itself creates the results you obtained but might not have bee expecting. You can understand the essence of what is happening with a simpler example: a = arange(10) a[:] = a[::-1] What is a? It is easy to see the answer when you realize that the code is doing the equivalent of a[0] = a[9] a[1] = a[8] a[2] = a[7] a[3] = a[6] a[4] = a[5] a[5] = a[4] a[6] = a[3] a[7] = a[2] a[8] = a[1] a[9] = a[0] Notice that the final 5 lines are completely redundant, so really all that is happening is a[:5] = a[5:][::-1] There was an explicit warning of the oddities of this construct in the original Numeric documentation. Better documentation of the flipud function to indicate that it returns a view is definitely desireable. In fact, all functions that return views should be clear about this in the docstring. In addition, all users of "in-place" functionality of NumPy must be aware of the view concept and realize that you could be modifying the array you are using. This came up before when somebody asked how to perform a "diff" in place and I was careful to make sure and not change the input array before it was used. >A question, then: Does this represent a bug? Or perhaps there is a >better idiom for modifying an array in-place than 'a[:] = ...'? Or is >incumbent on the user to ensure that any time an array is directly >modified, that the modifying array is not a view of the original array? > > Yes, it is and has always been incumbent on the user to ensure that any time an array is directly modified in-place that the modifying array is not a "view" of the original array. Good example... -Travis From Louis.Wicker at noaa.gov Thu Feb 1 14:33:23 2007 From: Louis.Wicker at noaa.gov (Louis Wicker) Date: Thu, 01 Feb 2007 13:33:23 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) Message-ID: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> Dear list: I cannot seem to figure how to create arrays > 2 GB on a Mac Pro (using Intel chip and Tiger, 4.8). I have hand compiled both Python 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I also run out of space if I try and 3-6 several arrays of 1000 mb or so (the mem-alloc failure does not seem consistent, depends on whether I am creating them with a "numpy.ones()" call, or creating them on the fly by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). Is this a numpy issue, or a Python 2.5 issue for the Mac? I have tried this on the SGI Altix, and this works fine. If there is a compile flag to turn on 64 bit support in the Mac compile, I would be glad to find out about it. Or do I have to wait for Leopard? Thanks. Lou Wicker ------------------------------------------------------------------------ ---- | Dr. Louis J. Wicker | NSSL/WRDD | National Weather Center | 120 David L. Boren Boulevard, Norman, OK 73072-7323 | | E-mail: Louis.Wicker at noaa.gov | HTTP: www.nssl.noaa.gov/~lwicker | Phone: (405) 325-6340 | Fax: (405) 325-6780 | | "Programming is not just creating strings of instructions | for a computer to execute. It's also 'literary' in that you | are trying to communicate a program structure to | other humans reading the code." - Paul Rubin | |"Real efficiency comes from elegant solutions, not optimized programs. | Optimization is always just a few correctness-preserving transformations | away." - Jonathan Sobel ------------------------------------------------------------------------ ---- | | "The contents of this message are mine personally and | do not reflect any position of the Government or NOAA." | ------------------------------------------------------------------------ ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: From svetosch at gmx.net Thu Feb 1 14:40:02 2007 From: svetosch at gmx.net (Sven Schreiber) Date: Thu, 01 Feb 2007 20:40:02 +0100 Subject: [Numpy-discussion] Cutting 1.0.2 release In-Reply-To: <45BE7E57.4000105@ee.byu.edu> References: <45BE7E57.4000105@ee.byu.edu> Message-ID: <45C24212.4060203@gmx.net> Travis Oliphant schrieb: > I think it's time for the 1.0.2 release of NumPy. > > What outstanding issues need to be resolved before we do it? > Hi, I just used real_if_close for the first time, and promptly discovered that it turns matrix input into array output: >>> import numpy as n >>> n.__version__ '1.0.1' >>> n.real_if_close(n.mat(1)) array([[1]]) Maybe it's something for 1.0.2, or should I file a ticket? I could also do another round of systematic searches for other functions that still behave like this, but I'm afraid not before 1.0.2 (if that happens this weekend). Thanks, Sven From oliphant at ee.byu.edu Thu Feb 1 14:41:38 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 12:41:38 -0700 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> Message-ID: <45C24272.4010402@ee.byu.edu> Louis Wicker wrote: > Dear list: > > I cannot seem to figure how to create arrays > 2 GB on a Mac Pro > (using Intel chip and Tiger, 4.8). I have hand compiled both Python > 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I also > run out of space if I try and 3-6 several arrays of 1000 mb or so (the > mem-alloc failure does not seem consistent, depends on whether I am > creating them with a "numpy.ones()" call, or creating them on the fly > by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). > > Is this a numpy issue, or a Python 2.5 issue for the Mac? I have > tried this on the SGI Altix, and this works fine. It must be a malloc issue. NumPy uses the system malloc to construct arrays. It just reports errors back to you if it can't. I don't think the Mac Pro uses a 64-bit chip, does it? -Travis From robert.kern at gmail.com Thu Feb 1 14:48:08 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 13:48:08 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C24272.4010402@ee.byu.edu> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> Message-ID: <45C243F8.3080306@gmail.com> Travis Oliphant wrote: > Louis Wicker wrote: > >> Dear list: >> >> I cannot seem to figure how to create arrays > 2 GB on a Mac Pro >> (using Intel chip and Tiger, 4.8). I have hand compiled both Python >> 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I also >> run out of space if I try and 3-6 several arrays of 1000 mb or so (the >> mem-alloc failure does not seem consistent, depends on whether I am >> creating them with a "numpy.ones()" call, or creating them on the fly >> by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). >> >> Is this a numpy issue, or a Python 2.5 issue for the Mac? I have >> tried this on the SGI Altix, and this works fine. > > It must be a malloc issue. NumPy uses the system malloc to construct > arrays. It just reports errors back to you if it can't. > > I don't think the Mac Pro uses a 64-bit chip, does it? Intel Core 2 Duo's are 64-bit, but the OS and runtime libraries are not. None of the Python distributions for OS X are compiled for 64-bit support, either. When Leopard comes out, there will be better 64-bit support in the OS, and Python will follow. Until then, Python on OS X is 32-bit. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Louis.Wicker at noaa.gov Thu Feb 1 14:48:16 2007 From: Louis.Wicker at noaa.gov (Louis Wicker) Date: Thu, 01 Feb 2007 13:48:16 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C24272.4010402@ee.byu.edu> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> Message-ID: Travis: yes it does. Its the Woodcrest server chip which supports 32 and 64 bit operations. For example the new Intel Fortran compiler can grab more than 2 GB of memory (its a beta10 version). I think gcc 4.x can as well. However, Tiger (OS X 10.4.x) is not completely 64 bit compliant - Leopard is supposed to be pretty darn close. Is there a numpy flag I could try for compilation.... Lou On Feb 1, 2007, at 1:41 PM, Travis Oliphant wrote: > Louis Wicker wrote: > >> Dear list: >> >> I cannot seem to figure how to create arrays > 2 GB on a Mac Pro >> (using Intel chip and Tiger, 4.8). I have hand compiled both Python >> 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I also >> run out of space if I try and 3-6 several arrays of 1000 mb or so >> (the >> mem-alloc failure does not seem consistent, depends on whether I am >> creating them with a "numpy.ones()" call, or creating them on the fly >> by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). >> >> Is this a numpy issue, or a Python 2.5 issue for the Mac? I have >> tried this on the SGI Altix, and this works fine. > > It must be a malloc issue. NumPy uses the system malloc to construct > arrays. It just reports errors back to you if it can't. > > I don't think the Mac Pro uses a 64-bit chip, does it? > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion ------------------------------------------------------------------------ ---- | Dr. Louis J. Wicker | NSSL/WRDD | National Weather Center | 120 David L. Boren Boulevard, Norman, OK 73072-7323 | | E-mail: Louis.Wicker at noaa.gov | HTTP: www.nssl.noaa.gov/~lwicker | Phone: (405) 325-6340 | Fax: (405) 325-6780 | | "Programming is not just creating strings of instructions | for a computer to execute. It's also 'literary' in that you | are trying to communicate a program structure to | other humans reading the code." - Paul Rubin | |"Real efficiency comes from elegant solutions, not optimized programs. | Optimization is always just a few correctness-preserving transformations | away." - Jonathan Sobel ------------------------------------------------------------------------ ---- | | "The contents of this message are mine personally and | do not reflect any position of the Government or NOAA." | ------------------------------------------------------------------------ ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: From Louis.Wicker at noaa.gov Thu Feb 1 14:50:04 2007 From: Louis.Wicker at noaa.gov (Louis Wicker) Date: Thu, 01 Feb 2007 13:50:04 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C24272.4010402@ee.byu.edu> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> Message-ID: <665E5E7D-1F12-4A52-8FB8-CAC043041EF0@noaa.gov> Travis: quick follow up: Mac Pro's currently have the dual-core 5100 Xeon (two processors, two cores each), the 5300 Xeon's (quad-core) are coming in a few weeks, we think. Lou On Feb 1, 2007, at 1:41 PM, Travis Oliphant wrote: > Louis Wicker wrote: > >> Dear list: >> >> I cannot seem to figure how to create arrays > 2 GB on a Mac Pro >> (using Intel chip and Tiger, 4.8). I have hand compiled both Python >> 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I also >> run out of space if I try and 3-6 several arrays of 1000 mb or so >> (the >> mem-alloc failure does not seem consistent, depends on whether I am >> creating them with a "numpy.ones()" call, or creating them on the fly >> by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). >> >> Is this a numpy issue, or a Python 2.5 issue for the Mac? I have >> tried this on the SGI Altix, and this works fine. > > It must be a malloc issue. NumPy uses the system malloc to construct > arrays. It just reports errors back to you if it can't. > > I don't think the Mac Pro uses a 64-bit chip, does it? > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion ------------------------------------------------------------------------ ---- | Dr. Louis J. Wicker | NSSL/WRDD | National Weather Center | 120 David L. Boren Boulevard, Norman, OK 73072-7323 | | E-mail: Louis.Wicker at noaa.gov | HTTP: www.nssl.noaa.gov/~lwicker | Phone: (405) 325-6340 | Fax: (405) 325-6780 | | "Programming is not just creating strings of instructions | for a computer to execute. It's also 'literary' in that you | are trying to communicate a program structure to | other humans reading the code." - Paul Rubin | |"Real efficiency comes from elegant solutions, not optimized programs. | Optimization is always just a few correctness-preserving transformations | away." - Jonathan Sobel ------------------------------------------------------------------------ ---- | | "The contents of this message are mine personally and | do not reflect any position of the Government or NOAA." | ------------------------------------------------------------------------ ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: From Louis.Wicker at noaa.gov Thu Feb 1 14:50:59 2007 From: Louis.Wicker at noaa.gov (Louis Wicker) Date: Thu, 01 Feb 2007 13:50:59 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C243F8.3080306@gmail.com> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> <45C243F8.3080306@gmail.com> Message-ID: <9A3986E1-FE7E-490A-B0A1-E2F0FAAC25C3@noaa.gov> Thanks Robert thats kinda what I thought. Since Leopard is not far off, then by summer things will be fine, I hope... L On Feb 1, 2007, at 1:48 PM, Robert Kern wrote: > Travis Oliphant wrote: >> Louis Wicker wrote: >> >>> Dear list: >>> >>> I cannot seem to figure how to create arrays > 2 GB on a Mac Pro >>> (using Intel chip and Tiger, 4.8). I have hand compiled both Python >>> 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I >>> also >>> run out of space if I try and 3-6 several arrays of 1000 mb or so >>> (the >>> mem-alloc failure does not seem consistent, depends on whether I am >>> creating them with a "numpy.ones()" call, or creating them on the >>> fly >>> by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). >>> >>> Is this a numpy issue, or a Python 2.5 issue for the Mac? I have >>> tried this on the SGI Altix, and this works fine. >> >> It must be a malloc issue. NumPy uses the system malloc to construct >> arrays. It just reports errors back to you if it can't. >> >> I don't think the Mac Pro uses a 64-bit chip, does it? > > Intel Core 2 Duo's are 64-bit, but the OS and runtime libraries are > not. None of > the Python distributions for OS X are compiled for 64-bit support, > either. > > When Leopard comes out, there will be better 64-bit support in the > OS, and > Python will follow. Until then, Python on OS X is 32-bit. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a > harmless enigma > that is made terrible by our own mad attempt to interpret it as > though it had > an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion ------------------------------------------------------------------------ ---- | Dr. Louis J. Wicker | NSSL/WRDD | National Weather Center | 120 David L. Boren Boulevard, Norman, OK 73072-7323 | | E-mail: Louis.Wicker at noaa.gov | HTTP: www.nssl.noaa.gov/~lwicker | Phone: (405) 325-6340 | Fax: (405) 325-6780 | | "Programming is not just creating strings of instructions | for a computer to execute. It's also 'literary' in that you | are trying to communicate a program structure to | other humans reading the code." - Paul Rubin | |"Real efficiency comes from elegant solutions, not optimized programs. | Optimization is always just a few correctness-preserving transformations | away." - Jonathan Sobel ------------------------------------------------------------------------ ---- | | "The contents of this message are mine personally and | do not reflect any position of the Government or NOAA." | ------------------------------------------------------------------------ ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: From oliphant at ee.byu.edu Thu Feb 1 14:55:55 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 12:55:55 -0700 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> Message-ID: <45C245CB.2080502@ee.byu.edu> Louis Wicker wrote: > Travis: > > yes it does. Its the Woodcrest server chip > which > supports 32 and 64 bit operations. For example the new Intel Fortran > compiler can grab more than 2 GB of memory (its a beta10 version). I > think gcc 4.x can as well. > Nice. I didn't know this. > However, Tiger (OS X 10.4.x) is not completely 64 bit compliant - > Leopard is supposed to be pretty darn close. > > Is there a numpy flag I could try for compilation.... It's entirely compiler and system dependent. NumPy just uses the system malloc. If you can compile it so that the system malloc supports 64-bit then O.K. (but you will probably run into trouble unless Python is also compiled as a 64-bit application). From Robert's answer, I guess it is impossible under Tiger to compile with 64-bit support. -Travis From zpincus at stanford.edu Thu Feb 1 15:01:00 2007 From: zpincus at stanford.edu (Zachary Pincus) Date: Thu, 1 Feb 2007 12:01:00 -0800 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <45C23FD9.6010108@ee.byu.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C23FD9.6010108@ee.byu.edu> Message-ID: <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> >> A question, then: Does this represent a bug? Or perhaps there is a >> better idiom for modifying an array in-place than 'a[:] = ...'? Or is >> incumbent on the user to ensure that any time an array is directly >> modified, that the modifying array is not a view of the original >> array? >> >> > Yes, it is and has always been incumbent on the user to ensure that > any > time an array is directly modified in-place that the modifying > array is > not a "view" of the original array. Fair enough. Now, how does a user ensure this -- say someone like me, who has been using numpy (et alia) for a couple of years, but clearly not long enough to have an 'intuitive' feel for every time something might be a view (a feeling that must seem quite natural to long-time numpy users, who may have forgotten precisely how long it takes to develop that level of intuition)? Documentation of what returns views helps, for sure. Would any other 'training' mechanisms help? Say a function that (despite Tim's pretty reasonable 'don't do that' warning) will return true when two arrays have overlapping memory? Or an 'inplace_modify' function that takes the time to make that check? Perhaps I'm the first to have views bite me in this precise way. However, if there are common failure-modes with views, I hope it's not too unreasonable to ask about ways that those common problems might be addressed. (Other than just saying "train for ten years, and you too will have numpy-fu, my son.") Giving newbies tools to deal with common problems with admittedly "dangerous" constructs might be useful. Zach From seb.haase at gmx.net Thu Feb 1 15:01:27 2007 From: seb.haase at gmx.net (Sebastian Haase) Date: Thu, 1 Feb 2007 12:01:27 -0800 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C245CB.2080502@ee.byu.edu> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> <45C245CB.2080502@ee.byu.edu> Message-ID: Here is a small c program that we used more than a year ago to confirm that tiger is really doing a 64-bit malloc (on G5). #include #include int main() { size_t n; void *p; double gb; for(gb=10;gb>.3;gb-=.5) { n= 1024L * 1024L * 1024L * gb; p = malloc( n ); printf("%12lu %4.1lfGb %p\n",n,n/1024./1024./1024.,p); free(p); } return 0; } Hope this helps anyone. Sebastian On 2/1/07, Travis Oliphant wrote: > Louis Wicker wrote: > > > Travis: > > > > yes it does. Its the Woodcrest server chip > > which > > supports 32 and 64 bit operations. For example the new Intel Fortran > > compiler can grab more than 2 GB of memory (its a beta10 version). I > > think gcc 4.x can as well. > > > Nice. I didn't know this. > > > However, Tiger (OS X 10.4.x) is not completely 64 bit compliant - > > Leopard is supposed to be pretty darn close. > > > > Is there a numpy flag I could try for compilation.... > > It's entirely compiler and system dependent. NumPy just uses the system > malloc. If you can compile it so that the system malloc supports 64-bit > then O.K. (but you will probably run into trouble unless Python is also > compiled as a 64-bit application). From Robert's answer, I guess it is > impossible under Tiger to compile with 64-bit support. > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From haase at msg.ucsf.edu Thu Feb 1 15:01:48 2007 From: haase at msg.ucsf.edu (Sebastian Haase) Date: Thu, 1 Feb 2007 12:01:48 -0800 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C245CB.2080502@ee.byu.edu> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> <45C245CB.2080502@ee.byu.edu> Message-ID: Here is a small c program that we used more than a year ago to confirm that tiger is really doing a 64-bit malloc (on G5). #include #include int main() { size_t n; void *p; double gb; for(gb=10;gb>.3;gb-=.5) { n= 1024L * 1024L * 1024L * gb; p = malloc( n ); printf("%12lu %4.1lfGb %p\n",n,n/1024./1024./1024.,p); free(p); } return 0; } Hope this helps anyone. Sebastian On 2/1/07, Travis Oliphant wrote: > Louis Wicker wrote: > > > Travis: > > > > yes it does. Its the Woodcrest server chip > > which > > supports 32 and 64 bit operations. For example the new Intel Fortran > > compiler can grab more than 2 GB of memory (its a beta10 version). I > > think gcc 4.x can as well. > > > Nice. I didn't know this. > > > However, Tiger (OS X 10.4.x) is not completely 64 bit compliant - > > Leopard is supposed to be pretty darn close. > > > > Is there a numpy flag I could try for compilation.... > > It's entirely compiler and system dependent. NumPy just uses the system > malloc. If you can compile it so that the system malloc supports 64-bit > then O.K. (but you will probably run into trouble unless Python is also > compiled as a 64-bit application). From Robert's answer, I guess it is > impossible under Tiger to compile with 64-bit support. > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From Chris.Barker at noaa.gov Thu Feb 1 15:12:34 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 12:12:34 -0800 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <4A126763-2364-41AB-AD1C-ACBB5CE265FF@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C22584.4060904@noaa.gov> <4A126763-2364-41AB-AD1C-ACBB5CE265FF@stanford.edu> Message-ID: <45C249B2.6070804@noaa.gov> Zachary Pincus wrote: >>> I recently was trying to write code to modify an array in-place (so >>> as not to invalidate any references to that array) >> I'm not sure what this means exactly. > > Say one wants to keep two different variables referencing a single in- > memory list, as so: > a = [1,2,3] > b = a > Now, if 'b' and 'a' go to live in different places (different class > instances or whatever) but we want 'b' and 'a' to always refer to the > same in-memory object, so that 'id(a) == id(b)', we need to make sure > to not assign a brand new list to either one. OK, got it, but numpy arrays are not quite the same as lists, there is the additional complication that two different array objects can share the same data: >>> b = a[:] >>> a = N.ones((5,)) >>> b = a[:] >>> a is b False >>> a[2] = 5 >>> a array([ 1., 1., 5., 1., 1.]) >>> b array([ 1., 1., 5., 1., 1.]) This is very useful, but can be tricky. In a way, it's like a nested list: >>> a = [[1,2,3,4]] >>> b = [a[0]] >>> a is b False >>> a[0][2] = 5 >>> a [[1, 2, 5, 4]] >>> b [[1, 2, 5, 4]] hey! changing a changed b too! So key is that in your case, it probably doesn't matter if a and b are the same object, as long as they share the same data, and having multiple arrays sharing the same data is a common idiom in numpy. > That is, if we do something like 'a = [i + 1 for i in a]' then 'id > (a) != id(b)'. However, we can do 'a[:] = [i + 1 for i in a]' to > modify a in-place. Ah, but at Travis pointed out, the difference is not in assignment or anything like that, but in the fact that a list comprehension produces a copy, which is analogous to : flipud(a).copy In numpy, you DO need to be aware of when you are getting copies, and when you are getting views, and what the consequences are. So really, the only "bug" here is in the docs -- they should make it clear whether a function returns a copy or a view. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From oliphant at ee.byu.edu Thu Feb 1 15:15:45 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 13:15:45 -0700 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C23FD9.6010108@ee.byu.edu> <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> Message-ID: <45C24A71.6070100@ee.byu.edu> Zachary Pincus wrote: >>>A question, then: Does this represent a bug? Or perhaps there is a >>>better idiom for modifying an array in-place than 'a[:] = ...'? Or is >>>incumbent on the user to ensure that any time an array is directly >>>modified, that the modifying array is not a view of the original >>>array? >>> >>> >>> >>> >>Yes, it is and has always been incumbent on the user to ensure that >>any >>time an array is directly modified in-place that the modifying >>array is >>not a "view" of the original array. >> >> > >Fair enough. Now, how does a user ensure this -- say someone like me, >who has been using numpy (et alia) for a couple of years, but clearly >not long enough to have an 'intuitive' feel for every time something >might be a view (a feeling that must seem quite natural to long-time >numpy users, who may have forgotten precisely how long it takes to >develop that level of intuition)? > > Basically, red-flags go off when you do in-place modification of any kind and you make sure you don't have an inappropriate view. That pretty much describes my "intuition." Views arise from "slicing" notation. The flipud returning a view is a bit obscure and should be documented better. >Documentation of what returns views helps, for sure. Would any other >'training' mechanisms help? Say a function that (despite Tim's pretty >reasonable 'don't do that' warning) will return true when two arrays >have overlapping memory? Or an 'inplace_modify' function that takes >the time to make that check? > > I thought I had written a function that would see if two input arrays have over-lapping memory, but maybe not. It's not hard for a contiguous chunk of memory, but for two views it's a harder function to write. It's probably a good idea to have such a thing, however. -Travis From Chris.Barker at noaa.gov Thu Feb 1 15:39:09 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 12:39:09 -0800 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C23FD9.6010108@ee.byu.edu> <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> Message-ID: <45C24FED.5050300@noaa.gov> Zachary Pincus wrote: > Say a function that (despite Tim's pretty > reasonable 'don't do that' warning) will return true when two arrays > have overlapping memory? I think it would be useful, even if it's not robust. I'd still like to know if a given two arrays COULD share data. I suppose to really be robust, what I'd really want to know is if a given array shares data with ANY other array, i.e. could changing this mess something up? -- but I'm pretty sure that is next to impossible -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From oliphant at ee.byu.edu Thu Feb 1 15:46:44 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 13:46:44 -0700 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <45C24FED.5050300@noaa.gov> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C23FD9.6010108@ee.byu.edu> <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> <45C24FED.5050300@noaa.gov> Message-ID: <45C251B4.1030404@ee.byu.edu> Christopher Barker wrote: >Zachary Pincus wrote: > > >>Say a function that (despite Tim's pretty >>reasonable 'don't do that' warning) will return true when two arrays >>have overlapping memory? >> >> > >I think it would be useful, even if it's not robust. I'd still like to >know if a given two arrays COULD share data. > >I suppose to really be robust, what I'd really want to know is if a >given array shares data with ANY other array, i.e. could changing this >mess something up? -- but I'm pretty sure that is next to impossible > > > Yeah, we don't keep track of who has a reference to a particular array. They only way to get that information would be to walk through all the Objects defined and see if any of them share memory with me. You can sometimes get away with it by looking at the reference count of the object. But, the reference count is used in more ways than that and so it's a very conservative check. In the array interface I'm proposing for inclusion into Python, an object that shares memory could define a "call-back" function that (if defined) would be called when the view to the memory was released. That way objects could store information regarding how many "views" they have extant. -Travis From tim.hochberg at ieee.org Thu Feb 1 15:54:23 2007 From: tim.hochberg at ieee.org (Timothy Hochberg) Date: Thu, 1 Feb 2007 13:54:23 -0700 Subject: [Numpy-discussion] array copy-to-self and views In-Reply-To: <45C24FED.5050300@noaa.gov> References: <66C0943A-1019-4B59-B5B6-F0D13EFF8BA3@stanford.edu> <45C23FD9.6010108@ee.byu.edu> <9A5766D8-8C3F-4C40-A54C-A286F264ECE7@stanford.edu> <45C24FED.5050300@noaa.gov> Message-ID: On 2/1/07, Christopher Barker wrote: > > Zachary Pincus wrote: > > Say a function that (despite Tim's pretty > > reasonable 'don't do that' warning) will return true when two arrays > > have overlapping memory? > > I think it would be useful, even if it's not robust. I'd still like to > know if a given two arrays COULD share data. > > I suppose to really be robust, what I'd really want to know is if a > given array shares data with ANY other array, i.e. could changing this > mess something up? -- but I'm pretty sure that is next to impossible It's not totally impossible in theory -- languages like Haskell and Clean (which I'm playing with now) manage to use arrays that get updated without copying, while still maintaining the illusion that everything is constant and thus you can't mess up any other arrays. While it's fun to play with and Clean is allegedly pretty fast, it takes quite a bit of work to wrap ones head around. In a language like Python I expect that it would be pretty hard to come up with something useful. Most of the checks would probably be too conservative and thus not useful -tim -- //=][=\\ tim.hochberg at ieee.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From Louis.Wicker at noaa.gov Thu Feb 1 16:02:32 2007 From: Louis.Wicker at noaa.gov (Louis Wicker) Date: Thu, 01 Feb 2007 15:02:32 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> <45C245CB.2080502@ee.byu.edu> Message-ID: Sebastian: that code helps a lot. A standard gcc (no flags) of that code breaks, but if you compile it with gcc -m64, you can address large memory spaces. So I will try and compile numpy with -m64.... Lou On Feb 1, 2007, at 2:01 PM, Sebastian Haase wrote: > #include > #include > int main() { size_t n; void *p; double gb; > for(gb=10;gb>.3;gb-=.5) { > n= 1024L * 1024L * 1024L * gb; > p = malloc( n ); > printf("%12lu %4.1lfGb %p\n",n,n/1024./1024./1024.,p); > free(p); } return 0; } ------------------------------------------------------------------------ ---- | Dr. Louis J. Wicker | NSSL/WRDD | National Weather Center | 120 David L. Boren Boulevard, Norman, OK 73072-7323 | | E-mail: Louis.Wicker at noaa.gov | HTTP: www.nssl.noaa.gov/~lwicker | Phone: (405) 325-6340 | Fax: (405) 325-6780 | | "Programming is not just creating strings of instructions | for a computer to execute. It's also 'literary' in that you | are trying to communicate a program structure to | other humans reading the code." - Paul Rubin | |"Real efficiency comes from elegant solutions, not optimized programs. | Optimization is always just a few correctness-preserving transformations | away." - Jonathan Sobel ------------------------------------------------------------------------ ---- | | "The contents of this message are mine personally and | do not reflect any position of the Government or NOAA." | ------------------------------------------------------------------------ ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Thu Feb 1 16:11:29 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 15:11:29 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> <45C245CB.2080502@ee.byu.edu>

Message-ID: <45C25781.6060507@gmail.com> Louis Wicker wrote: > Sebastian: > > that code helps a lot. A standard gcc (no flags) of that code breaks, > but if you compile it with gcc -m64, you can address large memory spaces. > > So I will try and compile numpy with -m64.... It won't work. Your Python is not compiled as a 64-bit program. The whole stack down to the runtime libraries needs to be built as a 64-bit program. That's easy enough to do with a single-file C program, but in Tiger the 64-bit runtime provides very few services, not enough to build Python. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Louis.Wicker at noaa.gov Thu Feb 1 16:18:09 2007 From: Louis.Wicker at noaa.gov (Louis Wicker) Date: Thu, 01 Feb 2007 15:18:09 -0600 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <45C25781.6060507@gmail.com> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> <45C24272.4010402@ee.byu.edu> <45C245CB.2080502@ee.byu.edu>

<45C25781.6060507@gmail.com> Message-ID: <15555BFC-908D-48FA-B281-467267BD930F@noaa.gov> Robert: thanks - I appreciate the advice, and hopefully a) Leopard will get here in a few months, and b) that will fix this. cheers! Lou Wicker On Feb 1, 2007, at 3:11 PM, Robert Kern wrote: > Louis Wicker wrote: >> Sebastian: >> >> that code helps a lot. A standard gcc (no flags) of that code >> breaks, >> but if you compile it with gcc -m64, you can address large memory >> spaces. >> >> So I will try and compile numpy with -m64.... > > It won't work. Your Python is not compiled as a 64-bit program. The > whole stack > down to the runtime libraries needs to be built as a 64-bit > program. That's easy > enough to do with a single-file C program, but in Tiger the 64-bit > runtime > provides very few services, not enough to build Python. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a > harmless enigma > that is made terrible by our own mad attempt to interpret it as > though it had > an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion ------------------------------------------------------------------------ ---- | Dr. Louis J. Wicker | NSSL/WRDD | National Weather Center | 120 David L. Boren Boulevard, Norman, OK 73072-7323 | | E-mail: Louis.Wicker at noaa.gov | HTTP: www.nssl.noaa.gov/~lwicker | Phone: (405) 325-6340 | Fax: (405) 325-6780 | | "Programming is not just creating strings of instructions | for a computer to execute. It's also 'literary' in that you | are trying to communicate a program structure to | other humans reading the code." - Paul Rubin | |"Real efficiency comes from elegant solutions, not optimized programs. | Optimization is always just a few correctness-preserving transformations | away." - Jonathan Sobel ------------------------------------------------------------------------ ---- | | "The contents of this message are mine personally and | do not reflect any position of the Government or NOAA." | ------------------------------------------------------------------------ ---- -------------- next part -------------- An HTML attachment was scrubbed... URL: From Chris.Barker at noaa.gov Thu Feb 1 16:58:45 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 13:58:45 -0800 Subject: [Numpy-discussion] SciPy '07 ??? Message-ID: <45C26295.4060707@noaa.gov> Hi, Does anyone know if there will be a SciPy '07 conference, and if so, when? I'd really like to try to get there this year, but need to start planning my summer schedule. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From oliphant at ee.byu.edu Thu Feb 1 18:48:56 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 16:48:56 -0700 Subject: [Numpy-discussion] classmethods for ndarray Message-ID: <45C27C68.7010306@ee.byu.edu> What is the attitude of this group about the ndarray growing some class methods? I'm thinking that we should have several. For example all the fromXXX functions should probably be classmethods ndarray.frombuffer ndarray.fromfile etc. -Travis From pgmdevlist at gmail.com Thu Feb 1 19:13:07 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 1 Feb 2007 19:13:07 -0500 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C27C68.7010306@ee.byu.edu> References: <45C27C68.7010306@ee.byu.edu> Message-ID: <200702011913.07062.pgmdevlist@gmail.com> On Thursday 01 February 2007 18:48:56 Travis Oliphant wrote: > What is the attitude of this group about the ndarray growing some class > methods? > ndarray.frombuffer > ndarray.fromfile Sounds great. But what would really make my semester is to have ndarray.__new__ accept optional keywords (as **whatever) on top of the shape/buffer/order... That'd be a big help for subclassing. On a side note, I came to the fact that Santa Claus doesn't really live in the North Pole, so I would understand if it wasn't feasible... From robert.kern at gmail.com Thu Feb 1 19:16:23 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 18:16:23 -0600 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C27C68.7010306@ee.byu.edu> References: <45C27C68.7010306@ee.byu.edu> Message-ID: <45C282D7.50901@gmail.com> Travis Oliphant wrote: > What is the attitude of this group about the ndarray growing some class > methods? Works for me. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Thu Feb 1 19:23:23 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 16:23:23 -0800 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C27C68.7010306@ee.byu.edu> References: <45C27C68.7010306@ee.byu.edu> Message-ID: <45C2847B.7080504@noaa.gov> Travis Oliphant wrote: > I'm thinking that we should have several. For example all the fromXXX > functions should probably be classmethods > > ndarray.frombuffer > ndarray.fromfile would they still be accessible in their functional form in the numpy namespace? -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From haase at msg.ucsf.edu Thu Feb 1 19:24:22 2007 From: haase at msg.ucsf.edu (Sebastian Haase) Date: Thu, 1 Feb 2007 16:24:22 -0800 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C282D7.50901@gmail.com> References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> Message-ID: Travis, Could you explain what a possible downside of this would be !? It seems that if you don't need to refer to a specific "self" object that a class-method is what it should - is this not always right !? -Sebastian On 2/1/07, Robert Kern wrote: > Travis Oliphant wrote: > > What is the attitude of this group about the ndarray growing some class > > methods? > > Works for me. > > -- > Robert Kern From russel at appliedminds.net Thu Feb 1 19:46:12 2007 From: russel at appliedminds.net (Russel Howe) Date: Thu, 1 Feb 2007 16:46:12 -0800 Subject: [Numpy-discussion] Complex arange Message-ID: Should this work? Python 2.4.3 (#1, Dec 27 2006, 21:18:13) [GCC 4.0.1 (Apple Computer, Inc. build 5341)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> import numpy as N >>> N.__version__ '1.0.2.dev3531' >>> N.arange(1j, 5j) array([], dtype=complex128) >>> N.arange(1j, 5j, 1j) array([], dtype=complex128) Currently, the real direction is determined to be of zero length (multiarraymodule.c _calc_length), and the length of the array is the minimal length. I can understand the first one not working (default step is 1, it takes 0 of those to cover this range), but the second seems like a bug. From oliphant at ee.byu.edu Thu Feb 1 19:50:31 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 17:50:31 -0700 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> Message-ID: <45C28AD7.4060305@ee.byu.edu> Sebastian Haase wrote: >Travis, >Could you explain what a possible downside of this would be !? >It seems that if you don't need to refer to a specific "self" object >that a class-method is what it should - is this not always right !? > > > I don't understand the last point. Classmethods would get inherited by sub-classes by default, as far as I know. I can't think of any downsides. I have to understand how class-methods are actually implemented, though before I could comment on speed implications of class methods. -Travis From oliphant at ee.byu.edu Thu Feb 1 19:51:22 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 17:51:22 -0700 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C2847B.7080504@noaa.gov> References: <45C27C68.7010306@ee.byu.edu> <45C2847B.7080504@noaa.gov> Message-ID: <45C28B0A.6000001@ee.byu.edu> Christopher Barker wrote: >Travis Oliphant wrote: > > >>I'm thinking that we should have several. For example all the fromXXX >>functions should probably be classmethods >> >>ndarray.frombuffer >>ndarray.fromfile >> >> > >would they still be accessible in their functional form in the numpy >namespace? > > > Yes, until a major revision at which point they could (if deemed useful) be removed after a deprecation warning period. -Travis From Chris.Barker at noaa.gov Thu Feb 1 19:58:17 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 16:58:17 -0800 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> Message-ID: <45C28CA9.7060704@noaa.gov> Sebastian Haase wrote: > Could you explain what a possible downside of this would be !? > It seems that if you don't need to refer to a specific "self" object > that a class-method is what it should - is this not always right !? Well, what these really are are alternate constructors. I don't think I've seen class methods used that way, but then I haven't seen them used much at all. Sometimes I have wished for an overloaded constructor, i.e.: array(SomeBuffer) results in the same thing as frombuffer(SomeBuffer) but Python doesn't really "do" overloaded methods, and there are some times when there wouldn't be only one way the input could be interpreted. That all being the case, it seems to make some sense to put these in as class methods, but : a = numpy.ndarray.fromfile(MyFile) does feel a bit awkward. Wx Python handles this by having a few constructors: wx.EmptyBitmap() wx.BitmapFromImage() wx.BitmapFromBuffer() etc... but that's kind of clunky too. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From oliphant at ee.byu.edu Thu Feb 1 20:00:25 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 01 Feb 2007 18:00:25 -0700 Subject: [Numpy-discussion] New may_share_memory function Message-ID: <45C28D29.4060902@ee.byu.edu> In SVN there is a new function may_share_memory(a,b) which will return True if the memory foot-print of the two arrays over-lap. >>> may_share_memory(a, flipud(a)) True This is based on another utility function byte_bounds that returns the byte-boundaries of any object exporting the Python side of the array interface. Perhaps these utilities will help (I know they can be used to make the who function a bit more intelligent about how many bytes are being used). -Travis From robert.kern at gmail.com Thu Feb 1 20:01:19 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 19:01:19 -0600 Subject: [Numpy-discussion] Complex arange In-Reply-To: References: Message-ID: <45C28D5F.70109@gmail.com> Russel Howe wrote: (It's good to see so many Rudds seeing sense and using Python and numpy. ;-)) > Should this work? > > Python 2.4.3 (#1, Dec 27 2006, 21:18:13) > [GCC 4.0.1 (Apple Computer, Inc. build 5341)] on darwin > Type "help", "copyright", "credits" or "license" for more information. > >>> import numpy as N > >>> N.__version__ > '1.0.2.dev3531' > >>> N.arange(1j, 5j) > array([], dtype=complex128) > >>> N.arange(1j, 5j, 1j) > array([], dtype=complex128) > > Currently, the real direction is determined to be of zero length > (multiarraymodule.c _calc_length), and the length of the array is the > minimal length. I can understand the first one not working (default > step is 1, it takes 0 of those to cover this range), but the second > seems like a bug. arange() is pretty much only defined for real numbers. For general z0, z1, and dz, there is no guarantee that (z1 - z0)/dz is a real number as it needs to be. dz may point in a different direction than (z1-z0). For example, what should arange(1j, 5j, 1) do? Numeric raises an exception here, and I thing numpy should, too. Of course, linspace() is generally preferred for floating point types regardless of whether complex or real. It's difficult to predetermine whether the stop value will be included or not. And since you give it a count instead of a step size, we can guarantee that the step does lie along the vector (z1-z0). In [8]: linspace(1j, 5j, 5) Out[8]: array([ 0.+1.j, 0.+2.j, 0.+3.j, 0.+4.j, 0.+5.j]) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Thu Feb 1 20:02:37 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 01 Feb 2007 17:02:37 -0800 Subject: [Numpy-discussion] New may_share_memory function In-Reply-To: <45C28D29.4060902@ee.byu.edu> References: <45C28D29.4060902@ee.byu.edu> Message-ID: <45C28DAD.7070201@noaa.gov> thanks Travis, Now I just need to remember it's there when I need it! -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From robert.kern at gmail.com Thu Feb 1 20:06:03 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 19:06:03 -0600 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C28CA9.7060704@noaa.gov> References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> <45C28CA9.7060704@noaa.gov> Message-ID: <45C28E7B.9050405@gmail.com> Christopher Barker wrote: > Sebastian Haase wrote: > >> Could you explain what a possible downside of this would be !? >> It seems that if you don't need to refer to a specific "self" object >> that a class-method is what it should - is this not always right !? > > Well, what these really are are alternate constructors. I don't think > I've seen class methods used that way, but then I haven't seen them used > much at all. Alternate constructors is probably the primary use case for class methods that I've seen. It's certainly the most frequent reason I've made them. > Sometimes I have wished for an overloaded constructor, i.e.: > > array(SomeBuffer) > > results in the same thing as > > frombuffer(SomeBuffer) > > > but Python doesn't really "do" overloaded methods, and there are some > times when there wouldn't be only one way the input could be interpreted. Well, array() is already very, very overloaded. That's why it's difficult to use sometimes. BTW, you might want to check out the module simplegeneric for a good way to implement certain kinds of overloading. Just not numpy.array(), please ;-). http://cheeseshop.python.org/pypi/simplegeneric/ -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From tim.hochberg at ieee.org Thu Feb 1 20:20:55 2007 From: tim.hochberg at ieee.org (Timothy Hochberg) Date: Thu, 1 Feb 2007 18:20:55 -0700 Subject: [Numpy-discussion] Complex arange In-Reply-To: <45C28D5F.70109@gmail.com> References: <45C28D5F.70109@gmail.com> Message-ID: On 2/1/07, Robert Kern wrote: > > Russel Howe wrote: > > (It's good to see so many Rudds seeing sense and using Python and > numpy. ;-)) rudds! Here? Dear me. -- //=][=\\ tim.hochberg at ieee.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From russel at appliedminds.net Thu Feb 1 20:34:38 2007 From: russel at appliedminds.net (Russel Howe) Date: Thu, 1 Feb 2007 17:34:38 -0800 Subject: [Numpy-discussion] Complex arange In-Reply-To: <45C28D5F.70109@gmail.com> References: <45C28D5F.70109@gmail.com> Message-ID: > arange(1j, 5j, 1) do? Numeric raises an exception here, and I thing > numpy > should, too. > The same as arange(1, 5, 1j) - an empty array since it takes 0 of the step to cross the distance. But something like arange(1j, 5j, 1j) seems fine. As does arange(1j, 3+5j, 2+1j) which should give [ 1j, 2+2j ]. The idea is to walk by step up to the edge of the box. I seem to recall a discussion of why this was a bad idea a while ago on this list, but I can't find it... The exception is a good answer too, but it should probably happen for all complex arguments, since most seem to return an empty array now. Russel they're all fine hovses. From robert.kern at gmail.com Thu Feb 1 20:47:33 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 19:47:33 -0600 Subject: [Numpy-discussion] Complex arange In-Reply-To: References: <45C28D5F.70109@gmail.com> Message-ID: <45C29835.70207@gmail.com> Russel Howe wrote: >> arange(1j, 5j, 1) do? Numeric raises an exception here, and I thing >> numpy >> should, too. > > The same as arange(1, 5, 1j) - an empty array since it takes 0 of the > step to cross the distance. I'm not sure that's really the answer. I think it's simply not defined. No number of steps (which is different than 0 steps) along the imaginary axis will take 1+0j to 5+0j. > But something like > arange(1j, 5j, 1j) seems fine. As does arange(1j, 3+5j, 2+1j) which > should give [ 1j, 2+2j ]. The idea is to walk by step up to the edge > of the box. I seem to recall a discussion of why this was a bad idea > a while ago on this list, but I can't find it... Box? *Aaah*! You're looking at z1 placing distinct upper(lower) bounds on the real and imaginary parts rather than specifying a point target. That's a...unique perspective. ;-) But then, I'm of the opinion that arange() should be reserved for integers, and the other use cases are better served by linspace() instead. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From aisaac at american.edu Thu Feb 1 22:23:27 2007 From: aisaac at american.edu (Alan G Isaac) Date: Thu, 1 Feb 2007 22:23:27 -0500 Subject: [Numpy-discussion] [SciPy-user] Release 0.6.1 of pyaudio, renamed pyaudiolab In-Reply-To: <45C0A076.2090805@ar.media.kyoto-u.ac.jp> References: <45C0A076.2090805@ar.media.kyoto-u.ac.jp> Message-ID: On Wed, 31 Jan 2007, David Cournapeau apparently wrote: > With pyaudiolab, you should be able to read and write most > common audio files from and to numpy arrays. The > underlying IO operations are done using libsndfile from > Erik Castro Lopo (http://www.mega-nerd.com/libsndfile/) I think it is worth mentioning (on this list) that pyaudiolab uses the SciPy license and libsndfile is LGPL. Cheers, Alan Isaac From david at ar.media.kyoto-u.ac.jp Thu Feb 1 22:50:42 2007 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 02 Feb 2007 12:50:42 +0900 Subject: [Numpy-discussion] [SciPy-user] Release 0.6.1 of pyaudio, renamed pyaudiolab In-Reply-To: References: <45C0A076.2090805@ar.media.kyoto-u.ac.jp> Message-ID: <45C2B512.9070003@ar.media.kyoto-u.ac.jp> Alan G Isaac wrote: > On Wed, 31 Jan 2007, David Cournapeau apparently wrote: >> With pyaudiolab, you should be able to read and write most >> common audio files from and to numpy arrays. The >> underlying IO operations are done using libsndfile from >> Erik Castro Lopo (http://www.mega-nerd.com/libsndfile/) > > I think it is worth mentioning (on this list) that > pyaudiolab uses the SciPy license and libsndfile is LGPL. Indeed, I forgot to mention this fact in the announcement. It is mentioned somewhere in the source, but it should be done better. It is the only reason that pyaudiolab is not part of scipy. Your post made me realize that I actually didn't look at how applying the license correctly, which is not good at all (that's the first project I started from scratch). I will change that. David From charlesr.harris at gmail.com Thu Feb 1 23:07:50 2007 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 1 Feb 2007 21:07:50 -0700 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: References: <20070127210641.GA12685@zaphod.lagged.za.net> <20070127223756.GC5742@mentat.za.net> <45BBE3C7.3000207@gmail.com> Message-ID: On 1/28/07, Keith Goodman wrote: > > On 1/27/07, Keith Goodman wrote: > > On 1/27/07, Fernando Perez wrote: > > > It's definitely looking like something SMP related: on my laptop, with > > > everything other than the hardware being identical (Linux distro, > > > kernel, numpy build, etc), I can't make it fail no matter how I muck > > > with it. I always get '0 differences'. > > > > > > The desktop is a dual-core AMD Athlon as indicated before, the laptop > > > is an oldie Pentium III. They both run the same SMP-aware Ubuntu i686 > > > kernel, since Ubuntu now ships a unified kernel, though obviously on > > > the laptop the SMP code isn't active. > > > > After installing a kernel that is not smp aware, I still have the same > problem. > > The problem goes away if I remove atlas (atlas3-sse2 for me). But that > just introduces another problem: slowness. This problem may be related to this bug: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=279294 Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From kwgoodman at gmail.com Thu Feb 1 23:46:12 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Thu, 1 Feb 2007 20:46:12 -0800 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: References: <20070127223756.GC5742@mentat.za.net> <45BBE3C7.3000207@gmail.com> Message-ID: On 2/1/07, Charles R Harris wrote: > This problem may be related to this bug: > http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=279294 It says it is fixed in libc6 2.3.5. I'm on 2.3.6. But do you think it is something similar? A port to Octave of the test script works fine on the same system. From robert.kern at gmail.com Thu Feb 1 23:51:14 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 01 Feb 2007 22:51:14 -0600 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: References: <20070127223756.GC5742@mentat.za.net> <45BBE3C7.3000207@gmail.com> Message-ID: <45C2C342.8050906@gmail.com> Keith Goodman wrote: > A port to Octave of the test script works fine on the same system. Are you sure that your Octave port uses ATLAS to do the matrix product? Could you post your port? -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From charlesr.harris at gmail.com Thu Feb 1 23:56:30 2007 From: charlesr.harris at gmail.com (Charles R Harris) Date: Thu, 1 Feb 2007 21:56:30 -0700 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: References: <20070127223756.GC5742@mentat.za.net> <45BBE3C7.3000207@gmail.com> Message-ID: On 2/1/07, Keith Goodman wrote: > > On 2/1/07, Charles R Harris wrote: > > This problem may be related to this bug: > > http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=279294 > > It says it is fixed in libc6 2.3.5. I'm on 2.3.6. But do you think it > is something similar? I do, I am suspicious that the roundoff mode flag is changing state. But these sort of bugs are notoriously hard to track down. You did good isolating it to atlas and sse. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From kwgoodman at gmail.com Fri Feb 2 00:02:59 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Thu, 1 Feb 2007 21:02:59 -0800 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: <45C2C342.8050906@gmail.com> References: <45BBE3C7.3000207@gmail.com> <45C2C342.8050906@gmail.com> Message-ID: On 2/1/07, Robert Kern wrote: > Keith Goodman wrote: > > A port to Octave of the test script works fine on the same system. > > Are you sure that your Octave port uses ATLAS to do the matrix product? Could > you post your port? Here's the port. Yes, Octave uses atlas for matrix multiplication. Maybe the problem is a race condition and due to timing the outcome is always the same in Octave... -------------- next part -------------- A non-text attachment was scrubbed... Name: repeat.m Type: text/x-objcsrc Size: 835 bytes Desc: not available URL: From fperez.net at gmail.com Fri Feb 2 03:41:57 2007 From: fperez.net at gmail.com (Fernando Perez) Date: Fri, 2 Feb 2007 01:41:57 -0700 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C28E7B.9050405@gmail.com> References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> <45C28CA9.7060704@noaa.gov> <45C28E7B.9050405@gmail.com> Message-ID: On 2/1/07, Robert Kern wrote: > Christopher Barker wrote: > > Sebastian Haase wrote: > > > >> Could you explain what a possible downside of this would be !? > >> It seems that if you don't need to refer to a specific "self" object > >> that a class-method is what it should - is this not always right !? > > > > Well, what these really are are alternate constructors. I don't think > > I've seen class methods used that way, but then I haven't seen them used > > much at all. > > Alternate constructors is probably the primary use case for class methods that > I've seen. It's certainly the most frequent reason I've made them. Same here (not in any publicly released code) and I happen to find them handy in that role. Absent truly overloaded constructors ? la C++, they appear to be an acceptable compromise to me. Cheers, f From david.douard at logilab.fr Fri Feb 2 04:56:12 2007 From: david.douard at logilab.fr (David Douard) Date: Fri, 2 Feb 2007 10:56:12 +0100 Subject: [Numpy-discussion] large memory address space on Mac OS X (intel) In-Reply-To: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> References: <4E1D1DFE-1FDE-43CF-97AC-F72EFC3D3E3C@noaa.gov> Message-ID: <20070202095612.GB5353@crater.logilab.fr> On Thu, Feb 01, 2007 at 01:33:23PM -0600, Louis Wicker wrote: > Dear list: Hi, may I suggest you to read this? http://orange.blender.org/blog/stupid-memory-problems It worth a read. David > > I cannot seem to figure how to create arrays > 2 GB on a Mac Pro > (using Intel chip and Tiger, 4.8). I have hand compiled both Python > 2.5 and numpy 1.0.1, and cannot make arrays bigger than 2 GB. I also > run out of space if I try and 3-6 several arrays of 1000 mb or so > (the mem-alloc failure does not seem consistent, depends on whether I > am creating them with a "numpy.ones()" call, or creating them on the > fly by doing math with the other arrays "e.g., c = 4.3*a + 3.1*b"). > > Is this a numpy issue, or a Python 2.5 issue for the Mac? I have > tried this on the SGI Altix, and this works fine. > > If there is a compile flag to turn on 64 bit support in the Mac > compile, I would be glad to find out about it. Or do I have to wait > for Leopard? > > Thanks. > > Lou Wicker > > ------------------------------------------------------------------------ > ---- > | Dr. Louis J. Wicker > | NSSL/WRDD > | National Weather Center > | 120 David L. Boren Boulevard, Norman, OK 73072-7323 > | > | E-mail: Louis.Wicker at noaa.gov > | HTTP: www.nssl.noaa.gov/~lwicker > | Phone: (405) 325-6340 > | Fax: (405) 325-6780 > | > | "Programming is not just creating strings of instructions > | for a computer to execute. It's also 'literary' in that you > | are trying to communicate a program structure to > | other humans reading the code." - Paul Rubin > | > |"Real efficiency comes from elegant solutions, not optimized programs. > | Optimization is always just a few correctness-preserving > transformations > | away." - Jonathan Sobel > ------------------------------------------------------------------------ > ---- > | > | "The contents of this message are mine personally and > | do not reflect any position of the Government or NOAA." > | > ------------------------------------------------------------------------ > ---- > > > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion -- David Douard LOGILAB, Paris (France) Formations Python, Zope, Plone, Debian : http://www.logilab.fr/formations D?veloppement logiciel sur mesure : http://www.logilab.fr/services Informatique scientifique : http://www.logilab.fr/science -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: Digital signature URL: From bblais at bryant.edu Fri Feb 2 05:43:48 2007 From: bblais at bryant.edu (Brian Blais) Date: Fri, 02 Feb 2007 05:43:48 -0500 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C28AD7.4060305@ee.byu.edu> References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> <45C28AD7.4060305@ee.byu.edu> Message-ID: <45C315E4.3040407@bryant.edu> Travis Oliphant wrote: > Sebastian Haase wrote: > >> Travis, >> Could you explain what a possible downside of this would be !? > > I can't think of any downsides. I have to understand how class-methods > are actually implemented, though before I could comment on speed > implications of class methods. Would this break previously saved pickles, so you couldn't load them? I ran into that problem last year when there was a change to numpy. Is that something that would happen here? bb -- ----------------- bblais at bryant.edu http://web.bryant.edu/~bblais From faltet at carabos.com Fri Feb 2 07:57:53 2007 From: faltet at carabos.com (Francesc Altet) Date: Fri, 2 Feb 2007 13:57:53 +0100 Subject: [Numpy-discussion] Native byteorder representation Message-ID: <200702021357.55549.faltet@carabos.com> Hi, We have been bitten by a small glitch related with the representation of native byteorders. Here is an example exposing the problem: >>> numpy.dtype('i4').newbyteorder('little').byteorder '<' [the example was run on a little endian machine] We thought that native byteorder were represented always by a '=', and this is true when you create the type from scratch. But, if you create a dtype with a different byteorder and then switch to a native one (in this case, 'little') the representation of the byteorder changes to '<' instead of '='. We can live with this, but IMO it would be better if the final representation of native byteorders could always be made to read '='. Thanks, -- >0,0< Francesc Altet ? ? http://www.carabos.com/ V V C?rabos Coop. V. ??Enjoy Data "-" From bsouthey at gmail.com Fri Feb 2 09:14:02 2007 From: bsouthey at gmail.com (Bruce Southey) Date: Fri, 2 Feb 2007 08:14:02 -0600 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: References: <45BBE3C7.3000207@gmail.com> Message-ID: Hi, I am curious why I do not see any mention of the compilers and versions that were used in this thread. Having just finally managed to get SciPY installed from scratch (but not with atlas), I could see that using different compliers or versions or options especially compiling done at different times could be a factor. Bruce On 2/1/07, Charles R Harris wrote: > > > On 2/1/07, Keith Goodman wrote: > > On 2/1/07, Charles R Harris wrote: > > > This problem may be related to this bug: > > > http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=279294 > > > > It says it is fixed in libc6 2.3.5. I'm on 2.3.6. But do you think it > > is something similar? > > I do, I am suspicious that the roundoff mode flag is changing state. But > these sort of bugs are notoriously hard to track down. You did good > isolating it to atlas and sse. > > Chuck > > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From kwgoodman at gmail.com Fri Feb 2 09:21:17 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Fri, 2 Feb 2007 06:21:17 -0800 Subject: [Numpy-discussion] Different results from repeated calculation In-Reply-To: References: <45BBE3C7.3000207@gmail.com> Message-ID: On 2/2/07, Bruce Southey wrote: > I am curious why I do not see any mention of the compilers and > versions that were used in this thread. Having just finally managed to > get SciPY installed from scratch (but not with atlas), I could see > that using different compliers or versions or options especially > compiling done at different times could be a factor. Yeah, good point. I installed 1.0.1 from binary from Debian sid. Maybe a chart of which configurations have the problem and which don't would help. If the problem is ATLAS I don't understand why test1 passes. Could the loading of the values be the problem and not the multiplication itself? From sturla at molden.no Fri Feb 2 11:05:09 2007 From: sturla at molden.no (Sturla Molden) Date: Fri, 2 Feb 2007 17:05:09 +0100 (CET) Subject: [Numpy-discussion] Complex arange In-Reply-To: <45C29835.70207@gmail.com> References: <45C28D5F.70109@gmail.com> <45C29835.70207@gmail.com> Message-ID: <1365.129.240.194.142.1170432309.squirrel@webmail.uio.no> I think arange for complex numbers should work like meshgrid, with the real and imaginary axis replacing the x and y axis. That would mean something like this: def complex_arange(start,end,stride): def iscomplex(x): if ((type(x)==complex) or (type(x)==complex64) or (type(x)==complex128)): return True else: return False if iscomplex(start) or iscomplex(end) or iscomplex(stride): start = complex(start) end = complex(end) stride = complex(stride) ar = arange(start.real, end.real, stride.real) ai = arange(start.imag ,end.imag, stride.imag) rr,ri = meshgrid(ar,ai) tmp = rr + 1j*ri if tmp.shape[0] == 1 or tmp.shape[1] == 1: tmp = tmp.flatten() return tmp else: return arange(start,end,stride) I think this is a reasonable extension of arange to complex numbers. Here complex_arange(1j, 5j, 1) throws a ZeroDivisionError as the stride for the imaginary part i 0. Observe that complex_arange(1j, 5j, 1j) throws an exception as well, as the extent of the real part is arange(0,0,0). But complex_arange(0+1j,1+5j,1+1j) does exist, and so does complex_arange(0+1j,0+5j,1+1j). But in the case of complex_arange(0+1j,0+5j,1+1j) the return value is an empty array, as the extent along the real axis is 0. Regards, Sturla Molden From oliphant at ee.byu.edu Fri Feb 2 12:22:12 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Fri, 02 Feb 2007 10:22:12 -0700 Subject: [Numpy-discussion] Native byteorder representation In-Reply-To: <200702021357.55549.faltet@carabos.com> References: <200702021357.55549.faltet@carabos.com> Message-ID: <45C37344.3000504@ee.byu.edu> Francesc Altet wrote: >Hi, > >We have been bitten by a small glitch related with the representation of >native byteorders. Here is an example exposing the problem: > > > >>>>numpy.dtype('>>> >>>> >'=' > > >>>>numpy.dtype('>i4').newbyteorder('little').byteorder >>>> >>>> >'<' > > > This is somewhat inconsistent. But, I'm not sure it's worth changing. In the second case, you request a "little" byteorder data-type. Keeping this as '<' seems O.K. One could instead ask why the first example did not report a byte-order of "<" when that's what was explicitly asked for. -Travis From faltet at carabos.com Fri Feb 2 13:11:11 2007 From: faltet at carabos.com (Francesc Altet) Date: Fri, 02 Feb 2007 19:11:11 +0100 Subject: [Numpy-discussion] Native byteorder representation In-Reply-To: <45C37344.3000504@ee.byu.edu> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> Message-ID: <1170439871.2880.4.camel@localhost.localdomain> El dv 02 de 02 del 2007 a les 10:22 -0700, en/na Travis Oliphant va escriure: > Francesc Altet wrote: > > >Hi, > > > >We have been bitten by a small glitch related with the representation of > >native byteorders. Here is an example exposing the problem: > > > > > > > >>>>numpy.dtype(' >>>> > >>>> > >'=' > > > > > >>>>numpy.dtype('>i4').newbyteorder('little').byteorder > >>>> > >>>> > >'<' > > > > > > > This is somewhat inconsistent. But, I'm not sure it's worth changing. > > In the second case, you request a "little" byteorder data-type. Keeping > this as '<' seems O.K. > > One could instead ask why the first example did not report a byte-order > of "<" when that's what was explicitly asked for. Well, just because of the same reason that numpy.dtype(' References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> Message-ID: <1170441744.2880.9.camel@localhost.localdomain> El dv 02 de 02 del 2007 a les 19:11 +0100, en/na Francesc Altet va escriure: > El dv 02 de 02 del 2007 a les 10:22 -0700, en/na Travis Oliphant va > escriure: > > Francesc Altet wrote: > > > > >Hi, > > > > > >We have been bitten by a small glitch related with the representation of > > >native byteorders. Here is an example exposing the problem: > > > > > > > > > > > >>>>numpy.dtype(' > >>>> > > >>>> > > >'=' > > > > > > > > >>>>numpy.dtype('>i4').newbyteorder('little').byteorder > > >>>> > > >>>> > > >'<' > > > > > > > > > > > This is somewhat inconsistent. But, I'm not sure it's worth changing. > > > > In the second case, you request a "little" byteorder data-type. Keeping > > this as '<' seems O.K. > > > > One could instead ask why the first example did not report a byte-order > > of "<" when that's what was explicitly asked for. > > Well, just because of the same reason that > > numpy.dtype(' > returns a '=' instead of a '<' (the latter being explicitely set in the > constructor). Ops. I was confused about what you was saying here, sorry. Forget this. > > I think that returning a '=' whenever the byteorder is the same than the > underlying machine is desirable because the user can quickly see whether > her data is in native order or not. I think that this is the only reason I can argue. -- Francesc Altet | Be careful about using the following code -- Carabos Coop. V. | I've only proven that it works, www.carabos.com | I haven't tested it. -- Donald Knuth From stefan at sun.ac.za Fri Feb 2 17:15:13 2007 From: stefan at sun.ac.za (Stefan van der Walt) Date: Sat, 3 Feb 2007 00:15:13 +0200 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C28B0A.6000001@ee.byu.edu> References: <45C27C68.7010306@ee.byu.edu> <45C2847B.7080504@noaa.gov> <45C28B0A.6000001@ee.byu.edu> Message-ID: <20070202221513.GA6439@mentat.za.net> On Thu, Feb 01, 2007 at 05:51:22PM -0700, Travis Oliphant wrote: > Christopher Barker wrote: > > >Travis Oliphant wrote: > > > > > >>I'm thinking that we should have several. For example all the fromXXX > >>functions should probably be classmethods > >> > >>ndarray.frombuffer > >>ndarray.fromfile > >> > >> > > > >would they still be accessible in their functional form in the numpy > >namespace? > > > > > > > Yes, until a major revision at which point they could (if deemed useful) > be removed after a deprecation warning period. That would be a happy day. I'd love to see the numpy namespace go on a diet... Cheers St?fan From dalcinl at gmail.com Fri Feb 2 20:28:08 2007 From: dalcinl at gmail.com (Lisandro Dalcin) Date: Fri, 2 Feb 2007 22:28:08 -0300 Subject: [Numpy-discussion] Requests for NumPy Ports? In-Reply-To: <45C0D5E7.4@ee.byu.edu> References: <45BFE7FB.6080000@ee.byu.edu> <45C0CF7A.6040807@ee.byu.edu> <45C0D3DF.1070606@noaa.gov> <45C0D5E7.4@ee.byu.edu> Message-ID: On 1/31/07, Travis Oliphant wrote: > To me this is so obvious that I don't understand the resistance in the > Python community to the concept. Indeed Travis, I was not reading this for a some time ago. Can you point me your last proposal? I remember reading about extending the builting buffer interface slots. Just an idea, as previous step to convince Py-Dev people: perhaps it is possible to develop an C extension, independent of numpy, implementing all what you proposed about dtype objects and defining a good C-API and Py-API to deal with all this, and next perhaps market it to people like gtk-python developers... In any case, you are contributing to a community standard which can be used once ready and next adopted by core python, like happened with ctype... What do you think? -- Lisandro Dalc?n --------------- Centro Internacional de M?todos Computacionales en Ingenier?a (CIMEC) Instituto de Desarrollo Tecnol?gico para la Industria Qu?mica (INTEC) Consejo Nacional de Investigaciones Cient?ficas y T?cnicas (CONICET) PTLC - G?emes 3450, (3000) Santa Fe, Argentina Tel/Fax: +54-(0)342-451.1594 From mattknox_ca at hotmail.com Fri Feb 2 20:35:02 2007 From: mattknox_ca at hotmail.com (Matt Knox) Date: Sat, 3 Feb 2007 01:35:02 +0000 (UTC) Subject: [Numpy-discussion] classmethods for ndarray References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> <45C28AD7.4060305@ee.byu.edu> <45C315E4.3040407@bryant.edu> Message-ID: > Would this break previously saved pickles, so you couldn't load them? I ran into > that problem last year when there was a change to numpy. Is that something that > would happen here? > > bb > Regardless of whether or not this change will "break pickles", it seems highly likely to me that other future changes will prevent you from loading arrays pickled with older versions of numpy and python. That is one of several reasons why I generally think relying on direct pickling for storage is not a good idea. Another reason is that loading data becomes an all or nothing proposition, you can't read half of an array off the disk for example (not easily anyway). If you need to store numpy arrays directly, pytables (www.pytables.org) is the best solution that I am aware of, and it will (mostly) save you from worrying about these kinds of issues in the future. - Matt From robert.kern at gmail.com Fri Feb 2 20:40:03 2007 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 02 Feb 2007 19:40:03 -0600 Subject: [Numpy-discussion] classmethods for ndarray In-Reply-To: <45C315E4.3040407@bryant.edu> References: <45C27C68.7010306@ee.byu.edu> <45C282D7.50901@gmail.com> <45C28AD7.4060305@ee.byu.edu> <45C315E4.3040407@bryant.edu> Message-ID: <45C3E7F3.7050709@gmail.com> Brian Blais wrote: > Would this break previously saved pickles, so you couldn't load them? I ran into > that problem last year when there was a change to numpy. Is that something that > would happen here? No. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From mail at stevesimmons.com Sat Feb 3 20:22:37 2007 From: mail at stevesimmons.com (Stephen Simmons) Date: Sun, 04 Feb 2007 12:22:37 +1100 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: <1170439871.2880.4.camel@localhost.localdomain> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> Message-ID: <45C5355D.1020904@stevesimmons.com> An HTML attachment was scrubbed... URL: From kwgoodman at gmail.com Sat Feb 3 20:47:08 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Sat, 3 Feb 2007 17:47:08 -0800 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: <45C5355D.1020904@stevesimmons.com> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> Message-ID: On 2/3/07, Stephen Simmons wrote: > Does anyone know why there is an order of magnitude difference > in the speed of numpy's array.sum() function depending on the axis > of the matrix summed? > > To see this, import numpy and create a big array with two rows: > >>> import numpy > >>> a = numpy.ones([2,1000000], 'f4') > > Then using ipython's timeit function: > Time (ms) > sum(a) 20 > a.sum() 9 > a.sum(axis=1) 9 > a.sum(axis=0) 159 > numpy.dot(numpy.ones(a.shape[0], a.dtype), a) 15 > > This last one using a dot product is functionally equivalent > to a.sum(axis=0), suggesting that the slowdown is due to how > indexing is implemented in array.sum(). I don't know how much time this would account for, but a.sum(0) has to create a much larger array than a.sum(1) does. From robert.kern at gmail.com Sat Feb 3 20:52:17 2007 From: robert.kern at gmail.com (Robert Kern) Date: Sat, 03 Feb 2007 19:52:17 -0600 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> Message-ID: <45C53C51.3020907@gmail.com> Keith Goodman wrote: > On 2/3/07, Stephen Simmons wrote: >> Does anyone know why there is an order of magnitude difference >> in the speed of numpy's array.sum() function depending on the axis >> of the matrix summed? >> >> To see this, import numpy and create a big array with two rows: >> >>> import numpy >> >>> a = numpy.ones([2,1000000], 'f4') >> >> Then using ipython's timeit function: >> Time (ms) >> sum(a) 20 >> a.sum() 9 >> a.sum(axis=1) 9 >> a.sum(axis=0) 159 >> numpy.dot(numpy.ones(a.shape[0], a.dtype), a) 15 >> >> This last one using a dot product is functionally equivalent >> to a.sum(axis=0), suggesting that the slowdown is due to how >> indexing is implemented in array.sum(). > > I don't know how much time this would account for, but a.sum(0) has to > create a much larger array than a.sum(1) does. However, so does sum(a) and numpy.dot(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From kwgoodman at gmail.com Sat Feb 3 21:12:39 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Sat, 3 Feb 2007 18:12:39 -0800 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: <45C53C51.3020907@gmail.com> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> <45C53C51.3020907@gmail.com> Message-ID: On 2/3/07, Robert Kern wrote: > Keith Goodman wrote: > > On 2/3/07, Stephen Simmons wrote: > >> Does anyone know why there is an order of magnitude difference > >> in the speed of numpy's array.sum() function depending on the axis > >> of the matrix summed? > >> > >> To see this, import numpy and create a big array with two rows: > >> >>> import numpy > >> >>> a = numpy.ones([2,1000000], 'f4') > >> > >> Then using ipython's timeit function: > >> Time (ms) > >> sum(a) 20 > >> a.sum() 9 > >> a.sum(axis=1) 9 > >> a.sum(axis=0) 159 > >> numpy.dot(numpy.ones(a.shape[0], a.dtype), a) 15 > >> > >> This last one using a dot product is functionally equivalent > >> to a.sum(axis=0), suggesting that the slowdown is due to how > >> indexing is implemented in array.sum(). > > > > I don't know how much time this would account for, but a.sum(0) has to > > create a much larger array than a.sum(1) does. > > However, so does sum(a) and numpy.dot(). The speed difference across axis 0 and 1 is also seen in Octave and Matlab (but it is more like a factor of 5). But in those languages axis=0 is much faster. And numpy, if I remember, stores arrays in the opposite way as Octave (by row or column, I forget). So a lot of the speed difference could be in how the array is stored. http://velveeta.che.wisc.edu/octave/lists/help-octave/2005/2195 http://velveeta.che.wisc.edu/octave/lists/help-octave/2005/1912 http://velveeta.che.wisc.edu/octave/lists/help-octave/2005/1897 From charlesr.harris at gmail.com Sat Feb 3 21:19:55 2007 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 3 Feb 2007 19:19:55 -0700 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: <45C5355D.1020904@stevesimmons.com> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> Message-ID: On 2/3/07, Stephen Simmons wrote: > > Hi, > > Does anyone know why there is an order of magnitude difference > in the speed of numpy's array.sum() function depending on the axis > of the matrix summed? > > To see this, import numpy and create a big array with two rows: > >>> import numpy > >>> a = numpy.ones([2,1000000], 'f4') > > Then using ipython's timeit function: > Time (ms) > sum(a) 20 > a.sum() 9 > a.sum(axis=1) 9 > a.sum(axis=0) 159 > numpy.dot(numpy.ones(a.shape[0], a.dtype), a) 15 > > This last one using a dot product is functionally equivalent > to a.sum(axis=0), suggesting that the slowdown is due to how > indexing is implemented in array.sum(). > In this case it is expected. There are inner and outer loops, in the slow case the inner loop with its extra code is called 1000000 times, in the fast case, twice. On the other hand, note this: In [10]: timeit a[0,:] + a[1,:] 100 loops, best of 3: 19.7 ms per loop Which has only one loop. Caching could also be a problem, but in this case it is dominated by loop overhead. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Feb 3 21:28:19 2007 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 3 Feb 2007 19:28:19 -0700 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> Message-ID: On 2/3/07, Charles R Harris wrote: > > > > On 2/3/07, Stephen Simmons wrote: > > > > Hi, > > > > Does anyone know why there is an order of magnitude difference > > in the speed of numpy's array.sum() function depending on the axis > > of the matrix summed? > > > > To see this, import numpy and create a big array with two rows: > > >>> import numpy > > >>> a = numpy.ones([2,1000000], 'f4') > > > > Then using ipython's timeit function: > > Time (ms) > > sum(a) 20 > > a.sum() 9 > > a.sum(axis=1) 9 > > a.sum(axis=0) 159 > > numpy.dot(numpy.ones(a.shape[0], a.dtype), a) 15 > > > > This last one using a dot product is functionally equivalent > > to a.sum(axis=0), suggesting that the slowdown is due to how > > indexing is implemented in array.sum(). > > > > In this case it is expected. There are inner and outer loops, in the slow > case the inner loop with its extra code is called 1000000 times, in the fast > case, twice. On the other hand, note this: > > In [10]: timeit a[0,:] + a[1,:] > 100 loops, best of 3: 19.7 ms per loop > > > Which has only one loop. Caching could also be a problem, but in this case > it is dominated by loop overhead. > PS, I think this indicate that the code would run faster in this case if it accumulated along the last axis, one at a time for each leading index. I suspect that the current implementation accumulates down the first axis, then repeats for each of the last indices. This shows that rearranging the way the accumulation is done could be a big gain, especially if the largest axis is chosen. Chuck Chuck > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From mail at stevesimmons.com Sat Feb 3 21:29:45 2007 From: mail at stevesimmons.com (Stephen Simmons) Date: Sun, 04 Feb 2007 13:29:45 +1100 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> Message-ID: <45C54519.4070109@stevesimmons.com> Charles R Harris wrote: > > > On 2/3/07, *Stephen Simmons* > wrote: > > Hi, > > Does anyone know why there is an order of magnitude difference > in the speed of numpy's array.sum() function depending on the axis > of the matrix summed? > > To see this, import numpy and create a big array with two rows: > >>> import numpy > >>> a = numpy.ones([2,1000000], 'f4') > > Then using ipython's timeit function: > Time (ms) > sum(a) 20 > a.sum() 9 > a.sum(axis=1) 9 > a.sum(axis=0) 159 > numpy.dot(numpy.ones(a.shape[0], a.dtype), a) 15 > > This last one using a dot product is functionally equivalent > to a.sum(axis=0), suggesting that the slowdown is due to how > indexing is implemented in array.sum(). > > > In this case it is expected. There are inner and outer loops, in the > slow case the inner loop with its extra code is called 1000000 times, > in the fast case, twice. On the other hand, note this: > > In [10]: timeit a[0,:] + a[1,:] > 100 loops, best of 3: 19.7 ms per loop > > > Which has only one loop. Caching could also be a problem, but in this > case it is dominated by loop overhead. > > Chuck I agree that summing along the longer axis is most probably slower because it makes more passes through the inner loop. The question though is whether all of the inner loop's overhead is necessary. My counterexample using numpy.dot() suggests there's considerable scope for improvement, at least for certain common cases. From robert.kern at gmail.com Sat Feb 3 21:34:38 2007 From: robert.kern at gmail.com (Robert Kern) Date: Sat, 03 Feb 2007 20:34:38 -0600 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: <45C54519.4070109@stevesimmons.com> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> <45C54519.4070109@stevesimmons.com> Message-ID: <45C5463E.9030000@gmail.com> Stephen Simmons wrote: > The question though is whether all of the inner loop's overhead is > necessary. > My counterexample using numpy.dot() suggests there's considerable scope > for improvement, at least for certain common cases. Well, yes. You most likely have an ATLAS-accelerated dot(). The ATLAS put a lot of work into making matrix products really fast. However, they did so at a cost: different architectures use different code. That's not really something we can do in the core of numpy without making numpy as difficult to build as ATLAS is. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From provos at citi.umich.edu Sun Feb 4 13:18:00 2007 From: provos at citi.umich.edu (Niels Provos) Date: Sun, 4 Feb 2007 10:18:00 -0800 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies Message-ID: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> Good morning, not sure if I got the right list, but I hope that somebody here will be able to shed some light on a Python-related memory problem. The following code eats over >2GB of memory and fails with MemoyError after just a few iterations. def ZeroPadData(A, shape): a = Numeric.zeros(shape, 'w') a.savespace() for y in xrange(A.shape[0]): for x in xrange(A.shape[1]): a[y, x] = A[y, x] return a def EatMemoryLikeTheCookieMonster(limit=10): A = Numeric.ones([1998, 3022]) count = 0 a = A while count < limit: print count count += 1 a = ZeroPadData(a, [2048, 4096]) b = fft2(a) b = ifft2(b) a = b[:1998,:3022].real EatMemoryLikeTheCookieMonster() This is for Python 2.4.3 on Mac OS X 10.4.8 (intel) using SciPy 0.5.2. If anyone could enlighten me about what I am doing wrong, I would very much appreciate it. Thank you, Niels. From haase at msg.ucsf.edu Sun Feb 4 14:37:49 2007 From: haase at msg.ucsf.edu (Sebastian Haase) Date: Sun, 4 Feb 2007 11:37:49 -0800 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: <45C5463E.9030000@gmail.com> References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> <45C54519.4070109@stevesimmons.com> <45C5463E.9030000@gmail.com> Message-ID: On 2/3/07, Robert Kern wrote: > Stephen Simmons wrote: > > > The question though is whether all of the inner loop's overhead is > > necessary. > > My counterexample using numpy.dot() suggests there's considerable scope > > for improvement, at least for certain common cases. > > Well, yes. You most likely have an ATLAS-accelerated dot(). The ATLAS put a lot > of work into making matrix products really fast. However, they did so at a cost: > different architectures use different code. That's not really something we can > do in the core of numpy without making numpy as difficult to build as ATLAS is. > Maybe this argument could be inverted: maybe numpy could check if ATLAS is installed and automatically switch to the numpy.dot(numpy.ones(a.shape[0], a.dtype), a) variant that Stephen suggested. Of course -- as I see it -- the numpy.ones(...) part requires lots of extra memory. Maybe there are other downsides ... !? -Sebastian From kwgoodman at gmail.com Sun Feb 4 15:00:26 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Sun, 4 Feb 2007 12:00:26 -0800 Subject: [Numpy-discussion] array.sum() slower than expected along some array axes? In-Reply-To: References: <200702021357.55549.faltet@carabos.com> <45C37344.3000504@ee.byu.edu> <1170439871.2880.4.camel@localhost.localdomain> <45C5355D.1020904@stevesimmons.com> <45C54519.4070109@stevesimmons.com> <45C5463E.9030000@gmail.com> Message-ID: On 2/4/07, Sebastian Haase wrote: > On 2/3/07, Robert Kern wrote: > > Stephen Simmons wrote: > > > > > The question though is whether all of the inner loop's overhead is > > > necessary. > > > My counterexample using numpy.dot() suggests there's considerable scope > > > for improvement, at least for certain common cases. > > > > Well, yes. You most likely have an ATLAS-accelerated dot(). The ATLAS put a lot > > of work into making matrix products really fast. However, they did so at a cost: > > different architectures use different code. That's not really something we can > > do in the core of numpy without making numpy as difficult to build as ATLAS is. > > > Maybe this argument could be inverted: > maybe numpy could check if ATLAS is installed and automatically switch to the > numpy.dot(numpy.ones(a.shape[0], a.dtype), a) > variant that Stephen suggested. > > Of course -- as I see it -- the numpy.ones(...) part requires lots of > extra memory. Maybe there are other downsides ... !? I use multiplication instead of sum in heavily used loops. I'm often able to predefine the ones outside the loop. In Octave I made my own sum functions---separate ones for axis 0 and 1---that use multiplication. Maybe it is better to make a new function rather than complicate the existing one. From robert.kern at gmail.com Sun Feb 4 15:20:52 2007 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 04 Feb 2007 14:20:52 -0600 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies In-Reply-To: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> References: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> Message-ID: <45C64024.3000305@gmail.com> Niels Provos wrote: > Good morning, > > not sure if I got the right list, but I hope that somebody here will > be able to shed some light on a Python-related memory problem. The > following code eats over >2GB of memory and fails with MemoyError > after just a few iterations. > > def ZeroPadData(A, shape): > a = Numeric.zeros(shape, 'w') > a.savespace() > > for y in xrange(A.shape[0]): > for x in xrange(A.shape[1]): > a[y, x] = A[y, x] > > return a > > def EatMemoryLikeTheCookieMonster(limit=10): > A = Numeric.ones([1998, 3022]) > > count = 0 > a = A > while count < limit: > print count > count += 1 > > a = ZeroPadData(a, [2048, 4096]) > > b = fft2(a) > b = ifft2(b) > > a = b[:1998,:3022].real > > EatMemoryLikeTheCookieMonster() > > This is for Python 2.4.3 on Mac OS X 10.4.8 (intel) using SciPy 0.5.2. Could you also post a complete example? Why are you using Numeric? scipy 0.5.2 requires numpy, not Numeric. Where are the fft2() and ifft2() functions coming from, scipy.fftpack or numpy? -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From provos at citi.umich.edu Sun Feb 4 16:33:05 2007 From: provos at citi.umich.edu (Niels Provos) Date: Sun, 4 Feb 2007 13:33:05 -0800 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies In-Reply-To: <45C64024.3000305@gmail.com> References: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> <45C64024.3000305@gmail.com> Message-ID: <850f7cbe0702041333j28812749j693737291b9bbc3e@mail.gmail.com> The missing imports are import Numeric # for zeros and ones from scipy.fftpack import fft2,ifft2 Curiously, replacing Numeric.zeros with scipy.zeros makes the problem go away. Why? Thank you, Niels. On 2/4/07, Robert Kern wrote: > Niels Provos wrote: > > Good morning, > > > > not sure if I got the right list, but I hope that somebody here will > > be able to shed some light on a Python-related memory problem. The > > following code eats over >2GB of memory and fails with MemoyError > > after just a few iterations. > > > > def ZeroPadData(A, shape): > > a = Numeric.zeros(shape, 'w') > > a.savespace() > > > > for y in xrange(A.shape[0]): > > for x in xrange(A.shape[1]): > > a[y, x] = A[y, x] > > > > return a > > > > def EatMemoryLikeTheCookieMonster(limit=10): > > A = Numeric.ones([1998, 3022]) > > > > count = 0 > > a = A > > while count < limit: > > print count > > count += 1 > > > > a = ZeroPadData(a, [2048, 4096]) > > > > b = fft2(a) > > b = ifft2(b) > > > > a = b[:1998,:3022].real > > > > EatMemoryLikeTheCookieMonster() > > > > This is for Python 2.4.3 on Mac OS X 10.4.8 (intel) using SciPy 0.5.2. > > Could you also post a complete example? Why are you using Numeric? scipy 0.5.2 > requires numpy, not Numeric. Where are the fft2() and ifft2() functions coming > from, scipy.fftpack or numpy? > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless enigma > that is made terrible by our own mad attempt to interpret it as though it had > an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From robert.kern at gmail.com Sun Feb 4 16:34:04 2007 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 04 Feb 2007 15:34:04 -0600 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies In-Reply-To: <850f7cbe0702041333j28812749j693737291b9bbc3e@mail.gmail.com> References: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> <45C64024.3000305@gmail.com> <850f7cbe0702041333j28812749j693737291b9bbc3e@mail.gmail.com> Message-ID: <45C6514C.7020800@gmail.com> Niels Provos wrote: > The missing imports are > > import Numeric # for zeros and ones > from scipy.fftpack import fft2,ifft2 > > Curiously, replacing Numeric.zeros with scipy.zeros makes the problem > go away. Why? Possibly a bug in Numeric. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From gnata at obs.univ-lyon1.fr Sun Feb 4 17:52:46 2007 From: gnata at obs.univ-lyon1.fr (Xavier Gnata) Date: Sun, 04 Feb 2007 23:52:46 +0100 Subject: [Numpy-discussion] numpy.tests() FAILED (errors=3) Message-ID: <45C663BE.1030100@obs.univ-lyon1.fr> Hi, I'm using numpy svn and running numpy.test(). The error log is the following : ====================================================================== ERROR: check_complex_bad (numpy.lib.tests.test_type_check.test_nan_to_num) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/lib/python2.4/site-packages/numpy/lib/tests/test_type_check.py", line 245, in check_complex_bad vals = nan_to_num(v) File "/usr/lib/python2.4/site-packages/numpy/lib/type_check.py", line 132, in nan_to_num are_inf = isposinf(y) File "/usr/lib/python2.4/site-packages/numpy/lib/ufunclike.py", line 33, in isposinf umath.logical_and(isinf(x), ~signbit(x), y) TypeError: function not supported for these types, and can't coerce safely to supported types ====================================================================== ERROR: check_complex_bad2 (numpy.lib.tests.test_type_check.test_nan_to_num) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/lib/python2.4/site-packages/numpy/lib/tests/test_type_check.py", line 253, in check_complex_bad2 vals = nan_to_num(v) File "/usr/lib/python2.4/site-packages/numpy/lib/type_check.py", line 132, in nan_to_num are_inf = isposinf(y) File "/usr/lib/python2.4/site-packages/numpy/lib/ufunclike.py", line 33, in isposinf umath.logical_and(isinf(x), ~signbit(x), y) TypeError: function not supported for these types, and can't coerce safely to supported types ====================================================================== ERROR: check_complex_good (numpy.lib.tests.test_type_check.test_nan_to_num) ---------------------------------------------------------------------- Traceback (most recent call last): File "/usr/lib/python2.4/site-packages/numpy/lib/tests/test_type_check.py", line 238, in check_complex_good vals = nan_to_num(1+1j) File "/usr/lib/python2.4/site-packages/numpy/lib/type_check.py", line 132, in nan_to_num are_inf = isposinf(y) File "/usr/lib/python2.4/site-packages/numpy/lib/ufunclike.py", line 33, in isposinf umath.logical_and(isinf(x), ~signbit(x), y) TypeError: function not supported for these types, and can't coerce safely to supported types ---------------------------------------------------------------------- It should be a side effect of one patch because I can read things like " # !! This is actually (unexpectedly) zero" in /usr/lib/python2.4/site-packages/numpy/lib/tests/test_type_check.py Xavier. -- ############################################ Xavier Gnata CRAL - Observatoire de Lyon 9, avenue Charles Andr? 69561 Saint Genis Laval cedex Phone: +33 4 78 86 85 28 Fax: +33 4 78 86 83 86 E-mail: gnata at obs.univ-lyon1.fr ############################################ From haase at msg.ucsf.edu Sun Feb 4 19:36:04 2007 From: haase at msg.ucsf.edu (Sebastian Haase) Date: Sun, 4 Feb 2007 16:36:04 -0800 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies In-Reply-To: <45C6514C.7020800@gmail.com> References: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> <45C64024.3000305@gmail.com> <850f7cbe0702041333j28812749j693737291b9bbc3e@mail.gmail.com> <45C6514C.7020800@gmail.com> Message-ID: On 2/4/07, Robert Kern wrote: > Niels Provos wrote: > > The missing imports are > > > > import Numeric # for zeros and ones > > from scipy.fftpack import fft2,ifft2 > > > > Curiously, replacing Numeric.zeros with scipy.zeros makes the problem > > go away. Why? > > Possibly a bug in Numeric. > > -- > Robert Kern Is there *any* support for old Numeric on this list !? Maybe it should be officially stated that the one way to go is numpy and that problems with Numeric ( or numarray ) can only be noticed but will likely not get fixed.... -Sebastian From robert.kern at gmail.com Sun Feb 4 19:39:17 2007 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 04 Feb 2007 18:39:17 -0600 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies In-Reply-To: References: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> <45C64024.3000305@gmail.com> <850f7cbe0702041333j28812749j693737291b9bbc3e@mail.gmail.com> <45C6514C.7020800@gmail.com> Message-ID: <45C67CB5.1000303@gmail.com> Sebastian Haase wrote: > Is there *any* support for old Numeric on this list !? Not unless if you are offering some. > Maybe it should be officially stated that the one way to go is > numpy > and that problems with Numeric ( or numarray ) can only be noticed but > will likely not get fixed.... That's pretty much what we've been officially stating for some time now. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From kwgoodman at gmail.com Sun Feb 4 20:06:34 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Sun, 4 Feb 2007 17:06:34 -0800 Subject: [Numpy-discussion] numpy.matlib.abs Message-ID: There's a numpy.abs but no numpy.matlib.abs. >> import numpy as N >> import numpy.matlib as M >> >> N.abs? Type: ufunc Base Class: String Form: Namespace: Interactive Docstring: y = absolute(x) takes |x| elementwise. >> M.abs? Object `M.abs` not found. From robert.kern at gmail.com Sun Feb 4 20:12:44 2007 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 04 Feb 2007 19:12:44 -0600 Subject: [Numpy-discussion] numpy.matlib.abs In-Reply-To: References: Message-ID: <45C6848C.4060105@gmail.com> Keith Goodman wrote: > There's a numpy.abs but no numpy.matlib.abs. > >>> import numpy as N >>> import numpy.matlib as M >>> >>> N.abs? > Type: ufunc > Base Class: > String Form: > Namespace: Interactive > Docstring: > y = absolute(x) takes |x| elementwise. > >>> M.abs? > Object `M.abs` not found. numpy.abs() is not exported via "from numpy import *", which is where numpy.matlib gets all of its non-overridden functions from. It is not exported because it conflicts with the builtin abs(). Of course, absolute() is preferred for the same reason, and numpy.matlib.absolute() does exist. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From jeremit0 at gmail.com Sun Feb 4 20:22:44 2007 From: jeremit0 at gmail.com (Jeremy Conlin) Date: Sun, 4 Feb 2007 20:22:44 -0500 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray Message-ID: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> I have subclassed the numpy.ndarray object, but need some help setting some attributes. I have read http://scipy.org/Subclasses but it doesn't provide the answer I am looking for. I create an instance of the class in my __new__ method as: import numpy class MyClass(numpy.ndarray): __new__(self,?): # Some stuff here H, edges = numpy.histogramdd(?) return H This sets H as the instance of my object. I would also like to have edges be an attribute of MyClass. I can't do: self.edges = edges because the object hasn't been instantiated yet. Can someone show me how I can also keep the information from the variable edges? Thanks, Jeremy From kwgoodman at gmail.com Sun Feb 4 20:28:48 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Sun, 4 Feb 2007 17:28:48 -0800 Subject: [Numpy-discussion] numpy.matlib.abs In-Reply-To: <45C6848C.4060105@gmail.com> References: <45C6848C.4060105@gmail.com> Message-ID: On 2/4/07, Robert Kern wrote: > Keith Goodman wrote: > > There's a numpy.abs but no numpy.matlib.abs. > > > >>> import numpy as N > >>> import numpy.matlib as M > >>> > >>> N.abs? > > Type: ufunc > > Base Class: > > String Form: > > Namespace: Interactive > > Docstring: > > y = absolute(x) takes |x| elementwise. > > > >>> M.abs? > > Object `M.abs` not found. > > numpy.abs() is not exported via "from numpy import *", which is where > numpy.matlib gets all of its non-overridden functions from. It is not exported > because it conflicts with the builtin abs(). > > Of course, absolute() is preferred for the same reason, and > numpy.matlib.absolute() does exist. Could numpy.matlib get the same functions as numpy? Would that have to be done with a manually maintained import list? I always use "import numpy.matlib as M" and then search for function names in ipython (M.a[TAB]). I didn't realize that some functions are missing. From pgmdevlist at gmail.com Sun Feb 4 20:33:01 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Sun, 4 Feb 2007 20:33:01 -0500 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> Message-ID: <200702042033.01767.pgmdevlist@gmail.com> On Sunday 04 February 2007 20:22:44 Jeremy Conlin wrote: > I have subclassed the numpy.ndarray object, but need some help setting > some attributes. I have read http://scipy.org/Subclasses but it > doesn't provide the answer I am looking for. Actually, yes: In the example given in http://scipy.org/Subclasses an attribute 'info' is defined from a class-generic one '__defaultinfo'. Just do the same thing with your 'edges' def __new__(cls,...) ... (H, edges) = numpy.histogramdd(..) cls.__defaultedges = edges def __array_finalize__(self, obj): if not hasattr(self, 'edges'): self.edges = self.__defaultedges That should the trick. From tim.leslie at gmail.com Mon Feb 5 00:32:43 2007 From: tim.leslie at gmail.com (Tim Leslie) Date: Mon, 5 Feb 2007 16:32:43 +1100 Subject: [Numpy-discussion] r3530 breaks nan_to_num for complex64 arrays In-Reply-To: References: Message-ID: On 2/5/07, Tim Leslie wrote: > Hi All, > > As of svn revision 3530 N.nan_to_num no longer works for arrays of > complex64. The actual error is raised in the signbit function, but I'm > not sure why this is failing. If someone has a quick fix for this > that'd be great, if not I'll lodge a full bug report when I get back > from lunch :-). Just noticed that this has already been reported. http://projects.scipy.org/scipy/numpy/ticket/443 Cheers, Tim > > Cheers, > > Tim > > In [1]: import numpy as N > > In [2]: a = N.ones((10, 10, 10), N.complex64) > > In [3]: N.nan_to_num(a) > --------------------------------------------------------------------------- > exceptions.TypeError Traceback (most > recent call last) > > /home/timl/ > > /usr/lib/python2.4/site-packages/numpy/lib/type_check.py in nan_to_num(x) > 130 else: > 131 scalar = False > --> 132 are_inf = isposinf(y) > 133 are_neg_inf = isneginf(y) > 134 are_nan = isnan(y) > > /usr/lib/python2.4/site-packages/numpy/lib/ufunclike.py in isposinf(x, y) > 31 if y is None: > 32 y = empty(x.shape, dtype=nx.bool_) > ---> 33 umath.logical_and(isinf(x), ~signbit(x), y) > 34 return y > 35 > > TypeError: function not supported for these types, and can't coerce > safely to supported types > From robert.kern at gmail.com Mon Feb 5 01:48:19 2007 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 05 Feb 2007 00:48:19 -0600 Subject: [Numpy-discussion] python eats memory like the cookie monster eats cookies In-Reply-To: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> References: <850f7cbe0702041018s27f64d44n4f0f77245654eefb@mail.gmail.com> Message-ID: <45C6D333.7050808@gmail.com> Niels Provos wrote: > Good morning, > > not sure if I got the right list, but I hope that somebody here will > be able to shed some light on a Python-related memory problem. The > following code eats over >2GB of memory and fails with MemoyError > after just a few iterations. Here is a minimal example that demonstrates the problem. It appears that assigning a numpy scalar to an element in an n-D Numeric array (where n is strictly > 1) leaks memory. Neither the data-types of the Numeric array nor the numpy scalar type seem to matter. import Numeric import numpy a = Numeric.zeros([1, 1]) while True: a[0, 0] = numpy.int32(0) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From klemm at phys.ethz.ch Mon Feb 5 05:20:47 2007 From: klemm at phys.ethz.ch (Hanno Klemm) Date: Mon, 05 Feb 2007 11:20:47 +0100 Subject: [Numpy-discussion] FFT definition Message-ID: Hi there, I have a question regarding the definitions surrounding FFTs. The help to numpy.fft.fft says: >>> help(N.fft.fft) Help on function fft in module numpy.fft.fftpack: fft(a, n=None, axis=-1) fft(a, n=None, axis=-1) Will return the n point discrete Fourier transform of a. n defaults to the length of a. If n is larger than a, then a will be zero-padded to make up the difference. If n is smaller than a, the first n items in a will be used. The packing of the result is "standard": If A = fft(a, n), then A[0] contains the zero-frequency term, A[1:n/2+1] contains the positive-frequency terms, and A[n/2+1:] contains the negative-frequency terms, in order of decreasingly negative frequency. So for an 8-point transform, the frequencies of the result are [ 0, 1, 2, 3, 4, -3, -2, -1]. This is most efficient for n a power of two. This also stores a cache of working memory for different sizes of fft's, so you could theoretically run into memory problems if you call this too many times with too many different n's. >>> However, the help to numpy.fft.helper.fftfreq says: >>> help(N.fft.helper.fftfreq) Help on function fftfreq in module numpy.fft.helper: fftfreq(n, d=1.0) fftfreq(n, d=1.0) -> f DFT sample frequencies The returned float array contains the frequency bins in cycles/unit (with zero at the start) given a window length n and a sample spacing d: f = [0,1,...,n/2-1,-n/2,...,-1]/(d*n) if n is even f = [0,1,...,(n-1)/2,-(n-1)/2,...,-1]/(d*n) if n is odd >>> So one claims, that the packing goes from [0,1,...,n/2,-n/2+1,..,-1] (fft) and the other one claims the frequencies go from [0,1,...,n/2-1,-n/2,...-1] Is this inconsistent or am I missing something here? Hanno -- Hanno Klemm klemm at phys.ethz.ch From focke at slac.stanford.edu Mon Feb 5 10:27:08 2007 From: focke at slac.stanford.edu (Warren Focke) Date: Mon, 5 Feb 2007 07:27:08 -0800 (PST) Subject: [Numpy-discussion] FFT definition In-Reply-To: References: Message-ID: The frequencies produced by the two recipies are not the same. But the DFT is periodic in both frequency and time. So whether you think that the number in bin in n/2 should correspond to frequency n/2 or -n/2, it's the same number. w On Mon, 5 Feb 2007, Hanno Klemm wrote: > > Hi there, > > I have a question regarding the definitions surrounding FFTs. The help > to numpy.fft.fft says: > > >>> help(N.fft.fft) > Help on function fft in module numpy.fft.fftpack: > > fft(a, n=None, axis=-1) > fft(a, n=None, axis=-1) > > Will return the n point discrete Fourier transform of a. n > defaults to the > length of a. If n is larger than a, then a will be zero-padded to > make up > the difference. If n is smaller than a, the first n items in a will be > used. > > The packing of the result is "standard": If A = fft(a, n), then A[0] > contains the zero-frequency term, A[1:n/2+1] contains the > positive-frequency terms, and A[n/2+1:] contains the > negative-frequency > terms, in order of decreasingly negative frequency. So for an 8-point > transform, the frequencies of the result are [ 0, 1, 2, 3, 4, -3, > -2, -1]. > > This is most efficient for n a power of two. This also stores a > cache of > working memory for different sizes of fft's, so you could > theoretically > run into memory problems if you call this too many times with too many > different n's. > > >>> > > However, the help to numpy.fft.helper.fftfreq says: > > >>> help(N.fft.helper.fftfreq) > Help on function fftfreq in module numpy.fft.helper: > > fftfreq(n, d=1.0) > fftfreq(n, d=1.0) -> f > > DFT sample frequencies > > The returned float array contains the frequency bins in > cycles/unit (with zero at the start) given a window length n and a > sample spacing d: > > f = [0,1,...,n/2-1,-n/2,...,-1]/(d*n) if n is even > f = [0,1,...,(n-1)/2,-(n-1)/2,...,-1]/(d*n) if n is odd > > >>> > > So one claims, that the packing goes from [0,1,...,n/2,-n/2+1,..,-1] > (fft) and the other one claims the frequencies go from > [0,1,...,n/2-1,-n/2,...-1] > > Is this inconsistent or am I missing something here? > > Hanno > > -- > Hanno Klemm > klemm at phys.ethz.ch > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From jeremit0 at gmail.com Mon Feb 5 11:32:22 2007 From: jeremit0 at gmail.com (Jeremy Conlin) Date: Mon, 5 Feb 2007 11:32:22 -0500 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <200702042033.01767.pgmdevlist@gmail.com> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> Message-ID: <3db594f70702050832y586069e5rb6186984b904f62@mail.gmail.com> On 2/4/07, Pierre GM wrote: > On Sunday 04 February 2007 20:22:44 Jeremy Conlin wrote: > > I have subclassed the numpy.ndarray object, but need some help setting > > some attributes. I have read http://scipy.org/Subclasses but it > > doesn't provide the answer I am looking for. > > Actually, yes: > In the example given in http://scipy.org/Subclasses an attribute 'info' is > defined from a class-generic one '__defaultinfo'. Just do the same thing with > your 'edges' > > def __new__(cls,...) > ... > (H, edges) = numpy.histogramdd(..) > cls.__defaultedges = edges > > def __array_finalize__(self, obj): > if not hasattr(self, 'edges'): > self.edges = self.__defaultedges > > That should the trick. Thanks for clarifying that. I didn't understand what the __array_finalize__ did. Jeremy From kwgoodman at gmail.com Mon Feb 5 11:45:39 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 5 Feb 2007 08:45:39 -0800 Subject: [Numpy-discussion] Memory leak in argsort? Message-ID: This eats up memory quickly on my system. import numpy.matlib as M def memleak(): a = M.randn(500, 1) while True: a = a.argsort(0) From faltet at carabos.com Mon Feb 5 12:04:35 2007 From: faltet at carabos.com (Francesc Altet) Date: Mon, 05 Feb 2007 18:04:35 +0100 Subject: [Numpy-discussion] Memory leak in argsort? In-Reply-To: References: Message-ID: <1170695075.2543.12.camel@localhost.localdomain> El dl 05 de 02 del 2007 a les 08:45 -0800, en/na Keith Goodman va escriure: > This eats up memory quickly on my system. > > import numpy.matlib as M > > def memleak(): > a = M.randn(500, 1) > while True: > a = a.argsort(0) Yeah, the guilty in this case is argsort(): http://projects.scipy.org/scipy/numpy/ticket/394 Travis fixed this in trunk (but forgot to close the ticket ;) -- Francesc Altet | Be careful about using the following code -- Carabos Coop. V. | I've only proven that it works, www.carabos.com | I haven't tested it. -- Donald Knuth From tim.hochberg at ieee.org Mon Feb 5 12:09:04 2007 From: tim.hochberg at ieee.org (Timothy Hochberg) Date: Mon, 5 Feb 2007 10:09:04 -0700 Subject: [Numpy-discussion] FFT definition In-Reply-To: References: Message-ID: On 2/5/07, Hanno Klemm wrote: [numpy.fft[ The packing of the result is "standard": If A = fft(a, n), then A[0] > contains the zero-frequency term, A[1:n/2+1] contains the > positive-frequency terms, and A[n/2+1:] contains the > negative-frequency > terms, in order of decreasingly negative frequency. So for an 8-point > transform, the frequencies of the result are [ 0, 1, 2, 3, 4, -3, > -2, -1]. [scipy.fft] f = [0,1,...,n/2-1,-n/2,...,-1]/(d*n) if n is even > f = [0,1,...,(n-1)/2,-(n-1)/2,...,-1]/(d*n) if n is odd > > >>> > > So one claims, that the packing goes from [0,1,...,n/2,-n/2+1,..,-1] > (fft) and the other one claims the frequencies go from > [0,1,...,n/2-1,-n/2,...-1] > > Is this inconsistent or am I missing something here? Both, I think. In the even case, the frequency at n/2 is shared by both the positive frequencies, so for that case things are consistent if not terribly clear. For the odd case, this is not true, and the scipy docs look correct in this case, while the numpy docs appear to assign an extra frequency to the positive branch. Of course that's not the one you were complaining about ;-). To be super pedantic, the discrete Fourier transform is periodic, so all of the frequencies can be regarded as positive or negative. That's not generally useful, since the assumptions that go into the DFT that make it periodic don't usually apply to the signal that you are sampling. Then again the results of DFTs are typicallly either small or silly in the vicinity of N//2. //=][=\\ tim.hochberg at ieee.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From kwgoodman at gmail.com Mon Feb 5 12:20:39 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 5 Feb 2007 09:20:39 -0800 Subject: [Numpy-discussion] Memory leak in argsort? In-Reply-To: <1170695075.2543.12.camel@localhost.localdomain> References: <1170695075.2543.12.camel@localhost.localdomain> Message-ID: On 2/5/07, Francesc Altet wrote: > El dl 05 de 02 del 2007 a les 08:45 -0800, en/na Keith Goodman va > escriure: > > This eats up memory quickly on my system. > > > > import numpy.matlib as M > > > > def memleak(): > > a = M.randn(500, 1) > > while True: > > a = a.argsort(0) > > Yeah, the guilty in this case is argsort(): > > http://projects.scipy.org/scipy/numpy/ticket/394 The first page of trac search results for argsort shows the ticket. Now I know to check there first. Will the latest numpy from svn work with matplotlib 0.87.7? From pgmdevlist at gmail.com Mon Feb 5 12:25:00 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Mon, 5 Feb 2007 12:25:00 -0500 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <3db594f70702050832y586069e5rb6186984b904f62@mail.gmail.com> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <3db594f70702050832y586069e5rb6186984b904f62@mail.gmail.com> Message-ID: <200702051225.00780.pgmdevlist@gmail.com> On Monday 05 February 2007 11:32:22 Jeremy Conlin wrote: > Thanks for clarifying that. I didn't understand what the > __array_finalize__ did. That means I should clarify some points on the wiki, then. A good exercise is to put some temporary comments in your code in __new__ and __array_finalize__, to show when these methods are called and how (that's how I learned) Thinking about it, the example you gave can't work. Your __new__ method returns H, viz, a pure ndarray. There won't be any call to __array_finalize__ in that case, which is not what you want. Force the call by accessing a view of your array: class myhistog(N.ndarray): def __new__(self, iniarray, inibin): (H,edges) = N.histogramdd(iniarray,inibin) self._defedges = edges return H.view(self) Now, you do return a 'myhistog' class, not a pure 'ndarray', and __array_finalize__ is called. def __array_finalize__(self, obj): print "__array_finalize__ got %s as %s" % (obj, type(obj)) if not hasattr(self, 'edges'): self.edges = self._defedges myhistog._defedges = None Note the last line: you reset the class default to None (if this is what you want). Otherwise, new 'myhistog' objects wil inherit the previous edges. From robert.kern at gmail.com Mon Feb 5 12:26:16 2007 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 05 Feb 2007 11:26:16 -0600 Subject: [Numpy-discussion] Memory leak in argsort? In-Reply-To: References: <1170695075.2543.12.camel@localhost.localdomain> Message-ID: <45C768B8.1030405@gmail.com> Keith Goodman wrote: > Will the latest numpy from svn work with matplotlib 0.87.7? It should. We are committed to backwards compatibility both at the Python level and the C binary level. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From focke at slac.stanford.edu Mon Feb 5 12:53:04 2007 From: focke at slac.stanford.edu (Warren Focke) Date: Mon, 5 Feb 2007 09:53:04 -0800 (PST) Subject: [Numpy-discussion] FFT definition In-Reply-To: References: Message-ID: On Mon, 5 Feb 2007, Timothy Hochberg wrote: > On 2/5/07, Hanno Klemm wrote: > [numpy.fft[ > > The packing of the result is "standard": If A = fft(a, n), then A[0] > > contains the zero-frequency term, A[1:n/2+1] contains the > > positive-frequency terms, and A[n/2+1:] contains the > > negative-frequency > > terms, in order of decreasingly negative frequency. So for an 8-point > > transform, the frequencies of the result are [ 0, 1, 2, 3, 4, -3, > > -2, -1]. > > > [scipy.fft] > > > f = [0,1,...,n/2-1,-n/2,...,-1]/(d*n) if n is even > > f = [0,1,...,(n-1)/2,-(n-1)/2,...,-1]/(d*n) if n is odd > > > > >>> > > > > So one claims, that the packing goes from [0,1,...,n/2,-n/2+1,..,-1] > > (fft) and the other one claims the frequencies go from > > [0,1,...,n/2-1,-n/2,...-1] > > > > Is this inconsistent or am I missing something here? > > > Both, I think. > > In the even case, the frequency at n/2 is shared by both the positive > frequencies, so for that case things are consistent if not terribly clear. > For the odd case, this is not true, and the scipy docs look correct in this > case, while the numpy docs appear to assign an extra frequency to the > positive branch. Of course that's not the one you were complaining about > ;-). Extra frequency where? (numpy 1.0, debian sarge) >>> n=9 >>> A=arange(n) numpy docs: >>> A[1:n/2+1] array([1, 2, 3, 4]) >>> A[n/2+1:] array([5, 6, 7, 8]) scipy docs: >>> (n-1)/2 4 >>> -(n-1)/2 -4 Note that in the odd-n case, there is no Nyquist term. If F = fft(f), len(f) == 9 then F[-4] != F[4] (F[5] == F[-4] by periodicty in frequency) w > > To be super pedantic, the discrete Fourier transform is periodic, so all of > the frequencies can be regarded as positive or negative. That's not > generally useful, since the assumptions that go into the DFT that make it > periodic don't usually apply to the signal that you are sampling. Then again > the results of DFTs are typicallly either small or silly in the vicinity of > N//2. > > > //=][=\\ > > tim.hochberg at ieee.org > From tim.hochberg at ieee.org Mon Feb 5 13:13:29 2007 From: tim.hochberg at ieee.org (Timothy Hochberg) Date: Mon, 5 Feb 2007 11:13:29 -0700 Subject: [Numpy-discussion] FFT definition In-Reply-To: References: Message-ID: On 2/5/07, Warren Focke wrote: > > > > On Mon, 5 Feb 2007, Timothy Hochberg wrote: > > > On 2/5/07, Hanno Klemm wrote: > > [numpy.fft[ > > > > The packing of the result is "standard": If A = fft(a, n), then A[0] > > > contains the zero-frequency term, A[1:n/2+1] contains the > > > positive-frequency terms, and A[n/2+1:] contains the > > > negative-frequency > > > terms, in order of decreasingly negative frequency. So for an > 8-point > > > transform, the frequencies of the result are [ 0, 1, 2, 3, 4, -3, > > > -2, -1]. > > > > > > [scipy.fft] > > > > > > f = [0,1,...,n/2-1,-n/2,...,-1]/(d*n) if n is even > > > f = [0,1,...,(n-1)/2,-(n-1)/2,...,-1]/(d*n) if n is odd > > > > > > >>> > > > > > > So one claims, that the packing goes from [0,1,...,n/2,-n/2+1,..,-1] > > > (fft) and the other one claims the frequencies go from > > > [0,1,...,n/2-1,-n/2,...-1] > > > > > > Is this inconsistent or am I missing something here? > > > > > > Both, I think. > > > > In the even case, the frequency at n/2 is shared by both the positive > > frequencies, so for that case things are consistent if not terribly > clear. > > For the odd case, this is not true, and the scipy docs look correct in > this > > case, while the numpy docs appear to assign an extra frequency to the > > positive branch. Of course that's not the one you were complaining about > > ;-). > > Extra frequency where? > > (numpy 1.0, debian sarge) > >>> n=9 > >>> A=arange(n) > > numpy docs: > >>> A[1:n/2+1] > array([1, 2, 3, 4]) > >>> A[n/2+1:] > array([5, 6, 7, 8]) > > scipy docs: > >>> (n-1)/2 > 4 > >>> -(n-1)/2 > -4 > > Note that in the odd-n case, there is no Nyquist term. If > F = fft(f), len(f) == 9 > then F[-4] != F[4] (F[5] == F[-4] by periodicty in frequency) Ooops, you're right I worked this through on paper and just blew it. I suppose I'd look less silly had I actually checked the results using the interpreter. -- //=][=\\ tim.hochberg at ieee.org -------------- next part -------------- An HTML attachment was scrubbed... URL: From jeremit0 at gmail.com Mon Feb 5 13:13:50 2007 From: jeremit0 at gmail.com (Jeremy Conlin) Date: Mon, 5 Feb 2007 13:13:50 -0500 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <200702051225.00780.pgmdevlist@gmail.com> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <3db594f70702050832y586069e5rb6186984b904f62@mail.gmail.com> <200702051225.00780.pgmdevlist@gmail.com> Message-ID: <3db594f70702051013x181cdc81x6a21dd471657d105@mail.gmail.com> On 2/5/07, Pierre GM wrote: > On Monday 05 February 2007 11:32:22 Jeremy Conlin wrote: > > Thanks for clarifying that. I didn't understand what the > > __array_finalize__ did. > > That means I should clarify some points on the wiki, then. > A good exercise is to put some temporary comments in your code in __new__ and > __array_finalize__, to show when these methods are called and how (that's how > I learned) > > Thinking about it, the example you gave can't work. Your __new__ method > returns H, viz, a pure ndarray. There won't be any call to __array_finalize__ > in that case, which is not what you want. Force the call by accessing a view > of your array: > > class myhistog(N.ndarray): > def __new__(self, iniarray, inibin): > (H,edges) = N.histogramdd(iniarray,inibin) > self._defedges = edges > return H.view(self) > > Now, you do return a 'myhistog' class, not a pure 'ndarray', and > __array_finalize__ is called. > > def __array_finalize__(self, obj): > print "__array_finalize__ got %s as %s" % (obj, type(obj)) > if not hasattr(self, 'edges'): > self.edges = self._defedges > myhistog._defedges = None > > Note the last line: you reset the class default to None (if this is what you > want). Otherwise, new 'myhistog' objects wil inherit the previous edges. Excellent now it does what I want! But it raises more questions. What exactly is a "view" of H? Thanks again, Jeremy From millman at berkeley.edu Tue Feb 6 01:14:15 2007 From: millman at berkeley.edu (Jarrod Millman) Date: Mon, 5 Feb 2007 22:14:15 -0800 Subject: [Numpy-discussion] Fwd: [Rpy] status of numpy version of rpy In-Reply-To: <0D56E923-C9FE-479C-8552-370BCD0F3EAD@rochester.edu> References: <0D56E923-C9FE-479C-8552-370BCD0F3EAD@rochester.edu> Message-ID: Hey, After reading Travis' email about getting more projects ported to Numpy, I sent an email to the RPy developer's asking when they would commit Travis' patch. Anyway, George Warnes replied that he would apply it this week (see the forwarded email below). I would encourage anyone who is using a project that hasn't converted to Numpy to send the developer's an email letting them know that you want them to move the project to Numpy as soon as possible. Thanks to all the Numpy developers, and in particular to Travis, for working so hard to make Numpy so great. Jarrod ---------- Forwarded message ---------- From: Gregory. R. Warnes Date: Feb 5, 2007 7:33 AM Subject: Re: [Rpy] status of numpy version of rpy To: "RPy help, support and design discussion list" Hi Jarrod, The Numpy patch should be applied sometime this week. -Greg On Feb 3, 2007, at 3:03 AM, Jarrod Millman wrote: > Hello, > > Thanks for all the great work. I am involved with the Neuroimaging in > Python project, which uses numpy. Numpy is very stable and much > better than Numeric. The problem is that some of us are also using > RPy for our data processing. So we are using Travis' numpy patch for > RPy. It seems to work perfectly. This process is acceptable for our > developers, but creates additional installation headaches for our > regular users. > > Do you have a timeline for when you will apply the numpy patch? Is > there anything you need help with before applying the patch? > > Thanks, > > -- > Jarrod Millman > Computational Infrastructure for Research Labs > 10 Giannini Hall, UC Berkeley > phone: 510.643.4014 > http://cirl.berkeley.edu/ > > ---------------------------------------------------------------------- > --- > Using Tomcat but need to do more? Need to support web services, > security? > Get stuff done quickly with pre-integrated technology to make your > job easier. > Download IBM WebSphere Application Server v.1.0.1 based on Apache > Geronimo > http://sel.as-us.falkag.net/sel? > cmd=lnk&kid=120709&bid=263057&dat=121642 > _______________________________________________ > rpy-list mailing list > rpy-list at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/rpy-list ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier. Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ rpy-list mailing list rpy-list at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/rpy-list -- Jarrod Millman Computational Infrastructure for Research Labs 10 Giannini Hall, UC Berkeley phone: 510.643.4014 http://cirl.berkeley.edu/ From millman at berkeley.edu Tue Feb 6 01:41:18 2007 From: millman at berkeley.edu (Jarrod Millman) Date: Mon, 5 Feb 2007 22:41:18 -0800 Subject: [Numpy-discussion] NumPy compatible version of ScientificPython Message-ID: The current development release of ScientificPython (2.7.3) supports NumPy. The release notes are here: http://sourcesup.cru.fr/frs/shownotes.php?release_id=634 I updated the "Porting to NumPy" wiki page: http://www.scipy.org/Porting_to_NumPy -- Jarrod Millman Computational Infrastructure for Research Labs 10 Giannini Hall, UC Berkeley phone: 510.643.4014 http://cirl.berkeley.edu/ From pjssilva at ime.usp.br Tue Feb 6 05:30:26 2007 From: pjssilva at ime.usp.br (Paulo J. S. Silva) Date: Tue, 06 Feb 2007 08:30:26 -0200 Subject: [Numpy-discussion] numpy.matlib.abs In-Reply-To: References: <45C6848C.4060105@gmail.com> Message-ID: <1170757826.21666.8.camel@localhost.localdomain> Em Dom, 2007-02-04 ?s 17:28 -0800, Keith Goodman escreveu: > Could numpy.matlib get the same functions as numpy? Would that have to > be done with a manually maintained import list? > I always use "import numpy.matlib as M" and then search for function > names in ipython (M.a[TAB]). I didn't realize that some functions are > missing. As the list knows, I am trying to build a special module that can convert any other module to behave nicely with matrices. I have special interest in using it as an interface to scipy modules that may return arrays when given a matrix. This effort let me to learn some tricks about modules imports in Python. I believe that if you add the following code to the end of matlib.py file it will behave just like you want without any manual intervention: --- Start Python code --- import inspect import matlib as M for i in dir(N): attribute = getattr(N, i) if type(attribute) is N.ufunc or inspect.isroutine(attribute): try: getattr(M, i) except AttributeError: setattr(M, i, attribute) --- End Python code --- Here is an ipython session: --- ipython session --- In [1]:import numpy.matlib as M In [2]:M.abs Out[2]: --- End of ipython sesssion --- By the way, there were only four functions that are missing without this code: abs, max, min, and round. You can see this by adding a "print i" in the except block above. If the list thinks this code is useful, I am donating it to numpy. Best, Paulo From sturla at molden.no Tue Feb 6 07:06:37 2007 From: sturla at molden.no (Sturla Molden) Date: Tue, 6 Feb 2007 13:06:37 +0100 (CET) Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <200702042033.01767.pgmdevlist@gmail.com> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> Message-ID: <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> > def __new__(cls,...) > ... > (H, edges) = numpy.histogramdd(..) > cls.__defaultedges = edges > > def __array_finalize__(self, obj): > if not hasattr(self, 'edges'): > self.edges = self.__defaultedges So in order to get an instance attribute, one has to temporarily define it as a class attribute? What happens if there is a thread switch between __new__ and __array_finalize__? This design is not thread safe and can produce strange race conditions. IMHO, the preferred way to set an instance attribute is to use __init__ method, which is the 'Pythonic' way to do it. Sturla Molden From stefan at sun.ac.za Tue Feb 6 07:56:01 2007 From: stefan at sun.ac.za (Stefan van der Walt) Date: Tue, 6 Feb 2007 14:56:01 +0200 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> Message-ID: <20070206125600.GN6274@mentat.za.net> On Tue, Feb 06, 2007 at 01:06:37PM +0100, Sturla Molden wrote: > > > def __new__(cls,...) > > ... > > (H, edges) = numpy.histogramdd(..) > > cls.__defaultedges = edges > > > > def __array_finalize__(self, obj): > > if not hasattr(self, 'edges'): > > self.edges = self.__defaultedges > > IMHO, the preferred way to set an instance attribute is to use __init__ > method, which is the 'Pythonic' way to do it. I don't pretend to know all the inner workings of subclassing, but I don't think that would work, given the following output: In [1]: import numpy as N In [2]: import numpy as N In [3]: In [3]: class MyArray(N.ndarray): ...: def __new__(cls,data): ...: return N.asarray(data).view(cls) ...: ...: def __init__(self,obj): ...: print "This is where __init__ is called" ...: ...: def __array_finalize__(self,obj): ...: print "This is where __array_finalize__ is called" ...: In [4]: x = MyArray(3) This is where __array_finalize__ is called This is where __init__ is called In [5]: y = N.array([1,2,3]) In [6]: x+y This is where __array_finalize__ is called Out[6]: MyArray([4, 5, 6]) Regards St?fan From jeremit0 at gmail.com Tue Feb 6 08:14:13 2007 From: jeremit0 at gmail.com (Jeremy Conlin) Date: Tue, 6 Feb 2007 08:14:13 -0500 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> Message-ID: <3db594f70702060514k6c78dcc5lbfa216ff0521450f@mail.gmail.com> On 2/6/07, Sturla Molden wrote: > > > def __new__(cls,...) > > ... > > (H, edges) = numpy.histogramdd(..) > > cls.__defaultedges = edges > > > > def __array_finalize__(self, obj): > > if not hasattr(self, 'edges'): > > self.edges = self.__defaultedges > > So in order to get an instance attribute, one has to temporarily define it > as a class attribute? What happens if there is a thread switch between > __new__ and __array_finalize__? This design is not thread safe and can > produce strange race conditions. > > IMHO, the preferred way to set an instance attribute is to use __init__ > method, which is the 'Pythonic' way to do it. > > Sturla Molden > Yes using __init__ to set an instance attribute is the Pythonic way to do this. However, I calculate/create the data in __new__. The data is unavailable to __init__. Jeremy From sturla at molden.no Tue Feb 6 09:01:37 2007 From: sturla at molden.no (Sturla Molden) Date: Tue, 6 Feb 2007 15:01:37 +0100 (CET) Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <3db594f70702060514k6c78dcc5lbfa216ff0521450f@mail.gmail.com> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> <3db594f70702060514k6c78dcc5lbfa216ff0521450f@mail.gmail.com> Message-ID: <2655.129.240.194.142.1170770497.squirrel@webmail.uio.no> > Yes using __init__ to set an instance attribute is the Pythonic way to > do this. However, I calculate/create the data in __new__. The data > is unavailable to __init__. The signatures of __new__ and __init__ is: def __new__(cls, *args, **kwds) def __init__(self, *args, **kwds) If __new__ has access to the data, __init__ has access to the data as well. But in order for __init__ to be called, it must return an instance of cls. Otherwise, Python leaves the class as returned by __new__. But it remains that the subclassing example is not thread safe. The only way to make it thread safe would be if __new__ sets a global lock and __array_finalize_ releases it. I think NumPy can get away with this because it holds the GIL inside its C extension, but when you subclass ndarray in Python, the GIL is released. From sturla at molden.no Tue Feb 6 09:17:39 2007 From: sturla at molden.no (Sturla Molden) Date: Tue, 6 Feb 2007 15:17:39 +0100 (CET) Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <20070206125600.GN6274@mentat.za.net> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> <20070206125600.GN6274@mentat.za.net> Message-ID: <2663.129.240.194.142.1170771459.squirrel@webmail.uio.no> > I don't pretend to know all the inner workings of subclassing, but I > don't think that would work, given the following output: > In [6]: x+y > This is where __array_finalize__ is called > Out[6]: MyArray([4, 5, 6]) Why is not __new__ called for the return value of x + y? Does it call __new__ for ndarray instead of MyArray? From kwgoodman at gmail.com Tue Feb 6 09:43:48 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Tue, 6 Feb 2007 06:43:48 -0800 Subject: [Numpy-discussion] numpy.matlib.abs In-Reply-To: <1170757826.21666.8.camel@localhost.localdomain> References: <45C6848C.4060105@gmail.com> <1170757826.21666.8.camel@localhost.localdomain> Message-ID: On 2/6/07, Paulo J. S. Silva wrote: > Em Dom, 2007-02-04 ?s 17:28 -0800, Keith Goodman escreveu: > > > Could numpy.matlib get the same functions as numpy? Would that have to > > be done with a manually maintained import list? > > I always use "import numpy.matlib as M" and then search for function > > names in ipython (M.a[TAB]). I didn't realize that some functions are > > missing. > > As the list knows, I am trying to build a special module that can > convert any other module to behave nicely with matrices. I have special > interest in using it as an interface to scipy modules that may return > arrays when given a matrix. This effort let me to learn some tricks > about modules imports in Python. > > I believe that if you add the following code to the end of matlib.py > file it will behave just like you want without any manual intervention: > > --- Start Python code --- > > import inspect > import matlib as M > for i in dir(N): > attribute = getattr(N, i) > if type(attribute) is N.ufunc or inspect.isroutine(attribute): > try: > getattr(M, i) > except AttributeError: > setattr(M, i, attribute) > > --- End Python code --- > > Here is an ipython session: > > --- ipython session --- > > In [1]:import numpy.matlib as M > > In [2]:M.abs > Out[2]: > > --- End of ipython sesssion --- > > By the way, there were only four functions that are missing without this > code: abs, max, min, and round. You can see this by adding a "print i" > in the except block above. > > If the list thinks this code is useful, I am donating it to numpy. That is great. I could think of a few uses for abs, max, min, and round. So I would like to see them imported. BTW, why can I do x.max() x.min() x.round() but not x.abs()? From haase at msg.ucsf.edu Tue Feb 6 17:29:38 2007 From: haase at msg.ucsf.edu (Sebastian Haase) Date: Tue, 6 Feb 2007 14:29:38 -0800 Subject: [Numpy-discussion] memmap on 64bit Linux for > 2 GB files Message-ID: Hi, I finally tried to do the test, to memmap a large file filesize: 2.8G a memmap call gives this error: {{{ >>> N.memmap('20050622-1648-Y_DEMO-1') Traceback (most recent call last): File "", line 1, in ? File "/jws30/haase/PrLinN64/numpy/core/memmap.py", line 67, in __new__ mm = mmap.mmap(fid.fileno(), bytes, access=acc) OverflowError: memory mapped size is too large (limited by C int) }}} I'm using a recent numpy on a 64bit Linux (debian etch, kernel: 2.6.16-2-em64t-p4-smp) {{{ >>> N.__version__ '1.0.2.dev3509' >>> N.int0 }}} Is this supposed to work ? Thanks, Sebastian From robert.kern at gmail.com Tue Feb 6 17:35:04 2007 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 06 Feb 2007 16:35:04 -0600 Subject: [Numpy-discussion] memmap on 64bit Linux for > 2 GB files In-Reply-To: References: Message-ID: <45C90298.4060705@gmail.com> Sebastian Haase wrote: > Hi, > I finally tried to do the test, to memmap a large file > filesize: 2.8G > > a memmap call gives this error: > > {{{ >>>> N.memmap('20050622-1648-Y_DEMO-1') > Traceback (most recent call last): > File "", line 1, in ? > File "/jws30/haase/PrLinN64/numpy/core/memmap.py", line 67, in __new__ > mm = mmap.mmap(fid.fileno(), bytes, access=acc) > OverflowError: memory mapped size is too large (limited by C int) > }}} > > I'm using a recent numpy on a 64bit Linux (debian etch, kernel: > 2.6.16-2-em64t-p4-smp) > {{{ >>>> N.__version__ > '1.0.2.dev3509' >>>> N.int0 > > }}} > > Is this supposed to work ? You need Python 2.5 for it to work. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From seb.haase at gmx.net Tue Feb 6 17:44:26 2007 From: seb.haase at gmx.net (Sebastian Haase) Date: Tue, 6 Feb 2007 14:44:26 -0800 Subject: [Numpy-discussion] memmap on 64bit Linux for > 2 GB files In-Reply-To: <45C90298.4060705@gmail.com> References: <45C90298.4060705@gmail.com> Message-ID: Of course ! Now I remember why I didn't test it yet... Thanks, -Sebastian On 2/6/07, Robert Kern wrote: > Sebastian Haase wrote: > > Hi, > > I finally tried to do the test, to memmap a large file > > filesize: 2.8G > > > > a memmap call gives this error: > > > > {{{ > >>>> N.memmap('20050622-1648-Y_DEMO-1') > > Traceback (most recent call last): > > File "", line 1, in ? > > File "/jws30/haase/PrLinN64/numpy/core/memmap.py", line 67, in __new__ > > mm = mmap.mmap(fid.fileno(), bytes, access=acc) > > OverflowError: memory mapped size is too large (limited by C int) > > }}} > > > > I'm using a recent numpy on a 64bit Linux (debian etch, kernel: > > 2.6.16-2-em64t-p4-smp) > > {{{ > >>>> N.__version__ > > '1.0.2.dev3509' > >>>> N.int0 > > > > }}} > > > > Is this supposed to work ? > > You need Python 2.5 for it to work. > From tom.denniston at alum.dartmouth.org Tue Feb 6 19:09:20 2007 From: tom.denniston at alum.dartmouth.org (Tom Denniston) Date: Tue, 6 Feb 2007 18:09:20 -0600 Subject: [Numpy-discussion] bug in numpy.equal? Message-ID: The behavior below seems strange to me. The string array is type S3 yet it says that comparison with 'abc' is not implemented. The == operator seems to work though. Is there a subtlty I am missing or is it simply a bug? In [1]: import numpy In [2]: numpy.equal(numpy.array(['abc', 'def']), 'abc') Out[2]: NotImplemented In [3]: numpy.array(['abc', 'def']) == 'abc' Out[3]: array([ True, False], dtype=bool) In [4]: numpy.equal? Type: ufunc Base Class: String Form: Namespace: Interactive Docstring: y = equal(x1,x2) returns elementwise x1 == x2 in a bool array From gsteele at qualcomm.com Tue Feb 6 19:28:26 2007 From: gsteele at qualcomm.com (Steele, Greg) Date: Tue, 6 Feb 2007 16:28:26 -0800 Subject: [Numpy-discussion] importing multiarraymodule.c in Python embedding Message-ID: <4D7539987C6B3340ADA6BC7AE609F73606FC8F@NAEX14.na.qualcomm.com> I have run into an interesting issue with multiarraymodule.c with regards to embedding Python in a C application on Windows XP. In the application, the following abbreviated sequence was executed 1) Py_Initialize 2) numpy is imported 3) Py_Finalize 4) Py_Initialize 5) attempt to import numpy results in an error. I have tracked this down to two things 1) Python does not call the FreeLibrary windows API on C extension modules when shutting down 2) multiarraymodule.c has a static variable _multiarray_module_loaded which, when set, bypasses the call to Py_InitModule The error at step 5 above occurs because the multiarraymodule DLL is loaded at step 2, the _multiarray_module_loaded variable is set at step 2, and Py_InitModule is bypassed at step 5. Since Py_InitModule is not called, multiarraymodule is not placed in the module dictionary (i.e. sys.modules) and an error occurs in importdl.h which checks if the module is there. The obvious solution is to not execute the sequence above. This issue does seem to highlight a weakness in the implementation of multiarraymodule.c. Could someone comment on the need for static variable _multiarray_module_loaded? Is there a more robust way to achieve the goal? Thanks. Greg -------------- next part -------------- An HTML attachment was scrubbed... URL: From mattknox_ca at hotmail.com Tue Feb 6 19:54:32 2007 From: mattknox_ca at hotmail.com (Matt Knox) Date: Tue, 6 Feb 2007 19:54:32 -0500 Subject: [Numpy-discussion] thread safe subclasses of ndarray Message-ID: Sturla Molden brought up a point in an earlier thread ("Please help with subclassing numpy.ndarray") that I think is worth highlighting. A "common" approach to subclassing ndarray in python (http://www.scipy.org/Subclasses) results in code that is not thread safe. This approach involves setting some class level variables inside the __new__ method, and then retrieving those values inside the __array_finalize__ method. A simple work around that Sturla alluded to is to explictly set a lock inside the __new__ method, and later release it in the __array_finalize__ method. However, this would cause problems if someone invoked the __new__ method in such a way that __array_finalize__ was not called afterwards (which I think is possible, isn't it?). The two subclasses of ndarray in the core numpy distribution (that I am aware of) , chararray and matrix, don't run into this problem because they can get all the relevant info they need by directly inspecting the ndarray that comes out of the __new__ method, and so they don't need to share any extra info between __new__ and __array_finalize__ So, I am wondering if perhaps one of the experts on this list would be able to shed some light on a good way to create thread safe subclasses of ndarray. Thanks, - Matt Knox -------------- next part -------------- An HTML attachment was scrubbed... URL: From ckkart at hoc.net Wed Feb 7 05:35:14 2007 From: ckkart at hoc.net (Christian) Date: Wed, 7 Feb 2007 10:35:14 +0000 (UTC) Subject: [Numpy-discussion] force column vector Message-ID: Hi, when creating an ndarray from a list, how can I force the result to be 2d *and* a column vector? So in case I pass a nested list, there will be no modification of the shape and when I pass a simple list, it will be converted to a 2d column vector. I can only think of a solution using 'if' clauses but I suppose there is a more elegant way. Thanks, Christian From bardeau at iram.fr Wed Feb 7 07:51:31 2007 From: bardeau at iram.fr (Sebastien Bardeau) Date: Wed, 07 Feb 2007 13:51:31 +0100 Subject: [Numpy-discussion] importing multiarraymodule.c in Python embedding In-Reply-To: <4D7539987C6B3340ADA6BC7AE609F73606FC8F@NAEX14.na.qualcomm.com> References: <4D7539987C6B3340ADA6BC7AE609F73606FC8F@NAEX14.na.qualcomm.com> Message-ID: <45C9CB53.2020905@iram.fr> Hi, it seems that I have the same trouble, but using Python under Linux: > I have run into an interesting issue with multiarraymodule.c with > regards to embedding Python in a C application on Windows XP. In the > application, the following abbreviated sequence was executed > > > > 1) Py_Initialize > > 2) numpy is imported > > 3) Py_Finalize > > 4) Py_Initialize > > 5) attempt to import numpy results in an error. > As an example I wrote a small C program that embeds Python. The steps are the same as above: 1) Python is initialized (Py_Initialize) 2) Numpy is "manually" imported (import numpy) under an interactive loop (PyRun_InteractiveLoop) 3) Python is finalized (Py_Finalize) 4) Python is initialized again (Py_Initialize) 5) Attempt to import numpy now fails: >>> import numpy Traceback (most recent call last): File "", line 1, in File "/usr/lib/python2.5/site-packages/numpy/__init__.py", line 36, in import core File "/usr/lib/python2.5/site-packages/numpy/core/__init__.py", line 5, in import multiarray SystemError: dynamic module not initialized properly I'm using Numpy 1.0 and Python 2.5 under a Fedora Core 6: Python 2.5 (r25:51908, Nov 15 2006, 14:24:03) [GCC 4.1.1 20061011 (Red Hat 4.1.1-30)] on linux2 I have not searched further why this happens. I embed Python in a larger project and let the choice to the user to start a Python session if he needs it. Numpy is automatically imported right after Python initialization (because it is needed in my context), so that if user starts a session, ends it, and starts a new one, the initialization automatically fails at the second time. The solution at the present I found is preventing Python to be finalized (Py_Finalize). I hope this can help. Thanks. Sebastien From meesters at uni-mainz.de Wed Feb 7 08:00:52 2007 From: meesters at uni-mainz.de (Christian Meesters) Date: Wed, 7 Feb 2007 14:00:52 +0100 Subject: [Numpy-discussion] getting indices for array positions Message-ID: <200702071400.53315.meesters@uni-mainz.de> Hi This questions might seem stupid, but I didn't get a clever solution myself, or found one in the archives, the cookbook, etc. . If I overlooked something, please give a pointer. Well, if I have an 1D array like [ 0. , 0.1, 0.2, 0.3, 0.4, 0.5] ,a scalar like 0.122 and want to retrieve the index postion of the closest value of the scalar in the array: Is there any fast method to get this? Right now I've implemented the following method: def _get_value_index(value, a): mindiff = 1e20 index = 0 for intensity, temp_index in zip(a, xrange(a.shape[0])): diff = abs(intensity - value) #closer to given value? if diff <= mindiff: mindiff = diff index = temp_index return index It works, but is akward and takes too much time (I've no benchmark), if the array is long and the method called often within a different function. But it should help clarify the problem. TIA Christian From cimrman3 at ntc.zcu.cz Wed Feb 7 08:08:42 2007 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 07 Feb 2007 14:08:42 +0100 Subject: [Numpy-discussion] getting indices for array positions In-Reply-To: <200702071400.53315.meesters@uni-mainz.de> References: <200702071400.53315.meesters@uni-mainz.de> Message-ID: <45C9CF5A.7070208@ntc.zcu.cz> Christian Meesters wrote: > Hi > > This questions might seem stupid, but I didn't get a clever solution myself, > or found one in the archives, the cookbook, etc. . If I overlooked something, > please give a pointer. > > Well, if I have an 1D array like > [ 0. , 0.1, 0.2, 0.3, 0.4, 0.5] > ,a scalar like 0.122 and want to retrieve the index postion of the closest > value of the scalar in the array: Is there any fast method to get this? Try searchsorted. r. From meesters at uni-mainz.de Wed Feb 7 08:20:03 2007 From: meesters at uni-mainz.de (Christian Meesters) Date: Wed, 7 Feb 2007 14:20:03 +0100 Subject: [Numpy-discussion] getting indices for array positions In-Reply-To: <45C9CF5A.7070208@ntc.zcu.cz> References: <200702071400.53315.meesters@uni-mainz.de> <45C9CF5A.7070208@ntc.zcu.cz> Message-ID: <200702071420.04008.meesters@uni-mainz.de> > Try searchsorted. Thanks, but that doesn't work. Sorry, if my question wasn't clear. To illustrate the requirement: For instance: >>> a array([ 0. , 0.1, 0.2, 0.3, 0.4]) >>> # should be 1 ... >>> a.searchsorted(0.11) 2 >>> # should be 2 ... >>> a.searchsorted(0.16) 2 I could correct for one index position, of course, but I still have the requirement to get the index of the item with the closest value to the key. Since searchsorted returns the index of the first item in a that is >= or > the key, it can't make the distinction between 0.1 and 0.2 as I would like to have. Hope this clarifies my question. Christian From cimrman3 at ntc.zcu.cz Wed Feb 7 08:32:08 2007 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Wed, 07 Feb 2007 14:32:08 +0100 Subject: [Numpy-discussion] getting indices for array positions In-Reply-To: <200702071420.04008.meesters@uni-mainz.de> References: <200702071400.53315.meesters@uni-mainz.de> <45C9CF5A.7070208@ntc.zcu.cz> <200702071420.04008.meesters@uni-mainz.de> Message-ID: <45C9D4D8.3050602@ntc.zcu.cz> Christian Meesters wrote: >> Try searchsorted. > Thanks, but that doesn't work. Sorry, if my question wasn't clear. > > To illustrate the requirement: > For instance: >>>> a > array([ 0. , 0.1, 0.2, 0.3, 0.4]) >>>> # should be 1 > ... >>>> a.searchsorted(0.11) > 2 >>>> # should be 2 > ... >>>> a.searchsorted(0.16) > 2 > > I could correct for one index position, of course, but I still have the > requirement to get the index of the item with the closest value to the key. > Since searchsorted returns the index of the first item in a that is >= or > > the key, it can't make the distinction between 0.1 and 0.2 as I would like to > have. I see. But it gives you the index (say 'ii') of the first item that is grater than your scalar - then you just have to compare your scalar with a[ii] and a[ii-1] and choose whichever is closer, no? r. From stefan at sun.ac.za Wed Feb 7 09:03:50 2007 From: stefan at sun.ac.za (Stefan van der Walt) Date: Wed, 7 Feb 2007 16:03:50 +0200 Subject: [Numpy-discussion] force column vector In-Reply-To: References: Message-ID: <20070207140350.GB6274@mentat.za.net> On Wed, Feb 07, 2007 at 10:35:14AM +0000, Christian wrote: > Hi, > > when creating an ndarray from a list, how can I force the result to be > 2d *and* a column vector? So in case I pass a nested list, there will be no > modification of the shape and when I pass a simple list, it will be > converted to a 2d column vector. I can only think of a solution using 'if' > clauses but I suppose there is a more elegant way. One way is to sub-class ndarray: import numpy as N class ColumnVectorArray(N.ndarray): def __new__(cls,data): data = N.asarray(data).view(cls) if len(data.shape) == 1: data.shape = (-1,1) return data x = ColumnVectorArray([[1,2],[3,4],[5,6]]) print 'x =' print x print y = ColumnVectorArray([1,2,3]) print 'y =' print y print print 'x+y =' print x+y which yields: x = [[1 2] [3 4] [5 6]] y = [[1] [2] [3]] x+y = [[2 3] [5 6] [8 9]] Cheers St?fan From stefan at sun.ac.za Wed Feb 7 09:08:27 2007 From: stefan at sun.ac.za (Stefan van der Walt) Date: Wed, 7 Feb 2007 16:08:27 +0200 Subject: [Numpy-discussion] getting indices for array positions In-Reply-To: <200702071400.53315.meesters@uni-mainz.de> References: <200702071400.53315.meesters@uni-mainz.de> Message-ID: <20070207140827.GC6274@mentat.za.net> On Wed, Feb 07, 2007 at 02:00:52PM +0100, Christian Meesters wrote: > This questions might seem stupid, but I didn't get a clever solution myself, > or found one in the archives, the cookbook, etc. . If I overlooked something, > please give a pointer. > > Well, if I have an 1D array like > [ 0. , 0.1, 0.2, 0.3, 0.4, 0.5] > ,a scalar like 0.122 and want to retrieve the index postion of the closest > value of the scalar in the array: Is there any fast method to get > this? If I understand correctly: data = N.array([ 0. , 0.1, 0.2, 0.3, 0.4, 0.5]) diff = N.abs(data - val) print N.argmin(diff) Regards St?fan From joris at ster.kuleuven.be Wed Feb 7 09:11:53 2007 From: joris at ster.kuleuven.be (Joris De Ridder) Date: Wed, 7 Feb 2007 15:11:53 +0100 Subject: [Numpy-discussion] confused with apply_along_axis() Message-ID: <200702071511.53362.joris@ster.kuleuven.be> Hi, I'm confused by the output of apply_along_axis() in the following very simple example: In [93]: a = arange(12.).reshape(2,2,3) In [95]: a Out[95]: array([[[ 0., 1., 2.], [ 3., 4., 5.]], [[ 6., 7., 8.], [ 9., 10., 11.]]]) In [96]: def myfunc(b): ....: print "slice:", b, " middle value:", b[1] ....: return b[1] ....: In [97]: apply_along_axis(myfunc,2,a) slice: [ 0. 1. 2.] middle value: 1.0 slice: [ 3. 4. 5.] middle value: 4.0 slice: [ 6. 7. 8.] middle value: 7.0 slice: [ 9. 10. 11.] middle value: 10.0 Out[97]: array([[ 7., 7.], [ 10., 10.]]) I expected as output array([[ 1., 4.], [ 7., 10.]]) Why is this not the case? How exactly does apply_along_axis() use the output of myfunc() to produce its result? Cheers, J. P.S. I'm working with Python 2.5 & Numpy 1.0 Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm From stefan at sun.ac.za Wed Feb 7 09:22:53 2007 From: stefan at sun.ac.za (Stefan van der Walt) Date: Wed, 7 Feb 2007 16:22:53 +0200 Subject: [Numpy-discussion] confused with apply_along_axis() In-Reply-To: <200702071511.53362.joris@ster.kuleuven.be> References: <200702071511.53362.joris@ster.kuleuven.be> Message-ID: <20070207142253.GD6274@mentat.za.net> Hi Joris On Wed, Feb 07, 2007 at 03:11:53PM +0100, Joris De Ridder wrote: > I expected as output > array([[ 1., 4.], > [ 7., 10.]]) That is the answer I get with numpy 1.0.2.dev3537 under Python 2.4. Cheers St?fan From svetosch at gmx.net Wed Feb 7 10:11:01 2007 From: svetosch at gmx.net (Sven Schreiber) Date: Wed, 07 Feb 2007 16:11:01 +0100 Subject: [Numpy-discussion] force column vector In-Reply-To: References: Message-ID: <45C9EC05.8040607@gmx.net> Christian schrieb: > Hi, > > when creating an ndarray from a list, how can I force the result to be > 2d *and* a column vector? So in case I pass a nested list, there will be no > modification of the shape and when I pass a simple list, it will be > converted to a 2d column vector. I can only think of a solution using 'if' > clauses but I suppose there is a more elegant way. > I'd be interested if you find a way without a single 'if'; I can't think of any, sorry. (which doesn't mean much, however... :-) Maybe your best bet is to make sure the lists are always nested at the source, if you have any control over that particular source. good luck, sven From joris at ster.kuleuven.be Wed Feb 7 10:19:41 2007 From: joris at ster.kuleuven.be (Joris De Ridder) Date: Wed, 7 Feb 2007 16:19:41 +0100 Subject: [Numpy-discussion] confused with apply_along_axis() In-Reply-To: <20070207142253.GD6274@mentat.za.net> References: <200702071511.53362.joris@ster.kuleuven.be> <20070207142253.GD6274@mentat.za.net> Message-ID: <200702071619.41532.joris@ster.kuleuven.be> On Wednesday 07 February 2007 15:22, Stefan van der Walt wrote: >On Wed, Feb 07, 2007 at 03:11:53PM +0100, Joris De Ridder wrote: >> I expected as output >> array([[ 1., 4.], >> [ 7., 10.]]) > >That is the answer I get with numpy 1.0.2.dev3537 under Python 2.4. Python 2.5 + numpy 1.0.2.dev3540 also solved the problem for me. Thanks, Joris Disclaimer: http://www.kuleuven.be/cwis/email_disclaimer.htm From Chris.Barker at noaa.gov Wed Feb 7 11:32:36 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Wed, 07 Feb 2007 08:32:36 -0800 Subject: [Numpy-discussion] force column vector In-Reply-To: References: Message-ID: <45C9FF24.7000005@noaa.gov> Christian wrote: > when creating an ndarray from a list, how can I force the result to be > 2d *and* a column vector? So in case I pass a nested list, there will be no > modification of the shape and when I pass a simple list, it will be > converted to a 2d column vector. I'm not sure I understand the specification of the problem. I would think that the definition of a column vector is that it's shape is: (-1,1) which makes it easy: def MakeColumn(input): a = asarray(input) a.shape = (-1,1) return a however, if you want: MakeColumn([[1,2],[3,4],[5,6]]) to return: array([[1, 2], [3, 4], [5, 6]]) that's not what I would call a column vector, and if that's what you want, then what would you want: MakeColumn([[1,2,3,4],[5,6,7,8]]) to return? -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From oliphant at ee.byu.edu Wed Feb 7 12:04:14 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Wed, 07 Feb 2007 10:04:14 -0700 Subject: [Numpy-discussion] bug in numpy.equal? In-Reply-To: References: Message-ID: <45CA068E.9040909@ee.byu.edu> Tom Denniston wrote: >The behavior below seems strange to me. The string array is type S3 >yet it says that comparison with 'abc' is not implemented. The == >operator seems to work though. Is there a subtlty I am missing or is >it simply a bug? > > > No bug. Ufuncs do not work with variable-sized arrays. However, we have implemented the equal operator using a different approach. -Travis From kwgoodman at gmail.com Wed Feb 7 12:15:38 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Wed, 7 Feb 2007 09:15:38 -0800 Subject: [Numpy-discussion] NaN, min, and max Message-ID: I keep running into this problem: >> import numpy.matlib as M >> x = M.rand(3,3) >> x[1,1] = M.nan >> x matrix([[ 0.94425407, 0.02216611, 0.999475 ], [ 0.40444129, nan, 0.23264341], [ 0.24202372, 0.05344269, 0.37967564]]) >> x.max() 0.379675636032 <---- Wrong (for me) >> x[1,1] = 0 >> x.max() 0.999474999444 <----- Beautiful! Look at all the tripple digits! How do I add nanmax as a method of the matrix class? How would I replace max with nanmax? Would I have to do that in every module of my package if I want to access numpy.matlib as M instead of packagename.M? From fperez.net at gmail.com Wed Feb 7 14:16:41 2007 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 7 Feb 2007 12:16:41 -0700 Subject: [Numpy-discussion] numpy.linalg.qr bug on 64-bit platforms Message-ID: Hi all, I recently got a report of a bug triggered only on 64-bit hardware, and on a machine (in case it's relevant) that runs python 2.5. This is with current numpy SVN which I just rebuilt a moment ago to triple-check: In [3]: a = numpy.array([[1.0,2],[3,4]]) In [4]: numpy.linalg.qr(a) ** On entry to DGEQRF parameter number 2 had an illegal value sage[~]> # dumped back at system prompt In case anyone has ideas, I added some more initial detail and guesses as to where the problem may be coming from here: http://projects.scipy.org/scipy/numpy/ticket/446 Unfortunately I don't know this part of the code well enough to come up with a quick fix myself. Cheers, f From tom.denniston at alum.dartmouth.org Wed Feb 7 14:16:53 2007 From: tom.denniston at alum.dartmouth.org (Tom Denniston) Date: Wed, 7 Feb 2007 13:16:53 -0600 Subject: [Numpy-discussion] bug in numpy.equal? In-Reply-To: <45CA068E.9040909@ee.byu.edu> References: <45CA068E.9040909@ee.byu.edu> Message-ID: got it. thanks. On 2/7/07, Travis Oliphant wrote: > Tom Denniston wrote: > > >The behavior below seems strange to me. The string array is type S3 > >yet it says that comparison with 'abc' is not implemented. The == > >operator seems to work though. Is there a subtlty I am missing or is > >it simply a bug? > > > > > > > > No bug. Ufuncs do not work with variable-sized arrays. However, we > have implemented the equal operator using a different approach. > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From oliphant at ee.byu.edu Wed Feb 7 16:36:39 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Wed, 07 Feb 2007 14:36:39 -0700 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> Message-ID: <45CA4667.1000402@ee.byu.edu> Sturla Molden wrote: >>def __new__(cls,...) >> ... >> (H, edges) = numpy.histogramdd(..) >> cls.__defaultedges = edges >> >>def __array_finalize__(self, obj): >> if not hasattr(self, 'edges'): >> self.edges = self.__defaultedges >> >> > >So in order to get an instance attribute, one has to temporarily define it >as a class attribute? > No, you don't *have* to do it this way for all instance attributes. In this example, the user was trying to keep the edges computed during the __new__ method as an attribute. What are the possibilities? 1) Use the __new__ method to create the object in full and then store the edges in some kind of global (or class global) variable. This solution because it uses global variables has all of the thread problems global variables bring. 2) Create a "dummy" arrayobject in the __new__ method and fill it in (i.e. using setstate or resize) during the __init__ method where the instance attribute is actually set. The __array_finalize__ method is intended for "passing-on" attributes to sub-classes from parent classes during operations where __new__ and __init__ are not called (but a new instance is still created). It was not intended to be used in all circumstances. -Travis From svetosch at gmx.net Wed Feb 7 17:07:11 2007 From: svetosch at gmx.net (Sven Schreiber) Date: Wed, 07 Feb 2007 23:07:11 +0100 Subject: [Numpy-discussion] force column vector In-Reply-To: <45C9FF24.7000005@noaa.gov> References: <45C9FF24.7000005@noaa.gov> Message-ID: <45CA4D8F.5040807@gmx.net> Christopher Barker schrieb: > Christian wrote: >> when creating an ndarray from a list, how can I force the result to be >> 2d *and* a column vector? So in case I pass a nested list, there will be no >> modification of the shape and when I pass a simple list, it will be >> converted to a 2d column vector. > > I'm not sure I understand the specification of the problem. I would > think that the definition of a column vector is that it's shape is: > > (-1,1) > So I think what's needed is: b = array(yourlist) b.reshape(b.shape[0], -1) Now it seems I finally understood this business with the -1 in the shapes... (well it's trivial if you have the book :-) -sven From kwgoodman at gmail.com Wed Feb 7 17:41:21 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Wed, 7 Feb 2007 14:41:21 -0800 Subject: [Numpy-discussion] force column vector In-Reply-To: <45CA4D8F.5040807@gmx.net> References: <45C9FF24.7000005@noaa.gov> <45CA4D8F.5040807@gmx.net> Message-ID: On 2/7/07, Sven Schreiber wrote: > Christopher Barker schrieb: > > Christian wrote: > >> when creating an ndarray from a list, how can I force the result to be > >> 2d *and* a column vector? So in case I pass a nested list, there will be no > >> modification of the shape and when I pass a simple list, it will be > >> converted to a 2d column vector. > > > > I'm not sure I understand the specification of the problem. I would > > think that the definition of a column vector is that it's shape is: > > > > (-1,1) > > > > So I think what's needed is: > > b = array(yourlist) > b.reshape(b.shape[0], -1) > > Now it seems I finally understood this business with the -1 in the > shapes... (well it's trivial if you have the book :-) I'd like to know what the -1 means. But first I'm trying to figure out why there are two reshapes? Do they behave identically? The doc strings make it look like they might not. >> x = M.rand(3,3) >> x.reshape? Type: builtin_function_or_method Base Class: String Form: Namespace: Interactive Docstring: a.reshape(d1, d2, ..., dn, order='c') Return a new array from this one. The new array must have the same number of elements as self. Also always returns a view or raises a ValueError if that is impossible.; >> M.reshape? Type: function Base Class: String Form: Namespace: Interactive File: /usr/local/lib/python2.4/site-packages/numpy/core/fromnumeric.py Definition: M.reshape(a, newshape, order='C') Docstring: Return an array that uses the data of the given array, but with a new shape. :Parameters: - `a` : array - `newshape` : shape tuple or int The new shape should be compatible with the original shape. If an integer, then the result will be a 1D array of that length. - `order` : 'C' or 'FORTRAN', optional (default='C') Whether the array data should be viewed as in C (row-major) order or FORTRAN (column-major) order. :Returns: - `reshaped_array` : array This will be a new view object if possible; otherwise, it will return a copy. :See also: numpy.ndarray.reshape() is the equivalent method. From robert.kern at gmail.com Wed Feb 7 17:52:16 2007 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 07 Feb 2007 16:52:16 -0600 Subject: [Numpy-discussion] force column vector In-Reply-To: References: <45C9FF24.7000005@noaa.gov> <45CA4D8F.5040807@gmx.net> Message-ID: <45CA5820.5060100@gmail.com> Keith Goodman wrote: > I'd like to know what the -1 means. It means "fill in with whatever is necessary to make the size correct given the other specified dimensions". > But first I'm trying to figure out > why there are two reshapes? reshape() used to be simply a function, not a method. > Do they behave identically? The doc > strings make it look like they might not. The docstrings are different only because I updated the one and not, yet, the other. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From tom.denniston at alum.dartmouth.org Wed Feb 7 18:16:07 2007 From: tom.denniston at alum.dartmouth.org (Tom Denniston) Date: Wed, 7 Feb 2007 17:16:07 -0600 Subject: [Numpy-discussion] Bug in numpy user-define types mechanism causes segfault on multiple calls to a ufunc Message-ID: I am trying to register a custom type to numpy. When I do so it works and the ufuncs work but then when I invoke any ufunc twice the second time my python interpretter segfaults. I think i know what the problem is. In the select_types method in ufuncobject.c in numpy/core/src/ numpy gets a reference to the key for the loop via a call to PyInt_FromLong: key = PyInt_FromLong((long) userdef); Then it gets the actual loop via a call to PyDict_GetItem: obj = PyDict_GetItem(self->userloops, key); It later proceeds to do a decref on key: Py_DECREF(key); and later a decref on obj Py_DECREF(obj); None of this code actually runs unless you are doing an operation on a user defined type because it is all in the block with an if statement if (userdef > 0). The Py_DECREF on key is correct because it returns a new reference per python c api doc: PyObject* PyInt_FromLong( long ival) Return value: New reference. Create a new integer object with a value of ival. However the Py_DECREF on the obj looks incorrect to me because the PyDict_Getitem returns a borrowed reference and the numpy code doesn't increment the reference: PyObject* PyDict_GetItem( PyObject *p, PyObject *key) Return value: Borrowed reference. Return the object from dictionary p which has a key key. Return NULL if the key key is not present, but without setting an exception. So what seems to happen is the last reference gets decremented and the garbage collector frees up obj (the ufunc loop) on the first time through. The second time the through the ufunc loop is garbage memory and it segfaults. If I comment out the obj DECREF it works. I think one needs to either do that or add an INCREF right after retrieving key. I think either will work but the multithreading implications are different. I don't think it matters give that (I believe) numpy doesn't release the GIL but I thought someone on this list would be a better judge than I (maybe Travis) of what the correct fix should be. Am I correct in my analysis? In either case it should be a one or two line fix. --Tom From robert.kern at gmail.com Wed Feb 7 18:25:40 2007 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 07 Feb 2007 17:25:40 -0600 Subject: [Numpy-discussion] SciPy '07 ??? In-Reply-To: <45C26295.4060707@noaa.gov> References: <45C26295.4060707@noaa.gov> Message-ID: <45CA5FF4.3070304@gmail.com> Christopher Barker wrote: > Hi, > > Does anyone know if there will be a SciPy '07 conference, and if so, when? Yes, there will be one. We are currently speaking with the venue (Caltech in Pasadena, CA) to set the dates. Expect the conference to be either late August or perhaps earlyish in September. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From oliphant at ee.byu.edu Wed Feb 7 18:29:31 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Wed, 07 Feb 2007 16:29:31 -0700 Subject: [Numpy-discussion] Bug in numpy user-define types mechanism causes segfault on multiple calls to a ufunc In-Reply-To: References: Message-ID: <45CA60DB.1070703@ee.byu.edu> Tom Denniston wrote: >I am trying to register a custom type to numpy. When I do so it works >and the ufuncs work but then when I invoke any ufunc twice the second >time my python interpretter segfaults. I think i know what the >problem is. In the select_types method in ufuncobject.c in >numpy/core/src/ numpy gets a reference to the key for the loop via a > > Thank you very much for your review of this less-used code. >If I comment out the obj DECREF it works. I think one needs to either >do that or add an INCREF right after retrieving key. I think either >will work but the multithreading implications are different. I don't >think it matters give that (I believe) numpy doesn't release the GIL >but I thought someone on this list would be a better judge than I >(maybe Travis) of what the correct fix should be. > >Am I correct in my analysis? > > Your analysis seems spot on. I think removing the is the right course of action. I've done it in SVN. -Travis From reggie at merfinllc.com Wed Feb 7 18:32:16 2007 From: reggie at merfinllc.com (Reggie Dugard) Date: Wed, 07 Feb 2007 15:32:16 -0800 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <45CA4667.1000402@ee.byu.edu> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> <45CA4667.1000402@ee.byu.edu> Message-ID: <1170891136.28353.75.camel@fox> On Wed, 2007-02-07 at 14:36 -0700, Travis Oliphant wrote: > Sturla Molden wrote: > > >>def __new__(cls,...) > >> ... > >> (H, edges) = numpy.histogramdd(..) > >> cls.__defaultedges = edges > >> > >>def __array_finalize__(self, obj): > >> if not hasattr(self, 'edges'): > >> self.edges = self.__defaultedges > >> > >> > > > >So in order to get an instance attribute, one has to temporarily define it > >as a class attribute? > > > > No, you don't *have* to do it this way for all instance attributes. > > In this example, the user was trying to keep the edges computed during > the __new__ method as an attribute. What are the possibilities? > > 1) Use the __new__ method to create the object in full and then store > the edges in some kind of global (or class global) variable. > > This solution because it uses global variables has all of the thread > problems global variables bring. > > 2) Create a "dummy" arrayobject in the __new__ method and fill it in > (i.e. using setstate or resize) during the __init__ method where the > instance attribute is actually set. > I'm probably missing something obvious here, but why can't you just attach the attribute to the actual object in the __new__ method before returning it. For example: class MyClass(numpy.ndarray): def __new__(self, ...): # Some stuff here H, edges = numpy.histogramdd(...) result = H.view(MyClass) result.edges = edges return result def __array_finalize__(self, obj): self.edges = getattr(obj, 'edges', []) If you could show me the error of my ways, it would help me in *my* attempt to subclass ndarray. > The __array_finalize__ method is intended for "passing-on" attributes to > sub-classes from parent classes during operations where __new__ and > __init__ are not called (but a new instance is still created). It was > not intended to be used in all circumstances. Thanks, -Reggie From tom.denniston at alum.dartmouth.org Wed Feb 7 18:38:41 2007 From: tom.denniston at alum.dartmouth.org (Tom Denniston) Date: Wed, 7 Feb 2007 17:38:41 -0600 Subject: [Numpy-discussion] Bug in numpy user-define types mechanism causes segfault on multiple calls to a ufunc In-Reply-To: <45CA60DB.1070703@ee.byu.edu> References: <45CA60DB.1070703@ee.byu.edu> Message-ID: Many thanks, Travis. I'll test the new version tonight. --Tom On 2/7/07, Travis Oliphant wrote: > Tom Denniston wrote: > > >I am trying to register a custom type to numpy. When I do so it works > >and the ufuncs work but then when I invoke any ufunc twice the second > >time my python interpretter segfaults. I think i know what the > >problem is. In the select_types method in ufuncobject.c in > >numpy/core/src/ numpy gets a reference to the key for the loop via a > > > > > Thank you very much for your review of this less-used code. > > >If I comment out the obj DECREF it works. I think one needs to either > >do that or add an INCREF right after retrieving key. I think either > >will work but the multithreading implications are different. I don't > >think it matters give that (I believe) numpy doesn't release the GIL > >but I thought someone on this list would be a better judge than I > >(maybe Travis) of what the correct fix should be. > > > >Am I correct in my analysis? > > > > > > Your analysis seems spot on. I think removing the is the right course > of action. I've done it in SVN. > > > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From oliphant at ee.byu.edu Wed Feb 7 18:41:47 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Wed, 07 Feb 2007 16:41:47 -0700 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <1170891136.28353.75.camel@fox> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> <45CA4667.1000402@ee.byu.edu> <1170891136.28353.75.camel@fox> Message-ID: <45CA63BB.9050707@ee.byu.edu> Reggie Dugard wrote: >On Wed, 2007-02-07 at 14:36 -0700, Travis Oliphant wrote: > > >>Sturla Molden wrote: >> >> >> >>>>def __new__(cls,...) >>>> ... >>>> (H, edges) = numpy.histogramdd(..) >>>> cls.__defaultedges = edges >>>> >>>>def __array_finalize__(self, obj): >>>> if not hasattr(self, 'edges'): >>>> self.edges = self.__defaultedges >>>> >>>> >>>> >>>> >>>So in order to get an instance attribute, one has to temporarily define it >>>as a class attribute? >>> >>> >>> >>No, you don't *have* to do it this way for all instance attributes. >> >>In this example, the user was trying to keep the edges computed during >>the __new__ method as an attribute. What are the possibilities? >> >>1) Use the __new__ method to create the object in full and then store >>the edges in some kind of global (or class global) variable. >> >>This solution because it uses global variables has all of the thread >>problems global variables bring. >> >>2) Create a "dummy" arrayobject in the __new__ method and fill it in >>(i.e. using setstate or resize) during the __init__ method where the >>instance attribute is actually set. >> >> >> >I'm probably missing something obvious here, but why can't you just >attach the attribute to the actual object in the __new__ method before >returning it. For example: > > Good point. I guess I thought the OP had tried that already. It turns out it works fine, too. The __array_finalize__ is useful if you want the attribute to be carried around when arrays are created automatically internally (after math operations for example). -Travis From sturla at molden.no Wed Feb 7 18:50:11 2007 From: sturla at molden.no (Sturla Molden) Date: Thu, 8 Feb 2007 00:50:11 +0100 (CET) Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <45CA63BB.9050707@ee.byu.edu> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> <45CA4667.1000402@ee.byu.edu> <1170891136.28353.75.camel@fox> <45CA63BB.9050707@ee.byu.edu> Message-ID: <4790.89.8.42.137.1170892211.squirrel@webmail.uio.no> > Good point. I guess I thought the OP had tried that already. It turns > out it works fine, too. > > The __array_finalize__ is useful if you want the attribute to be carried > around when arrays are created automatically internally (after math > operations for example). I too may be missing something here. Will using __array_finalize__ this way be thread safe or not? Sturla Molden From Chris.Barker at noaa.gov Wed Feb 7 18:59:48 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Wed, 07 Feb 2007 15:59:48 -0800 Subject: [Numpy-discussion] SciPy '07 ??? In-Reply-To: <45CA5FF4.3070304@gmail.com> References: <45C26295.4060707@noaa.gov> <45CA5FF4.3070304@gmail.com> Message-ID: <45CA67F4.9050603@noaa.gov> Robert Kern wrote: >> Does anyone know if there will be a SciPy '07 conference, and if so, when? > > Yes, there will be one. We are currently speaking with the venue (Caltech in > Pasadena, CA) to set the dates. Expect the conference to be either late August > or perhaps earlyish in September. Nice to know. The sooner we get dates, the better, my wife needs to schedule out summer vacation now! I've missed it for various reasons for the last 3 years, I hope I can do this one! -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From oliphant at ee.byu.edu Wed Feb 7 19:07:55 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Wed, 07 Feb 2007 17:07:55 -0700 Subject: [Numpy-discussion] Please help with subclassing numpy.ndarray In-Reply-To: <4790.89.8.42.137.1170892211.squirrel@webmail.uio.no> References: <3db594f70702041722m36d3de67t2119b83ceca981a6@mail.gmail.com> <200702042033.01767.pgmdevlist@gmail.com> <2558.129.240.194.142.1170763597.squirrel@webmail.uio.no> <45CA4667.1000402@ee.byu.edu> <1170891136.28353.75.camel@fox> <45CA63BB.9050707@ee.byu.edu> <4790.89.8.42.137.1170892211.squirrel@webmail.uio.no> Message-ID: <45CA69DB.2020304@ee.byu.edu> Sturla Molden wrote: >>Good point. I guess I thought the OP had tried that already. It turns >>out it works fine, too. >> >>The __array_finalize__ is useful if you want the attribute to be carried >>around when arrays are created automatically internally (after math >>operations for example). >> >> > >I too may be missing something here. > >Will using __array_finalize__ this way be thread safe or not? > > Yes because __array_finalize__ is called while NumPy owns the GIL. It is called in one place during array creation (in the C routine that all array-creation routines call). -Travis From ckkart at hoc.net Wed Feb 7 20:01:31 2007 From: ckkart at hoc.net (Christian) Date: Thu, 8 Feb 2007 01:01:31 +0000 (UTC) Subject: [Numpy-discussion] force column vector References: <45C9FF24.7000005@noaa.gov> Message-ID: Christopher Barker noaa.gov> writes: > I'm not sure I understand the specification of the problem. I would > think that the definition of a column vector is that it's shape is: > > (-1,1) I was not aware of that possibility althoug I own the book I - shame on me. Thank you (and all others) for pointing that out. What I want is a 2d array regardless of whether the input is a simple or nested list. However if it is a simple list I want the result to be a column vector rather than a row vector, which happens by default. Like Sven I'm wondering if there is solution without using any 'if'. However using the subclassed array class which was proposed by Stefan is pretty elegant. Thanks to everybody, Christian From ckkart at hoc.net Wed Feb 7 20:03:02 2007 From: ckkart at hoc.net (Christian) Date: Thu, 8 Feb 2007 01:03:02 +0000 (UTC) Subject: [Numpy-discussion] getting indices for array positions References: <200702071400.53315.meesters@uni-mainz.de> <45C9CF5A.7070208@ntc.zcu.cz> <200702071420.04008.meesters@uni-mainz.de> Message-ID: Christian Meesters uni-mainz.de> writes: > Since searchsorted returns the index of the first item in a that is >= or > > the key, it can't make the distinction between 0.1 and 0.2 as I would like to Then how about a.searchsorted(val+0.5) Christian From ckkart at hoc.net Wed Feb 7 20:06:01 2007 From: ckkart at hoc.net (Christian) Date: Thu, 8 Feb 2007 01:06:01 +0000 (UTC) Subject: [Numpy-discussion] force column vector References: <45C9FF24.7000005@noaa.gov> <45CA4D8F.5040807@gmx.net> Message-ID: Sven Schreiber gmx.net> writes: > So I think what's needed is: > > b = array(yourlist) > b.reshape(b.shape[0], -1) Yes! That is it. Thanks, Christian From pgmdevlist at gmail.com Wed Feb 7 22:00:28 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 7 Feb 2007 22:00:28 -0500 Subject: [Numpy-discussion] Comparing x and x.view Message-ID: <200702072200.30228.pgmdevlist@gmail.com> All, I want to compare whether two arrays point to the same data. I've been using 'is' so far, but I'm wondering whether it's the right approach. If x is a plain ndarray, `x is x`, and `x is not x.view()`. I understand the second one (I think so...), `x` and `x.view `are two different Python objects. However, `x.__array_interface__ == x.view().__array_interface__`, which means that the underlying memory hasn't been modified at all, right ? In other terms, the data hasn't been copied, it's just being accessed slightly differently. So, when I'm using `x is y` to test whether some data has been copied, I should in fact compare the __array_interface__s, shouldn't I ? Sorry for the poor phrasing, and thanks a lot for your forthcoming inputs. P. From robert.kern at gmail.com Wed Feb 7 22:38:30 2007 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 07 Feb 2007 21:38:30 -0600 Subject: [Numpy-discussion] Comparing x and x.view In-Reply-To: <200702072200.30228.pgmdevlist@gmail.com> References: <200702072200.30228.pgmdevlist@gmail.com> Message-ID: <45CA9B36.3020408@gmail.com> Pierre GM wrote: > All, > > I want to compare whether two arrays point to the same data. > I've been using 'is' so far, but I'm wondering whether it's the right > approach. It isn't. Your analysis is correct. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Wed Feb 7 22:52:14 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Wed, 7 Feb 2007 22:52:14 -0500 Subject: [Numpy-discussion] Comparing x and x.view In-Reply-To: <45CA9B36.3020408@gmail.com> References: <200702072200.30228.pgmdevlist@gmail.com> <45CA9B36.3020408@gmail.com> Message-ID: <200702072252.15238.pgmdevlist@gmail.com> On Wednesday 07 February 2007 22:38:30 Robert Kern wrote: > Pierre GM wrote: > > All, > > > > I want to compare whether two arrays point to the same data. > > I've been using 'is' so far, but I'm wondering whether it's the right > > approach. > > It isn't. Your analysis is correct. So, there's no real point in using the Python 'id' function ? Do we need a shortcut to __array_interface__['data'] as id number ? From oliphant at ee.byu.edu Wed Feb 7 23:09:16 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Wed, 07 Feb 2007 21:09:16 -0700 Subject: [Numpy-discussion] Comparing x and x.view In-Reply-To: <200702072252.15238.pgmdevlist@gmail.com> References: <200702072200.30228.pgmdevlist@gmail.com> <45CA9B36.3020408@gmail.com> <200702072252.15238.pgmdevlist@gmail.com> Message-ID: <45CAA26C.5020509@ee.byu.edu> Pierre GM wrote: > On Wednesday 07 February 2007 22:38:30 Robert Kern wrote: > >> Pierre GM wrote: >> >>> All, >>> >>> I want to compare whether two arrays point to the same data. >>> I've been using 'is' so far, but I'm wondering whether it's the right >>> approach. >>> >> It isn't. Your analysis is correct. >> > > So, there's no real point in using the Python 'id' function ? Do we need a > shortcut to __array_interface__['data'] as id number ? > You have it (sort of a short-cut). .ctypes.data This works even if ctypes is not installed. -Travis From robert.kern at gmail.com Wed Feb 7 23:05:46 2007 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 07 Feb 2007 22:05:46 -0600 Subject: [Numpy-discussion] Comparing x and x.view In-Reply-To: <200702072252.15238.pgmdevlist@gmail.com> References: <200702072200.30228.pgmdevlist@gmail.com> <45CA9B36.3020408@gmail.com> <200702072252.15238.pgmdevlist@gmail.com> Message-ID: <45CAA19A.1010205@gmail.com> Pierre GM wrote: > On Wednesday 07 February 2007 22:38:30 Robert Kern wrote: >> Pierre GM wrote: >>> All, >>> >>> I want to compare whether two arrays point to the same data. >>> I've been using 'is' so far, but I'm wondering whether it's the right >>> approach. >> It isn't. Your analysis is correct. > > So, there's no real point in using the Python 'id' function ? Not for this purpose, no. > Do we need a > shortcut to __array_interface__['data'] as id number ? I'm not sure that would be useful. Different arrays may share the same starting pointer but have different strides, and arrays may also have different starting pointers but share some overlapping data. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From charlesr.harris at gmail.com Wed Feb 7 23:46:59 2007 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 7 Feb 2007 21:46:59 -0700 Subject: [Numpy-discussion] force column vector In-Reply-To: References: <45C9FF24.7000005@noaa.gov> <45CA4D8F.5040807@gmx.net> Message-ID: On 2/7/07, Christian wrote: > > Sven Schreiber gmx.net> writes: > > So I think what's needed is: > > > > b = array(yourlist) > > b.reshape(b.shape[0], -1) Row vectors are easy to get. In [1]: asmatrix([1,2,3,4]) Out[1]: matrix([[1, 2, 3, 4]]) And nested lists work, but you will be stuck with using matrices. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From ckkart at hoc.net Thu Feb 8 00:21:08 2007 From: ckkart at hoc.net (Christian) Date: Thu, 8 Feb 2007 05:21:08 +0000 (UTC) Subject: [Numpy-discussion] NaN, min, and max References: Message-ID: Keith Goodman gmail.com> writes: > matrix([[ 0.94425407, 0.02216611, 0.999475 ], > [ 0.40444129, nan, 0.23264341], > [ 0.24202372, 0.05344269, 0.37967564]]) > >> x.max() > 0.379675636032 <---- Wrong (for me) > >> x[1,1] = 0 > >> x.max() > 0.999474999444 <----- Beautiful! Look at all the tripple digits! Works as expected with python2.4/numpy1.0 Christian From meesters at uni-mainz.de Thu Feb 8 07:54:57 2007 From: meesters at uni-mainz.de (Christian Meesters) Date: Thu, 8 Feb 2007 13:54:57 +0100 Subject: [Numpy-discussion] getting indices for array positions In-Reply-To: References: <200702071400.53315.meesters@uni-mainz.de> <200702071420.04008.meesters@uni-mainz.de> Message-ID: <200702081354.57887.meesters@uni-mainz.de> Hi Thanks for all your suggestions. Christian From Chris.Barker at noaa.gov Thu Feb 8 13:43:55 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Thu, 08 Feb 2007 10:43:55 -0800 Subject: [Numpy-discussion] Comparing x and x.view In-Reply-To: <200702072200.30228.pgmdevlist@gmail.com> References: <200702072200.30228.pgmdevlist@gmail.com> Message-ID: <45CB6F6B.7040204@noaa.gov> Pierre GM wrote: > I want to compare whether two arrays point to the same data. Travis just posted a note about a couple utility functions that may help: Travis Oliphant wrote: > In SVN there is a new function may_share_memory(a,b) which will return > True if the memory foot-print of the two arrays over-lap. > > >>> may_share_memory(a, flipud(a)) > True > > This is based on another utility function byte_bounds that returns the > byte-boundaries of any object exporting the Python side of the array > interface. > > Perhaps these utilities will help (I know they can be used to make the > who function a bit more intelligent about how many bytes are being used). > > -Travis -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From mclay at l2sg.com Thu Feb 8 13:42:42 2007 From: mclay at l2sg.com (Michael McLay) Date: Thu, 8 Feb 2007 13:42:42 -0500 Subject: [Numpy-discussion] Latest Array-Interface PEP In-Reply-To: <459D9D9C.30501@ee.byu.edu> References: <459D6964.30701@ee.byu.edu> <459D9590.80202@noaa.gov> <459D9D9C.30501@ee.byu.edu> Message-ID: <200702081342.42565.mclay@l2sg.com> On Thursday 04 January 2007 19:36, Travis Oliphant wrote: > Christopher Barker wrote: > > eople like: > > > > wxPython -- Robin Dunn > > PIL -- Fredrik Lundh > > PyOpenGL -- Who? > > PyObjC -- would it be useful there? (Ronald Oussoren) > > MatplotLib (but maybe it's already married to numpy...) > > PyGtk ? > > It's a good start, but their is also > > PyMedia, PyVoxel, any video-library interface writers, any audo-library > interface writers. > > Anybody who wants to wrap/write code that does some kind of manipulation > on a chunk of data of a specific data-format. > > There are so many people who would use it that I don't feel qualified to > speak for them all. Two more places to look for projects that may be interested: SQL wrappers, such as Psycopg2, and the Python DB API 2.0 community QuantLib (see the message below from the enthought-dev mailing list.) On Saturday 03 February 2007 00:23, Prabhu Ramachandran wrote: > >>>>> "Joseph" == Joseph Wang writes: > Joseph> 3) I'd like to make the impedance mismatch between things > Joseph> like QuantLib arrays and numpy arrays as seemless as > Joseph> possible. Any words of wisdom on how to do this? > > I don't know anything about QuantLib arrays so will shoot in the dark > here. If your arrays can take or provide a block of contiguous memory > that is interpreted according to a particular data type then I think > it is easily possible to get these two array types talking to each > other. However doing this right and optimally will take a bit of > effort (I've done this for tvtk which lets VTK and numpy arrays talk > to each other seamlessly). I think there is a fair bit of > documentation on the scipy wiki on the various ways to do this right. > The scipy-user list is a good place to ask for pointers since the > folks who actually develop numpy and do all sorts of things with numpy > arrays are on that list and will be happy to answer questions. From pgmdevlist at gmail.com Thu Feb 8 14:51:12 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 8 Feb 2007 14:51:12 -0500 Subject: [Numpy-discussion] Comparing x and x.view In-Reply-To: <45CAA26C.5020509@ee.byu.edu> References: <200702072200.30228.pgmdevlist@gmail.com> <200702072252.15238.pgmdevlist@gmail.com> <45CAA26C.5020509@ee.byu.edu> Message-ID: <200702081451.14506.pgmdevlist@gmail.com> On Wednesday 07 February 2007 23:09:16 Travis Oliphant wrote: > > So, there's no real point in using the Python 'id' function ? Do we need > > a shortcut to __array_interface__['data'] as id number ? > > You have it (sort of a short-cut). > > .ctypes.data OK, great, thanks a lot On Wednesday 07 February 2007 23:05:46 Robert Kern wrote: > > Do we need a > > shortcut to __array_interface__['data'] as id number ? > > I'm not sure that would be useful. Different arrays may share the same > starting pointer but have different strides, and arrays may also have > different starting pointers but share some overlapping data. So, in the first case, I can still compare the __array_interfaces__, right ? If they're equal, then the two arrays are strictly equivalent. In the second case, even if the arrays share some overlapping data, they're intrinsically different, so a test on whether the data has been copied/moved around would fail anyway. Well, thank y'all again. From humufr at yahoo.fr Thu Feb 8 16:21:09 2007 From: humufr at yahoo.fr (humufr at yahoo.fr) Date: Thu, 8 Feb 2007 16:21:09 -0500 Subject: [Numpy-discussion] bug or feature? Message-ID: <200702081621.09495.humufr@yahoo.fr> I have a big problem with numpy, numarray and Numeric (all version) If I'm using the script at the bottom, I obtain these results: var1 before function is [3 4 5] var2 before function is 1 var1 after function must be [3 4 5] is [ 9 12 15] <------ problem var2 after function must be 1 is 1 var3 must be the [9 12 15] is [ 9 12 15] var4 must be the 'toto' is toto I'm very surprised by the line noted. I always thinking that the input variable didn't change the variable itself outside the function. It's the comportement for var2 but var1 is changed and it's a big problem (at least for me). The only object in python with this behavior are the numeric object (Numeric, numarray or numpy), with list or other kind of object I have the expected result (the var1 before to go inside the function) I can't keep the input variable if I'm not doing a copy before to call a function... Is it normal and so do I have to do a copy of the input data each time I'm calling a function? Thanks Nicolas #!/usr/bin/env python import numpy print "numpy version ", numpy.__version__ def test(var1,var2): #print "var1 input function is",var1 #print "var2 input function is",var2 var1 *=3 var2 = 'toto' return var1,var2 var1=numpy.array([3,4,5]) var2=1 print "var1 before function is ",var1 print "var2 before function is ",var2 var3,var4=test(var1,var2) print "var1 after function must be [3 4 5] is ",var1 print "var2 after function must be 1 is ", var2 print "var3 must be the [9 12 15] is ", var3 print "var4 must be the 'toto' is ", var4 From pgmdevlist at gmail.com Thu Feb 8 16:42:34 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Thu, 8 Feb 2007 16:42:34 -0500 Subject: [Numpy-discussion] bug or feature? In-Reply-To: <200702081621.09495.humufr@yahoo.fr> References: <200702081621.09495.humufr@yahoo.fr> Message-ID: <200702081642.35801.pgmdevlist@gmail.com> On Thursday 08 February 2007 16:21:09 humufr at yahoo.fr wrote: > > I'm very surprised by the line noted. I always thinking that the input > variable didn't change the variable itself outside the function. Except that in this particular case, you explicitly change the input array itself by using an inplace operator. > I can't keep the input variable if I'm not doing a copy before to call a > function... what about just using var1 = var1*3 ? That way, the initial array is left intact, and var1 inside your function is multiplied by 3. From oliphant at ee.byu.edu Thu Feb 8 17:01:36 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Thu, 08 Feb 2007 15:01:36 -0700 Subject: [Numpy-discussion] bug or feature? In-Reply-To: <200702081621.09495.humufr@yahoo.fr> References: <200702081621.09495.humufr@yahoo.fr> Message-ID: <45CB9DC0.3090206@ee.byu.edu> humufr at yahoo.fr wrote: >I have a big problem with numpy, numarray and Numeric (all version) > >If I'm using the script at the bottom, I obtain these results: > >var1 before function is [3 4 5] >var2 before function is 1 >var1 after function must be [3 4 5] is [ 9 12 15] <------ problem >var2 after function must be 1 is 1 >var3 must be the [9 12 15] is [ 9 12 15] >var4 must be the 'toto' is toto > > >I'm very surprised by the line noted. I always thinking that the input >variable didn't change the variable itself outside the function. > To save yourself confusion, you need to understand the difference between mutable and immutable types. Mutable types can be changed inside of a function call. You also need to understand that = is a "name-binding operation only" it does not change objects. >Is it normal and so do I have to do a copy of the input data each time I'm >calling a function? > > Yes, it's very normal, if your function does an "in-place" operation on a mutable type. Consider the following code: def test(var1, var2): var1[0] *= 3 # this accesses the 0'th element of var1 and alters it. var2 = 'toto' # this makes a new object and names it with var2 # whatever was passed in is gone return var1, var2 test([1,2,3],[1,2]) will return [3,2,3], 'toto' -Travis From humufr at yahoo.fr Thu Feb 8 17:15:08 2007 From: humufr at yahoo.fr (humufr at yahoo.fr) Date: Thu, 8 Feb 2007 17:15:08 -0500 Subject: [Numpy-discussion] bug or feature? In-Reply-To: <45CB9DC0.3090206@ee.byu.edu> References: <200702081621.09495.humufr@yahoo.fr> <45CB9DC0.3090206@ee.byu.edu> Message-ID: <200702081715.08431.humufr@yahoo.fr> Thank you to both of you for this explanatin. I'm coming from the fortran world and so I never had to deal with this before... Sorry to have polluate the list for a stupid things Thanks again that clarify the problem Nicolas Le Thursday 08 February 2007 17:01:36 Travis Oliphant, vous avez ?crit?: > humufr at yahoo.fr wrote: > >I have a big problem with numpy, numarray and Numeric (all version) > > > >If I'm using the script at the bottom, I obtain these results: > > > >var1 before function is [3 4 5] > >var2 before function is 1 > >var1 after function must be [3 4 5] is [ 9 12 15] <------ problem > >var2 after function must be 1 is 1 > >var3 must be the [9 12 15] is [ 9 12 15] > >var4 must be the 'toto' is toto > > > > > >I'm very surprised by the line noted. I always thinking that the input > >variable didn't change the variable itself outside the function. > > To save yourself confusion, you need to understand the difference > between mutable and immutable types. Mutable types can be changed > inside of a function call. > > You also need to understand that = is a "name-binding operation only" > it does not change objects. > > >Is it normal and so do I have to do a copy of the input data each time I'm > >calling a function? > > Yes, it's very normal, if your function does an "in-place" operation on > a mutable type. > > Consider the following code: > > def test(var1, var2): > var1[0] *= 3 # this accesses the 0'th element of var1 and alters it. > var2 = 'toto' # this makes a new object and names it with var2 > # whatever was passed in is gone > return var1, var2 > > test([1,2,3],[1,2]) > > will return > > [3,2,3], 'toto' > > > > -Travis > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Thu Feb 8 17:28:39 2007 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 08 Feb 2007 14:28:39 -0800 Subject: [Numpy-discussion] bug or feature? In-Reply-To: <200702081715.08431.humufr@yahoo.fr> References: <200702081621.09495.humufr@yahoo.fr> <45CB9DC0.3090206@ee.byu.edu> <200702081715.08431.humufr@yahoo.fr> Message-ID: <45CBA417.1020301@gmail.com> humufr at yahoo.fr wrote: > Thank you to both of you for this explanatin. I'm coming from the fortran > world and so I never had to deal with this before... > > Sorry to have polluate the list for a stupid things No need to apologize. All programming systems take some getting-used-to at first before they become natural. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From drife at ucar.edu Fri Feb 9 09:31:47 2007 From: drife at ucar.edu (Daran L. Rife) Date: Fri, 9 Feb 2007 07:31:47 -0700 (MST) Subject: [Numpy-discussion] Request for porting pycdf to NumPy Message-ID: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> Hi Travis, If you're still offering NumPy "patches" to third party packages that rely upon Numeric, I would really like for pycdf to be ported to NumPy. This would allow me to completely transition to NumPy. Thanks very much for considering my request. Daran Rife From oliphant at ee.byu.edu Fri Feb 9 12:15:20 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Fri, 09 Feb 2007 10:15:20 -0700 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> Message-ID: <45CCAC28.3060006@ee.byu.edu> Daran L. Rife wrote: >Hi Travis, > >If you're still offering NumPy "patches" to third party >packages that rely upon Numeric, I would really like for >pycdf to be ported to NumPy. This would allow me to >completely transition to NumPy. > >Thanks very much for considering my request. > > Is pycdf the same as pynetcdf? Where is it located? There is a netcdf in the sandbox of SciPy that works with NumPy. We should get it moved over into the main SciPy tree. Thanks, -Travis From otto at tronarp.se Sat Feb 3 11:39:11 2007 From: otto at tronarp.se (Otto Tronarp) Date: Sat, 03 Feb 2007 17:39:11 +0100 Subject: [Numpy-discussion] numarray argmax problem Message-ID: <20070203173911.ax2xhqvu8sk84g44@mathcore.kicks-ass.org> Hi, I have a problem with numarray.argmax, the following code import numarray as N import sys c = N.zeros((10, ), N.Float64) while 1: print 'B: ', sys.getrefcount(None) l = N.argmax(c) print 'A: ', sys.getrefcount(None) print Dies with: Fatal Python error: deallocating None Aborted I'm using numarray 1.5.2 Any chance that there is an easy fix for this? I know I should consider switching to numpy, but I don't have the time right now to do that. Otto From a.schmolck at gmx.net Wed Feb 7 08:08:01 2007 From: a.schmolck at gmx.net (Alexander Schmolck) Date: 07 Feb 2007 13:08:01 +0000 Subject: [Numpy-discussion] force column vector In-Reply-To: References: Message-ID: Christian writes: > Hi, > > when creating an ndarray from a list, how can I force the result to be > 2d *and* a column vector? So in case I pass a nested list, there will be no > modification of the shape and when I pass a simple list, it will be > converted to a 2d column vector. I can only think of a solution using 'if' > clauses but I suppose there is a more elegant way. This will always return a column vector, but I'm not sure from you description if that's what you want (do you want [[1,2,3]] etc. to come out as a row-vector?) In [8]: ravel(array([1,2,3]))[:,newaxis] Out[8]: array([[1], [2], [3]]) 'as From robert.kern at gmail.com Fri Feb 9 12:17:41 2007 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 09 Feb 2007 09:17:41 -0800 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <45CCAC28.3060006@ee.byu.edu> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <45CCAC28.3060006@ee.byu.edu> Message-ID: <45CCACB5.2030206@gmail.com> Travis Oliphant wrote: > Is pycdf the same as pynetcdf? Where is it located? I presume it's this one: http://pysclint.sourceforge.net/pycdf/ -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Fri Feb 9 12:24:03 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 09 Feb 2007 09:24:03 -0800 Subject: [Numpy-discussion] bug or feature? In-Reply-To: <200702081621.09495.humufr@yahoo.fr> References: <200702081621.09495.humufr@yahoo.fr> Message-ID: <45CCAE33.5040701@noaa.gov> Your question has been answered, but I think a few comments are in order: 1) Read this, everyone new to Python should: http://python.net/crew/mwh/hacks/objectthink.html 2) > The only object in python with this behavior are the numeric object (Numeric, numarray or numpy) Not the case (see the above reference). Num* arrays are different than the standard objects in that slicing returns views on data of the array, but all mutable types will behave the same way in your code: >>> def test(l): ... l *= 3 ... >>> print l [3, 3, 3] >>> test(l) >>> print l [3, 3, 3, 3, 3, 3, 3, 3, 3] As Travis said, in some languages, the question is "copy or reference?" (and Fortran is always reference, IIRC), in Python, the question is "mutable or immutable?". Immutable objects can not be effected when passed into function, mutable objects can. Using the same test function above: >>> i = 5 >>> test(5) >>> i 5 integers are immutable, so i was not changed (more accurately, the object bound to i was not changed). Compounding the confusion, is this: the *=, etc operators mean: "mutate the object in place if possible, otherwise return a new object". This is confusing as that means that some objects (ndarrays, list), will get changed by a function like that, and others (ints, floats, tuples) will not. Personally, I would be happier if +=, etc. meant "mutate the object in place if possible, otherwise raise an exception", but then you couldn't do i = 0 i += 1 which is probably the most common use. I think the mistake arose from trying to solve two totally different problems at once: numpy users and the like wanted a clean and compact way to mutate in place. Lots of others (particularly those coming from the C family of languages) wanted a quick and easy notation for "increment this value". Since python numbers are not mutable, these CAN'T be the same thing, so using the same notation for both causes confusion. Of course, you can now use numpy rank zero arrays almost like mutable numbers: >>> s = N.array(5) >>> test(s) >>> s array(15) By the way, there are times when I think mutable scalars would be handy. rank-zero arrays almost fit the bill, but I can't see a way to directly re-set their value, and they don't seem to take precedence in operations with other numbers: >>> s = N.array(5) >>> type(s) >>> s2 = 4 * s >>> type(s2) # so I got a numpy scalar back instead of a rank-zero array However: >>> s = N.array((5,6)) >>> type(s) >>> s2 = 4 * s >>> type(s2) This stayed an array, as, of course, it would have to! I now the whole "how the heck should a rank-zero array" behave? has been discussed a lot, but I still wonder if this is how it should be. Maybe having a whole new object that explicitly defined as a mutable scalar would be the way to do it. It would probably have less overhead than a rank-zero array as well. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From Chris.Barker at noaa.gov Fri Feb 9 12:33:24 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 09 Feb 2007 09:33:24 -0800 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <45CCACB5.2030206@gmail.com> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <45CCAC28.3060006@ee.byu.edu> <45CCACB5.2030206@gmail.com> Message-ID: <45CCB064.4010706@noaa.gov> hmmm, this page from unidata: http://www.unidata.ucar.edu/software/netcdf/software.html#Python Indicates 4-6 different Python netcfd interfaces! Has anyone done a review of these to see where effort would best be put into upgrading? I've used Konrad Hinsen's package a little bit with success, and I think he's upgraded all of Scientific Python to support numpy now, but I know nothing of the others. I think it would be ideal if Unidata could be persuaded to adopt an maintain one as the official version to save this extra work an confusion. Maybe there is some hope of this for netcdf4. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From jmiller at stsci.edu Fri Feb 9 13:02:08 2007 From: jmiller at stsci.edu (Todd Miller) Date: Fri, 09 Feb 2007 13:02:08 -0500 Subject: [Numpy-discussion] numarray argmax problem In-Reply-To: <20070203173911.ax2xhqvu8sk84g44@mathcore.kicks-ass.org> References: <20070203173911.ax2xhqvu8sk84g44@mathcore.kicks-ass.org> Message-ID: <45CCB720.70705@stsci.edu> Otto Tronarp wrote: > Hi, > > I have a problem with numarray.argmax, the following code > > import numarray as N > import sys > c = N.zeros((10, ), N.Float64) > while 1: > print 'B: ', sys.getrefcount(None) > l = N.argmax(c) > print 'A: ', sys.getrefcount(None) > print > > Dies with: > Fatal Python error: deallocating None > Aborted > > I'm using numarray 1.5.2 > > Any chance that there is an easy fix for this? I know I should > If you're willing to check out numarray and build it from source yourself, this problem is fixed in numarray CVS here: http://sourceforge.net/cvs/?group_id=1369 You can check it out like this: cvs -d:pserver:anonymous at numpy.cvs.sourceforge.net:/cvsroot/numpy login cvs -z3 -d:pserver:anonymous at numpy.cvs.sourceforge.net:/cvsroot/numpy co -P /numarray / Regards, Todd > consider switching to numpy, but I don't have the time right now to do > that. > > Otto > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From wfspotz at sandia.gov Fri Feb 9 13:12:42 2007 From: wfspotz at sandia.gov (Bill Spotz) Date: Fri, 9 Feb 2007 11:12:42 -0700 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <45CCB064.4010706@noaa.gov> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <45CCAC28.3060006@ee.byu.edu> <45CCACB5.2030206@gmail.com> <45CCB064.4010706@noaa.gov> Message-ID: <981E18ED-8CE6-49F7-A583-EF57B8F97163@sandia.gov> I believe the person most qualified to answer that question would be Mary Haley, (haley at ucar.edu). On Feb 9, 2007, at 10:33 AM, Christopher Barker wrote: > Has anyone done a review of these to see where effort would best be > put > into upgrading? ** Bill Spotz ** ** Sandia National Laboratories Voice: (505)845-0170 ** ** P.O. Box 5800 Fax: (505)284-5451 ** ** Albuquerque, NM 87185-0370 Email: wfspotz at sandia.gov ** From drife at ucar.edu Fri Feb 9 13:30:45 2007 From: drife at ucar.edu (Daran Rife) Date: Fri, 9 Feb 2007 11:30:45 -0700 Subject: [Numpy-discussion] Request for porting pycdf to NumPy Message-ID: Yep, I was referring to: http://pysclint.sourceforge.net/pycdf/ Regarding the issue of deciding which netCDF interface to adopt as the "standard", although it is unlikely we'll ever get consensus on this, I have tried several of the netCDF interfaces, including the one in Scientific, and have found that pycdf is the "cleanest"; it is very easy to use and understand. Daran -- Travis Oliphant wrote: > Is pycdf the same as pynetcdf? Where is it located? > I presume it's this one: http://pysclint.sourceforge.net/pycdf/ From efiring at hawaii.edu Fri Feb 9 14:19:37 2007 From: efiring at hawaii.edu (Eric Firing) Date: Fri, 09 Feb 2007 09:19:37 -1000 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: References: Message-ID: <45CCC949.9050408@hawaii.edu> I have been using Jeff Whitaker's netcdf4 interface with good results. I could not find the web page for it on a NOAA site--I think NOAA is reorganizing--but a search turned it up here. Maybe Jeff can provide a better link. http://netcdf4-python.googlecode.com/svn/trunk/docs/netCDF4-module.html Eric Daran Rife wrote: > Yep, I was referring to: http://pysclint.sourceforge.net/pycdf/ > > Regarding the issue of deciding which netCDF interface to adopt > as the "standard", although it is unlikely we'll ever get consensus > on this, I have tried several of the netCDF interfaces, including > the one in Scientific, and have found that pycdf is the "cleanest"; > it is very easy to use and understand. > > > Daran > > -- > > Travis Oliphant wrote: > > >> Is pycdf the same as pynetcdf? Where is it located? >> > > I presume it's this one: > > http://pysclint.sourceforge.net/pycdf/ > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From mithrandir42 at web.de Fri Feb 9 14:22:49 2007 From: mithrandir42 at web.de (N. Volbers) Date: Fri, 09 Feb 2007 20:22:49 +0100 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: References: Message-ID: <45CCCA09.80603@web.de> Daran Rife schrieb: > Yep, I was referring to: http://pysclint.sourceforge.net/pycdf/ > > Regarding the issue of deciding which netCDF interface to adopt > as the "standard", although it is unlikely we'll ever get consensus > on this, I have tried several of the netCDF interfaces, including > the one in Scientific, and have found that pycdf is the "cleanest"; > it is very easy to use and understand. > > +1 for this one for the same reason. I have tried two or three different ones and pycdf was by far the easiest to use. Niklas Volbers. From jswhit at fastmail.fm Fri Feb 9 14:31:31 2007 From: jswhit at fastmail.fm (Jeff Whitaker) Date: Fri, 09 Feb 2007 12:31:31 -0700 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <45CCC949.9050408@hawaii.edu> References: <45CCC949.9050408@hawaii.edu> Message-ID: <45CCCC13.8000708@fastmail.fm> Eric Firing wrote: > I have been using Jeff Whitaker's netcdf4 interface with good results. > > I could not find the web page for it on a NOAA site--I think NOAA is > reorganizing--but a search turned it up here. Maybe Jeff can provide > a better link. > > http://netcdf4-python.googlecode.com/svn/trunk/docs/netCDF4-module.html > > Eric > Eric: Yep, that's a link to the docs from the google code homepage http://code.google.com/p/netcdf4-python/ AFAIK, this is the only one of the python interfaces that supports both the netcdf version 3 API and the new (still in alpha) version 4 API, which is built on top of HDF5. As far as getting unidata to support or bless an 'official' python interface, that's not going to happen. They barely have enough staff to support the C, fortran and Java interfaces. -Jeff -- Jeffrey S. Whitaker Phone : (303)497-6313 Meteorologist FAX : (303)497-6449 NOAA/OAR/PSD R/PSD1 Email : Jeffrey.S.Whitaker at noaa.gov 325 Broadway Office : Skaggs Research Cntr 1D-124 Boulder, CO, USA 80303-3328 Web : http://tinyurl.com/5telg From hetland at tamu.edu Fri Feb 9 14:35:27 2007 From: hetland at tamu.edu (Rob Hetland) Date: Fri, 9 Feb 2007 13:35:27 -0600 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <45CCC949.9050408@hawaii.edu> References: <45CCC949.9050408@hawaii.edu> Message-ID: <1E07F547-E441-44C2-B43F-C5596DD301A4@tamu.edu> +1 for Whitaker's package. The new location of the package is: http://code.google.com/p/netcdf4-python/ I use netcdf for everything I do, and so I checked out all of the packages in detail. All follow a similar API, although there are very minor differences that will catch you if you are not careful (like writing a single character). Although this is (austensibly) made for netcdf4, Whitaker's package works well both reading and writing netcdf3 files. It also has support for stitching together multiple files. It also has support for compression of numpy4 data (as well as all of the other cool netcdf4 features). The only disadvantage is that netcdf4 is a bit harder to install than netcdf3 -- primarily because the user must first install hdf. However, recent versions work fine on all the platforms I use (Mac and Redhat). Clearly this is the way to go. -Rob On Feb 9, 2007, at 1:19 PM, Eric Firing wrote: > I have been using Jeff Whitaker's netcdf4 interface with good results. > > I could not find the web page for it on a NOAA site--I think NOAA is > reorganizing--but a search turned it up here. Maybe Jeff can > provide a > better link. > > http://netcdf4-python.googlecode.com/svn/trunk/docs/netCDF4- > module.html > > Eric > > Daran Rife wrote: >> Yep, I was referring to: http://pysclint.sourceforge.net/pycdf/ >> >> Regarding the issue of deciding which netCDF interface to adopt >> as the "standard", although it is unlikely we'll ever get consensus >> on this, I have tried several of the netCDF interfaces, including >> the one in Scientific, and have found that pycdf is the "cleanest"; >> it is very easy to use and understand. >> >> >> Daran >> >> -- >> >> Travis Oliphant wrote: >> >> >>> Is pycdf the same as pynetcdf? Where is it located? >>> >> >> I presume it's this one: >> >> http://pysclint.sourceforge.net/pycdf/ >> >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion ---- Rob Hetland, Associate Professor Dept. of Oceanography, Texas A&M University http://pong.tamu.edu/~rob phone: 979-458-0096, fax: 979-845-6331 From christian at marquardt.sc Fri Feb 9 16:00:27 2007 From: christian at marquardt.sc (Christian Marquardt) Date: Fri, 9 Feb 2007 22:00:27 +0100 (CET) Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> Message-ID: <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> Dear list, attached is a patch for the original pycdf-0.6.2-rc1 distribution as available through sourceforge - and a little shell script illustrating how to install it. After applyng the patch, it should be configured for numpy. Note that the "installation" script uses an environment variable PREFIX to define where the pacjkage is installed; it also assumes that the netcdf libraries are installed in $PREFIX/lib. The orginal author already supported both numeric and numarray, so I just added a new subdirectory for numpy - which is simply the numeric version slightly changed. The patch is only that large because it replicates much of already existing code... I have been using this "port" for many weeks now without any problems or difficulties. I hope it's useful for others as well;-) Christian. On Fri, February 9, 2007 15:31, Daran L. Rife wrote: > Hi Travis, > > If you're still offering NumPy "patches" to third party > packages that rely upon Numeric, I would really like for > pycdf to be ported to NumPy. This would allow me to > completely transition to NumPy. > > Thanks very much for considering my request. > > > Daran Rife > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- A non-text attachment was scrubbed... Name: pycdf-0.6.2-rc1-numpy.patch.gz Type: application/x-gzip Size: 27143 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: build_pycdf-0.6-2-rc1-linux.sh Type: application/x-shellscript Size: 585 bytes Desc: not available URL: From christian at marquardt.sc Fri Feb 9 16:14:34 2007 From: christian at marquardt.sc (Christian Marquardt) Date: Fri, 9 Feb 2007 22:14:34 +0100 (CET) Subject: [Numpy-discussion] pyhdf / was: Request for porting pycdf to NumPy In-Reply-To: <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> Message-ID: <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> As we are at it, Andre Gosselin (the guy who wrote pycdf) also wrote an interface to HDF4 (not 5) named pyhdf. I'm using that with numpy as well (patch attached), but I haven't tested it much - little more than just running the examples, really (which appear to be ok). Maybe it's useful... In pyhdf, the author didn't support different array interfaces; so the attached patch just modifes the existing source code and moves it over to numpy. I've also attached a "install script" which makes use of the particular lacotion of the HDF4 libraries (in $PREFIX/lib) and header files (in $PREFIX/include/hdf), so it it almost certainly needs to be adapted to the location of the actual HDF libraries and headers. Regards, Christian. On Fri, February 9, 2007 22:00, Christian Marquardt wrote: > Dear list, > > attached is a patch for the original pycdf-0.6.2-rc1 distribution as > available through sourceforge - and a little shell script illustrating > how to install it. After applyng the patch, it should be configured for > numpy. Note that the "installation" script uses an environment variable > PREFIX to define where the pacjkage is installed; it also assumes that > the netcdf libraries are installed in $PREFIX/lib. > > The orginal author already supported both numeric and numarray, so I just > added a new subdirectory for numpy - which is simply the numeric version > slightly changed. The patch is only that large because it replicates much > of already existing code... > > I have been using this "port" for many weeks now without any problems or > difficulties. I hope it's useful for others as well;-) > > Christian. > > > > On Fri, February 9, 2007 15:31, Daran L. Rife wrote: >> Hi Travis, >> >> If you're still offering NumPy "patches" to third party >> packages that rely upon Numeric, I would really like for >> pycdf to be ported to NumPy. This would allow me to >> completely transition to NumPy. >> >> Thanks very much for considering my request. >> >> >> Daran Rife >> >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- A non-text attachment was scrubbed... Name: pyhdf-0.7-3-numpy.patch Type: text/x-patch Size: 12890 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: build_pyhdf-0.7-3-numpy-linux.sh Type: application/x-shellscript Size: 604 bytes Desc: not available URL: From oliphant at ee.byu.edu Fri Feb 9 16:16:57 2007 From: oliphant at ee.byu.edu (Travis Oliphant) Date: Fri, 09 Feb 2007 14:16:57 -0700 Subject: [Numpy-discussion] pyhdf / was: Request for porting pycdf to NumPy In-Reply-To: <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> Message-ID: <45CCE4C9.5020504@ee.byu.edu> Christian Marquardt wrote: >As we are at it, > >Andre Gosselin (the guy who wrote pycdf) also wrote an interface to HDF4 >(not 5) named pyhdf. I'm using that with numpy as well (patch attached), >but I haven't tested it much - little more than just running the examples, >really (which appear to be ok). Maybe it's useful... > >In pyhdf, the author didn't support different array interfaces; so the >attached patch just modifes the existing source code and moves it over >to numpy. I've also attached a "install script" which makes use of the >particular lacotion of the HDF4 libraries (in $PREFIX/lib) and header >files (in $PREFIX/include/hdf), so it it almost certainly needs to be >adapted to the location of the actual HDF libraries and headers. > >Regards, > > Great job. Thanks so much. Would you mind adding these patches to the scipy web page. Or, perhaps we should place them in the numpy svn directory somewhere. -Travis From Chris.Barker at noaa.gov Fri Feb 9 16:28:15 2007 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 09 Feb 2007 13:28:15 -0800 Subject: [Numpy-discussion] pyhdf / was: Request for porting pycdf to NumPy In-Reply-To: <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> Message-ID: <45CCE76F.70301@noaa.gov> > Andre Gosselin (the guy who wrote pycdf) also wrote an interface to HDF4 > (not 5) named pyhdf. Is he still maintaining these packages? Have you submitted the patches to him? -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From evanmason at gmail.com Fri Feb 9 17:22:41 2007 From: evanmason at gmail.com (Evan Mason) Date: Fri, 9 Feb 2007 14:22:41 -0800 Subject: [Numpy-discussion] averaging array, with fill values Message-ID: <4254ff910702091422j78fff464i1c80fe0c20555399@mail.gmail.com> Hi, I want to get the mean over an array with 3 dimensions. The array has, say, dimensions 2x3x4 and I want the result to be 3x4; so the averaging is to be done over the 1st dimension. Also, there are some fill values, and I want these to be excluded from the calculation. I know I can use the ma module to locate the fill values, but after that I am unsure how to exclude them, and how to do the averaging. In [150]: x = rand(2,3,4) In [151]: x[0,2,1]=999 # fill value In [152]: x[1,2,3]=999 In [153]: x[0,1,3]=999 In [154]: x Out[154]: array([[[ 3.01880915e-02, 5.77085271e-02, 7.59176038e-01, 4.15271486e-01], [ 5.48643693e-01, 3.84995126e-01 , 5.01683678e-01, 9.99000000e+02], [ 5.72779004e-01, 9.99000000e+02, 4.18143934e-01, 5.84781674e-01]], [[ 8.90443158e-01, 3.76986788e-01, 8.81270409e-01, 5.19094405e-01], [ 8.12944573e-01, 3.89858156e-01, 5.99219891e-01, 9.99000000e+02], [ 2.31215256e-01, 5.93222883e-01, 2.45004093e-01, 9.18647954e-01]]]) In [155]: Thanks in advance for any help. -Evan -------------- next part -------------- An HTML attachment was scrubbed... URL: From christian at marquardt.sc Fri Feb 9 17:24:56 2007 From: christian at marquardt.sc (Christian Marquardt) Date: Fri, 9 Feb 2007 23:24:56 +0100 (CET) Subject: [Numpy-discussion] pyhdf / was: Request for porting pycdf to NumPy In-Reply-To: <45CCE76F.70301@noaa.gov> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> <45CCE76F.70301@noaa.gov> Message-ID: <27130.84.167.122.11.1171059896.squirrel@webmail.marquardt.sc> On Fri, February 9, 2007 22:28, Christopher Barker wrote: >> Andre Gosselin (the guy who wrote pycdf) also wrote an interface to HDF4 >> (not 5) named pyhdf. > > Is he still maintaining these packages? Have you submitted the patches > to him? a) Don't know; the last releases of pycdf and pyhdf are from February 2001 and July 2005, respectively. b) Yes, but just half an hour ago, after I had seen the request for pycdf here. I would actually prefer if Andre would apply the patches in some way in his distribution. The C wrappers for both pycdf and pyhdf are created with swig, but the original interface description files are not included in the distribution. So I patched the generated wrapper code instead of the original files. Or rather let Travis' alter_codeN do the job;-)) Chris. > > -Chris > > > > -- > Christopher Barker, Ph.D. > Oceanographer > > Emergency Response Division > NOAA/NOS/OR&R (206) 526-6959 voice > 7600 Sand Point Way NE (206) 526-6329 fax > Seattle, WA 98115 (206) 526-6317 main reception > > Chris.Barker at noaa.gov > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From christian at marquardt.sc Fri Feb 9 17:27:05 2007 From: christian at marquardt.sc (Christian Marquardt) Date: Fri, 9 Feb 2007 23:27:05 +0100 (CET) Subject: [Numpy-discussion] pyhdf / was: Request for porting pycdf to NumPy In-Reply-To: <27130.84.167.122.11.1171059896.squirrel@webmail.marquardt.sc> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <21280.84.167.122.11.1171054827.squirrel@webmail.marquardt.sc> <4069.84.167.122.11.1171055674.squirrel@webmail.marquardt.sc> <45CCE76F.70301@noaa.gov> <27130.84.167.122.11.1171059896.squirrel@webmail.marquardt.sc> Message-ID: <23364.84.167.122.11.1171060025.squirrel@webmail.marquardt.sc> Oops! > a) Don't know; the last releases of pycdf and pyhdf are from February 2001 ^^^^ pycdf is from 2006, of course. Sorry! Chris. From nicolas.grilly at garden-paris.com Fri Feb 9 22:11:45 2007 From: nicolas.grilly at garden-paris.com (Nicolas Grilly) Date: Sat, 10 Feb 2007 04:11:45 +0100 Subject: [Numpy-discussion] Function numpy.get_include() has a side effet Message-ID: <554cfd4f0702091911w662b09e5t1abf3e32ba0685d5@mail.gmail.com> Hello, I have a package with a distutils script. This script calls numpy.get_include(), in order to get numpy's header files location. But the function seems to have a side effect by silently patching distutils. Some classes and functions of standard library distutils are replaced by numpy's ones. For example, before calling numpy.get_include(), the function new_compiler comes from standard library: >>> from distutils.ccompiler import new_compiler >>> new_compiler.__module__ 'distutils.ccompiler' But after calling numpy.get_include(), the function comes from numpy: >>> import numpy >>> numpy.get_include() >>> from distutils.ccompiler import new_compiler >>> new_compiler.__module__ 'numpy.distutils.ccompiler' This is because the code of numpy.get_include() imports numpy.distutils, and this triggers the patching mechanism. This is a problem because the behavior of distutils is different before and after the call to numpy.get_include(), and it breaks my distutils script. Is it possible to remove this side effect? Thanks, Nicolas Grilly From kwgoodman at gmail.com Sat Feb 10 11:25:44 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Sat, 10 Feb 2007 08:25:44 -0800 Subject: [Numpy-discussion] fill_value in masked arrays Message-ID: The doc strings for MaskedArray and ma.argsort do not seem consistent. The doc string for MaskedArray says "The fill_value is not used for computation within this module." But the doc string for ma.argsort says "Treating masked values as if they have the value fill_value, return sort indices for sorting along given axis." I'm trying to argsort matrices that have missing values. From matthew.brett at gmail.com Sun Feb 11 09:30:17 2007 From: matthew.brett at gmail.com (Matthew Brett) Date: Sun, 11 Feb 2007 14:30:17 +0000 Subject: [Numpy-discussion] Fwd: Numpy x Matlab: some synthetic benchmarks In-Reply-To: <1137584647.19613.17.camel@localhost.localdomain> References: <1137584647.19613.17.camel@localhost.localdomain> Message-ID: <1e2af89e0702110630q7e274603j6ce53061ae6502d0@mail.gmail.com> Hi, I don't know if people remember this thread, but my memory was that there might be some interest in including numpy / matlab benchmark code somewhere in the distribution, to check on performance and allow matlab people to do a direct comparison. Is this still of interest? If so, what should be the next step? Thanks a lot, Matthew ---------- Forwarded message ---------- From: Paulo Jose da Silva e Silva Date: Jan 18, 2006 11:44 AM Subject: [Numpy-discussion] Numpy x Matlab: some synthetic benchmarks To: numpy-discussion Hello, Travis asked me to benchmark numpy versus matlab in some basic linear algebra operations. Here are the resuts for matrices/vectors of dimensions 5, 50 and 500: Operation x'*y x*y' A*x A*B A'*x Half 2in2 Dimension 5 Array 0.94 0.7 0.22 0.28 1.12 0.98 1.1 Matrix 7.06 1.57 0.66 0.79 1.6 3.11 4.56 Matlab 1.88 0.44 0.41 0.35 0.37 1.2 0.98 Dimension 50 Array 9.74 3.09 0.56 18.12 13.93 4.2 4.33 Matrix 81.99 3.81 1.04 19.13 14.58 6.3 7.88 Matlab 16.98 1.94 1.07 17.86 0.73 1.57 1.77 Dimension 500 Array 1.2 8.97 2.03 166.59 20.34 3.99 4.31 Matrix 17.95 9.09 2.07 166.62 20.67 4.11 4.45 Matlab 2.09 6.07 2.17 169.45 2.1 2.56 3.06 Obs: The operation Half is actually A*x using only the lower half of the matrix and vector. The operation 2in2 is A*x using only the even indexes. Of course there are many repetitions of the same operation: 100000 for dim 5 and 50 and 1000 for dim 500. The inner product is number of repetitions is multiplied by dimension (it is very fast). The software is numpy svn version 1926 Matlab 6.5.0.180913a Release 13 (Jun 18 2002) Both softwares are using the *same* BLAS and LAPACK (ATLAS for sse). As you can see, numpy array looks very competitive. The matrix class in numpy has too much overhead for small dimension though. This overhead is very small for medium size arrays. Looking at the results above (specially the small dimensions ones, for higher dimensions the main computations are being performed by the same BLAS) I believe we can say: 1) Numpy array is faster on usual operations but outerproduct (I believe the reason is that the dot function uses the regular matrix multiplication to compute outer-products, instead of using a special function. This can "easily" changes). In particular numpy was faster in matrix times vector operations, which is the most usual in numerical linear algebra. 2) Any operation that involves transpose suffers a very big penalty in numpy. Compare A'*x and A*x, it is 10 times slower. In contrast Matlab deals with transpose quite well. Travis is already aware of this and it can be probably solved. 3) When using subarrays, numpy is a slower. The difference seems acceptable. Travis, can this be improved? Best, Paulo Obs: Latter on (in a couple of days) I may present less synthetic benchmarks (a QR factorization and a Modified Cholesky). ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642 _______________________________________________ Numpy-discussion mailing list Numpy-discussion at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/numpy-discussion From pgmdevlist at gmail.com Sun Feb 11 13:14:37 2007 From: pgmdevlist at gmail.com (Pierre GM) Date: Sun, 11 Feb 2007 13:14:37 -0500 Subject: [Numpy-discussion] fill_value in masked arrays In-Reply-To: References: Message-ID: <200702111314.38064.pgmdevlist@gmail.com> On Saturday 10 February 2007 11:25:44 Keith Goodman wrote: > The doc strings for MaskedArray and ma.argsort do not seem consistent. numpy.core.ma ? Or the other implementation in the SVN ? > The doc string for MaskedArray says "The fill_value is not used for > computation within this module." But the doc string for ma.argsort > says "Treating masked values as if they have the value fill_value, The fill_value is used to convert a MaskedArray to a regular ndarray. When sorting an array with missing data, the fill_value is used to find the end at which the missing data should be: for example, when sorting a 1D float array, a fill_value of -inf will put the missing data on the left (beginning, small indices), a fill_value of +inf will put the missing data on the right (end, large indices). Other than that, the fill_value is not used in computations: for sum, it is temporarily set to 0, for prod, to 1... > I'm trying to argsort matrices that have missing values. You have an argsort method and an argsort function that behaves similarly. The output is an array of indices. Depending on the fill_value you chose, the indices corresponding to the missing data wil be either at the beginning or the end. example: >>>x=array([0,2,4,3,1],mask=[0,0,1,1,0]) >>>x masked_array(data = [0 2 -- -- 1], mask = [False False True True False], fill_value=999999) >>>x.argsort(fill_value=-999999) array([2, 3, 0, 4, 1]) >>>x.argsort(fill_value=+999999) array([0, 4, 1, 2, 3]) Now, if you want only the indices that are not masked, you can use something like that: >>>a=x.argsort(fill_value=+99999) >>>a[~x._mask[a]] array([0,4,1]) From david at ar.media.kyoto-u.ac.jp Mon Feb 12 00:02:08 2007 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Mon, 12 Feb 2007 14:02:08 +0900 Subject: [Numpy-discussion] Possible solution to binary distributionproblems for numpy on linux? Message-ID: <45CFF4D0.5080401@ar.media.kyoto-u.ac.jp> Hi there, I came across an interesting post on Miguel De Icaza's blog this week-end: http://tirania.org/blog/archive/2007/Jan-26.html The build system used in opensuse is open sourced for a few weeks. The basic idea is that it provides a build farm to build packages for most major distributions, automatically, with automatic dependency tracking for rebuilding a package when its dependencies changed, etc... [1] My questions are: - does it seem interesting to numpy developers ? My impression is that binary distribution of numpy is a big problem for many linux users, and that is entry barrier for many users (I may be wrong, that's just an impression from the ML). - the registration requires agreement from the open build system's team for now. I would be interesting in trying this out, but I didn't want to "proclaim" myself as a numpy developer without consent from the numpy dev team. cheers, David [1]I have not studied throughly, but the idea is: - you submit the sources of your package + a description file - you upload it to the build system - the build systeme consists in a build farm to build binary packages automatically for many distribution (including opensuse, Suse, fedora, ubuntu and debian; the biggest distribution in term of marketshare which are not there are slackware + gentoo, but I guess users of those distribution would know enough to compile packages themselves). Besides the build farm, some advantages are: - automatic rebuilding when one of the dependency changed (let's say the fortran compiler changed in debian -> numpy which depends on it would be rebuilt automatically) - a system for mirroring. The system is still in beta, and requires registration for trying it as a developer. From eric at enthought.com Mon Feb 12 00:19:52 2007 From: eric at enthought.com (eric jones) Date: Sun, 11 Feb 2007 23:19:52 -0600 Subject: [Numpy-discussion] pickling ufuncs? Message-ID: <45CFF8F8.40206@enthought.com> I recently noticed that we can't pickle ufuncs (like sin, ...). Is there any technical reason this doesn't work, or is it in the category of 'just needs to be done...' FYI, I noticed that it didn't work on the old Numeric either. thanks, eric Python 2.4.3 - Enthought Edition 1.0.0 (#69, Aug 2 2006, 12:09:59) [MSC v.1310 32 bit (Intel)] on win32 Type "help", "copyright", "credits" or "license" for more information. >>> import pickle >>> import numpy >>> import math >>> pickle.dumps(numpy.arange) 'cnumpy.core.multiarray\narange\np0\n.' >>> pickle.dumps(math.sin) 'cmath\nsin\np0\n.' >>> pickle.dumps(numpy.sin) Traceback (most recent call last): File "", line 1, in ? File "C:\Python24\lib\pickle.py", line 1386, in dumps Pickler(file, protocol, bin).dump(obj) File "C:\Python24\lib\pickle.py", line 231, in dump self.save(obj) File "C:\Python24\lib\pickle.py", line 313, in save rv = reduce(self.proto) File "C:\Python24\lib\copy_reg.py", line 69, in _reduce_ex raise TypeError, "can't pickle %s objects" % base.__name__ TypeError: can't pickle ufunc objects >>> From gleban at gmail.com Mon Feb 12 03:51:54 2007 From: gleban at gmail.com (Gregor Leban) Date: Mon, 12 Feb 2007 09:51:54 +0100 Subject: [Numpy-discussion] using a vector of values in masked array's filled() function Message-ID: Hi, I'm converting my Numeric code to numpy and I ran into a problem. In Numeric the MA.filled() function was able to accept a vector of fill values and it used the first value in the vector to fill the missing values in the first row of the matrix, the second value for the second row, etc. The filled() function in numpy now throws an exception "ValueError: array is not broadcastable to correct shape" if I give it a vector of fill values. Is there any other way or function in numpy that would perform the thing as Numeric did? I know that I could write a loop but I often have data with a huge number of rows and I would prefer some optimized way of doing this. Best regards, Gregor -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Mon Feb 12 09:01:25 2007 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 12 Feb 2007 08:01:25 -0600 Subject: [Numpy-discussion] pickling ufuncs? In-Reply-To: <45CFF8F8.40206@enthought.com> References: <45CFF8F8.40206@enthought.com> Message-ID: <45D07335.7080805@gmail.com> eric jones wrote: > I recently noticed that we can't pickle ufuncs (like sin, ...). Is > there any technical reason this doesn't work, or is it in the category > of 'just needs to be done...' We talked about this about two weeks ago: http://projects.scipy.org/pipermail/numpy-discussion/2007-January/025778.html In short, there is a technical reason that it doesn't work in general, there is a workaround that will work for all of the ufuncs that are exposed in modules (but not those created on the fly by scipy.vectorize(), for example), and that workaround is in the category of "just needs to be done." -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From eike.welk at gmx.net Mon Feb 12 12:29:40 2007 From: eike.welk at gmx.net (Eike Welk) Date: Mon, 12 Feb 2007 18:29:40 +0100 Subject: [Numpy-discussion] Possible solution to binary distributionproblems for numpy on linux? In-Reply-To: <45CFF4D0.5080401@ar.media.kyoto-u.ac.jp> References: <45CFF4D0.5080401@ar.media.kyoto-u.ac.jp> Message-ID: <200702121829.41208.eike.welk@gmx.net> Hello David! You may get in contact with Werner Hoch. He currently creates NumPy, SciPy, and Matplotlib packages for Opensuse. werner dot ho (at the server) gmx dot de The packages are here: http://repos.opensuse.org/science/ Regards Eike. From gosselina at dfo-mpo.gc.ca Mon Feb 12 12:45:26 2007 From: gosselina at dfo-mpo.gc.ca (Andre Gosselin) Date: Mon, 12 Feb 2007 12:45:26 -0500 Subject: [Numpy-discussion] New release of pycdf package ported to NumPy In-Reply-To: <1171247042.4560.6.camel@maboule> References: <1171247042.4560.6.camel@maboule> Message-ID: <1171302326.4803.4.camel@maboule> Hi everybody. I have just released version 0.6-3 of pycdf, which finally supports the NumPy array package (along Numeric and numarray). Here is the announcement. ================================== Project "pysclint" ('pysclint') has released the new version of package 'pycdf'. You can download it from SourceForge.net by following this link: or browse Release Notes and ChangeLog by visiting this link: ================================== I was pleased to follow a previous thread about pycdf, and see the number of people who are pleased with it. To remove any doubt, I will still actively maintain pycdf. Best regards, Andr? Gosselin gosselina at dfo-mpo.gc.ca From kwgoodman at gmail.com Mon Feb 12 12:45:42 2007 From: kwgoodman at gmail.com (Keith Goodman) Date: Mon, 12 Feb 2007 09:45:42 -0800 Subject: [Numpy-discussion] Possible solution to binary distributionproblems for numpy on linux? In-Reply-To: <45CFF4D0.5080401@ar.media.kyoto-u.ac.jp> References: <45CFF4D0.5080401@ar.media.kyoto-u.ac.jp> Message-ID: On 2/11/07, David Cournapeau wrote: > My impression is that binary distribution of numpy is a big problem for many > linux users, and that is entry barrier for many users (I may be wrong, that's just > an impression from the ML). Do all of the major GNU/Linux distributions have recent versions of NumPy? Debian Etch is at NumPy 1.0.1 From gosselina at dfo-mpo.gc.ca Mon Feb 12 16:01:54 2007 From: gosselina at dfo-mpo.gc.ca (Andre Gosselin) Date: Mon, 12 Feb 2007 16:01:54 -0500 Subject: [Numpy-discussion] New release of pycdf package ported to NumPy In-Reply-To: 1171247042.4560.6.camel@maboule Message-ID: <1171314114.6259.4.camel@maboule> A small error slipped into the pycdf-0.6-3 release. Attribute deletion through del(), and dimension length inquiry through len() were missing in that release. A new pycdf-0.6.3b fixes those problems. I have withdrawn pycdf-0.6.3 from Sourceforge.net . Those people who have already downloaded this release can safely continue to use it,if they do not mind missing the del() and len() features. Reagrds, Andre Gosselin From evanmason at gmail.com Mon Feb 12 18:55:00 2007 From: evanmason at gmail.com (Evan Mason) Date: Mon, 12 Feb 2007 15:55:00 -0800 Subject: [Numpy-discussion] concatenate and different numbers of dimensions Message-ID: <4254ff910702121555m7867752fic5d6565666188d09@mail.gmail.com> Hi, I am trying to use concatenate to join together a 40x50x45 array and a 50x45 array. The shape of the resulting array should be 41x50x45. In [132]: tempLevel.shape Out[132]: (40, 50, 45) In [133]: temp.shape Out[133]: (50, 45) I've tried various ways to do this using concatenate but always get the following: In [142]: concatenate((tempLevel, temp), axis=0) --------------------------------------------------------------------------- exceptions.ValueError Traceback (most recent call last) /home/emason/python/tools/ ValueError: arrays must have same number of dimensions How do I do this with concatenate, or is there another way to do it? Many thanks, Evan -------------- next part -------------- An HTML attachment was scrubbed... URL: From torgil.svensson at gmail.com Mon Feb 12 19:02:15 2007 From: torgil.svensson at gmail.com (Torgil Svensson) Date: Tue, 13 Feb 2007 01:02:15 +0100 Subject: [Numpy-discussion] concatenate and different numbers of dimensions In-Reply-To: <4254ff910702121555m7867752fic5d6565666188d09@mail.gmail.com> References: <4254ff910702121555m7867752fic5d6565666188d09@mail.gmail.com> Message-ID: As the error-message say, they must match number of dimensions, try: concatenate((tempLevel, temp[newaxis,...]), axis=0) On 2/13/07, Evan Mason wrote: > Hi, I am trying to use concatenate to join together a 40x50x45 array and a > 50x45 array. The shape of the resulting array should be 41x50x45. > > In [132]: tempLevel.shape > Out[132]: (40, 50, 45) > > In [133]: temp.shape > Out[133]: (50, 45) > > I've tried various ways to do this using concatenate but always get the > following: > > In [142]: concatenate((tempLevel, temp), axis=0) > --------------------------------------------------------------------------- > exceptions.ValueError Traceback (most recent > call last) > > /home/emason/python/tools/ > > ValueError: arrays must have same number of dimensions > > > How do I do this with concatenate, or is there another way to do it? > > Many thanks, Evan > > > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From david at ar.media.kyoto-u.ac.jp Mon Feb 12 20:48:44 2007 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Tue, 13 Feb 2007 10:48:44 +0900 Subject: [Numpy-discussion] Possible solution to binary distributionproblems for numpy on linux? In-Reply-To: References: <45CFF4D0.5080401@ar.media.kyoto-u.ac.jp> Message-ID: <45D118FC.7020403@ar.media.kyoto-u.ac.jp> Keith Goodman wrote: > On 2/11/07, David Cournapeau wrote: >> My impression is that binary distribution of numpy is a big problem for many >> linux users, and that is entry barrier for many users (I may be wrong, that's just >> an impression from the ML). > > Do all of the major GNU/Linux distributions have recent versions of NumPy? > > Debian Etch is at NumPy 1.0.1 I think debian has numpy now (I am not using debian on workstation anymore, so I am not really following), but what about new versions of numpy/scipy ? If I want to give some of my code to people in my lab who do not use the same distribution than me, can I give a 10 minutes instructions set to have everything ? If you read the wiki for linux, you have all kind of information distribution specific, which demands some knowledge about compiler and co, etc... If opensuse build system works as I think it does, this would be kind of helpful. Particularly the rebuilding of packages when one of the dependency changed. cheers, David From konrad.hinsen at laposte.net Tue Feb 13 08:34:20 2007 From: konrad.hinsen at laposte.net (Konrad Hinsen) Date: Tue, 13 Feb 2007 14:34:20 +0100 Subject: [Numpy-discussion] Request for porting pycdf to NumPy In-Reply-To: <45CCB064.4010706@noaa.gov> References: <36897.24.56.171.189.1171031507.squirrel@imap.rap.ucar.edu> <45CCAC28.3060006@ee.byu.edu> <45CCACB5.2030206@gmail.com> <45CCB064.4010706@noaa.gov> Message-ID: