From igorsyl at gmail.com Thu Jan 1 14:09:30 2009 From: igorsyl at gmail.com (Igor Sylvester) Date: Thu, 1 Jan 2009 13:09:30 -0600 Subject: [Numpy-discussion] PyArray_DescrConverter segfault Message-ID: Hi, Does anyone have an insight on the following problem? PyObject* descr(PyObject* self, PyObject* args) { PyObject *d; PyArg_ParseTuple(args, "O&", PyArray_DescrConverter, &d); return d; } >>> import numpy >>> import mymodule >>> numpy.__version__ '1.2.1' >>> mymodule.descr([('a', 'i4'), ('b', 'i8')]) segmentation fault (core dumped) ipython Thanks! -Igor -------------- next part -------------- An HTML attachment was scrubbed... URL: From igorsyl at gmail.com Thu Jan 1 14:21:20 2009 From: igorsyl at gmail.com (Igor Sylvester) Date: Thu, 1 Jan 2009 13:21:20 -0600 Subject: [Numpy-discussion] Fwd: PyArray_DescrConverter segfault In-Reply-To: References: Message-ID: I realize I need to call import_array() in the module initialization module. Why isn't this equivalent to importing numpy before importing my module? ---------- Forwarded message ---------- From: Igor Sylvester Date: Thu, Jan 1, 2009 at 1:09 PM Subject: PyArray_DescrConverter segfault To: Numpy-discussion at scipy.org Hi, Does anyone have an insight on the following problem? PyObject* descr(PyObject* self, PyObject* args) { PyObject *d; PyArg_ParseTuple(args, "O&", PyArray_DescrConverter, &d); return d; } >>> import numpy >>> import mymodule >>> numpy.__version__ '1.2.1' >>> mymodule.descr([('a', 'i4'), ('b', 'i8')]) segmentation fault (core dumped) ipython Thanks! -Igor -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Thu Jan 1 17:11:10 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 1 Jan 2009 17:11:10 -0500 Subject: [Numpy-discussion] Fwd: PyArray_DescrConverter segfault In-Reply-To: References: Message-ID: <3d375d730901011411w21b534e5mc5f35ca39f8e0ff7@mail.gmail.com> On Thu, Jan 1, 2009 at 14:21, Igor Sylvester wrote: > I realize I need to call import_array() in the module initialization > module. Why isn't this equivalent to importing numpy before importing my > module? In 3rd party extension modules, all of the PyArray_* API functions are actually #define macros pointing to an array of function pointers. import_array() imports numpy.core.multiarray and sets up this array. If you don't do this, then trying to call one of the PyArray_* functions will result in a segfault because it tries to dereference a pointer in the array that is not set up. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From igorsyl at gmail.com Thu Jan 1 17:25:21 2009 From: igorsyl at gmail.com (Igor Sylvester) Date: Thu, 1 Jan 2009 16:25:21 -0600 Subject: [Numpy-discussion] Fwd: PyArray_DescrConverter segfault In-Reply-To: <3d375d730901011411w21b534e5mc5f35ca39f8e0ff7@mail.gmail.com> References: <3d375d730901011411w21b534e5mc5f35ca39f8e0ff7@mail.gmail.com> Message-ID: Robert, If I import numpy and then import a 3rd party extension module, why does the extension module still have to call import_array if numpy sets this array in the first place? I assume that there's a single array of API functions in a single python process with multiple extension modules. Igor. On Thu, Jan 1, 2009 at 4:11 PM, Robert Kern wrote: > On Thu, Jan 1, 2009 at 14:21, Igor Sylvester wrote: > > I realize I need to call import_array() in the module initialization > > module. Why isn't this equivalent to importing numpy before importing my > > module? > > In 3rd party extension modules, all of the PyArray_* API functions are > actually #define macros pointing to an array of function pointers. > import_array() imports numpy.core.multiarray and sets up this array. > If you don't do this, then trying to call one of the PyArray_* > functions will result in a segfault because it tries to dereference a > pointer in the array that is not set up. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Thu Jan 1 17:27:40 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 1 Jan 2009 17:27:40 -0500 Subject: [Numpy-discussion] Fwd: PyArray_DescrConverter segfault In-Reply-To: References: <3d375d730901011411w21b534e5mc5f35ca39f8e0ff7@mail.gmail.com> Message-ID: <3d375d730901011427n6b09f8ect1ea44c31f65785f3@mail.gmail.com> On Thu, Jan 1, 2009 at 17:25, Igor Sylvester wrote: > Robert, > > If I import numpy and then import a 3rd party extension module, why does the > extension module still have to call import_array if numpy sets this array in > the first place? I assume that there's a single array of API functions in a > single python process with multiple extension modules. Your 3rd party extension module does not know the location of that array until you call import_array(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From mlandis001 at comcast.net Fri Jan 2 13:29:26 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Fri, 02 Jan 2009 13:29:26 -0500 Subject: [Numpy-discussion] building numpy/scipy Message-ID: <20090102183005.27311C7C019@scipy.org> Some of the install instructions are kind of ambiguous. When a library name ends in .a or .dll, it's obvious what it is, but 'library' is sometimes used generically without indicating whether you're talking about static or dynamic, e.g. how does numpy/scipy link to MKL? Is it statically or dynamically linked? It's not clear where things should go or how things are found. After downloading numpy and putting numpy-1.2.1 into a directory named numpy-1.2.1 in python2.5/Lib/site-packages/ and running python setup.py install from the numpy-1.2.1 directory, you get a numpy directory in site-packages. Later, the install instructions refer to the numpy directory generically. Which numpy directory are you talking about? Should the numpy-1.2.1 directory not be in site-packages in the first place? Ditto for scipy. E.g. are you referring to the site.cfg in the distribution directory (numpy-1.2.1) or the installed directory (numpy without version notation). Maybe you should give names to these directories (e.g. distribution and install) and always refer to them with leading adjectives. It should be possible to create a script that keeps track of all of these extra lib configuration steps, so installs won't be so involved and error prone. I built ATLAS (they need better documentation of processor types and how to use what CPU-Z tells you) and I have MKL installed. A quarter of my MKL evaluation days have already ticked away, and I've come to the disappointing realization that 30 days is probably not going to be enough to get the scipy/MKL install right. What's compatible with what? There is mention of MKL, ATLAS, LAPACK and CLAPACK (I also downloaded CLAPACK 3.1.1.1). It seems like CLAPACK would be naturally compatible because python uses C-style matrix addressing, but does Python (or numpy) flip indexes around if you have LAPACK instead of CLAPACK, so it works anyway, if slower? There is mention of ".numpy-site.cfg" in the user's home directory (which I created), but it didn't seem to have any effect on installing. I created the following site.cfg in the numpy (install) directory: [mkl] include_dirs = D:\Programs\Intel\MKL\10.1.0.018\include library_dirs = D:\Programs\Intel\MKL\10.1.0.018\ia32\lib mkl_libs = mkl_ia32, mkl_c_dll, libguide40 lapack_libs = mkl_lapack Do numpy AND scipy each need a site.cfg in their respective install directories? If so, does one take precedence over the other? When I run 'python setup.py install' from the numpy install directory, I get: Warning: No configuration returned, assuming unavailable. blas_opt_info: blas_mkl_info: libraries mkl, vml, guide not found in d:\Programs\Python25\lib libraries mkl, vml, guide not found in C:\ libraries mkl, vml, guide not found in d:\Programs\Python25\libs NOT AVAILABLE atlas_blas_threads_info: Setting PTATLAS=ATLAS libraries mkl, vml, guide not found in d:\Programs\Python25\lib libraries mkl, vml, guide not found in C:\ libraries mkl, vml, guide not found in d:\Programs\Python25\libs NOT AVAILABLE atlas_blas_info: libraries mkl, vml, guide not found in d:\Programs\Python25\lib libraries mkl, vml, guide not found in C:\ libraries mkl, vml, guide not found in d:\Programs\Python25\libs NOT AVAILABLE D:\Programs\Python25\lib\site-packages\numpy\disutils\system_info.py:1340: UserWarning: Atlas (http://math-atlas.sourceforgr.net/) libraries not found Directories to search for the libraries can be specified in the numpy/disutils/site.cfg file (section [atlas]) or by setting the ATLAS environment variable. warnings.warn(AtlasNotFoundError.__doc__) blas_info: libraries blas not found in d:\Programs\Python25\lib ----------------------------------------------------------------- Where is it looking for blas_opt_info, blas_mkl_info, atlas_blas_threads_info, and atlas_blas_info? Is it actually looking on sourceforge for Atlas? FYI, when I try using make in ...MKL/10.1.0.018/examples/cblas, I get a "missing separator" error on the "!INCLUDE cblas.lst" line (first significant line in the file). There are no spaces on this line. Could cygwin make be complaining about something inside the cblas.lst file, or is complaining about the line not being terminated the way it expects it to be (I see carriage return, line feed pairs in the file). HELP!!!! From cournape at gmail.com Fri Jan 2 13:43:55 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 3 Jan 2009 03:43:55 +0900 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090102183005.27311C7C019@scipy.org> References: <20090102183005.27311C7C019@scipy.org> Message-ID: <5b8d13220901021043x34b5f823r58aa09c9061c925@mail.gmail.com> On Sat, Jan 3, 2009 at 3:29 AM, Mike Landis wrote: > Some of the install instructions are kind of ambiguous. > > When a library name ends in .a or .dll, it's obvious what it is, but > 'library' is sometimes used generically without indicating whether > you're talking about static or dynamic, e.g. how does numpy/scipy > link to MKL? Is it statically or dynamically linked? Dynamically first, statically if only the static library is found. > > It's not clear where things should go or how things are found. After > downloading numpy and putting numpy-1.2.1 into a directory named > numpy-1.2.1 in python2.5/Lib/site-packages/ and running python > setup.py install from the numpy-1.2.1 directory, you get a numpy > directory in site-packages. Later, the install instructions refer to > the numpy directory generically. Which numpy directory are you > talking about? Should the numpy-1.2.1 directory not be in > site-packages in the first place? No, it should certainly not be in site-packages. Why would you want to do that ? site-packages is reserved for already built python modules. It is a very bad idea to put anything else there. > > It should be possible to create a script that keeps track of all of > these extra lib configuration steps, so installs won't be so involved > and error prone. There is no ambiguity: when referring to installation, it is understood we refer to the source directory, always. > > What's compatible with what? There is mention of MKL, ATLAS, LAPACK > and CLAPACK (I also downloaded CLAPACK 3.1.1.1). It seems like > CLAPACK would be naturally compatible because python uses C-style > matrix addressing, but does Python (or numpy) flip indexes around if > you have LAPACK instead of CLAPACK, so it works anyway, if slower? Numpy and Scipy supports the netlib F77 interface to both blas and lapack. There is no real advantage to using CLAPACK/CBLAS instead of those if you have the fortran interfaces. > > There is mention of ".numpy-site.cfg" in the user's home directory > (which I created), but it didn't seem to have any effect on installing. > > I created the following site.cfg in the numpy (install) directory: > > [mkl] > include_dirs = D:\Programs\Intel\MKL\10.1.0.018\include > library_dirs = D:\Programs\Intel\MKL\10.1.0.018\ia32\lib > mkl_libs = mkl_ia32, mkl_c_dll, libguide40 > lapack_libs = mkl_lapack There are many problems with the MLK - and I bet it does not work on windows. You are much better of using the numpy and scipy binaries. > > Do numpy AND scipy each need a site.cfg in their respective install > directories? If so, does one take precedence over the other? When building, the site.cfg should be put in the *source* directory. But again, particularly on windows, you should really use the distributed binaries. Building numpy and scipy with external BLAS/LAPACK is not easy, especially on windows. David From matthieu.brucher at gmail.com Fri Jan 2 13:45:29 2009 From: matthieu.brucher at gmail.com (Matthieu Brucher) Date: Fri, 2 Jan 2009 19:45:29 +0100 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090102183005.27311C7C019@scipy.org> References: <20090102183005.27311C7C019@scipy.org> Message-ID: 2009/1/2 Mike Landis : > Some of the install instructions are kind of ambiguous. > > When a library name ends in .a or .dll, it's obvious what it is, but > 'library' is sometimes used generically without indicating whether > you're talking about static or dynamic, e.g. how does numpy/scipy > link to MKL? Is it statically or dynamically linked? If the dynamic libraries are available, they are used. > It's not clear where things should go or how things are found. After > downloading numpy and putting numpy-1.2.1 into a directory named > numpy-1.2.1 in python2.5/Lib/site-packages/ and running python > setup.py install from the numpy-1.2.1 directory, you get a numpy > directory in site-packages. Wait, never put numpy-1.2.1 inside site-packes, build it somewhere else. Later, the install instructions refer to > the numpy directory generically. Which numpy directory are you > talking about? Should the numpy-1.2.1 directory not be in > site-packages in the first place? Ditto for scipy. E.g. are you > referring to the site.cfg in the distribution directory (numpy-1.2.1) > or the installed directory (numpy without version notation). Just get inside numpy-1.2.1, type python setup.py install and you never have to use this folder. The only relevant folder is the numpy one inside site-packages. Maybe > you should give names to these directories (e.g. distribution and > install) and always refer to them with leading adjectives. Perhaps. > It should be possible to create a script that keeps track of all of > these extra lib configuration steps, so installs won't be so involved > and error prone. I just use python setup.py install, nothing else, and it works with the default Linux mathematic libraries. > I built ATLAS (they need better documentation of processor types and > how to use what CPU-Z tells you) and I have MKL installed. A quarter > of my MKL evaluation days have already ticked away, and I've come to > the disappointing realization that 30 days is probably not going to > be enough to get the scipy/MKL install right. I don't know if many people are using Numpy with the MKL on Windows. > What's compatible with what? There is mention of MKL, ATLAS, LAPACK > and CLAPACK (I also downloaded CLAPACK 3.1.1.1). It seems like > CLAPACK would be naturally compatible because python uses C-style > matrix addressing, but does Python (or numpy) flip indexes around if > you have LAPACK instead of CLAPACK, so it works anyway, if slower? CLAPACK is not a standard interface, so it is not usable with MKL or ACML. > There is mention of ".numpy-site.cfg" in the user's home directory > (which I created), but it didn't seem to have any effect on installing. Only site.cfg is relevant. > I created the following site.cfg in the numpy (install) directory: > > [mkl] > include_dirs = D:\Programs\Intel\MKL\10.1.0.018\include > library_dirs = D:\Programs\Intel\MKL\10.1.0.018\ia32\lib > mkl_libs = mkl_ia32, mkl_c_dll, libguide40 > lapack_libs = mkl_lapack > Do numpy AND scipy each need a site.cfg in their respective install > directories? If so, does one take precedence over the other? Yes and I don't know ;) > When I run 'python setup.py install' from the numpy install directory, I get: > > Warning: No configuration returned, assuming unavailable. > blas_opt_info: > blas_mkl_info: > libraries mkl, vml, guide not found in d:\Programs\Python25\lib > libraries mkl, vml, guide not found in C:\ > libraries mkl, vml, guide not found in d:\Programs\Python25\libs > NOT AVAILABLE > > atlas_blas_threads_info: > Setting PTATLAS=ATLAS > libraries mkl, vml, guide not found in d:\Programs\Python25\lib > libraries mkl, vml, guide not found in C:\ > libraries mkl, vml, guide not found in d:\Programs\Python25\libs > NOT AVAILABLE > > atlas_blas_info: > libraries mkl, vml, guide not found in d:\Programs\Python25\lib > libraries mkl, vml, guide not found in C:\ > libraries mkl, vml, guide not found in d:\Programs\Python25\libs > NOT AVAILABLE > > D:\Programs\Python25\lib\site-packages\numpy\disutils\system_info.py:1340: > UserWarning: > Atlas (http://math-atlas.sourceforgr.net/) libraries not found > Directories to search for the libraries can be specified in the > numpy/disutils/site.cfg file (section [atlas]) or by setting the > ATLAS environment variable. > warnings.warn(AtlasNotFoundError.__doc__) > blas_info: > libraries blas not found in d:\Programs\Python25\lib > ----------------------------------------------------------------- > Where is it looking for blas_opt_info, blas_mkl_info, > atlas_blas_threads_info, and atlas_blas_info? Is it actually looking > on sourceforge for Atlas? it looks in site.cfg for those informations. FYI, when I try using make in > ...MKL/10.1.0.018/examples/cblas, I get a "missing separator" error > on the "!INCLUDE cblas.lst" line (first significant line in the > file). There are no spaces on this line. Could cygwin make be > complaining about something inside the cblas.lst file, or is > complaining about the line not being terminated the way it expects it > to be (I see carriage return, line feed pairs in the file). Did you try mingw or Visual studio 2003? Matthieu -- Information System Engineer, Ph.D. Website: http://matthieu-brucher.developpez.com/ Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 LinkedIn: http://www.linkedin.com/in/matthieubrucher From boogaloojb at yahoo.fr Fri Jan 2 13:47:11 2009 From: boogaloojb at yahoo.fr (Jean-Baptiste Rudant) Date: Fri, 2 Jan 2009 18:47:11 +0000 (GMT) Subject: [Numpy-discussion] Re : Alternative to record array References: <602202.94406.qm@web28502.mail.ukl.yahoo.com> <200812300000.10365.faltet@pytables.org> <200812301634.28218.faltet@pytables.org> Message-ID: <395702.84148.qm@web28512.mail.ukl.yahoo.com> Thank you for everything, it works fine ant it is very helpful. Regards, Jean-Baptiste Rudant ________________________________ De : Francesc Alted ? : Discussion of Numerical Python Envoy? le : Mardi, 30 D?cembre 2008, 16h34mn 27s Objet : Re: [Numpy-discussion] Alternative to record array A Tuesday 30 December 2008, Francesc Alted escrigu?: > A Monday 29 December 2008, Jean-Baptiste Rudant escrigu?: [snip] > > The difference for both approaches is that the row-wise arrangement > is more efficient when data is iterated by field, while the > column-wise one is more efficient when data is iterated by column. > This is why you are seeing the increase of 4x in performance > --incidentally, by looking at both data arrangements, I'd expect an > increase of just 2x (the stride count is 2 in this case), but I > suspect that there are hidden copies during the increment operation > for the record array case. As I was mystified about this difference in speed, I kept investigating and I think I have an answer for the difference in the expected speed-up in the unary increment operator over a recarray field. After looking at the numpy code, it turns out that the next statement: data.ages += 1 is more or less equivalent to: a = data.ages a[:] = a + 1 i.e. a temporary is created (for keeping the result of 'a + 1') and then assigned to the 'ages' column. As it happens that, in this sort of operations, the memory copies are the bottleneck, the creation of the first temporary introduced a slowdown of 2x (due to the strided column) and the assignment represents the additional 2x (4x in total). However, the next idiom: a = data.ages a += 1 effectively removes the need for the temporary copy and is 2x faster than the original "data.ages += 1". This can be seen in the next simple benchmark: --------------------------- import numpy, timeit count = 10e6 ages = numpy.random.randint(0,100,count) weights = numpy.random.randint(1,200,count) data = numpy.rec.fromarrays((ages,weights),names='ages,weights') timer = timeit.Timer('data.ages += 1','from __main__ import data') print "v0-->", timer.timeit(number=10) timer = timeit.Timer('a=data.ages; a[:] = a + 1','from __main__ import data') print "v1-->", timer.timeit(number=10) timer = timeit.Timer('a=data.ages; a += 1','from __main__ import data') print "v2-->", timer.timeit(number=10) timer = timeit.Timer('ages += 1','from __main__ import ages') print "v3-->", timer.timeit(number=10) --------------------------- which produces the next output on my laptop: v0--> 2.98340201378 v1--> 3.22748112679 v2--> 1.5474319458 v3--> 0.809724807739 As a final comment, I suppose that unary operators (+=, -=...) can be optimized in the context of recarray columns in numpy, but I don't think it is worth the effort: when really high performance is needed for operating with columns in the context of recarrays, a column-wise approach is best. Cheers, -- Francesc Alted _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From mlandis001 at comcast.net Fri Jan 2 18:27:46 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Fri, 02 Jan 2009 18:27:46 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <5b8d13220901021043x34b5f823r58aa09c9061c925@mail.gmail.com > References: <20090102183005.27311C7C019@scipy.org> <5b8d13220901021043x34b5f823r58aa09c9061c925@mail.gmail.com> Message-ID: <20090102232825.F26BBC8410D@scipy.org> An HTML attachment was scrubbed... URL: From mlandis001 at comcast.net Fri Jan 2 20:45:19 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Fri, 02 Jan 2009 20:45:19 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090102232825.F26BBC8410D@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <5b8d13220901021043x34b5f823r58aa09c9061c925@mail.gmail.com> <20090102232825.F26BBC8410D@scipy.org> Message-ID: <20090103014559.81970C7C009@scipy.org> An HTML attachment was scrubbed... URL: From mlandis001 at comcast.net Fri Jan 2 22:10:07 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Fri, 02 Jan 2009 22:10:07 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: References: <20090102183005.27311C7C019@scipy.org> Message-ID: <20090103031048.38829C7C009@scipy.org> Maybe the reason I'm having trouble is that I'm trying to get it working on Windows, when almost everyone else is running on Linux? I have cygwin with f77, g++, make, ... installed, but it's definitely not a Linux machine. I'm working from the windows install documentation page. Maybe there are additional steps that you wouldn't have on Linux. It's not clear whether BLAS and LAPACK and ATLAS are all distinct or if building one gets you one or more of the others, e.g. the ATLAS build produces both blas and lapack directories. Is there a third ATLAS library? When I set up site.cfg in site-packages/numpy and run "python setup.py install" from there, it complains that this isn't the right place to run setup.py, so I put the site.cfg in d:\temp\numpy-1.2.1\ and ran setup.py install there. I now have ATLAS libraries, so my site.cfg looks like this: [atlas] library_dirs = d:\Docs\ATLAS\build\lib atlas_libs = lapack, f77blas, cblas, atlas ----------------------------------------------------- Some good news, some bad, details below. Any suggestions would be appreciated... Thanks, Mike Running from numpy source directory. blas_opt_info: blas_mkl_info: libraries mkl, vml, guide not found in d:\Programs\Python25\lib libraries mkl, vml, guide not found in C:\ libraries mkl, vml, guide not found in d:\Programs\Python25\libs NOT AVAILABLE atlas_blas_threads_info: Setting PTATLAS=ATLAS Setting PTATLAS=ATLAS Setting PTATLAS=ATLAS FOUND: libraries = ['lapack', 'f77blas', ;cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c No module named msvccompiler in numpy.disutils; trying from disutils FOUND: libraries = ['lapack', 'f77blas', ;cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c define_macros = [('ATLAS_INFO', '"\\"?.?.?\\""')] lapack_opt_info: lapack_mkl_info: mkl_info: libraries mkl, vml, guide not found in d:\Programs\Python25\lib libraries mkl, vml, guide not found in C:\ libraries mkl, vml, guide not found in d:\Programs\Python25\libs NOT AVAILABLE NOT AVAILABLE atlas_threads_info: Setting PTATLAS=ATLAS libraries lapack_atlas not found in d:\Docs\ATLAS\build\lib numpy.disutils.system_info.atlas_threads_info Setting PTATLAS=ATLAS d:\temp\numpy-1.2.1\numpy\disutils\system_info.py:955: UserWarning: ******************************************************************************** Lapack library (from ATLAS) is probably incomplete: size of d:\Docs\ATLAS\build\lib\libapack.a is 251k (expected >4000k) Follow the instructions in the KNOWN PROBLEMS section of the file numpy/INSTALL.txt. [ NOTE: There is no such section in this file. ] ******************************************************************************** warnings.warn(message) Setting PTATLAS=ATLAS FOUND: libraries = ['lapack', 'lapack', 'f77blas', ;cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c No module named msvccompiler in numpy.disutils; trying from disutils FOUND: libraries = ['lapack', 'lapack', 'f77blas', ;cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c define_macros = [('ATLAS_INFO', '"\\"?.?.?\\""')] could not resolve pattern in '': '*.txt' non-existing path in '': 'COMPATIBILITY' running install running build running config_cc unifying config_cc, config, build_clib, build_ext, build commands --compiler options running config_fc unifying config_fc, config, build_clib, build_ext, build commands --fcompiler options running build_src building py_modules sources building extension "numpy.core.multiarray" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_numpy_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__multiarray_api.h' to sources. adding 'build\src.win32-2.5\numpy\core\src' to include_dirs. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\src\\scalartypes.inc', 'build\\src.win32-2.5\\numpy\\core\\src\\arraytypes.inc', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__multiarray_api.h'] building extension "numpy.core.umath" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_ufunc_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__ufunc_api.h' to sources. adding 'build\src.win32-2.5\numpy\core\src' to include_dirs. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\src\\scalartypes.inc', 'build\\src.win32-2.5\\numpy\\core\\src\\arraytypes.inc', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__ufunc_api.h'] building extension "numpy.core._sort" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_numpy_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__multiarray_api.h' to sources. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__multiarray_api.h'] building extension "numpy.core.scalarmath" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_numpy_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__multiarray_api.h' to sources. executing numpy\core\code_generators\generate_ufunc_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__ufunc_api.h' to sources. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__multiarray_api.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__ufunc_api.h'] building extension "numpy.core._dotblas" sources adding 'numpy\core\blasdot\_dotblas.c' to sources. building extension "numpy.lib._compiled_base" sources building extension "numpy.numarray._capi" sources building extension "numpy.fft.fftpack_lite" sources building extension "numpy.linalg.lapack_lite" sources ### Warning: python_xerbla.c is disabled ### adding 'numpy\linalg\lapack_litemodule.c' to sources. building extension "numpy.random.mtrand" sources Traceback (most recent call last): File "setup.py", line 96, in setup_package() File "setup.py", line 89, in setup_package configuration=configuration ) File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\core.py", line 184, in setup File "d:\programs\python25\lib\distutils\core.py", line 151, in setup dist.run_commands() File "d:\programs\python25\lib\distutils\dist.py", line 974, in run_commands self.run_command(cmd) File "d:\programs\python25\lib\distutils\dist.py", line 994, in run_command cmd_obj.run() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\install.py", line 49, in run File "d:\programs\python25\lib\distutils\command\install.py", line 506, in run self.run_command('build') File "d:\programs\python25\lib\distutils\cmd.py", line 333, in run_command self.distribution.run_command(command) File "d:\programs\python25\lib\distutils\dist.py", line 994, in run_command cmd_obj.run() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build.py", line 37, in run File "d:\programs\python25\lib\distutils\command\build.py", line 112, in run self.run_command(cmd_name) File "d:\programs\python25\lib\distutils\cmd.py", line 333, in run_command self.distribution.run_command(command) File "d:\programs\python25\lib\distutils\dist.py", line 994, in run_command cmd_obj.run() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 130, in run File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 147, in build_sources File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 250, in build_extension_sources File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 307, in generate_sources File "numpy\random\setup.py", line 11, in generate_libraries if config_cmd.try_run(tc): File "d:\programs\python25\lib\distutils\command\config.py", line 278, in try_run self._check_compiler() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\config.py", line 26, in _check_compiler File "d:\programs\python25\lib\distutils\command\config.py", line 107, in _check_compiler dry_run=self.dry_run, force=1) File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\ccompiler.py", line 366, in new_compiler File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\mingw32ccompiler.py", line 46, in __init__ File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 84, in __init__ get_versions() File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 424, in get_versions ld_version = StrictVersion(result.group(1)) File "d:\programs\python25\lib\distutils\version.py", line 40, in __init__ self.parse(vstring) File "d:\programs\python25\lib\distutils\version.py", line 107, in parse raise ValueError, "invalid version number '%s'" % vstring ValueError: invalid version number '2.18.50.20080625' From tgrav at mac.com Fri Jan 2 23:05:15 2009 From: tgrav at mac.com (Tommy Grav) Date: Fri, 02 Jan 2009 23:05:15 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103031048.38829C7C009@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> Message-ID: <524D03AE-2D0E-4A9A-8FE7-BBB700BC34D9@mac.com> > Is there any reason why you can not use the numpy-1.2.1-win32- superpack-python2.4.exe from the http://sourceforge.net/project/showfiles.php?group_id=1369&package_id=175103 download page? I think that is what Mr. Kern meant by using the binaries. This will install already built code into the proper places on your Windows box. Cheers Tommy From mlandis001 at comcast.net Fri Jan 2 23:26:52 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Fri, 02 Jan 2009 23:26:52 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <524D03AE-2D0E-4A9A-8FE7-BBB700BC34D9@mac.com> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <524D03AE-2D0E-4A9A-8FE7-BBB700BC34D9@mac.com> Message-ID: <20090103042733.2F910C7C009@scipy.org> Have to use Pyton 2.5 because I'm also using web2py. Python 2.5 and a bunch of packages that depend on it are already installed. At 11:05 PM 1/2/2009, you wrote: > > > >Is there any reason why you can not use the numpy-1.2.1-win32- >superpack-python2.4.exe >from the >http://sourceforge.net/project/showfiles.php?group_id=1369&package_id=175103 >download page? I think that is what Mr. Kern meant by using the >binaries. This will install >already built code into the proper places on your Windows box. > >Cheers > Tommy >_______________________________________________ >Numpy-discussion mailing list >Numpy-discussion at scipy.org >http://projects.scipy.org/mailman/listinfo/numpy-discussion > >No virus found in this incoming message. >Checked by AVG - http://www.avg.com >Version: 8.0.176 / Virus Database: 270.10.2/1871 - Release Date: >1/1/2009 5:01 PM From tgrav at mac.com Fri Jan 2 23:45:01 2009 From: tgrav at mac.com (Tommy Grav) Date: Fri, 02 Jan 2009 23:45:01 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103042733.2F910C7C009@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <524D03AE-2D0E-4A9A-8FE7-BBB700BC34D9@mac.com> <20090103042733.2F910C7C009@scipy.org> Message-ID: <81A59E45-69B7-4351-B44C-95441A5A9F83@mac.com> There is a superpack for the python2.5 at the same page. Again a binary .exe file that should make the installing a fair bit easier. Cheers Tommy On Jan 2, 2009, at 11:26 PM, Mike Landis wrote: > Have to use Pyton 2.5 because I'm also using web2py. Python 2.5 and > a bunch of packages that depend on it are already installed. > > At 11:05 PM 1/2/2009, you wrote: >>> >> >> Is there any reason why you can not use the numpy-1.2.1-win32- >> superpack-python2.4.exe >> from the >> http://sourceforge.net/project/showfiles.php?group_id=1369&package_id=175103 >> download page? I think that is what Mr. Kern meant by using the >> binaries. This will install >> already built code into the proper places on your Windows box. >> >> Cheers >> Tommy >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> >> No virus found in this incoming message. >> Checked by AVG - http://www.avg.com >> Version: 8.0.176 / Virus Database: 270.10.2/1871 - Release Date: >> 1/1/2009 5:01 PM > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From mlandis001 at comcast.net Sat Jan 3 00:07:19 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 00:07:19 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <81A59E45-69B7-4351-B44C-95441A5A9F83@mac.com> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <524D03AE-2D0E-4A9A-8FE7-BBB700BC34D9@mac.com> <20090103042733.2F910C7C009@scipy.org> <81A59E45-69B7-4351-B44C-95441A5A9F83@mac.com> Message-ID: <20090103050805.49D18C7C009@scipy.org> Does it sort of non-destructively overlay the 2.5 that I'm already running? At 11:45 PM 1/2/2009, you wrote: >There is a superpack for the python2.5 at the same page. Again a >binary .exe file that should make the installing a fair bit easier. > >Cheers > Tommy > >On Jan 2, 2009, at 11:26 PM, Mike Landis wrote: > > > Have to use Pyton 2.5 because I'm also using web2py. Python 2.5 and > > a bunch of packages that depend on it are already installed. > > > > At 11:05 PM 1/2/2009, you wrote: > >> Is there any reason why you can not use the numpy-1.2.1-win32- > >> superpack-python2.4.exe > >> from the > >> > http://sourceforge.net/project/showfiles.php?group_id=1369&package_id=175103 > >> download page? I think that is what Mr. Kern meant by using the > >> binaries. This will install > >> already built code into the proper places on your Windows box. > >> > >> Cheers > >> Tommy > >> _______________________________________________ > >> Numpy-discussion mailing list > >> Numpy-discussion at scipy.org > >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > >> > >> No virus found in this incoming message. > >> Checked by AVG - http://www.avg.com > >> Version: 8.0.176 / Virus Database: 270.10.2/1871 - Release Date: > >> 1/1/2009 5:01 PM > > > > _______________________________________________ > > Numpy-discussion mailing list > > Numpy-discussion at scipy.org > > http://projects.scipy.org/mailman/listinfo/numpy-discussion > >_______________________________________________ >Numpy-discussion mailing list >Numpy-discussion at scipy.org >http://projects.scipy.org/mailman/listinfo/numpy-discussion > >No virus found in this incoming message. >Checked by AVG - http://www.avg.com >Version: 8.0.176 / Virus Database: 270.10.2/1871 - Release Date: >1/1/2009 5:01 PM From mlandis001 at comcast.net Sat Jan 3 00:11:11 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 00:11:11 -0500 Subject: [Numpy-discussion] installing numpy with Python2.5 already present Message-ID: <20090103051153.50F33C7C009@scipy.org> This time I included both ATLAS and MKL in the site.cfg file and got a little further... D:\temp\numpy-1.2.1\site.cfg now looks like: [atlas] library_dirs = d:\Docs\ATLAS\build\lib atlas_libs = lapack, f77blas, cblas, atlas [mkl] include_dirs = D:\Programs\Intel\MKL\10.1.0.018\include library_dirs = D:\Programs\Intel\MKL\10.1.0.018\ia32\lib mkl_libs = mkl_ia32, mkl_c_dll, libguide40 lapack_libs = mkl_lapack ----------------------------------------------------------------------------------- FOUND a few more things... though still having problems finding blas_opt_info, blas_mkl_info (even though it finds MKL later???) Install log follows... ----------------------------------------------------------------------------------- Running from numpy source directory. F2PY Version 2_5972 blas_opt_info: blas_mkl_info: libraries mkl_ia32,mkl_c_dll,libguide40 not found in D:\Programs\Intel\MKL\10.1.0.018\ia32\lib NOT AVAILABLE atlas_blas_threads_info: Setting PTATLAS=ATLAS Setting PTATLAS=ATLAS Setting PTATLAS=ATLAS FOUND: libraries = ['lapack', 'f77blas', 'cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c No module named msvccompiler in numpy.distutils; trying from distutils FOUND: libraries = ['lapack', 'f77blas', 'cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c define_macros = [('ATLAS_INFO', '"\\"?.?.?\\""')] lapack_opt_info: lapack_mkl_info: mkl_info: libraries mkl_ia32,mkl_c_dll,libguide40 not found in D:\Programs\Intel\MKL\10.1.0.018\ia32\lib NOT AVAILABLE NOT AVAILABLE atlas_threads_info: Setting PTATLAS=ATLAS libraries lapack_atlas not found in d:\Docs\ATLAS\build\lib numpy.distutils.system_info.atlas_threads_info Setting PTATLAS=ATLAS d:\temp\numpy-1.2.1\numpy\distutils\system_info.py:955: UserWarning: ********************************************************************* Lapack library (from ATLAS) is probably incomplete: size of d:\Docs\ATLAS\build\lib\liblapack.a is 251k (expected >4000k) Follow the instructions in the KNOWN PROBLEMS section of the file numpy/INSTALL.txt. ********************************************************************* warnings.warn(message) Setting PTATLAS=ATLAS FOUND: libraries = ['lapack', 'f77blas', 'cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c No module named msvccompiler in numpy.distutils; trying from distutils FOUND: libraries = ['lapack', 'f77blas', 'cblas', 'atlas'] library_dirs = ['d:\\Docs\\ATLAS\\build\\lib'] language = c define_macros = [('ATLAS_INFO', '"\\"?.?.?\\""')] could not resolve pattern in '': '*.txt' non-existing path in '': 'COMPATIBILITY' running install running build running config_cc unifing config_cc, config, build_clib, build_ext, build commands --compiler options running config_fc unifing config_fc, config, build_clib, build_ext, build commands --fcompiler options running build_src building py_modules sources building extension "numpy.core.multiarray" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_numpy_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__multiarray_api.h' to sources. adding 'build\src.win32-2.5\numpy\core\src' to include_dirs. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\src\\scalartypes.inc', 'build\\src.win32-2.5\\numpy\\core\\src\\arraytypes.inc', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__multiarray_api.h'] building extension "numpy.core.umath" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_ufunc_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__ufunc_api.h' to sources. adding 'build\src.win32-2.5\numpy\core\src' to include_dirs. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\src\\scalartypes.inc', 'build\\src.win32-2.5\\numpy\\core\\src\\arraytypes.inc', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__ufunc_api.h'] building extension "numpy.core._sort" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_numpy_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__multiarray_api.h' to sources. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__multiarray_api.h'] building extension "numpy.core.scalarmath" sources adding 'build\src.win32-2.5\numpy\core\include/numpy\config.h' to sources. adding 'build\src.win32-2.5\numpy\core\include/numpy\numpyconfig.h' to sources. executing numpy\core\code_generators\generate_numpy_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__multiarray_api.h' to sources. executing numpy\core\code_generators\generate_ufunc_api.py adding 'build\src.win32-2.5\numpy\core\include/numpy\__ufunc_api.h' to sources. numpy.core - nothing done with h_files = ['build\\src.win32-2.5\\numpy\\core\\include/numpy\\config.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\numpyconfig.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__multiarray_api.h', 'build\\src.win32-2.5\\numpy\\core\\include/numpy\\__ufunc_api.h'] building extension "numpy.core._dotblas" sources adding 'numpy\core\blasdot\_dotblas.c' to sources. building extension "numpy.lib._compiled_base" sources building extension "numpy.numarray._capi" sources building extension "numpy.fft.fftpack_lite" sources building extension "numpy.linalg.lapack_lite" sources ### Warning: python_xerbla.c is disabled ### adding 'numpy\linalg\lapack_litemodule.c' to sources. building extension "numpy.random.mtrand" sources Traceback (most recent call last): File "setup.py", line 96, in setup_package() File "setup.py", line 89, in setup_package configuration=configuration ) File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\core.py", line 184, in setup File "d:\programs\python25\lib\distutils\core.py", line 151, in setup dist.run_commands() File "d:\programs\python25\lib\distutils\dist.py", line 974, in run_commands self.run_command(cmd) File "d:\programs\python25\lib\distutils\dist.py", line 994, in run_command cmd_obj.run() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\install.py", line 49, in run File "d:\programs\python25\lib\distutils\command\install.py", line 506, in run self.run_command('build') File "d:\programs\python25\lib\distutils\cmd.py", line 333, in run_command self.distribution.run_command(command) File "d:\programs\python25\lib\distutils\dist.py", line 994, in run_command cmd_obj.run() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build.py", line 37, in run File "d:\programs\python25\lib\distutils\command\build.py", line 112, in run self.run_command(cmd_name) File "d:\programs\python25\lib\distutils\cmd.py", line 333, in run_command self.distribution.run_command(command) File "d:\programs\python25\lib\distutils\dist.py", line 994, in run_command cmd_obj.run() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 130, in run File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 147, in build_sources File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 250, in build_extension_sources File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\build_src.py", line 307, in generate_sources File "numpy\random\setup.py", line 11, in generate_libraries if config_cmd.try_run(tc): File "d:\programs\python25\lib\distutils\command\config.py", line 278, in try_run self._check_compiler() File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\command\config.py", line 26, in _check_compiler File "d:\programs\python25\lib\distutils\command\config.py", line 107, in _check_compiler dry_run=self.dry_run, force=1) File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\ccompiler.py", line 366, in new_compiler File "D:\Programs\Python25\Lib\site-packages\numpy-1.2.1\numpy\distutils\mingw32ccompiler.py", line 46, in __init__ File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 84, in __init__ get_versions() File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 424, in get_versions ld_version = StrictVersion(result.group(1)) File "d:\programs\python25\lib\distutils\version.py", line 40, in __init__ self.parse(vstring) File "d:\programs\python25\lib\distutils\version.py", line 107, in parse raise ValueError, "invalid version number '%s'" % vstring ValueError: invalid version number '2.18.50.20080625' From cournape at gmail.com Sat Jan 3 00:15:53 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 3 Jan 2009 14:15:53 +0900 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103050805.49D18C7C009@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <524D03AE-2D0E-4A9A-8FE7-BBB700BC34D9@mac.com> <20090103042733.2F910C7C009@scipy.org> <81A59E45-69B7-4351-B44C-95441A5A9F83@mac.com> <20090103050805.49D18C7C009@scipy.org> Message-ID: <5b8d13220901022115p1f07d3desde3ce9a152d1fa06@mail.gmail.com> On Sat, Jan 3, 2009 at 2:07 PM, Mike Landis wrote: > Does it sort of non-destructively overlay the 2.5 that I'm already running? > It only installs numpy into your existing python installation - it will not overwrite anything else in your python installation (e.g. everything else should work as before). You can see numpy as a "plugin" for python, and the binary installer will just install the numpy "plugin". You will need to have python installed first. Note also that if you need more than numpy and scipy, there are also other options which can make your life easier, like enthought or pythonxy: http://www.pythonxy.com/foreword.php http://www.enthought.com/products/epd.php In that case, they install many packages related to scientific tasks, and are self contained. For newcomers on windows, this is the recommended approach if you don't mind the size. David From cournape at gmail.com Sat Jan 3 00:19:11 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 3 Jan 2009 14:19:11 +0900 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103031048.38829C7C009@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> Message-ID: <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> On Sat, Jan 3, 2009 at 12:10 PM, Mike Landis wrote: > Maybe the reason I'm having trouble is that I'm trying to get it > working on Windows, when almost everyone else is running on Linux? It is true that most developers use some sort of unix (linux, mac os X), but we definitely try to make sure it works as well on windows natively (e.g. without cygwin or any other kind of emulation). But windows being windows, the only way to make that happen in finite amount of time is to distribute binaries only (the so called superpack) - if you are new to python development, you can't expect being able to build from sources in ten minutes. But you can if you use the binaries. David From cournape at gmail.com Sat Jan 3 00:24:08 2009 From: cournape at gmail.com (David Cournapeau) Date: Sat, 3 Jan 2009 14:24:08 +0900 Subject: [Numpy-discussion] installing numpy with Python2.5 already present In-Reply-To: <20090103051153.50F33C7C009@scipy.org> References: <20090103051153.50F33C7C009@scipy.org> Message-ID: <5b8d13220901022124p35b94c98u8d5b057877c1193b@mail.gmail.com> On Sat, Jan 3, 2009 at 2:11 PM, Mike Landis wrote: > This time I included both ATLAS and MKL in the site.cfg file and got > a little further... D:\temp\numpy-1.2.1\site.cfg now looks like: > ... > ValueError: invalid version number '2.18.50.20080625' This is a bug in python. Really, you should use the binaries as recommended previously - it will take one minute, and that's how the vast majority of windows users install numpy. David From mlandis001 at comcast.net Sat Jan 3 01:07:59 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 01:07:59 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.co m> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> Message-ID: <20090103060841.53FA0C8410D@scipy.org> An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Sat Jan 3 01:12:30 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 3 Jan 2009 01:12:30 -0500 Subject: [Numpy-discussion] installing numpy with Python2.5 already present In-Reply-To: <5b8d13220901022124p35b94c98u8d5b057877c1193b@mail.gmail.com> References: <20090103051153.50F33C7C009@scipy.org> <5b8d13220901022124p35b94c98u8d5b057877c1193b@mail.gmail.com> Message-ID: <1cd32cbb0901022212l154cfcecnc6560a78cb2b7702@mail.gmail.com> On Sat, Jan 3, 2009 at 12:24 AM, David Cournapeau wrote: > On Sat, Jan 3, 2009 at 2:11 PM, Mike Landis wrote: >> This time I included both ATLAS and MKL in the site.cfg file and got >> a little further... D:\temp\numpy-1.2.1\site.cfg now looks like: >> ... >> ValueError: invalid version number '2.18.50.20080625' > > This is a bug in python. Really, you should use the binaries as > recommended previously - it will take one minute, and that's how the > vast majority of windows users install numpy. > > David > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > I'm working only on Windows, and I never had any problems with the numpy installer. And until recently, I never tried to build it myself. It is easier to install additional packages once numpy, and numpy distutils are properly set up. One problem on Windows is to get the path correct, when I'm building scipy or scikits or any other package with C-extension, I only use MinGW, and I don't have cygwin installed. On my old computer I had cygwin, and I read stories that you shouldn't have cygwin on the Windows path, since some programs got confused with conflicting Windows and cygwin versions. Just a guess: >From your error message, I would think that python distutils finds your cygwin installation, instead I would hide it by removing it from the windows path and work only with MingW. Josef From josef.pktd at gmail.com Sat Jan 3 01:30:24 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 3 Jan 2009 01:30:24 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103060841.53FA0C8410D@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> Message-ID: <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> On Sat, Jan 3, 2009 at 1:07 AM, Mike Landis wrote: > Cygwin is present, so not just the dumbed down Windows CMD available. > > I ran the numpy-1.2.1 superpak. Verified that it installed (cause you don't > get near as much output as you do from a shell prompt) by running: > > " > > python -c 'import numpy; print numpy.__version__ > ' > > " > > and got the numpy version number back. Found the site.cfg that the > installer left in the numpy package directory (it picked up on my ATLAS > install, but not on MKL) and copied it into d:\temp\scipy-0.7.0b1 and ran > "python setup.py install" there. Lots of positive looking output, but it > ultimately crapped out. Here's the tail end of that transcript: > > copying scipy\weave\__init__.py -> build\lib.win32-2.5\scipy\weave > running build_clib > Traceback (most recent call last): > File "setup.py", line 92, in > setup_package() > File "setup.py", line 84, in setup_package > configuration=configuration ) > File "d:\Programs\Python25\lib\site-packages\numpy\distutils\core.py", > line 184, in setup > return old_setup(**new_attr) > File "d:\programs\python25\lib\distutils\core.py", line 151, in setup > dist.run_commands() > File "d:\programs\python25\lib\distutils\dist.py", line 974, in > run_commands > self.run_command(cmd) > File "d:\programs\python25\lib\distutils\dist.py", line 994, in > run_command > cmd_obj.run() > File > "d:\Programs\Python25\lib\site-packages\numpy\distutils\command\install.py", > line 49, in run > r = old_install.run(self) > File "d:\programs\python25\lib\distutils\command\install.py", line 506, in > run > self.run_command('build') > File "d:\programs\python25\lib\distutils\cmd.py", line 333, in run_command > self.distribution.run_command(command) > File "d:\programs\python25\lib\distutils\dist.py", line 994, in > run_command > cmd_obj.run() > File > "d:\Programs\Python25\lib\site-packages\numpy\distutils\command\build.py", > line 37, in run > old_build.run(self) > File "d:\programs\python25\lib\distutils\command\build.py", line 112, in > run > self.run_command(cmd_name) > File "d:\programs\python25\lib\distutils\cmd.py", line 333, in run_command > self.distribution.run_command(command) > File "d:\programs\python25\lib\distutils\dist.py", line 994, in > run_command > cmd_obj.run() > File > "d:\Programs\Python25\lib\site-packages\numpy\distutils\command\build_clib.py", > line 63, in run > force=self.force) > File > "d:\Programs\Python25\lib\site-packages\numpy\distutils\ccompiler.py", line > 366, in new_compiler > compiler = klass(None, dry_run, force) > File > "d:\Programs\Python25\lib\site-packages\numpy\distutils\mingw32ccompiler.py", > line 46, in __init__ > verbose,dry_run, force) > File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 84, in > __init__ > get_versions() > File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 424, in > get_versions > ld_version = StrictVersion(result.group(1)) > File "d:\programs\python25\lib\distutils\version.py", line 40, in __init__ > self.parse(vstring) > File "d:\programs\python25\lib\distutils\version.py", line 107, in parse > raise ValueError, "invalid version number '%s'" % vstring > ValueError: invalid version number '2.18.50.20080625' > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > If you want to get around this bug, you could correct the version parsing, change in File "d:\programs\python25\lib\distutils\cygwinccompiler.py", line 424, in > get_versions > ld_version = StrictVersion(result.group(1)) to ld_version = StrictVersion(result.group(1).rsplit('.',1)[0]) see version problem >>> from distutils.version import StrictVersion >>> StrictVersion('2.18.50.20080625') Traceback (most recent call last): File "", line 1, in StrictVersion('2.18.50.20080625') File "C:\Programs\Python25\lib\distutils\version.py", line 40, in __init__ self.parse(vstring) File "C:\Programs\Python25\lib\distutils\version.py", line 107, in parse raise ValueError, "invalid version number '%s'" % vstring ValueError: invalid version number '2.18.50.20080625' >>> StrictVersion('2.18.50.20080625'.rsplit('.',1)[0]) StrictVersion ('2.18.50') But, see my other message, try without cygwin. I usually don't have any problems building scipy with mingw. Josef From david at ar.media.kyoto-u.ac.jp Sat Jan 3 01:56:27 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Sat, 03 Jan 2009 15:56:27 +0900 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103060841.53FA0C8410D@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> Message-ID: <495F0C1B.5020407@ar.media.kyoto-u.ac.jp> Mike Landis wrote: > Cygwin is present, so not just the dumbed down Windows CMD available. You should not use cygwin: if you use cygwin, it will build numpy against the cygwin python, or worse, will be very confused, because you will mix cygwin compilers and mingw compilers. Unless you want to build numpy for cygwin python, you should not try anything from cygwin. Not that it is impossible, but there are many warts to avoid, and it does not worth the pain unless you are a numpy developer. > > I ran the numpy-1.2.1 superpak. Verified that it installed (cause you > don't get near as much output as you do from a shell prompt) by running: > > " > python -c 'import numpy; print numpy.__version__ > ' > " See, it is easier :) > > and got the numpy version number back. Found the site.cfg that the > installer left in the numpy package directory (it picked up on my > ATLAS install, but not on MKL) and copied it into > d:\temp\scipy-0.7.0b1 and ran "python setup.py install" there. This will not work: you need blas/lapack for scipy - the site.cfg in installed numpy refers to a blas/lapack which was used when I built the binary installer - and it not installed (blas/lapack is statically linked, as dynamically linked libraries is too difficult on windows). Please install from the scipy binary installer instead: http://sourceforge.net/project/showfiles.php?group_id=27747 Trust me, it will take you less time. > Lots of positive looking output, but it ultimately crapped out. > Here's the tail end of that transcript: Same python bug as before: the problem is that for some reason, python checks the version of your toolchain (here binutils), and refuses to build when the version is not the one expected. The easiest fix is to follow Joseph suggestion. But again, you need a blas/lapack first. David From mlandis001 at comcast.net Sat Jan 3 10:13:55 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 10:13:55 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.co m> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> Message-ID: <20090103151440.EBE93C8410D@scipy.org> An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Sat Jan 3 10:50:42 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 3 Jan 2009 10:50:42 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103151440.EBE93C8410D@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> Message-ID: <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> On Sat, Jan 3, 2009 at 10:13 AM, Mike Landis wrote: > I do not have cygwin in my windows path, so I guess that's already hidden. > > I patched d:\programs\python25\lib\distutils\cygwinccompiler.py, line 424 > to read > > ld_version = StrictVersion(result.group(1).rsplit('.',1)[0]) > > but I still got crash and a traceback. > > David Cournapeau suggested using the scipy superpack, so i tried > scipy-0.7.0b1-win32-superpack-python2.5.exe. If there were errors I > wouldn't know about them, but running > > 'python -c 'import scipy; print scipy.__version__ ' > > produces a version number (0.7.0.dev5180 ... not exactly the 0.7.0b1 you'd > expect, but not a stack trace either). > > Now all I have to do is find some test cases so I can verify a little deeper > than the version number. > If you have nose installed, you can run the scipy test suite with import scipy scipy.test() Josef From pgmdevlist at gmail.com Sat Jan 3 15:26:14 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Sat, 3 Jan 2009 15:26:14 -0500 Subject: [Numpy-discussion] genloadtxt : ready for inclusion Message-ID: <2FA139CE-ACC2-4632-82EB-90627196AFA6@gmail.com> All, You'll probably remember that last December, I started rewriting np.loadtxt and ame up with a series of functions that support missing data. I tried to copy/paste the code in numpy.lib.io.py but ran into dependency problems and left it at that. I think that part of the reason is that the code relies on numpy.ma which can't be loaded when numpy.lib gets loaded. As I needed a way to grant access to the code to anybody, I created a small project on launchpad: you can access it to: https://code.launchpad.net/~pierregm/numpy/numpy_addons The loadtxt reimplementation functions can be found in the numpy.io.fromascii module, their unittest in the corresponding test directory. In addition, you'll find several other functions and their unittest to manipulate arrays w/ flexible data-type. They are basically rewritten version of some functions in matplotlib.mlab. Would anybody be willing to try inserting the new functions in numpy ? I was hoping the genfromtxt and consorts would make it to numpy 1.3.x (I'd need the code for the scikits.timeseries package). As usual, I'd need all the feedback you can share. Thanks a lot in advance. P. From mlandis001 at comcast.net Sat Jan 3 17:27:30 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 17:27:30 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.co m> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> Message-ID: <20090103222816.5B76EC7C026@scipy.org> Thanks for the suggestion Joseph - the scipy test suite runs, but it produces lots of errors. Some deprecation warnings in numpy\lib\utils.py (line 110) and scipy\linalg\decomp.py (line 1173) Than it complains about a '_bad_path_' (doesn't exist or not writable). Couldn't remove \appdata]local\temp\tmpc192_pcat_test (directory not empty) Then the crap hits the fan... the following crashes are all due to: test_polyint.CheckKrogh test_derivative crashes on line 38 of scipy\interpolate\tests\test_polyint.py test_derivatives crashes on line 26 test_empty crashes in the same file on line 73 test_hermite crashes in the same file on line 57 test_high_derivative crashes in the same file on line 44 test_lagrange crashes in the same file on line 19 test_low_derivatives crashes in the same file line 32 test_scalar on line 22 test_shapes_1d_vectorvalue on line 95 test_shapes_scalarvalue on line 76 test_shapes_scalarvalue_derivative on line 82 test_shapes_vectorvalue on line 89 test_shapes_vectorvalue_derivative on line 101 test_vector on line 63 test_wrapper on line 108 The following crashes are due to test_polyint.CheckPiecewise test_construction on line 186 test_derivative on line 193 test_derivatives on line 196 test_incremental on line 217 test_scalar on line 189 test_shapes_scalarvalue on line 221 test_shapes_scalarvalue_derivative on line 227 test_shapes_vectorvalue on line 235 test_shapes_vectorvalue_1d on line 242 test_shapes_vectorvalue_derivative on line 248 test_vector on line 205 test_wrapper on line 255 The following crashes are due to test_polyint.CheckTaylor test_exponential on line 116 Failure: AttributeError ('module' object has no attribute 'byteordercodes') nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name numpyio) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name fblas) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name flapack) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name fblas) AGAIN nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name flapack) AGAIN nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: NameError (name 'pilutil is not defined) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name cobyla) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name nonlin) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name zeros) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: NameError (name 'pilutil is not defined) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: ImportError (cannot import name linsolve) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: AttributeError ('module' object has no attribute '_cephes') nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: NameError (name 'pilutil is not defined) nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName Failure: AttributeError ('module' object has no attribute 'convolve') nose-0.10.30py2.5.egg\nose\loader.py on line 364 in loadTestsFromName That was with nose-0.10.3-py2.5. I upgrades to the latest nose-0.10.4-py2.5, but it still produced an armload of (what look like the same) errors. Does this symptomology point to anything (configuration error, package out of date, ???) At 10:50 AM 1/3/2009, you wrote: >On Sat, Jan 3, 2009 at 10:13 AM, Mike Landis wrote: > > I do not have cygwin in my windows path, so I guess that's already hidden. > > > > I patched d:\programs\python25\lib\distutils\cygwinccompiler.py, line 424 > > to read > > > > ld_version = StrictVersion(result.group(1).rsplit('.',1)[0]) > > > > but I still got crash and a traceback. > > > > David Cournapeau suggested using the scipy superpack, so i tried > > scipy-0.7.0b1-win32-superpack-python2.5.exe. If there were errors I > > wouldn't know about them, but running > > > > 'python -c 'import scipy; print scipy.__version__ ' > > > > produces a version number (0.7.0.dev5180 ... not exactly the 0.7.0b1 you'd > > expect, but not a stack trace either). > > > > Now all I have to do is find some test cases so I can verify a > little deeper > > than the version number. > > > >If you have nose installed, you can run the scipy test suite with > >import scipy >scipy.test() > >Josef >_______________________________________________ >Numpy-discussion mailing list >Numpy-discussion at scipy.org >http://projects.scipy.org/mailman/listinfo/numpy-discussion > >No virus found in this incoming message. >Checked by AVG - http://www.avg.com >Version: 8.0.176 / Virus Database: 270.10.2/1872 - Release Date: >1/2/2009 1:10 PM From mlandis001 at comcast.net Sat Jan 3 21:56:39 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 21:56:39 -0500 Subject: [Numpy-discussion] debugging scipy install Message-ID: <20090104025726.093E0C7C026@scipy.org> running "python -c 'import scipy; scipy.test()' " finding one of the earliest errors referencing a bad_path, I found scipy's INSTALL.txt, paragraph 5, under the TROUBLESHOOTING says to "cd scipy/Lib/linalg" so you can run: 'python setup_atlas_version.py build_ext --inplace --force' [ setup_atlas_version.py is actually in scipy/linalg (at least on Windows) ] ... but that shell command bombs out with the following Traceback File "setup_atlas_version.py", line 7 in from numpy.distutils.misc_util_import get_path, default_config_dict ImportError: cannot import name get_path Seems like getting the path could be source of a '_bad_path_' problem. Does anyone know where get_path is defined, and if so, where the file is downloadable from? After taking the superpack install routes with both numpy and scipy, shouldn't these issues have been taken care of already? From josef.pktd at gmail.com Sat Jan 3 22:40:28 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 3 Jan 2009 22:40:28 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090103222816.5B76EC7C026@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> Message-ID: <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> Make sure that when you import scipy that you get the correct version. >>> import scipy >>> scipy.__file__ 'C:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' >From your error messages, I would think python is loading the source distribution and not the compiled and installed version. It would be helpful to see your actual error messages from nose, with copy and paste, at least the first few and last parts of the nose tests. Your summary error message is not very helpful because it doesn't show your actual error path and trace backs. When I installed the 0.7.0 b1 superpack on WindowsXP, it worked out of the box. The only thing to do, before installing a new version of numpy or scipy, is to uninstall or delete any old version in site-packages, since the directory names of scipy and numpy do not include version numbers. Installing on top of an old version, can leave some old files around which sometimes cause errors. Josef From mlandis001 at comcast.net Sat Jan 3 23:20:40 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 23:20:40 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.co m> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> Message-ID: <20090104042129.8D284C7C026@scipy.org> Josef (sorry about spelling your name wrong in a previous post), Thanks for the continued suggestions. I deleted the site-packages: numpy and scipy, and reinstalled each using the current release superpacks (numpy first, then scipy). then I ran: python -c 'import numpy; numpy.test()' and got: Running unit tests for numpy NumPy version 1.2.1 NumPy is installed in d::\Programs\Python25\lib\site-packages\numpy Python version 2.5.2 (r252:60911, Mar 27 2008, 17:57:18) IMSC v.1310 32 bit (ntel)] nose version 0.10.4 Traceback (most recent call last): File "(string)", line 1, in File "d::\Programs\Python25\lib\site-packages\numpy\testing\nosetester.py", line 264, in test import doctest File"d:\Programs\Python25\lib\doctest.p", line 99, in import unittest, difflib, pd, tempfile File "d:\programs\python25\lib\tempfile.py", line 33, in from random import Random as __Random ImportError: cannot import name Random scipy.test() bombs out with the exact same Traceback, except that it mentions the scipy version (0,7.0.dev5180) and scipy install directory just before it echoes the line with the python version number. So, it's not getting as far as it was with the source mixed in. At 10:40 PM 1/3/2009, you wrote: >Make sure that when you import scipy that you get the correct version. > > >>> import scipy > >>> scipy.__file__ >'C:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' > > >From your error messages, I would think python is loading the source >distribution and not the compiled and installed version. It would be >helpful to see your actual error messages from nose, with copy and >paste, at least the first few and last parts of the nose tests. Your >summary error message is not very helpful because it doesn't show your >actual error path and trace backs. > >When I installed the 0.7.0 b1 superpack on WindowsXP, it worked out of >the box. The only thing to do, before installing a new version of >numpy or scipy, is to uninstall or delete any old version in >site-packages, since the directory names of scipy and numpy do not >include version numbers. Installing on top of an old version, can >leave some old files around which sometimes cause errors. > > >Josef >_______________________________________________ >Numpy-discussion mailing list >Numpy-discussion at scipy.org >http://projects.scipy.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Sat Jan 3 23:29:26 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sat, 3 Jan 2009 22:29:26 -0600 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090104042129.8D284C7C026@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> <20090104042129.8D284C7C026@scipy.org> Message-ID: <3d375d730901032029y7a2b2820m908ce2748384663e@mail.gmail.com> On Sat, Jan 3, 2009 at 22:20, Mike Landis wrote: > Josef (sorry about spelling your name wrong in a previous post), > > Thanks for the continued suggestions. I deleted the site-packages: > numpy and scipy, and reinstalled each using the current release > superpacks (numpy first, then scipy). > > then I ran: > > python -c 'import numpy; numpy.test()' > > and got: > > Running unit tests for numpy > NumPy version 1.2.1 > NumPy is installed in d::\Programs\Python25\lib\site-packages\numpy > Python version 2.5.2 (r252:60911, Mar 27 2008, 17:57:18) IMSC v.1310 > 32 bit (ntel)] > nose version 0.10.4 > Traceback (most recent call last): > File "(string)", line 1, in > File > "d::\Programs\Python25\lib\site-packages\numpy\testing\nosetester.py", > line 264, in test > import doctest > File"d:\Programs\Python25\lib\doctest.p", line 99, in > import unittest, difflib, pd, tempfile > File "d:\programs\python25\lib\tempfile.py", line 33, in > from random import Random as __Random > ImportError: cannot import name Random What directory are you in? I'm guessing that you are in a numpy/ directory either in the source tree or under site-packages/. Change out of that directory. Python looks for modules in the current directory before the standard locations, so it's picking up the numpy.random subpackage instead of the standard library module random. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From mlandis001 at comcast.net Sat Jan 3 23:38:17 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sat, 03 Jan 2009 23:38:17 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.co m> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> Message-ID: <20090104043904.7CEFCC7C029@scipy.org> Maybe the following will also be useful... Recall that I completely deleted numpy and scipy and reinstalled each from their respective superpacks, then ran: >>> import numpy; numpy.__file__ 'D:\\Programs\\Python25\\lib\\site-packages\\numpy\\__init__.pyc' >>> import scipy; scipy.__file__ 'D:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' $ cd site-packages\numpy; python setup.py config This is the wrong setup.py file to run $ cd site-packages\scipy; python setup.py config non-existing path in 'cluster': 'src\\vq_module.c' non-existing path in 'cluster': 'src\\vq.c' non-existing path in 'cluster': 'src\\hierarchy_wrap.c' non-existing path in 'cluster': 'src\\hierarchy.c' Appending scipy.cluster configuration to scipy Ignoring attempt to set 'name' (from 'scipy' to 'scipy.cluster') Warning: No configuration returned, assuming unavailable. Appending scipy.constants configuration to scipy Ignoring attempt to set 'name' (from 'scipy' to 'scipy.constants') could not resolve pattern in 'fftpack': 'src/dfftpack\\*.f' non-existing path in 'fftpack': 'fftpack.pyf' non-existing path in 'fftpack': 'src/zfft.c' non-existing path in 'fftpack': 'src/drfft.c' non-existing path in 'fftpack': 'src/zrfft.c' non-existing path in 'fftpack': 'src/zfftnd.c' non-existing path in 'fftpack': 'src/zfft_fftpack.c' non-existing path in 'fftpack': 'src/drfft_fftpack.c' non-existing path in 'fftpack': 'src/zfftnd_fftpack.c' non-existing path in 'fftpack': 'convolve.pyf' non-existing path in 'fftpack': 'src/convolve.c' Appending scipy.fftpack configuration to scipy Ignoring attempt to set 'name' (from 'scipy' to 'scipy.fftpack') d:\Programs\Python25\lib\site-packages\numpy\distutils\system_info.py:1340: UserWarning: Atlas (http://math-atlas.sourceforge.net/) libraries not found. Directories to search for the libraries can be specified in the numpy/distutils/site.cfg file (section [atlas]) or by setting the ATLAS environment variable. warnings.warn(AtlasNotFoundError.__doc__) d:\Programs\Python25\lib\site-packages\numpy\distutils\system_info.py:1349: UserWarning: Blas (http://www.netlib.org/blas/) libraries not found. Directories to search for the libraries can be specified in the numpy/distutils/site.cfg file (section [blas]) or by setting the BLAS environment variable. warnings.warn(BlasNotFoundError.__doc__) d:\Programs\Python25\lib\site-packages\numpy\distutils\system_info.py:1352: UserWarning: Blas (http://www.netlib.org/blas/) sources not found. Directories to search for the sources can be specified in the numpy/distutils/site.cfg file (section [blas_src]) or by setting the BLAS_SRC environment variable. warnings.warn(BlasSrcNotFoundError.__doc__) Traceback (most recent call last): File "setup.py", line 32, in setup(**configuration(top_path='').todict()) File "setup.py", line 8, in configuration config.add_subpackage('integrate') File "d:\Programs\Python25\lib\site-packages\numpy\distutils\misc_util.py", line 851, in add_subpackage caller_level = 2) File "d:\Programs\Python25\lib\site-packages\numpy\distutils\misc_util.py", line 834, in get_subpackage caller_level = caller_level + 1) File "d:\Programs\Python25\lib\site-packages\numpy\distutils\misc_util.py", line 781, in _get_configuration_from_setup_py config = setup_module.configuration(*args) File "D:\Programs\Python25\Lib\site-packages\scipy\integrate\setup.py", line 10, in configuration blas_opt = get_info('blas_opt',notfound_action=2) File "d:\Programs\Python25\lib\site-packages\numpy\distutils\system_info.py", line 267, in get_info return cl().get_info(notfound_action) File "d:\Programs\Python25\lib\site-packages\numpy\distutils\system_info.py", line 416, in get_info raise self.notfounderror,self.notfounderror.__doc__ numpy.distutils.system_info.BlasNotFoundError: Blas (http://www.netlib.org/blas/) libraries not found. Directories to search for the libraries can be specified in the numpy/distutils/site.cfg file (section [blas]) or by setting the BLAS environment variable. At 10:40 PM 1/3/2009, you wrote: >Make sure that when you import scipy that you get the correct version. > > >>> import scipy > >>> scipy.__file__ >'C:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' > > >From your error messages, I would think python is loading the source >distribution and not the compiled and installed version. It would be >helpful to see your actual error messages from nose, with copy and >paste, at least the first few and last parts of the nose tests. Your >summary error message is not very helpful because it doesn't show your >actual error path and trace backs. > >When I installed the 0.7.0 b1 superpack on WindowsXP, it worked out of >the box. The only thing to do, before installing a new version of >numpy or scipy, is to uninstall or delete any old version in >site-packages, since the directory names of scipy and numpy do not >include version numbers. Installing on top of an old version, can >leave some old files around which sometimes cause errors. > > >Josef >_______________________________________________ >Numpy-discussion mailing list >Numpy-discussion at scipy.org >http://projects.scipy.org/mailman/listinfo/numpy-discussion From robert.kern at gmail.com Sat Jan 3 23:42:52 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sat, 3 Jan 2009 22:42:52 -0600 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090104043904.7CEFCC7C029@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> <20090104043904.7CEFCC7C029@scipy.org> Message-ID: <3d375d730901032042t4b123fcbu221f254c8798ad54@mail.gmail.com> On Sat, Jan 3, 2009 at 22:38, Mike Landis wrote: > Maybe the following will also be useful... Recall that I completely > deleted numpy and scipy and reinstalled each from their respective > superpacks, then ran: > > >>> import numpy; numpy.__file__ > 'D:\\Programs\\Python25\\lib\\site-packages\\numpy\\__init__.pyc' > > >>> import scipy; scipy.__file__ > 'D:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' > > $ cd site-packages\numpy; python setup.py config > This is the wrong setup.py file to run That message is correct. Don't do that. You don't run the setup.py scripts on installed binaries. > $ cd site-packages\scipy; python setup.py config Don't do that either. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From josef.pktd at gmail.com Sat Jan 3 23:52:46 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Sat, 3 Jan 2009 23:52:46 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <3d375d730901032042t4b123fcbu221f254c8798ad54@mail.gmail.com> References: <20090102183005.27311C7C019@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> <20090104043904.7CEFCC7C029@scipy.org> <3d375d730901032042t4b123fcbu221f254c8798ad54@mail.gmail.com> Message-ID: <1cd32cbb0901032052v73067cd9la64dcea8f53ed562@mail.gmail.com> On Sat, Jan 3, 2009 at 11:42 PM, Robert Kern wrote: > On Sat, Jan 3, 2009 at 22:38, Mike Landis wrote: >> Maybe the following will also be useful... Recall that I completely >> deleted numpy and scipy and reinstalled each from their respective >> superpacks, then ran: >> >> >>> import numpy; numpy.__file__ >> 'D:\\Programs\\Python25\\lib\\site-packages\\numpy\\__init__.pyc' >> >> >>> import scipy; scipy.__file__ >> 'D:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' >> >> $ cd site-packages\numpy; python setup.py config >> This is the wrong setup.py file to run > > That message is correct. Don't do that. You don't run the setup.py > scripts on installed binaries. > >> $ cd site-packages\scipy; python setup.py config > > Don't do that either. > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > Mike, don't go in site-packages at all, except for browsing and looking at the source for information. basic steps: run installer go to a working directory, not under the python directory in programs, (and without any scipy, numpy source in it) then start python or idle there and then import numpy and scipy and test, that's it. Unless there is a reason for you to change the numpy, scipy source there is no reason for you to touch any of the config, compile build steps. They are for later, when you need additional packages that don't have an installer. To get up and running, I recommend just to follow the basic steps for a user. Josef From cournape at gmail.com Sat Jan 3 23:55:47 2009 From: cournape at gmail.com (David Cournapeau) Date: Sun, 4 Jan 2009 13:55:47 +0900 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090104043904.7CEFCC7C029@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> <20090104043904.7CEFCC7C029@scipy.org> Message-ID: <5b8d13220901032055tfe9839w3386d79659ffc852@mail.gmail.com> On Sun, Jan 4, 2009 at 1:38 PM, Mike Landis wrote: > Maybe the following will also be useful... Recall that I completely > deleted numpy and scipy and reinstalled each from their respective > superpacks, then ran: > > >>> import numpy; numpy.__file__ > 'D:\\Programs\\Python25\\lib\\site-packages\\numpy\\__init__.pyc' > > >>> import scipy; scipy.__file__ > 'D:\\Programs\\Python25\\lib\\site-packages\\scipy\\__init__.pyc' > At this stage, you're *done*. Everything is installed, you don't have to do anything anymore. I am sorry if the following is obvious, but that's the only explanation I can make: there are two ways to install open source softwares - from sources, or from binary installers. By using the superpack, you are using the later - using setup.py implied the former. So what you end up doing is to try building the software from the binary - which does not make sense. It may not look like it at this point, but you are making things much more complicate than they really are :) After the super pack executions, you have run the installers successfully, so everything is installed, without any further step to follow. cheers, David From mlandis001 at comcast.net Sun Jan 4 00:15:18 2009 From: mlandis001 at comcast.net (Mike Landis) Date: Sun, 04 Jan 2009 00:15:18 -0500 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <3d375d730901032029y7a2b2820m908ce2748384663e@mail.gmail.co m> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> <20090104042129.8D284C7C026@scipy.org> <3d375d730901032029y7a2b2820m908ce2748384663e@mail.gmail.com> Message-ID: <20090104051606.758A9C84112@scipy.org> I cd'd out of numpy and site-packages and re-ran the package tests. both numpy.test() and scipy.test() ran without serious errors. Some DeprecationWarnings and integrals that are probably divergent or slowly convergent... It's looking much more promising. Two gotchas on top of each other - not deleting the previously existing source, and running scripts from within the site-packages directory. Whew! numpy/scipy was looking scarily unstable until discovering my install and pilot errors. Thanks for all of the help ... From david at ar.media.kyoto-u.ac.jp Sun Jan 4 02:15:45 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Sun, 04 Jan 2009 16:15:45 +0900 Subject: [Numpy-discussion] building numpy/scipy In-Reply-To: <20090104051606.758A9C84112@scipy.org> References: <20090102183005.27311C7C019@scipy.org> <20090103031048.38829C7C009@scipy.org> <5b8d13220901022119v3ef22a3bw3f8aee29caee600b@mail.gmail.com> <20090103060841.53FA0C8410D@scipy.org> <1cd32cbb0901022230x2c56adc7tf6a3de4bd478881e@mail.gmail.com> <20090103151440.EBE93C8410D@scipy.org> <1cd32cbb0901030750s6745d840n64cd7c916f5a77d4@mail.gmail.com> <20090103222816.5B76EC7C026@scipy.org> <1cd32cbb0901031940x565d925eo1e4037ebfe7caa5b@mail.gmail.com> <20090104042129.8D284C7C026@scipy.org> <3d375d730901032029y7a2b2820m908ce2748384663e@mail.gmail.com> <20090104051606.758A9C84112@scipy.org> Message-ID: <49606221.8080101@ar.media.kyoto-u.ac.jp> Mike Landis wrote: > I cd'd out of numpy and site-packages and re-ran the package > tests. both numpy.test() and scipy.test() ran without serious > errors. Some DeprecationWarnings and integrals that are probably > divergent or slowly convergent... > > It's looking much more promising. Two gotchas on top of each other - > not deleting the previously existing source, and running scripts from > within the site-packages directory. > > Whew! numpy/scipy was looking scarily unstable until discovering my > install and pilot errors. > Glad it is working for you. We are sorry about the installation issues: we know things could be better, and we hope to improve things on that front - they have improved quite a bit already. cheers, David From pgmdevlist at gmail.com Sun Jan 4 16:44:33 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Sun, 4 Jan 2009 16:44:33 -0500 Subject: [Numpy-discussion] unique1d and asarray Message-ID: All, Currently, np.unique1d uses np.asarray to ensure the input is an array. The problem is that np.asarray transforms a MaskedArray into a regular ndarray, the missing information is lost and the result is not correct. If we used np.asanyarray instead, subclasses are recognized properly, the mask is recognized by argsort and the result correct. Is there a reason why we use np.asarray instead of np.asanyarray ? Thanks a lot in advance, P. From robert.kern at gmail.com Sun Jan 4 16:47:37 2009 From: robert.kern at gmail.com (Robert Kern) Date: Sun, 4 Jan 2009 15:47:37 -0600 Subject: [Numpy-discussion] unique1d and asarray In-Reply-To: References: Message-ID: <3d375d730901041347j72f4dba2p36dd3a03ccade2ae@mail.gmail.com> On Sun, Jan 4, 2009 at 15:44, Pierre GM wrote: > All, > Currently, np.unique1d uses np.asarray to ensure the input is an > array. The problem is that np.asarray transforms a MaskedArray into a > regular ndarray, the missing information is lost and the result is not > correct. > If we used np.asanyarray instead, subclasses are recognized properly, > the mask is recognized by argsort and the result correct. > Is there a reason why we use np.asarray instead of np.asanyarray ? Probably not. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pgmdevlist at gmail.com Sun Jan 4 16:51:48 2009 From: pgmdevlist at gmail.com (Pierre GM) Date: Sun, 4 Jan 2009 16:51:48 -0500 Subject: [Numpy-discussion] unique1d and asarray In-Reply-To: <3d375d730901041347j72f4dba2p36dd3a03ccade2ae@mail.gmail.com> References: <3d375d730901041347j72f4dba2p36dd3a03ccade2ae@mail.gmail.com> Message-ID: <2983337C-89AE-4FE6-8DAB-A68DA04F0380@gmail.com> On Jan 4, 2009, at 4:47 PM, Robert Kern wrote: > On Sun, Jan 4, 2009 at 15:44, Pierre GM wrote: >> >> If we used np.asanyarray instead, subclasses are recognized properly, >> the mask is recognized by argsort and the result correct. >> Is there a reason why we use np.asarray instead of np.asanyarray ? > > Probably not. So there wouldn't be any objections to make the switch ? We can wait a couple of days if anybody has a pb with that... From ghzhao6917 at yahoo.com.cn Sun Jan 4 20:08:21 2009 From: ghzhao6917 at yahoo.com.cn (gh zhao) Date: Mon, 5 Jan 2009 09:08:21 +0800 (CST) Subject: [Numpy-discussion] cobyla, f2py Message-ID: <552189.73120.qm@web15605.mail.cnb.yahoo.com> ?just?try ?f2py according to http://www.scipy.org/F2PY_Windows. ?The following ?woks in dos? cmd window. E:\Miser3p2F\opcntrl\python E:\python25\scripts\f2py.py -c --fcompiler=gnu95 --compiler=mingw32 -lmsvcr71 -m cobyla cobyla.pyf cobyla2.f trstlp.f But I directly run run_compile() of? f2py2e with ??? sys.argv=sys.argv+['-c','--fcompiler=gnu95','--compiler=mingw32', '-lmsvcr71','-m', 'cobyla','E:\\Miser3p2F\opcntrl\\cobyla.pyf','E:\\Miser3p2F\opcntrl\\cobyla2.f','E:\\Miser3p2F\opcntrl\\trstlp.f'] , it does not work. I have tried several simple examples, they all work. I do not know why. ? ghzhao from Curtin ___________________________________________________________ ????????????????? http://card.mail.cn.yahoo.com/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From zhangyunfeng at gmail.com Mon Jan 5 01:55:46 2009 From: zhangyunfeng at gmail.com (Yunfeng Zhang) Date: Sun, 4 Jan 2009 22:55:46 -0800 (PST) Subject: [Numpy-discussion] Invitation to connect on LinkedIn Message-ID: <149989302.5199552.1231138546697.JavaMail.app@ech3-cdn06.prod> LinkedIn ------------ Discussion, I'd like to add you to my professional network on LinkedIn. - Yunfeng Learn more: https://www.linkedin.com/e/isd/443568042/FmoYJeHZ/ ------------------------------------------ What is LinkedIn and why should you join? http://learn.linkedin.com/what-is-linkedin/ ------ (c) 2008, LinkedIn Corporation -------------- next part -------------- An HTML attachment was scrubbed... URL: From nadavh at visionsense.com Mon Jan 5 02:27:10 2009 From: nadavh at visionsense.com (Nadav Horesh) Date: Mon, 5 Jan 2009 09:27:10 +0200 Subject: [Numpy-discussion] Zoom fft code Message-ID: <710F2847B0018641891D9A216027636029C39B@ex3.envision.co.il> I am looking for a zoom fft code. I found an old code by Paule Kinzle (a matlab code with a translation to numarray), but its 2D extension (czt1.py) looks buggy. Nadav. From stefan at sun.ac.za Mon Jan 5 03:25:57 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Mon, 5 Jan 2009 10:25:57 +0200 Subject: [Numpy-discussion] Zoom fft code In-Reply-To: <710F2847B0018641891D9A216027636029C39B@ex3.envision.co.il> References: <710F2847B0018641891D9A216027636029C39B@ex3.envision.co.il> Message-ID: <9457e7c80901050025j520ce5dds26d914d63e332e76@mail.gmail.com> Hi Nadav I recall that you posted an implementation yourself a while ago! http://www.mail-archive.com/numpy-discussion at scipy.org/msg01812.html Regards St?fan 2009/1/5 Nadav Horesh : > > I am looking for a zoom fft code. I found an old code by Paule Kinzle (a matlab code with a translation to numarray), but its 2D extension (czt1.py) looks buggy. > > Nadav. From ezindy at gmail.com Mon Jan 5 04:42:31 2009 From: ezindy at gmail.com (Egor Zindy) Date: Mon, 5 Jan 2009 18:42:31 +0900 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Hello Rich, sorry it took so long to answer back, holidays and all :-) That's exactly the kind of SWIG / numpy.i problems I've been working on over the past few months: How to generate an array you don't know the size of a-priori, and then handle the memory deallocation seamlessly. In your case, you know that the output array will be half the size of the input array, but this falls under the more general case of "not knowing the output size a-priori". Have a look at the files attached. I've rewritten your function header as: void sms_spectrumMag( int sizeInMag, float *pInRect, int *sizeOutMag, float **pOutMag); Easy to see what the input and output arrays are now. Then my numpy.i handles the memory deallocation of the **pOutMag array. I've actually moved my numpy.i explanations to the scipy/numpy cookbook last week :-) http://www.scipy.org/Cookbook/SWIG_Memory_Deallocation Hope it all makes sense. If you have any questions, don't hesitate! >python test_dftmagnitude.py [1, 1, 2, 2] [ 1.41421354 2.82842708] [1, 1, 2, 2, 3, 3, 4, 4] [ 1.41421354 2.82842708 4.2426405 5.65685415] [1, 1, 2, 2, 3, 3, 4, 4, 5, 5] [ 1.41421354 2.82842708 4.2426405 5.65685415 7.07106781] Regards, Egor On Wed, Dec 24, 2008 at 1:52 AM, Rich E wrote: > Hi list, > > My question has to do with the Numpy/SWIG typemapping system. > > I recently got the typemaps in numpy.i to work on most of my C > functions that are wrapped using SWIG, if they have arguments of the > form (int sizeArray, float *pArray). > > Now I am trying to figure out how to wrap function that aren't of the > form, such as the following function: > > /*! \brief compute magnitude spectrum of a DFT > * > * \param sizeMag size of output Magnitude (half of input real > FFT) > * \param pFReal pointer to input FFT real array > (real/imag floats) > * \param pFMAg pointer to float array of magnitude spectrum > */ > void sms_spectrumMag( int sizeMag, float *pInRect, float *pOutMag) > { > int i, it2; > float fReal, fImag; > > for (i=0; i { > it2 = i << 1; > fReal = pInRect[it2]; > fImag = pInRect[it2+1]; > pOutMag[i] = sqrtf(fReal * fReal + fImag * fImag); > } > } > > There are two arrays, one is half the size of the other. But, SWIG > doesn't know this, according to the type map it will think *pInRect is > of size sizeMag and will not know anything about *pOutMag. > > Ideally in python, I would like to call the function as > sms_spectrumMag(nArray1, nArray2), where nArray1 is twice the size of > nArray2, and nArray2 is of size sizeMag. > > I think in order to do this (although if someone has a better > suggestion, I am open to it), I will have to modify the typemap in > order to tell SWIG how to call the C function properly. I do not want > to have to edit the wrapped C file every time it is regenerated from > the interface file. > > > Here is a start I made with the existing typemap code in numpy.i (not > working): > > /* Typemap suite for (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) > */ > %typecheck(SWIG_TYPECHECK_DOUBLE_ARRAY, > fragment="NumPy_Macros") > (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) > { > $1 = is_array($input) && PyArray_EquivTypenums(array_type($input), > DATA_TYPECODE); > } > %typemap(in, > fragment="NumPy_Fragments") > (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) > (PyArrayObject* array=NULL, int i=0) > { > array = obj_to_array_no_conversion($input, DATA_TYPECODE); > if (!array || !require_dimensions(array,1) || !require_contiguous(array) > || !require_native(array)) SWIG_fail; > $1 = 1; > for (i=0; i < array_numdims(array); ++i) $1 *= array_size(array,i); > $2 = (DATA_TYPE*) array_data(array); > } > > and try to alter it to allow for a conversion of type: > (DIM_TYPE DIM1, DATA_TYPE* ARRAY1, DATA_TYPE* ARRAY2) > where ARRAY1 is size DIM1 * 2 and ARRAY2 is size DIM1. Then I can > %apply this to my function that I mentioned in the last post. > > So here are my first two questions: > > 1) where is DIM1 used to declare the array size? I don't see where it > is used at all, and I need to somewhere multiply it by 2 to declare > the size of ARRAY1 > > 2) I am not understanding where $input comes from, so I do not > understand how to distinguish between ARRAY1 and ARRAY2. In the > attempt I have already tried, I think I just use the pointer to ARRAY1 > twice. > > If anyone has suggestions on how to solve this problem, thanks! > > regards, > Rich > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: dftmagnitude.zip Type: application/zip Size: 10857 bytes Desc: not available URL: From ndbecker2 at gmail.com Mon Jan 5 08:34:51 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Mon, 05 Jan 2009 08:34:51 -0500 Subject: [Numpy-discussion] Zoom fft code References: <710F2847B0018641891D9A216027636029C39B@ex3.envision.co.il> <9457e7c80901050025j520ce5dds26d914d63e332e76@mail.gmail.com> Message-ID: =?ISO-8859-1?Q?St=E9fan_van_der_Walt?= wrote: > Hi Nadav > > I recall that you posted an implementation yourself a while ago! > > http://www.mail-archive.com/numpy-discussion at scipy.org/msg01812.html > > Regards > St?fan > > 2009/1/5 Nadav Horesh : >> >> I am looking for a zoom fft code. I found an old code by Paule Kinzle (a >> matlab code with a translation to numarray), but its 2D extension >> (czt1.py) looks buggy. >> >> Nadav. I was not aware that chirp-z transform can be used to efficiently compute DFT over a limited part of the spectrum. I could use this. Any references on this technique? From stefan at sun.ac.za Mon Jan 5 08:58:11 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Mon, 5 Jan 2009 15:58:11 +0200 Subject: [Numpy-discussion] Zoom fft code In-Reply-To: References: <710F2847B0018641891D9A216027636029C39B@ex3.envision.co.il> <9457e7c80901050025j520ce5dds26d914d63e332e76@mail.gmail.com> Message-ID: <9457e7c80901050558v585437bcl5d5e17cb209ebebf@mail.gmail.com> 2009/1/5 Neal Becker : > I was not aware that chirp-z transform can be used to efficiently compute DFT over a limited part of the spectrum. I could use this. Any references on this technique? The only reference I have is the one mentioned in the source: Rabiner, L.R., R.W. Schafer and C.M. Rader. The Chirp z-Transform Algorithm. IEEE Transactions on Audio and Electroacoustics, AU-17(2):86--92, 1969 The discrete z-transform, X(z_k) = \sum_{n=0}^{N-1} x_n z^{-n} is calculated at M points, z_k = AW^-k, k = 0,1,...,M-1. You can think of the z_k's as a spiral, where A controls the outside radius (starting frequency) and W the rate of inward spiralling. Regards St?fan From cimrman3 at ntc.zcu.cz Mon Jan 5 10:13:48 2009 From: cimrman3 at ntc.zcu.cz (Robert Cimrman) Date: Mon, 05 Jan 2009 16:13:48 +0100 Subject: [Numpy-discussion] unique1d and asarray In-Reply-To: <2983337C-89AE-4FE6-8DAB-A68DA04F0380@gmail.com> References: <3d375d730901041347j72f4dba2p36dd3a03ccade2ae@mail.gmail.com> <2983337C-89AE-4FE6-8DAB-A68DA04F0380@gmail.com> Message-ID: <496223AC.1090403@ntc.zcu.cz> Pierre GM wrote: > On Jan 4, 2009, at 4:47 PM, Robert Kern wrote: > >> On Sun, Jan 4, 2009 at 15:44, Pierre GM wrote: >>> If we used np.asanyarray instead, subclasses are recognized properly, >>> the mask is recognized by argsort and the result correct. >>> Is there a reason why we use np.asarray instead of np.asanyarray ? >> Probably not. > > So there wouldn't be any objections to make the switch ? We can wait a > couple of days if anybody has a pb with that... There are probably other functions in arraysetops that could be fixed easily to work with masked arrays, feel free to do it if you like. I have never worked with the masked arrays, so the np.asarray problem had not come to my mind. Also, if you change np.asarray to np.asanyarray, add a corresponding test emplying the masked arrays to test_arraysetops.py, please. cheers & thanks, r. From nadavh at visionsense.com Mon Jan 5 10:26:54 2009 From: nadavh at visionsense.com (Nadav Horesh) Date: Mon, 5 Jan 2009 17:26:54 +0200 Subject: [Numpy-discussion] Zoom fft code References: <710F2847B0018641891D9A216027636029C39B@ex3.envision.co.il> <9457e7c80901050025j520ce5dds26d914d63e332e76@mail.gmail.com> Message-ID: <710F2847B0018641891D9A216027636029C39C@ex3.envision.co.il> Thank you, I lost the code so thank you for finding it. In addition, chirp z transform is broader then zoom fft. There was someone on this list that was interested especially in zoom fft, so I was wondered if there is a code for it. Anyway, I can use my old code again. Nadav -----????? ??????----- ???: numpy-discussion-bounces at scipy.org ??? St?fan van der Walt ????: ? 05-?????-09 10:25 ??: Discussion of Numerical Python ????: Re: [Numpy-discussion] Zoom fft code Hi Nadav I recall that you posted an implementation yourself a while ago! http://www.mail-archive.com/numpy-discussion at scipy.org/msg01812.html Regards St?fan 2009/1/5 Nadav Horesh : > > I am looking for a zoom fft code. I found an old code by Paule Kinzle (a matlab code with a translation to numarray), but its 2D extension (czt1.py) looks buggy. > > Nadav. _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 3221 bytes Desc: not available URL: From reakinator at gmail.com Mon Jan 5 11:06:06 2009 From: reakinator at gmail.com (Rich E) Date: Mon, 5 Jan 2009 17:06:06 +0100 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Egor, Thanks for the help. I think I want to leave the C code as-is however, as it is perfectly fine there no knowing 'sizeOutMag' because it can deduce both array sizes from one variable. There are many other similar cases in my code (many where the size of the array is known by a member of a structure passed to the function). Maybe I should look into using an 'insertion block' of code in the interface file, instead of trying to typemap the array? I am thinking I may just be able to copy the generated code (from SWIG) into my interface file to do this, but I have not tried it yet. I will experiment a little and post again. Thanks and happy holidays! regards, Rich On Mon, Jan 5, 2009 at 10:42 AM, Egor Zindy wrote: > Hello Rich, > > sorry it took so long to answer back, holidays and all :-) > > That's exactly the kind of SWIG / numpy.i problems I've been working on over > the past few months: How to generate an array you don't know the size of > a-priori, and then handle the memory deallocation seamlessly. In your case, > you know that the output array will be half the size of the input array, but > this falls under the more general case of "not knowing the output size > a-priori". > > Have a look at the files attached. I've rewritten your function header as: > void sms_spectrumMag( int sizeInMag, float *pInRect, int *sizeOutMag, float > **pOutMag); > > Easy to see what the input and output arrays are now. Then my numpy.i > handles the memory deallocation of the **pOutMag array. > > I've actually moved my numpy.i explanations to the scipy/numpy cookbook last > week :-) > http://www.scipy.org/Cookbook/SWIG_Memory_Deallocation > > Hope it all makes sense. If you have any questions, don't hesitate! > >>python test_dftmagnitude.py > [1, 1, 2, 2] > [ 1.41421354 2.82842708] > [1, 1, 2, 2, 3, 3, 4, 4] > [ 1.41421354 2.82842708 4.2426405 5.65685415] > [1, 1, 2, 2, 3, 3, 4, 4, 5, 5] > [ 1.41421354 2.82842708 4.2426405 5.65685415 7.07106781] > > Regards, > Egor > > On Wed, Dec 24, 2008 at 1:52 AM, Rich E wrote: >> >> Hi list, >> >> My question has to do with the Numpy/SWIG typemapping system. >> >> I recently got the typemaps in numpy.i to work on most of my C >> functions that are wrapped using SWIG, if they have arguments of the >> form (int sizeArray, float *pArray). >> >> Now I am trying to figure out how to wrap function that aren't of the >> form, such as the following function: >> >> /*! \brief compute magnitude spectrum of a DFT >> * >> * \param sizeMag size of output Magnitude (half of input >> real FFT) >> * \param pFReal pointer to input FFT real array >> (real/imag floats) >> * \param pFMAg pointer to float array of magnitude spectrum >> */ >> void sms_spectrumMag( int sizeMag, float *pInRect, float *pOutMag) >> { >> int i, it2; >> float fReal, fImag; >> >> for (i=0; i> { >> it2 = i << 1; >> fReal = pInRect[it2]; >> fImag = pInRect[it2+1]; >> pOutMag[i] = sqrtf(fReal * fReal + fImag * fImag); >> } >> } >> >> There are two arrays, one is half the size of the other. But, SWIG >> doesn't know this, according to the type map it will think *pInRect is >> of size sizeMag and will not know anything about *pOutMag. >> >> Ideally in python, I would like to call the function as >> sms_spectrumMag(nArray1, nArray2), where nArray1 is twice the size of >> nArray2, and nArray2 is of size sizeMag. >> >> I think in order to do this (although if someone has a better >> suggestion, I am open to it), I will have to modify the typemap in >> order to tell SWIG how to call the C function properly. I do not want >> to have to edit the wrapped C file every time it is regenerated from >> the interface file. >> >> >> Here is a start I made with the existing typemap code in numpy.i (not >> working): >> >> /* Typemap suite for (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) >> */ >> %typecheck(SWIG_TYPECHECK_DOUBLE_ARRAY, >> fragment="NumPy_Macros") >> (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) >> { >> $1 = is_array($input) && PyArray_EquivTypenums(array_type($input), >> DATA_TYPECODE); >> } >> %typemap(in, >> fragment="NumPy_Fragments") >> (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) >> (PyArrayObject* array=NULL, int i=0) >> { >> array = obj_to_array_no_conversion($input, DATA_TYPECODE); >> if (!array || !require_dimensions(array,1) || !require_contiguous(array) >> || !require_native(array)) SWIG_fail; >> $1 = 1; >> for (i=0; i < array_numdims(array); ++i) $1 *= array_size(array,i); >> $2 = (DATA_TYPE*) array_data(array); >> } >> >> and try to alter it to allow for a conversion of type: >> (DIM_TYPE DIM1, DATA_TYPE* ARRAY1, DATA_TYPE* ARRAY2) >> where ARRAY1 is size DIM1 * 2 and ARRAY2 is size DIM1. Then I can >> %apply this to my function that I mentioned in the last post. >> >> So here are my first two questions: >> >> 1) where is DIM1 used to declare the array size? I don't see where it >> is used at all, and I need to somewhere multiply it by 2 to declare >> the size of ARRAY1 >> >> 2) I am not understanding where $input comes from, so I do not >> understand how to distinguish between ARRAY1 and ARRAY2. In the >> attempt I have already tried, I think I just use the pointer to ARRAY1 >> twice. >> >> If anyone has suggestions on how to solve this problem, thanks! >> >> regards, >> Rich >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From garry.willgoose at newcastle.edu.au Mon Jan 5 19:48:57 2009 From: garry.willgoose at newcastle.edu.au (Garry Willgoose) Date: Tue, 6 Jan 2009 11:48:57 +1100 Subject: [Numpy-discussion] when will osx linker option -bundle be reflected in distutils Message-ID: <3993B772-CE2D-436E-AA88-336C8ADBBFCB@newcastle.edu.au> > > I was just wondering what plans there were to reflect the different > > linker options (i.e. -bundle instead of -shared) that are required > on > > OSX in the fcompiler files within distutils. While its a minor thing > > it always catches the users of my software when they either install > > fresh or update numpy ... and sometimes on a bad day it even catches > > me ;-) > > I'm sorry; I don't follow. What problems are you having? -- > -- Robert Kern > ----------------------------------------------- OK for example the distribution g95.py in distutils/fcompiler has the following code executables = { 'version_cmd' : ["g95", "--version"], 'compiler_f77' : ["g95", "-ffixed-form"], 'compiler_fix' : ["g95", "-ffixed-form"], 'compiler_f90' : ["g95"], 'linker_so' : ["g95","-shared"], 'archiver' : ["ar", "-cr"], 'ranlib' : ["ranlib"] } For osx you need to modify it to executables = { 'version_cmd' : ["g95", "--version"], 'compiler_f77' : ["g95", "-ffixed-form"], 'compiler_fix' : ["g95", "-ffixed-form"], 'compiler_f90' : ["g95"], 'linker_so' : ["g95","-shared"], 'archiver' : ["ar", "-cr"], 'ranlib' : ["ranlib"] } import sys if sys.platform.lower() == 'darwin': executables[linker_so'] = ["g95","-Wall -bundle"] The 'shared' option is not implemented in the osx linker. Not sure what the underlying difference between 'shared' and 'bundle' is but this substitution is necessary and this has been working for me for the last year or so. You also need the -Wall but for reasons that completely escape me. The same goes for gfortran and intel (both of which I use) and I assume the other compilers that are available for OSX. ==================================================================== Prof Garry Willgoose, Australian Professorial Fellow in Environmental Engineering, Director, Centre for Climate Impact Management (C2IM), School of Engineering, The University of Newcastle, Callaghan, 2308 Australia. Centre webpage: www.c2im.org.au Phone: (International) +61 2 4921 6050 (Tues-Fri AM); +61 2 6545 9574 (Fri PM-Mon) FAX: (International) +61 2 4921 6991 (Uni); +61 2 6545 9574 (personal and Telluric) Env. Engg. Secretary: (International) +61 2 4921 6042 email: garry.willgoose at newcastle.edu.au; g.willgoose at telluricresearch.com email-for-life: garry.willgoose at alum.mit.edu personal webpage: www.telluricresearch.com/garry ==================================================================== "Do not go where the path may lead, go instead where there is no path and leave a trail" Ralph Waldo Emerson ==================================================================== From robert.kern at gmail.com Mon Jan 5 20:30:12 2009 From: robert.kern at gmail.com (Robert Kern) Date: Mon, 5 Jan 2009 19:30:12 -0600 Subject: [Numpy-discussion] when will osx linker option -bundle be reflected in distutils In-Reply-To: <3993B772-CE2D-436E-AA88-336C8ADBBFCB@newcastle.edu.au> References: <3993B772-CE2D-436E-AA88-336C8ADBBFCB@newcastle.edu.au> Message-ID: <3d375d730901051730o28c509d5g7bcdb8d8ad712815@mail.gmail.com> On Mon, Jan 5, 2009 at 18:48, Garry Willgoose wrote: >> > I was just wondering what plans there were to reflect the different >> > linker options (i.e. -bundle instead of -shared) that are required >> on >> > OSX in the fcompiler files within distutils. While its a minor thing >> > it always catches the users of my software when they either install >> > fresh or update numpy ... and sometimes on a bad day it even catches >> > me ;-) >> >> I'm sorry; I don't follow. What problems are you having? -- >> -- Robert Kern >> > ----------------------------------------------- > > OK for example the distribution g95.py in distutils/fcompiler has the > following code > > executables = { > 'version_cmd' : ["g95", "--version"], > 'compiler_f77' : ["g95", "-ffixed-form"], > 'compiler_fix' : ["g95", "-ffixed-form"], > 'compiler_f90' : ["g95"], > 'linker_so' : ["g95","-shared"], > 'archiver' : ["ar", "-cr"], > 'ranlib' : ["ranlib"] > } > > For osx you need to modify it to > > executables = { > 'version_cmd' : ["g95", "--version"], > 'compiler_f77' : ["g95", "-ffixed-form"], > 'compiler_fix' : ["g95", "-ffixed-form"], > 'compiler_f90' : ["g95"], > 'linker_so' : ["g95","-shared"], > 'archiver' : ["ar", "-cr"], > 'ranlib' : ["ranlib"] > } > import sys > if sys.platform.lower() == 'darwin': > executables[linker_so'] = ["g95","-Wall -bundle"] > > The 'shared' option is not implemented in the osx linker. Not sure > what the underlying difference between 'shared' and 'bundle' is but > this substitution is necessary and this has been working for me for > the last year or so. You also need the -Wall but for reasons that > completely escape me. -Wall absolutely should not affect anything except adding warning messages. I suspect something else is getting modified when you do that. > The same goes for gfortran and intel (both of which I use) and I > assume the other compilers that are available for OSX. I've been building scipy for years with gfortran and an unmodified numpy on OS X. The correct switches are added in the get_flags_linker_so() method: def get_flags_linker_so(self): opt = self.linker_so[1:] if sys.platform=='darwin': # MACOSX_DEPLOYMENT_TARGET must be at least 10.3. This is # a reasonable default value even when building on 10.4 when using # the official Python distribution and those derived from it (when # not broken). target = os.environ.get('MACOSX_DEPLOYMENT_TARGET', None) if target is None or target == '': target = '10.3' major, minor = target.split('.') if int(minor) < 3: minor = '3' warnings.warn('Environment variable ' 'MACOSX_DEPLOYMENT_TARGET reset to %s.%s' % (major, minor)) os.environ['MACOSX_DEPLOYMENT_TARGET'] = '%s.%s' % (major, minor) opt.extend(['-undefined', 'dynamic_lookup', '-bundle']) else: opt.append("-shared") if sys.platform.startswith('sunos'): # SunOS often has dynamically loaded symbols defined in the # static library libg2c.a The linker doesn't like this. To # ignore the problem, use the -mimpure-text flag. It isn't # the safest thing, but seems to work. 'man gcc' says: # ".. Instead of using -mimpure-text, you should compile all # source code with -fpic or -fPIC." opt.append('-mimpure-text') return opt If this is not working for you, please show me the error messages you get. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From fonnesbeck at maths.otago.ac.nz Mon Jan 5 21:41:57 2009 From: fonnesbeck at maths.otago.ac.nz (Christopher Fonnesbeck) Date: Tue, 6 Jan 2009 15:41:57 +1300 Subject: [Numpy-discussion] ANN: PyMC 2.0 Message-ID: <7DB2E7D8-A9CF-4F3F-9979-71BC5F26DFDD@maths.otago.ac.nz> Numpy list members, It gives me great pleasure to be able to announce the long-awaited release of PyMC 2.0. Platform-specific installers have been uploaded to the Google Code page (Mac OSX) and the Python Package Index (all other platforms), along with the new user's guide (http://pymc.googlecode.com/files/UserGuide2.0.pdf ). PyMC is a python module that implements Bayesian statistical models and ?tting algorithms, including Markov chain Monte Carlo. Its ?exibility makes it applicable to a large suite of problems as well as easily extensible. Along with core sampling functionality, PyMC includes methods for summarizing output, plotting, goodness-of-?t and convergence diagnostics. PyMC 2.0 is a quantum leap from the 1.3 release. It includes a completely revised object model and syntax, more efficient log- probability computation, a variety of specialised MCMC algorithms, and an expanded set of optimised probability distributions. As a result, models built for previous versions of PyMC will not run under version 2.0. I would like to particularly thank Anand Patil and David Huard, who have done most of the work on this version, and to all the users who have sent questions, comments and bug reports over the past year or two. Please keep the feedback coming! Please report any problems with the release to the issues page (http://code.google.com/p/pymc/issues/list ). Python Package Index: http://pypi.python.org/pypi/pymc/ Google Code: http://pymc.googelcode.com Mailing List: http://groups.google.com/group/pymc Happy new year, Chris -- Christopher J. Fonnesbeck Department of Mathematics and Statistics University of Otago, PO Box 56 Dunedin, New Zealand From dmacks at netspace.org Mon Jan 5 23:11:30 2009 From: dmacks at netspace.org (Daniel Macks) Date: Mon, 5 Jan 2009 23:11:30 -0500 Subject: [Numpy-discussion] when will osx linker option -bundle be reflected in distutils In-Reply-To: <3993B772-CE2D-436E-AA88-336C8ADBBFCB@newcastle.edu.au> References: <3993B772-CE2D-436E-AA88-336C8ADBBFCB@newcastle.edu.au> Message-ID: <20090106041130.GA21304@happy.netspace.org> On Tue, Jan 06, 2009 at 11:48:57AM +1100, Garry Willgoose wrote: > The 'shared' option is not implemented in the osx linker. Not sure > what the underlying difference between 'shared' and 'bundle' is To answer this narrow part of the question, -shared is the way to build shared libraries on linux (I think it's part of the standard GNU ld and/or ELF binary format), and is how one builds all sorts of .so. On OS X, there is a difference between a "dynamic library" (one that is linked later via "-lFOO" flags, standard extension .dylib) and a "loadable module" (one that is loaded at runtime via dlopen() or similar methods, often extension .so). Linux doesn't have as sharp a distinction. OS X linker uses different flags to specify which one to build (-dynamiclib and -bundle, respectively). dan -- Daniel Macks dmacks at netspace.org http://www.netspace.org/~dmacks From stefan at sun.ac.za Tue Jan 6 04:15:45 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Tue, 6 Jan 2009 11:15:45 +0200 Subject: [Numpy-discussion] Handling methods of object-arrays Message-ID: <9457e7c80901060115wb142238i1fbcca5ebae02299@mail.gmail.com> Hi all, What is the exact protocol for evaluating functions like "real" and "imag" on object arrays? For example, I'm looking at x = np.array([np.array(3+1j), np.array(4+1j)], dtype=object) For which both In [4]: x.real Out[4]: array([(3+1j), (4+1j)], dtype=object) and In [6]: np.real(x) Out[6]: array([(3+1j), (4+1j)], dtype=object) does nothing, so that I have to do In [8]: [np.real(e) for e in x] Out[8]: [array(3.0), array(4.0)] or [e.real for e in x]. Would it make sense make np.real aware of the above scenario? Regards St?fan From robert.kern at gmail.com Tue Jan 6 04:19:10 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 6 Jan 2009 03:19:10 -0600 Subject: [Numpy-discussion] Handling methods of object-arrays In-Reply-To: <9457e7c80901060115wb142238i1fbcca5ebae02299@mail.gmail.com> References: <9457e7c80901060115wb142238i1fbcca5ebae02299@mail.gmail.com> Message-ID: <3d375d730901060119k6e124d15lc2d66e36102fbabc@mail.gmail.com> On Tue, Jan 6, 2009 at 03:15, St?fan van der Walt wrote: > Hi all, > > What is the exact protocol for evaluating functions like "real" and > "imag" on object arrays? > > For example, I'm looking at > > x = np.array([np.array(3+1j), np.array(4+1j)], dtype=object) > > For which both > > In [4]: x.real > Out[4]: array([(3+1j), (4+1j)], dtype=object) > > and > > In [6]: np.real(x) > Out[6]: array([(3+1j), (4+1j)], dtype=object) > > does nothing, so that I have to do > > In [8]: [np.real(e) for e in x] > Out[8]: [array(3.0), array(4.0)] > > or [e.real for e in x]. > > Would it make sense make np.real aware of the above scenario? Methods like ndarray.sin() check for e.sin(), so a case could certainly be made that ndarray.real should check for e.real . -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From pommereau at univ-paris12.fr Tue Jan 6 04:35:01 2009 From: pommereau at univ-paris12.fr (Franck Pommereau) Date: Tue, 06 Jan 2009 10:35:01 +0100 Subject: [Numpy-discussion] [Newbie] Fast plotting Message-ID: <496325C5.4000505@univ-paris12.fr> Hi all, and happy new year! I'm new to NumPy and searching a way to compute from a set of points (x,y) the mean value of y values associated to each distinct x value. Each point corresponds to a measure in a benchmark (x = parameter, y = computation time) and I'd like to plot the graph of mean computation time wrt parameter values. (I know how to plot, but not how to compute mean values.) My points are stored as two arrays X, Y (same size). In pure Python, I'd do as follows: s = {} # sum of y values for each distinct x (as keys) n = {} # number of summed values (same keys) for x, y in zip(X, Y) : s[x] = s.get(x, 0.0) + y n[x] = n.get(x, 0) + 1 new_x = array(list(sorted(s))) new_y = array([s[x]/n[x] for x in sorted(s)]) Unfortunately, this code is much too slow because my arrays have millions of elements. But I'm pretty sure that NumPy offers a way to handle this more elegantly and much faster. As a bonus, I'd be happy if the solution would allow me to compute also standard deviation, min, max, etc. Thanks in advance for any help! Franck From stefan at sun.ac.za Tue Jan 6 05:14:53 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Tue, 6 Jan 2009 12:14:53 +0200 Subject: [Numpy-discussion] Old-style classes in tests In-Reply-To: <492ACBCB.4060204@resolversystems.com> References: <492ACBCB.4060204@resolversystems.com> Message-ID: <9457e7c80901060214o68bf8cdeq123f234ed6d2dfce@mail.gmail.com> Hi Tom 2008/11/24 Tom Wright : > I am currently working on the Ironclad project porting numpy to Ironpython. > > It would be quite useful for me if HermitianTestCase in test_linalg.py > was a new style-class instead of an old-style class - since Ironpython > has a bug where dir operations do not work for classes inheriting from > both old- and new- style classes and I'd very much prefer not to patch > my version of numpy. These changes have been made in r6297. Let me know if you find any other occurrences. Regards St?fan From faltet at pytables.org Tue Jan 6 06:56:44 2009 From: faltet at pytables.org (Francesc Alted) Date: Tue, 6 Jan 2009 12:56:44 +0100 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <496325C5.4000505@univ-paris12.fr> References: <496325C5.4000505@univ-paris12.fr> Message-ID: <200901061256.44641.faltet@pytables.org> A Tuesday 06 January 2009, Franck Pommereau escrigu?: > Hi all, and happy new year! > > I'm new to NumPy and searching a way to compute from a set of points > (x,y) the mean value of y values associated to each distinct x value. > Each point corresponds to a measure in a benchmark (x = parameter, y > = computation time) and I'd like to plot the graph of mean > computation time wrt parameter values. (I know how to plot, but not > how to compute mean values.) > > My points are stored as two arrays X, Y (same size). > In pure Python, I'd do as follows: > > s = {} # sum of y values for each distinct x (as keys) > n = {} # number of summed values (same keys) > for x, y in zip(X, Y) : > s[x] = s.get(x, 0.0) + y > n[x] = n.get(x, 0) + 1 > new_x = array(list(sorted(s))) > new_y = array([s[x]/n[x] for x in sorted(s)]) > > Unfortunately, this code is much too slow because my arrays have > millions of elements. But I'm pretty sure that NumPy offers a way to > handle this more elegantly and much faster. > > As a bonus, I'd be happy if the solution would allow me to compute > also standard deviation, min, max, etc. The next would do the trick: In [92]: x = np.random.randint(100,size=100) In [93]: y = np.random.rand(100) In [94]: u = np.unique(x) In [95]: means = [ y[x == i].mean() for i in u ] In [96]: stds = [ y[x == i].std() for i in u ] In [97]: maxs = [ y[x == i].max() for i in u ] In [98]: mins = [ y[x == i].min() for i in u ] and your wanted data will be in means, stds, maxs and mins lists. This approach has the drawback that you have to process the array each time that you want to extract the desired info. If what you want is to always retrieve the same set of statistics, you can do this in one single loop: In [99]: means, std, maxs, mins = [], [], [], [] In [100]: for i in u: g = y[x == i] means.append(g.mean()) stds.append(g.std()) maxs.append(g.max()) mins.append(g.min()) .....: which has the same effect than above, but is much faster. Hope that helps, -- Francesc Alted From boogaloojb at yahoo.fr Tue Jan 6 08:38:27 2009 From: boogaloojb at yahoo.fr (Jean-Baptiste Rudant) Date: Tue, 6 Jan 2009 13:38:27 +0000 (GMT) Subject: [Numpy-discussion] Re : [Newbie] Fast plotting Message-ID: <970036.60257.qm@web28512.mail.ukl.yahoo.com> Hello, I'm not an expert. Something exists in matplotlib, but it's not very efficient. import matplotlib.mlab import numpy N = 1000 X = numpy.random.randint(0, 10, N) Y = numpy.random.random(N) recXY = numpy.rec.fromarrays((X, Y), names='x, y') summary = matplotlib..mlab.rec_groupby(recXY, ('x',), (('y', numpy.mean, 'y_avg'),)) Jean-Baptiste Rudant ________________________________ De : Franck Pommereau ? : Discussion of Numerical Python Envoy? le : Mardi, 6 Janvier 2009, 10h35mn 01s Objet : [Numpy-discussion] [Newbie] Fast plotting Hi all, and happy new year! I'm new to NumPy and searching a way to compute from a set of points (x,y) the mean value of y values associated to each distinct x value. Each point corresponds to a measure in a benchmark (x = parameter, y = computation time) and I'd like to plot the graph of mean computation time wrt parameter values. (I know how to plot, but not how to compute mean values.) My points are stored as two arrays X, Y (same size). In pure Python, I'd do as follows: s = {} # sum of y values for each distinct x (as keys) n = {} # number of summed values (same keys) for x, y in zip(X, Y) : s[x] = s.get(x, 0.0) + y n[x] = n.get(x, 0) + 1 new_x = array(list(sorted(s))) new_y = array([s[x]/n[x] for x in sorted(s)]) Unfortunately, this code is much too slow because my arrays have millions of elements. But I'm pretty sure that NumPy offers a way to handle this more elegantly and much faster. As a bonus, I'd be happy if the solution would allow me to compute also standard deviation, min, max, etc. Thanks in advance for any help! Franck _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From jdh2358 at gmail.com Tue Jan 6 09:34:02 2009 From: jdh2358 at gmail.com (John Hunter) Date: Tue, 6 Jan 2009 08:34:02 -0600 Subject: [Numpy-discussion] Re : [Newbie] Fast plotting In-Reply-To: <970036.60257.qm@web28512.mail.ukl.yahoo.com> References: <970036.60257.qm@web28512.mail.ukl.yahoo.com> Message-ID: <88e473830901060634t4e60c1eap78d33b0219cfca7e@mail.gmail.com> On Tue, Jan 6, 2009 at 7:38 AM, Jean-Baptiste Rudant wrote: > Hello, > I'm not an expert. Something exists in matplotlib, but it's not very > efficient. > import matplotlib.mlab > import numpy > N = 1000 > X = numpy.random.randint(0, 10, N) > Y = numpy.random.random(N) > recXY = numpy.rec.fromarrays((X, Y), names='x, y') > summary = matplotlib.mlab.rec_groupby(recXY, ('x',), (('y', numpy.mean, > 'y_avg'),)) And you can use rec2txt for pretty printing in the shell: In [103]: print matplotlib.mlab.rec2txt(summary) x y_avg 0 0.506 1 0.531 2 0.491 3 0.482 4 0.511 5 0.507 6 0.543 7 0.525 8 0.512 9 0.472 > Jean-Baptiste Rudant > > ________________________________ > De : Franck Pommereau > ? : Discussion of Numerical Python > Envoy? le : Mardi, 6 Janvier 2009, 10h35mn 01s > Objet : [Numpy-discussion] [Newbie] Fast plotting > > Hi all, and happy new year! > > I'm new to NumPy and searching a way to compute from a set of points > (x,y) the mean value of y values associated to each distinct x value. > Each point corresponds to a measure in a benchmark (x = parameter, y = > computation time) and I'd like to plot the graph of mean computation > time wrt parameter values. (I know how to plot, but not how to compute > mean values.) > > My points are stored as two arrays X, Y (same size). > In pure Python, I'd do as follows: > > s = {} # sum of y values for each distinct x (as keys) > n = {} # number of summed values (same keys) > for x, y in zip(X, Y) : > s[x] = s.get(x, 0.0) + y > n[x] = n.get(x, 0) + 1 > new_x = array(list(sorted(s))) > new_y = array([s[x]/n[x] for x in sorted(s)]) > > Unfortunately, this code is much too slow because my arrays have > millions of elements. But I'm pretty sure that NumPy offers a way to > handle this more elegantly and much faster. > > As a bonus, I'd be happy if the solution would allow me to compute also > standard deviation, min, max, etc. > > Thanks in advance for any help! > Franck > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From bsouthey at gmail.com Tue Jan 6 09:44:42 2009 From: bsouthey at gmail.com (Bruce Southey) Date: Tue, 06 Jan 2009 08:44:42 -0600 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <200901061256.44641.faltet@pytables.org> References: <496325C5.4000505@univ-paris12.fr> <200901061256.44641.faltet@pytables.org> Message-ID: <49636E5A.2070301@gmail.com> Francesc Alted wrote: > A Tuesday 06 January 2009, Franck Pommereau escrigu?: > >> Hi all, and happy new year! >> >> I'm new to NumPy and searching a way to compute from a set of points >> (x,y) the mean value of y values associated to each distinct x value. >> Each point corresponds to a measure in a benchmark (x = parameter, y >> = computation time) and I'd like to plot the graph of mean >> computation time wrt parameter values. (I know how to plot, but not >> how to compute mean values.) >> >> My points are stored as two arrays X, Y (same size). >> In pure Python, I'd do as follows: >> >> s = {} # sum of y values for each distinct x (as keys) >> n = {} # number of summed values (same keys) >> for x, y in zip(X, Y) : >> s[x] = s.get(x, 0.0) + y >> n[x] = n.get(x, 0) + 1 >> new_x = array(list(sorted(s))) >> new_y = array([s[x]/n[x] for x in sorted(s)]) >> >> Unfortunately, this code is much too slow because my arrays have >> millions of elements. But I'm pretty sure that NumPy offers a way to >> handle this more elegantly and much faster. >> >> As a bonus, I'd be happy if the solution would allow me to compute >> also standard deviation, min, max, etc. >> > > The next would do the trick: > > In [92]: x = np.random.randint(100,size=100) > > In [93]: y = np.random.rand(100) > > In [94]: u = np.unique(x) > > In [95]: means = [ y[x == i].mean() for i in u ] > > In [96]: stds = [ y[x == i].std() for i in u ] > > In [97]: maxs = [ y[x == i].max() for i in u ] > > In [98]: mins = [ y[x == i].min() for i in u ] > > and your wanted data will be in means, stds, maxs and mins lists. This > approach has the drawback that you have to process the array each time > that you want to extract the desired info. If what you want is to > always retrieve the same set of statistics, you can do this in one > single loop: > > In [99]: means, std, maxs, mins = [], [], [], [] > > In [100]: for i in u: > g = y[x == i] > means.append(g.mean()) > stds.append(g.std()) > maxs.append(g.max()) > mins.append(g.min()) > .....: > > which has the same effect than above, but is much faster. > > Hope that helps, > > If you use Knuth's one pass approach (http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#III._On-line_algorithm) you can write a function to get the min, max, mean and variance/standard deviation in a single pass through the array rather than one pass for each. I do not know if this will provide any advantage as that will probably depend on the size of the arrays. Also, please use the highest precision possible (ie float128) for your arrays to minimize numerical error due to the size of your arrays. Bruce From ezindy at gmail.com Tue Jan 6 09:47:38 2009 From: ezindy at gmail.com (Egor Zindy) Date: Tue, 6 Jan 2009 23:47:38 +0900 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Hello again, I really don't know what came over me when I changed your function prototype, that wasn't a very thoughtful thing to do! Maybe I should look into using an 'insertion block' of code in the > interface file, instead of trying to typemap the array? > Insertion blocks... is that %inline code? In which case, yes! Have a look, I attached a new version that uses some %inline directives in the dftmagnitude.i file. Basically, you can inline a new function with an easier prototype to wrap. The function allocates memory and calls your sms_spectrumMag() function. my inline function: void my_spectrumMag( int sizeInMag, float *pInRect, int *sizeOutMag, float **pOutMag) there's also a %rename directive: %rename (spectrumMag) my_spectrumMag; I had a go at defining some exceptions too (no memory and odd number of indexes), but I'm not sure errno is the easiest way to go about it... Hope this helps! ... and the python test output: ~>python test_dftmagnitude.py array: [1, 1, 2, 2] result: [ 1.41421354 2.82842708] array: [1, 1, 2, 2, 3, 3, 4, 4] result: [ 1.41421354 2.82842708 4.2426405 5.65685415] array: [1, 1, 2] result: Traceback (most recent call last): File "test_dftmagnitude.py", line 15, in print "result:",dftmagnitude.spectrumMag(a) IndexError: Odd number of elements in input array: 3 ~> Regards, Egor On Tue, Jan 6, 2009 at 1:06 AM, Rich E wrote: > Egor, > > Thanks for the help. I think I want to leave the C code as-is > however, as it is perfectly fine there no knowing 'sizeOutMag' because > it can deduce both array sizes from one variable. There are many > other similar cases in my code (many where the size of the array is > known by a member of a structure passed to the function). > > Maybe I should look into using an 'insertion block' of code in the > interface file, instead of trying to typemap the array? I am thinking > I may just be able to copy the generated code (from SWIG) into my > interface file to do this, but I have not tried it yet. > > I will experiment a little and post again. Thanks and happy holidays! > > regards, > Rich > > On Mon, Jan 5, 2009 at 10:42 AM, Egor Zindy wrote: > > Hello Rich, > > > > sorry it took so long to answer back, holidays and all :-) > > > > That's exactly the kind of SWIG / numpy.i problems I've been working on > over > > the past few months: How to generate an array you don't know the size of > > a-priori, and then handle the memory deallocation seamlessly. In your > case, > > you know that the output array will be half the size of the input array, > but > > this falls under the more general case of "not knowing the output size > > a-priori". > > > > Have a look at the files attached. I've rewritten your function header > as: > > void sms_spectrumMag( int sizeInMag, float *pInRect, int *sizeOutMag, > float > > **pOutMag); > > > > Easy to see what the input and output arrays are now. Then my numpy.i > > handles the memory deallocation of the **pOutMag array. > > > > I've actually moved my numpy.i explanations to the scipy/numpy cookbook > last > > week :-) > > http://www.scipy.org/Cookbook/SWIG_Memory_Deallocation > > > > Hope it all makes sense. If you have any questions, don't hesitate! > > > >>python test_dftmagnitude.py > > [1, 1, 2, 2] > > [ 1.41421354 2.82842708] > > [1, 1, 2, 2, 3, 3, 4, 4] > > [ 1.41421354 2.82842708 4.2426405 5.65685415] > > [1, 1, 2, 2, 3, 3, 4, 4, 5, 5] > > [ 1.41421354 2.82842708 4.2426405 5.65685415 7.07106781] > > > > Regards, > > Egor > > > > On Wed, Dec 24, 2008 at 1:52 AM, Rich E wrote: > >> > >> Hi list, > >> > >> My question has to do with the Numpy/SWIG typemapping system. > >> > >> I recently got the typemaps in numpy.i to work on most of my C > >> functions that are wrapped using SWIG, if they have arguments of the > >> form (int sizeArray, float *pArray). > >> > >> Now I am trying to figure out how to wrap function that aren't of the > >> form, such as the following function: > >> > >> /*! \brief compute magnitude spectrum of a DFT > >> * > >> * \param sizeMag size of output Magnitude (half of input > >> real FFT) > >> * \param pFReal pointer to input FFT real array > >> (real/imag floats) > >> * \param pFMAg pointer to float array of magnitude spectrum > >> */ > >> void sms_spectrumMag( int sizeMag, float *pInRect, float *pOutMag) > >> { > >> int i, it2; > >> float fReal, fImag; > >> > >> for (i=0; i >> { > >> it2 = i << 1; > >> fReal = pInRect[it2]; > >> fImag = pInRect[it2+1]; > >> pOutMag[i] = sqrtf(fReal * fReal + fImag * fImag); > >> } > >> } > >> > >> There are two arrays, one is half the size of the other. But, SWIG > >> doesn't know this, according to the type map it will think *pInRect is > >> of size sizeMag and will not know anything about *pOutMag. > >> > >> Ideally in python, I would like to call the function as > >> sms_spectrumMag(nArray1, nArray2), where nArray1 is twice the size of > >> nArray2, and nArray2 is of size sizeMag. > >> > >> I think in order to do this (although if someone has a better > >> suggestion, I am open to it), I will have to modify the typemap in > >> order to tell SWIG how to call the C function properly. I do not want > >> to have to edit the wrapped C file every time it is regenerated from > >> the interface file. > >> > >> > >> Here is a start I made with the existing typemap code in numpy.i (not > >> working): > >> > >> /* Typemap suite for (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) > >> */ > >> %typecheck(SWIG_TYPECHECK_DOUBLE_ARRAY, > >> fragment="NumPy_Macros") > >> (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) > >> { > >> $1 = is_array($input) && PyArray_EquivTypenums(array_type($input), > >> DATA_TYPECODE); > >> } > >> %typemap(in, > >> fragment="NumPy_Fragments") > >> (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) > >> (PyArrayObject* array=NULL, int i=0) > >> { > >> array = obj_to_array_no_conversion($input, DATA_TYPECODE); > >> if (!array || !require_dimensions(array,1) || > !require_contiguous(array) > >> || !require_native(array)) SWIG_fail; > >> $1 = 1; > >> for (i=0; i < array_numdims(array); ++i) $1 *= array_size(array,i); > >> $2 = (DATA_TYPE*) array_data(array); > >> } > >> > >> and try to alter it to allow for a conversion of type: > >> (DIM_TYPE DIM1, DATA_TYPE* ARRAY1, DATA_TYPE* ARRAY2) > >> where ARRAY1 is size DIM1 * 2 and ARRAY2 is size DIM1. Then I can > >> %apply this to my function that I mentioned in the last post. > >> > >> So here are my first two questions: > >> > >> 1) where is DIM1 used to declare the array size? I don't see where it > >> is used at all, and I need to somewhere multiply it by 2 to declare > >> the size of ARRAY1 > >> > >> 2) I am not understanding where $input comes from, so I do not > >> understand how to distinguish between ARRAY1 and ARRAY2. In the > >> attempt I have already tried, I think I just use the pointer to ARRAY1 > >> twice. > >> > >> If anyone has suggestions on how to solve this problem, thanks! > >> > >> regards, > >> Rich > >> _______________________________________________ > >> Numpy-discussion mailing list > >> Numpy-discussion at scipy.org > >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: dftmagnitude2.zip Type: application/zip Size: 10880 bytes Desc: not available URL: From sebastian at sipsolutions.net Tue Jan 6 09:45:31 2009 From: sebastian at sipsolutions.net (Sebastian Stephan Berg) Date: Tue, 06 Jan 2009 15:45:31 +0100 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <496325C5.4000505@univ-paris12.fr> References: <496325C5.4000505@univ-paris12.fr> Message-ID: <1231253131.6741.50.camel@sebook> Hello, Just thinking. If the parameters are limited, you may be able to use the histogram feature? Doing one histogram with Y as weights, then one without weights and calculating the mean from this yourself should be pretty speedy I imagine. Other then that maybe sorting the whole thing and then doing some searchsorted and side='right' and working on those slices maybe. I mean something like this: def spam(x, y, work_on_copy=False): """Take the arrays x and y and return unique_x_values, means, stds, maxs, mins as lists. means, stds, maxs and mins are those of the corresponding y values. If work_on_copy is true, x and y are copied to ensure that they are not sorted in place. """ u, means, stds, maxs, mins = [], [], [], [], [] s = x.argsort() if work_on_copy: x = x[s] y = y[s] else: x[:] = x[s] y[:] = y[s] start = 0 value = x[0] while True: next = x.searchsorted(value, side='right') u.append(value) means.append(y[start:next].mean()) stds.append(y[start:next].std()) maxs.append(y[start:next].max()) mins.append(y[start:next].min()) if next == len(x): break value = x[next] start = next return u, means, stds, maxs, mins This is of course basically the same as what Francesc suggested, but a quick test shows that it seems to scale better. I didn't try the speed of histogram. Sebastian On Tue, 2009-01-06 at 10:35 +0100, Franck Pommereau wrote: > Hi all, and happy new year! > > I'm new to NumPy and searching a way to compute from a set of points > (x,y) the mean value of y values associated to each distinct x value. > Each point corresponds to a measure in a benchmark (x = parameter, y = > computation time) and I'd like to plot the graph of mean computation > time wrt parameter values. (I know how to plot, but not how to compute > mean values.) > > My points are stored as two arrays X, Y (same size). > In pure Python, I'd do as follows: > > s = {} # sum of y values for each distinct x (as keys) > n = {} # number of summed values (same keys) > for x, y in zip(X, Y) : > s[x] = s.get(x, 0.0) + y > n[x] = n.get(x, 0) + 1 > new_x = array(list(sorted(s))) > new_y = array([s[x]/n[x] for x in sorted(s)]) > > Unfortunately, this code is much too slow because my arrays have > millions of elements. But I'm pretty sure that NumPy offers a way to > handle this more elegantly and much faster. > > As a bonus, I'd be happy if the solution would allow me to compute also > standard deviation, min, max, etc. > > Thanks in advance for any help! > Franck > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From aisaac at american.edu Tue Jan 6 10:11:51 2009 From: aisaac at american.edu (Alan G Isaac) Date: Tue, 06 Jan 2009 10:11:51 -0500 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <49636E5A.2070301@gmail.com> References: <496325C5.4000505@univ-paris12.fr> <200901061256.44641.faltet@pytables.org> <49636E5A.2070301@gmail.com> Message-ID: <496374B7.9050900@american.edu> A Tuesday 06 January 2009, Franck Pommereau escrigu?: > s = {} # sum of y values for each distinct x (as keys) > n = {} # number of summed values (same keys) > for x, y in zip(X, Y) : > s[x] = s.get(x, 0.0) + y > n[x] = n.get(x, 0) + 1 Maybe this is not so bad with a couple changes? from collections import defaultdict from itertools import izip s = defaultdict(int) # sum of y values for each distinct x (as keys) n = defaultdict(int) # number of summed values (same keys) for x, y in izip(x, y) : s[x] += y n[x] += 1 fwiw, Alan Isaac From igorsyl at gmail.com Tue Jan 6 15:07:56 2009 From: igorsyl at gmail.com (Igor Sylvester) Date: Tue, 6 Jan 2009 14:07:56 -0600 Subject: [Numpy-discussion] record array with array elements Message-ID: Everyone, Shouldn't the itemsize below be 2? >>> import numpy as np >>> dtype = np.dtype( [ (((2,), 'top'), [('nested', 'i1')]) ] ) >>> dtype.itemsize 1 >>> np.__version__ '1.0.4' The elements of the dtype are of type array of size 2. Each element is a (nested) record array of size 2 with one field of type 'i1'. In contiguous memory, this should look identical to an 'i1' array of size 2. -Igor Sylvester -------------- next part -------------- An HTML attachment was scrubbed... URL: From igorsyl at gmail.com Tue Jan 6 15:15:07 2009 From: igorsyl at gmail.com (Igor Sylvester) Date: Tue, 6 Jan 2009 14:15:07 -0600 Subject: [Numpy-discussion] record array with array elements In-Reply-To: References: Message-ID: A simpler example returns 1 as well: np.dtype( [ (((2,), 'a'), 'i1') ] ).itemsize On Tue, Jan 6, 2009 at 2:07 PM, Igor Sylvester wrote: > Everyone, > > Shouldn't the itemsize below be 2? > > >>> import numpy as np > >>> dtype = np.dtype( [ (((2,), 'top'), [('nested', 'i1')]) ] ) > >>> dtype.itemsize > 1 > >>> np.__version__ > '1.0.4' > > The elements of the dtype are of type array of size 2. Each element is a > (nested) record array of size 2 with one field of type 'i1'. In contiguous > memory, this should look identical to an 'i1' array of size 2. > > -Igor Sylvester > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Tue Jan 6 16:41:49 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 6 Jan 2009 15:41:49 -0600 Subject: [Numpy-discussion] record array with array elements In-Reply-To: References: Message-ID: <3d375d730901061341x5af96075h16678408ec33472c@mail.gmail.com> On Tue, Jan 6, 2009 at 14:07, Igor Sylvester wrote: > Everyone, > > Shouldn't the itemsize below be 2? > >>>> import numpy as np >>>> dtype = np.dtype( [ (((2,), 'top'), [('nested', 'i1')]) ] ) >>>> dtype.itemsize > 1 >>>> np.__version__ > '1.0.4' > > The elements of the dtype are of type array of size 2. Each element is a > (nested) record array of size 2 with one field of type 'i1'. In contiguous > memory, this should look identical to an 'i1' array of size 2. That's not a valid dtype. Array fields should be of the form (name, subdtype, shape), not ((shape, name), subdtype). I'm not sure why dtype() does not simply reject this input. In [22]: np.dtype([('top', [('nested', 'i1')], (2,))]).itemsize Out[22]: 2 -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From reakinator at gmail.com Tue Jan 6 17:51:11 2009 From: reakinator at gmail.com (Rich E) Date: Tue, 6 Jan 2009 23:51:11 +0100 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: This helped immensely. I feel like I am getting close to being able to accomplish what I would like with SWIG: producing a python module that can be very 'python-like', while co-existing with the c library that is very 'c-like'. There is one question still remaining though, is it possible to make the wrapped function have the same name still? Using either my_spectrumMag or spectrumMag means I have to create a number of inconsistencies between the python module and the c library. It is ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the wrapped one, with the same name. But my attempts at doing this so far have not compiled because of name conflictions. Thanks for the help, I think you are doing great things with this numpy interface/typemaps system. regards, Rich On Tue, Jan 6, 2009 at 3:47 PM, Egor Zindy wrote: > Hello again, > > I really don't know what came over me when I changed your function > prototype, that wasn't a very thoughtful thing to do! > >> Maybe I should look into using an 'insertion block' of code in the >> interface file, instead of trying to typemap the array? > > Insertion blocks... is that %inline code? In which case, yes! Have a look, I > attached a new version that uses some %inline directives in the > dftmagnitude.i file. > > Basically, you can inline a new function with an easier prototype to wrap. > The function allocates memory and calls your sms_spectrumMag() function. > > my inline function: void my_spectrumMag( int sizeInMag, float *pInRect, int > *sizeOutMag, float **pOutMag) > > there's also a %rename directive: %rename (spectrumMag) my_spectrumMag; > > I had a go at defining some exceptions too (no memory and odd number of > indexes), but I'm not sure errno is the easiest way to go about it... > > Hope this helps! > > ... and the python test output: > > ~>python test_dftmagnitude.py > array: [1, 1, 2, 2] > result: [ 1.41421354 2.82842708] > > array: [1, 1, 2, 2, 3, 3, 4, 4] > result: [ 1.41421354 2.82842708 4.2426405 5.65685415] > > array: [1, 1, 2] > result: > Traceback (most recent call last): > File "test_dftmagnitude.py", line 15, in > print "result:",dftmagnitude.spectrumMag(a) > IndexError: Odd number of elements in input array: 3 > > ~> > > Regards, > Egor > > > On Tue, Jan 6, 2009 at 1:06 AM, Rich E wrote: >> >> Egor, >> >> Thanks for the help. I think I want to leave the C code as-is >> however, as it is perfectly fine there no knowing 'sizeOutMag' because >> it can deduce both array sizes from one variable. There are many >> other similar cases in my code (many where the size of the array is >> known by a member of a structure passed to the function). >> >> Maybe I should look into using an 'insertion block' of code in the >> interface file, instead of trying to typemap the array? I am thinking >> I may just be able to copy the generated code (from SWIG) into my >> interface file to do this, but I have not tried it yet. >> >> I will experiment a little and post again. Thanks and happy holidays! >> >> regards, >> Rich >> >> On Mon, Jan 5, 2009 at 10:42 AM, Egor Zindy wrote: >> > Hello Rich, >> > >> > sorry it took so long to answer back, holidays and all :-) >> > >> > That's exactly the kind of SWIG / numpy.i problems I've been working on >> > over >> > the past few months: How to generate an array you don't know the size of >> > a-priori, and then handle the memory deallocation seamlessly. In your >> > case, >> > you know that the output array will be half the size of the input array, >> > but >> > this falls under the more general case of "not knowing the output size >> > a-priori". >> > >> > Have a look at the files attached. I've rewritten your function header >> > as: >> > void sms_spectrumMag( int sizeInMag, float *pInRect, int *sizeOutMag, >> > float >> > **pOutMag); >> > >> > Easy to see what the input and output arrays are now. Then my numpy.i >> > handles the memory deallocation of the **pOutMag array. >> > >> > I've actually moved my numpy.i explanations to the scipy/numpy cookbook >> > last >> > week :-) >> > http://www.scipy.org/Cookbook/SWIG_Memory_Deallocation >> > >> > Hope it all makes sense. If you have any questions, don't hesitate! >> > >> >>python test_dftmagnitude.py >> > [1, 1, 2, 2] >> > [ 1.41421354 2.82842708] >> > [1, 1, 2, 2, 3, 3, 4, 4] >> > [ 1.41421354 2.82842708 4.2426405 5.65685415] >> > [1, 1, 2, 2, 3, 3, 4, 4, 5, 5] >> > [ 1.41421354 2.82842708 4.2426405 5.65685415 7.07106781] >> > >> > Regards, >> > Egor >> > >> > On Wed, Dec 24, 2008 at 1:52 AM, Rich E wrote: >> >> >> >> Hi list, >> >> >> >> My question has to do with the Numpy/SWIG typemapping system. >> >> >> >> I recently got the typemaps in numpy.i to work on most of my C >> >> functions that are wrapped using SWIG, if they have arguments of the >> >> form (int sizeArray, float *pArray). >> >> >> >> Now I am trying to figure out how to wrap function that aren't of the >> >> form, such as the following function: >> >> >> >> /*! \brief compute magnitude spectrum of a DFT >> >> * >> >> * \param sizeMag size of output Magnitude (half of input >> >> real FFT) >> >> * \param pFReal pointer to input FFT real array >> >> (real/imag floats) >> >> * \param pFMAg pointer to float array of magnitude spectrum >> >> */ >> >> void sms_spectrumMag( int sizeMag, float *pInRect, float *pOutMag) >> >> { >> >> int i, it2; >> >> float fReal, fImag; >> >> >> >> for (i=0; i> >> { >> >> it2 = i << 1; >> >> fReal = pInRect[it2]; >> >> fImag = pInRect[it2+1]; >> >> pOutMag[i] = sqrtf(fReal * fReal + fImag * fImag); >> >> } >> >> } >> >> >> >> There are two arrays, one is half the size of the other. But, SWIG >> >> doesn't know this, according to the type map it will think *pInRect is >> >> of size sizeMag and will not know anything about *pOutMag. >> >> >> >> Ideally in python, I would like to call the function as >> >> sms_spectrumMag(nArray1, nArray2), where nArray1 is twice the size of >> >> nArray2, and nArray2 is of size sizeMag. >> >> >> >> I think in order to do this (although if someone has a better >> >> suggestion, I am open to it), I will have to modify the typemap in >> >> order to tell SWIG how to call the C function properly. I do not want >> >> to have to edit the wrapped C file every time it is regenerated from >> >> the interface file. >> >> >> >> >> >> Here is a start I made with the existing typemap code in numpy.i (not >> >> working): >> >> >> >> /* Typemap suite for (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) >> >> */ >> >> %typecheck(SWIG_TYPECHECK_DOUBLE_ARRAY, >> >> fragment="NumPy_Macros") >> >> (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) >> >> { >> >> $1 = is_array($input) && PyArray_EquivTypenums(array_type($input), >> >> DATA_TYPECODE); >> >> } >> >> %typemap(in, >> >> fragment="NumPy_Fragments") >> >> (DIM_TYPE DIM1, DATA_TYPE* INPLACE_ARRAY1) >> >> (PyArrayObject* array=NULL, int i=0) >> >> { >> >> array = obj_to_array_no_conversion($input, DATA_TYPECODE); >> >> if (!array || !require_dimensions(array,1) || >> >> !require_contiguous(array) >> >> || !require_native(array)) SWIG_fail; >> >> $1 = 1; >> >> for (i=0; i < array_numdims(array); ++i) $1 *= array_size(array,i); >> >> $2 = (DATA_TYPE*) array_data(array); >> >> } >> >> >> >> and try to alter it to allow for a conversion of type: >> >> (DIM_TYPE DIM1, DATA_TYPE* ARRAY1, DATA_TYPE* ARRAY2) >> >> where ARRAY1 is size DIM1 * 2 and ARRAY2 is size DIM1. Then I can >> >> %apply this to my function that I mentioned in the last post. >> >> >> >> So here are my first two questions: >> >> >> >> 1) where is DIM1 used to declare the array size? I don't see where it >> >> is used at all, and I need to somewhere multiply it by 2 to declare >> >> the size of ARRAY1 >> >> >> >> 2) I am not understanding where $input comes from, so I do not >> >> understand how to distinguish between ARRAY1 and ARRAY2. In the >> >> attempt I have already tried, I think I just use the pointer to ARRAY1 >> >> twice. >> >> >> >> If anyone has suggestions on how to solve this problem, thanks! >> >> >> >> regards, >> >> Rich >> >> _______________________________________________ >> >> Numpy-discussion mailing list >> >> Numpy-discussion at scipy.org >> >> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> > >> > > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From igorsyl at gmail.com Tue Jan 6 23:04:49 2009 From: igorsyl at gmail.com (Igor Sylvester) Date: Tue, 6 Jan 2009 22:04:49 -0600 Subject: [Numpy-discussion] record array with array elements In-Reply-To: <3d375d730901061341x5af96075h16678408ec33472c@mail.gmail.com> References: <3d375d730901061341x5af96075h16678408ec33472c@mail.gmail.com> Message-ID: If array fields should be of the form (name,subdtype, shape), how do I specify field offsets? My datatype is word-aligned. Thanks. On Tue, Jan 6, 2009 at 3:41 PM, Robert Kern wrote: > On Tue, Jan 6, 2009 at 14:07, Igor Sylvester wrote: > > Everyone, > > > > Shouldn't the itemsize below be 2? > > > >>>> import numpy as np > >>>> dtype = np.dtype( [ (((2,), 'top'), [('nested', 'i1')]) ] ) > >>>> dtype.itemsize > > 1 > >>>> np.__version__ > > '1.0.4' > > > > The elements of the dtype are of type array of size 2. Each element is a > > (nested) record array of size 2 with one field of type 'i1'. In > contiguous > > memory, this should look identical to an 'i1' array of size 2. > > That's not a valid dtype. Array fields should be of the form (name, > subdtype, shape), not ((shape, name), subdtype). I'm not sure why > dtype() does not simply reject this input. > > In [22]: np.dtype([('top', [('nested', 'i1')], (2,))]).itemsize > Out[22]: 2 > > -- > Robert Kern > > "I have come to believe that the whole world is an enigma, a harmless > enigma that is made terrible by our own mad attempt to interpret it as > though it had an underlying truth." > -- Umberto Eco > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Tue Jan 6 23:21:15 2009 From: robert.kern at gmail.com (Robert Kern) Date: Tue, 6 Jan 2009 22:21:15 -0600 Subject: [Numpy-discussion] record array with array elements In-Reply-To: References: <3d375d730901061341x5af96075h16678408ec33472c@mail.gmail.com> Message-ID: <3d375d730901062021v3f3cd03cj3deac5339f51672f@mail.gmail.com> On Tue, Jan 6, 2009 at 22:04, Igor Sylvester wrote: > If array fields should be of the form (name,subdtype, shape), how do I > specify field offsets? My datatype is word-aligned. With dtype(some_list), you need to explicitly include the padding. E.g. ('', '|V4') to add 4 bytes of padding. Alternately, you can use dtype(some_dict): dtype(dict( names=['x', 'y'], formats=['u1', ('u1', (2,2))], offsets=[0, 4], )) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From matthieu.brucher at gmail.com Wed Jan 7 02:58:56 2009 From: matthieu.brucher at gmail.com (Matthieu Brucher) Date: Wed, 7 Jan 2009 08:58:56 +0100 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: 2009/1/6 Rich E : > This helped immensely. I feel like I am getting close to being able > to accomplish what I would like with SWIG: producing a python module > that can be very 'python-like', while co-existing with the c library > that is very 'c-like'. > > There is one question still remaining though, is it possible to make > the wrapped function have the same name still? Using either > my_spectrumMag or spectrumMag means I have to create a number of > inconsistencies between the python module and the c library. It is > ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the > wrapped one, with the same name. But my attempts at doing this so far > have not compiled because of name conflictions. Ok course you can. The function is renamed only if you say so. Perhaps can you provide a small example of what doesn't work at the moment ? > Thanks for the help, I think you are doing great things with this > numpy interface/typemaps system. Matthieu -- Information System Engineer, Ph.D. Website: http://matthieu-brucher.developpez.com/ Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 LinkedIn: http://www.linkedin.com/in/matthieubrucher From pommereau at univ-paris12.fr Wed Jan 7 07:37:53 2009 From: pommereau at univ-paris12.fr (Franck Pommereau) Date: Wed, 07 Jan 2009 13:37:53 +0100 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <1231253131.6741.50.camel@sebook> References: <496325C5.4000505@univ-paris12.fr> <1231253131.6741.50.camel@sebook> Message-ID: <4964A221.9070808@univ-paris12.fr> Hi all, First, let me say that I'm impressed: this mailing list is probably the most reactive I've ever seen. I've asked my first question and got immediately more solutions than time to test them... Many thanks to all the answerers. Using the various proposals, I ran two performance tests: - test 1: 2000000 random values - test 2: 1328724 values from my real use case Here are the various functions and how they perform: def f0 (x, y) : """Initial version test 1 CPU times: 13.37s test 2 CPU times: 5.92s """ s, n = {}, {} for a, b in zip(x, y) : s[a] = s.get(a, 0.0) + b n[a] = n.get(a, 0) + 1 return (numpy.array([a for a in sorted(s)]), numpy.array([s[a]/n[a] for a in sorted(s)])) def f1 (x, y) : """Alan G Isaac Modified in order to sort the result only once. test 1 CPU times: 10.86s test 2 CPU times: 2.78s defaultdict indeed speeds things up, probably avoiding one of two sorts is good also """ s, n = defaultdict(float), defaultdict(int) for a, b in izip(x, y) : s[a] += b n[a] += 1 new_x = numpy.array([a for a in sorted(s)]) return (new_x, numpy.array([s[a]/n[a] for a in new_x])) def f2 (x, y) : """Francesc Alted Modified with preallocation of arrays (it appeared faster) test 1: killed after more than 10 minutes test 2 CPU times: 22.01s This result is not surprising as I guess a quadratic complexity: one pass for each unique value in x, and presumably one nested pass to compute y[x==i] """ u = numpy.unique(x) m = numpy.array(range(len(u))) for pos, i in enumerate(u) : g = y[x == i] m[pos] = g.mean() return u, m def f3 (x, y) : """Sebastian Stephan Berg Modified because I can always work in place. test 1 CPU times: 17.43s test 2 CPU times: 0.21s Adopted! This is definitely the fastest one when using real values. I tried to preallocate arrays by setting u=numpy.unique(x) and the looping on u, but the result is slower, probably because of unique() Compared with f1, its slower on larger arrays of random values. It may be explained by a complexity argument: f1 as a linear complexity (two passes in sequence) while f3 is probably N log N (a sequence of one sort, two passes to set x[:] and y[:] and one loop on each distinct value with a nested searchsorted that is probably logarithmic). But, real values are far from random, and the sort is probably more efficient, as well as the while loop is shorter because there are less values. """ s = x.argsort() x[:] = x[s] y[:] = y[s] u, means, start, value = [], [], 0, x[0] while True: next = x.searchsorted(value, side='right') u.append(value) means.append(y[start:next].mean()) if next == len(x): break value = x[next] start = next return numpy.array(u), numpy.array(means) def f4 (x, y) : """Jean-Baptiste Rudant test 1 CPU times: 111.21s test 2 CPU times: 13.48s As Jean-Baptiste noticed, this solution is not very efficient (but works almost of-the-shelf). """ recXY = numpy.rec.fromarrays((x, x), names='x, y') return matplotlib.mlab.rec_groupby(recXY, ('x',), (('y', numpy.mean, 'y_avg'),)) A few more remarks. Sebastian Stephan Berg wrote: > Just thinking. If the parameters are limited, you may be able to use the > histogram feature? Doing one histogram with Y as weights, then one > without weights and calculating the mean from this yourself should be > pretty speedy I imagine. I'm afraid I don't know what the histogram function computes. But this may be something worth to investigate because I think I'll need it later on in order to smooth my graphs (by plotting mean values on intervals). Bruce Southey wrote: > If you use Knuth's one pass approach > (http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance#III._On-line_algorithm) > you can write a function to get the min, max, mean and variance/standard > deviation in a single pass through the array rather than one pass for > each. I do not know if this will provide any advantage as that will > probably depend on the size of the arrays. If I understood well, this algorithm computes the variance of a whole array, I can see how to adapt it to compute mean (already done by the algorithm), max, min, etc., but I did not see how it can be adapted to my case. > Also, please use the highest precision possible (ie float128) for your > arrays to minimize numerical error due to the size of your arrays. Thanks for the advice! So, thank you again everybody. Cheers, Franck From nicolas.roux at st.com Wed Jan 7 10:19:16 2009 From: nicolas.roux at st.com (Nicolas ROUX) Date: Wed, 7 Jan 2009 16:19:16 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. Message-ID: <004401c970db$4e829820$e7ad810a@gnb.st.com> Hi, I need help ;-) I have here a testcase which works much faster in Matlab than Numpy. The following code takes less than 0.9sec in Matlab, but 21sec in Python. Numpy is 24 times slower than Matlab ! The big trouble I have is a large team of people within my company is ready to replace Matlab by Numpy/Scipy/Matplotlib, but I have to demonstrate that this kind of Python Code is executed with the same performance than Matlab, without writing C extension. This is becoming a critical point for us. This is a testcase that people would like to see working without any code restructuring. The reasons are: - this way of writing is fairly natural. - the original code which showed me the matlab/Numpy performance differences is much more complex, and can't benefit from broadcasting or other numpy tips (I can later give this code) ...So I really need to use the code below, without restructuring. Numpy/Python code: ##################################################################### import numpy import time print "Start test \n" dim = 3000 a = numpy.zeros((dim,dim,3)) start = time.clock() for i in range(dim): for j in range(dim): a[i,j,0] = a[i,j,1] a[i,j,2] = a[i,j,0] a[i,j,1] = a[i,j,2] end = time.clock() - start print "Test done, %f sec" % end ##################################################################### Matlab code: ##################################################################### 'Start test' dim = 3000; tic; a =zeros(dim,dim,3); for i = 1:dim for j = 1:dim a(i,j,1) = a(i,j,2); a(i,j,2) = a(i,j,1); a(i,j,3) = a(i,j,3); end end toc 'Test done' ##################################################################### Any idea on it ? Did I missed something ? Thanks a lot, in advance for your help. Cheers, Nicolas. From david at ar.media.kyoto-u.ac.jp Wed Jan 7 10:16:46 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Thu, 08 Jan 2009 00:16:46 +0900 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <004401c970db$4e829820$e7ad810a@gnb.st.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> Message-ID: <4964C75E.6020103@ar.media.kyoto-u.ac.jp> Nicolas ROUX wrote: > Hi, > > I need help ;-) > I have here a testcase which works much faster in Matlab than Numpy. > > The following code takes less than 0.9sec in Matlab, but 21sec in Python. > Numpy is 24 times slower than Matlab ! > The big trouble I have is a large team of people within my company is ready to replace Matlab by Numpy/Scipy/Matplotlib, > but I have to demonstrate that this kind of Python Code is executed with the same performance than Matlab, without writing C extension. > This is becoming a critical point for us. > > This is a testcase that people would like to see working without any code restructuring. > The reasons are: > - this way of writing is fairly natural. > - the original code which showed me the matlab/Numpy performance differences is much more complex, > and can't benefit from broadcasting or other numpy tips (I can later give this code) > > ...So I really need to use the code below, without restructuring. > > Numpy/Python code: > ##################################################################### > import numpy > import time > > print "Start test \n" > > dim = 3000 > > a = numpy.zeros((dim,dim,3)) > > start = time.clock() > > for i in range(dim): > for j in range(dim): > a[i,j,0] = a[i,j,1] > a[i,j,2] = a[i,j,0] > a[i,j,1] = a[i,j,2] > > end = time.clock() - start > > print "Test done, %f sec" % end > ##################################################################### > > Matlab code: > ##################################################################### > 'Start test' > dim = 3000; > tic; > a =zeros(dim,dim,3); > for i = 1:dim > for j = 1:dim > a(i,j,1) = a(i,j,2); > a(i,j,2) = a(i,j,1); > a(i,j,3) = a(i,j,3); > end > end > toc > 'Test done' > ##################################################################### > > Any idea on it ? > Did I missed something ? > I think on recent versions of matlab, there is nothing you can do without modifying the code: matlab has some JIT compilation for loops, which is supposed to speed up those cases - at least, that's what is claimed by matlab. The above loops are typical examples where this should work reasonably well I believe: http://www.mathworks.com/access/helpdesk_r13/help/techdoc/matlab_prog/ch7_pe10.html If you really have to use loops, then matlab will be faster. But maybe you don't; can you show us a more typical example ? cheers, David From rmay31 at gmail.com Wed Jan 7 10:44:50 2009 From: rmay31 at gmail.com (Ryan May) Date: Wed, 07 Jan 2009 09:44:50 -0600 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <004401c970db$4e829820$e7ad810a@gnb.st.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> Message-ID: <4964CDF2.9090308@gmail.com> Nicolas ROUX wrote: > Hi, > > I need help ;-) > I have here a testcase which works much faster in Matlab than Numpy. > > The following code takes less than 0.9sec in Matlab, but 21sec in Python. > Numpy is 24 times slower than Matlab ! > The big trouble I have is a large team of people within my company is ready to replace Matlab by Numpy/Scipy/Matplotlib, > but I have to demonstrate that this kind of Python Code is executed with the same performance than Matlab, without writing C extension. > This is becoming a critical point for us. > > This is a testcase that people would like to see working without any code restructuring. > The reasons are: > - this way of writing is fairly natural. > - the original code which showed me the matlab/Numpy performance differences is much more complex, > and can't benefit from broadcasting or other numpy tips (I can later give this code) > > ...So I really need to use the code below, without restructuring. > > Numpy/Python code: > ##################################################################### > import numpy > import time > > print "Start test \n" > > dim = 3000 > > a = numpy.zeros((dim,dim,3)) > > start = time.clock() > > for i in range(dim): > for j in range(dim): > a[i,j,0] = a[i,j,1] > a[i,j,2] = a[i,j,0] > a[i,j,1] = a[i,j,2] > > end = time.clock() - start > > print "Test done, %f sec" % end > ##################################################################### > Any idea on it ? > Did I missed something ? I think you may have reduced the complexity a bit too much. The python code above sets all of the elements equal to a[i,j,1]. Is there any reason you can't use slicing to avoid the loops? Ryan -- Ryan May Graduate Research Assistant School of Meteorology University of Oklahoma From matthieu.brucher at gmail.com Wed Jan 7 10:53:36 2009 From: matthieu.brucher at gmail.com (Matthieu Brucher) Date: Wed, 7 Jan 2009 16:53:36 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <004401c970db$4e829820$e7ad810a@gnb.st.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> Message-ID: > for i in range(dim): > for j in range(dim): > a[i,j,0] = a[i,j,1] > a[i,j,2] = a[i,j,0] > a[i,j,1] = a[i,j,2] > for i = 1:dim > for j = 1:dim > a(i,j,1) = a(i,j,2); > a(i,j,2) = a(i,j,1); > a(i,j,3) = a(i,j,3); > end > end Hi, The two loops are not the same. As David stated, with JIT, the loops may be vectorized by Matlab on the fly. -- Information System Engineer, Ph.D. Website: http://matthieu-brucher.developpez.com/ Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 LinkedIn: http://www.linkedin.com/in/matthieubrucher From chaos.proton at gmail.com Wed Jan 7 10:58:40 2009 From: chaos.proton at gmail.com (Grissiom) Date: Wed, 7 Jan 2009 23:58:40 +0800 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964CDF2.9090308@gmail.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> Message-ID: On Wed, Jan 7, 2009 at 23:44, Ryan May wrote: > Nicolas ROUX wrote: > > Hi, > > > > I need help ;-) > > I have here a testcase which works much faster in Matlab than Numpy. > > > > The following code takes less than 0.9sec in Matlab, but 21sec in Python. > > Numpy is 24 times slower than Matlab ! > > The big trouble I have is a large team of people within my company is > ready to replace Matlab by Numpy/Scipy/Matplotlib, > > but I have to demonstrate that this kind of Python Code is executed with > the same performance than Matlab, without writing C extension. > > This is becoming a critical point for us. > > > > This is a testcase that people would like to see working without any code > restructuring. > > The reasons are: > > - this way of writing is fairly natural. > > - the original code which showed me the matlab/Numpy performance > differences is much more complex, > > and can't benefit from broadcasting or other numpy tips (I can later give > this code) > > > > ...So I really need to use the code below, without restructuring. > > > > Numpy/Python code: > > ##################################################################### > > import numpy > > import time > > > > print "Start test \n" > > > > dim = 3000 > > > > a = numpy.zeros((dim,dim,3)) > > > > start = time.clock() > > > > for i in range(dim): > > for j in range(dim): > > a[i,j,0] = a[i,j,1] > > a[i,j,2] = a[i,j,0] > > a[i,j,1] = a[i,j,2] > > > > end = time.clock() - start > > > > print "Test done, %f sec" % end > > ##################################################################### > > > Any idea on it ? > > Did I missed something ? > > I think you may have reduced the complexity a bit too much. The python > code > above sets all of the elements equal to a[i,j,1]. Is there any reason you > can't > use slicing to avoid the loops? > > Yes, I think so. I think the testcase is a matter of python loop vs matlab loop rather than python vs matlab. -- Cheers, Grissiom -------------- next part -------------- An HTML attachment was scrubbed... URL: From jdh2358 at gmail.com Wed Jan 7 11:14:44 2009 From: jdh2358 at gmail.com (John Hunter) Date: Wed, 7 Jan 2009 10:14:44 -0600 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <4964A221.9070808@univ-paris12.fr> References: <496325C5.4000505@univ-paris12.fr> <1231253131.6741.50.camel@sebook> <4964A221.9070808@univ-paris12.fr> Message-ID: <88e473830901070814p7d4fe368t2b908e01ab44b9ea@mail.gmail.com> On Wed, Jan 7, 2009 at 6:37 AM, Franck Pommereau wrote: > def f4 (x, y) : > """Jean-Baptiste Rudant > > test 1 CPU times: 111.21s > test 2 CPU times: 13.48s > > As Jean-Baptiste noticed, this solution is not very efficient (but > works almost of-the-shelf). > """ > recXY = numpy.rec.fromarrays((x, x), names='x, y') > return matplotlib.mlab.rec_groupby(recXY, ('x',), > (('y', numpy.mean, 'y_avg'),)) This probably will have no impact on your tests, but this looks like a bug. You probably mean: recXY = numpy.rec.fromarrays((x, y), names='x, y') Could you post the code you use to generate you inputs (ie what is x?) I will look into trying some of the suggestions here to improve the performance on rec_groupby. One thing that slows it down is that it supports an arbitrary number of keys -- eg groupby ('year', 'month') -- whereas the examples above are using a single value lookup. JDH From pommereau at univ-paris12.fr Wed Jan 7 11:33:54 2009 From: pommereau at univ-paris12.fr (Franck Pommereau) Date: Wed, 07 Jan 2009 17:33:54 +0100 Subject: [Numpy-discussion] [Newbie] Fast plotting In-Reply-To: <88e473830901070814p7d4fe368t2b908e01ab44b9ea@mail.gmail.com> References: <496325C5.4000505@univ-paris12.fr> <1231253131.6741.50.camel@sebook> <4964A221.9070808@univ-paris12.fr> <88e473830901070814p7d4fe368t2b908e01ab44b9ea@mail.gmail.com> Message-ID: <4964D972.80502@univ-paris12.fr> > This probably will have no impact on your tests, but this looks like a > bug. You probably mean: > > recXY = numpy.rec.fromarrays((x, y), names='x, y') Sure! Thanks. > Could you post the code you use to generate you inputs (ie what is x?) My code is probably not usable by somebody else than me. I'm presently too busy to clean it and add comments. But as soon as I'll be able to do so, I'll send you the usable version. Cheers, Franck From josef.pktd at gmail.com Wed Jan 7 11:36:27 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 7 Jan 2009 11:36:27 -0500 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> Message-ID: <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> On Wed, Jan 7, 2009 at 10:58 AM, Grissiom wrote: > On Wed, Jan 7, 2009 at 23:44, Ryan May wrote: >> >> Nicolas ROUX wrote: >> > Hi, >> > >> > I need help ;-) >> > I have here a testcase which works much faster in Matlab than Numpy. >> > >> > The following code takes less than 0.9sec in Matlab, but 21sec in >> > Python. >> > Numpy is 24 times slower than Matlab ! >> > The big trouble I have is a large team of people within my company is >> > ready to replace Matlab by Numpy/Scipy/Matplotlib, >> > but I have to demonstrate that this kind of Python Code is executed with >> > the same performance than Matlab, without writing C extension. >> > This is becoming a critical point for us. >> > >> > This is a testcase that people would like to see working without any >> > code restructuring. >> > The reasons are: >> > - this way of writing is fairly natural. >> > - the original code which showed me the matlab/Numpy performance >> > differences is much more complex, >> > and can't benefit from broadcasting or other numpy tips (I can later >> > give this code) >> > >> > ...So I really need to use the code below, without restructuring. >> > >> > Numpy/Python code: >> > ##################################################################### >> > import numpy >> > import time >> > >> > print "Start test \n" >> > >> > dim = 3000 >> > >> > a = numpy.zeros((dim,dim,3)) >> > >> > start = time.clock() >> > >> > for i in range(dim): >> > for j in range(dim): >> > a[i,j,0] = a[i,j,1] >> > a[i,j,2] = a[i,j,0] >> > a[i,j,1] = a[i,j,2] >> > >> > end = time.clock() - start >> > >> > print "Test done, %f sec" % end >> > ##################################################################### >> >> > Any idea on it ? >> > Did I missed something ? >> >> I think you may have reduced the complexity a bit too much. The python >> code >> above sets all of the elements equal to a[i,j,1]. Is there any reason you >> can't >> use slicing to avoid the loops? >> > > Yes, I think so. I think the testcase is a matter of python loop vs matlab > loop rather than python vs matlab. > > -- > Cheers, > Grissiom > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > I tried with matlab 2006a, I don't know if there is JIT, but the main speed difference comes with the numpy array access. The test is actually biased in favor of python, since in the matlab code the initialization with zeros is inside the time count, but outside in the python version If I just put b=1.0 inside the double loop (no numpy) Python 1.453644 sec matlab 0.335249 seconds, with zeros outside loop: 0.060582 seconds with original array assignment: python/numpy 32.745030 sec matlab 1.633415 seconds, with zeros outside loop: 1.251597 seconds (putting the loop in a function and using psyco reduces speed by 30%) Josef From reakinator at gmail.com Wed Jan 7 11:43:07 2009 From: reakinator at gmail.com (Rich E) Date: Wed, 7 Jan 2009 17:43:07 +0100 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Here is my example, trying to wrap the function sms_spectrumMag that we have been dealing with: %apply (int DIM1, float* IN_ARRAY1) {(int sizeInArray, float* pInArray)}; %apply (int DIM1, float* INPLACE_ARRAY1) {(int sizeOutArray, float* pOutArray)}; %inline %{ void my_spectrumMag( int sizeInArray, float *pInArray, int sizeOutArray, float *pOutArray) { sms_spectrumMag(sizeOutArray, pInArray, pOutArray); } %} at this point, have the new function my_spectrumMag that wraps sms_spectrumMag() and provides arguments that can be typemapped using numpy.i Now, I don't want to have to call the function my_spectrumMag() in python, I want to use the original name, I would like to call the function as: sms_spectrumMag(numpyArray1, numpyArray2) But, trying to %rename my_spectrumMag to sms_spectrumMag does not work, the original sms_spectrumMag gets called in python instead. Trying to %ignore the original function first as follows removes the sms_spectrumMag completely from the module and I am left with my_spectrumMag: %ignore sms_spectrumMag; %rename (sms_spectrumMag) my_spectrumMag; Do you see my problem? On Wed, Jan 7, 2009 at 8:58 AM, Matthieu Brucher wrote: > 2009/1/6 Rich E : >> This helped immensely. I feel like I am getting close to being able >> to accomplish what I would like with SWIG: producing a python module >> that can be very 'python-like', while co-existing with the c library >> that is very 'c-like'. >> >> There is one question still remaining though, is it possible to make >> the wrapped function have the same name still? Using either >> my_spectrumMag or spectrumMag means I have to create a number of >> inconsistencies between the python module and the c library. It is >> ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the >> wrapped one, with the same name. But my attempts at doing this so far >> have not compiled because of name conflictions. > > Ok course you can. The function is renamed only if you say so. Perhaps > can you provide a small example of what doesn't work at the moment ? > >> Thanks for the help, I think you are doing great things with this >> numpy interface/typemaps system. > > Matthieu > -- > Information System Engineer, Ph.D. > Website: http://matthieu-brucher.developpez.com/ > Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 > LinkedIn: http://www.linkedin.com/in/matthieubrucher > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From josef.pktd at gmail.com Wed Jan 7 12:27:07 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 7 Jan 2009 12:27:07 -0500 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> Message-ID: <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> A test case closer to my applications is calling functions in loops: Python ----------------------------------- def assgn(a,i,j): a[i,j,0] = a[i,j,1] + 1.0 a[i,j,2] = a[i,j,0] a[i,j,1] = a[i,j,2] return a print "Start test \n" dim = 300#0 a = numpy.zeros((dim,dim,3)) start = time.clock() for i in range(dim): for j in range(dim): assgn(a,i,j) end = time.clock() - start assert numpy.max(a)==1.0 #added to check inplace substitution print "Test done, %f sec" % end --------------------------- matlab: ------------------------------------------------------------------ function a = tryloopspeed() 'Start test' dim = 300; a = zeros(dim,dim,3); tic; for i = 1:dim for j = 1:dim a = assgn(a,i,j); end end toc 'Test done' end function a = assgn(a,i,j) a(i,j,1) = a(i,j,2); a(i,j,2) = a(i,j,1); a(i,j,3) = a(i,j,3); end --------------------------- Note: I had to reduce the size of the matrix because I got impatient waiting for matlab time: python: Test done, 0.486127 sec matlab: >> output = tryloopspeed(); ans = Start test Elapsed time is 511.815971 seconds. ans = Test done >> 511.815971/60.0 #minutes ans = 8.530 matlab takes 1053 times the time of python The problem is that at least in my version of matlab, it copies function arguments when they are modified. It's possible to work around this, but not very clean. So for simple loops python looses, but for other things, python wins by a huge margin. Unless somebody can spot a mistake in my timing. Josef From Chris.Barker at noaa.gov Wed Jan 7 12:51:58 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Wed, 07 Jan 2009 09:51:58 -0800 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964C75E.6020103@ar.media.kyoto-u.ac.jp> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964C75E.6020103@ar.media.kyoto-u.ac.jp> Message-ID: <4964EBBE.7020706@noaa.gov> > Nicolas ROUX wrote: >> The big trouble I have is a large team of people within my company is ready to replace Matlab by Numpy/Scipy/Matplotlib, we like that! >> This is a testcase that people would like to see working without any code restructuring. >> The reasons are: >> - this way of writing is fairly natural. Only if you haven't wrapped your brain around array-oriented programming! (see below) >> - the original code which showed me the matlab/Numpy performance differences is much more complex, >> and can't benefit from broadcasting or other numpy tips (I can later give this code) so you're asking: "how can I make this code faster without changing it?" The only way to do that is to change python or numpy, and while it might be nice to do that to improve performance in this type of case, it's a tall order! It's really not a good goal, anyway -- python/numpy is by no means a drop-in replacement for MATLAB -- they are very different beasts. Personally, I think most of the differences favor Python, but if you try to write python the same way you'd write MATLAB, you'll lose most of the benefits -- you might as well stick with MATLAB. However, in this case, MATLAB was traditionally slow with loops and indexing and needed to be vectorized for decent performance as well. It look like they now have a nice JIT compiler for this sort of thing -- to get a similar effect in numpy, you'll need to use weave or Cython or something, notable not as easy as having the interpreter just do it for you. I'd love to see a numpy-aware psyco some day, an maybe the new buffer interface will facilitate that, but it's inherently harder with numpy -- MATLAB at least used to be limited to 2-d arrays of doubles, so far less special casing to be done. Even with this nifty JIT, I think Python has many advantages -- if your code is well written, there will be a only a few places with these sorts of performance bottlenecks, and weave or Cython, or SWIG, or Ctypes, or f2py can all give you a good solution. One other thought -- could numexp help here? About array-oriented programming: All lot of folks seem to think that the only reason to "vectorize" code in MATLAB, numpy, etc, is for better performance. If MATLAB now has a good JIT, then there is no point -- I think that's a mistake. If you write your code to work with arrays of data, you get more compact, less bug-prone code than if you are working with indexed elements all the time. I also think the code is clearer most of the time. I say most, because sometimes you do need to do "tricks" to vectorize that can obfuscate the code. I understand that this may be a simplified example, and the real use-case could be quite different. However: >> a = numpy.zeros((dim,dim,3)) so we essentially have three square arrays stacked together -- what do they represent? that might help guide you, but without that, I can still see: >> for i in range(dim): >> for j in range(dim): this really means -- for every element of the 2-d arrays, which can be written as: a[:,:] >> a[i,j,0] = a[i,j,1] >> a[i,j,2] = a[i,j,0] >> a[i,j,1] = a[i,j,2] and this is simply swapping the three around. So, if you start out thinking in terms of a set of 2-d arrays, rather than a huge pile of elements, the code you will arrive at is more like: a[:,:,0] = a[:,:,1] a[:,:,2] = a[:,:,0] a[:,:,1] = a[:,:,2] With no loops: or you could give them names: a0 = a[:,:,0] a1 = a[:,:,1] a2 = a[:,:,2] then: a0[:] = a1 a2[:] = a0 a1[:] = a2 which, of course, is really: a[:,:,:] = a1.reshape((dim,dim,1)) but I suspect that that's the result of a typo. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From Chris.Barker at noaa.gov Wed Jan 7 12:56:26 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Wed, 07 Jan 2009 09:56:26 -0800 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> Message-ID: <4964ECCA.7060103@noaa.gov> josef.pktd at gmail.com wrote: > So for simple loops python looses, but for other things, python wins > by a huge margin. which emphasizes the point that you can't write code the same way in the two languages, though I'd argue that that code needs refactoring in any language! However, numpy's reference semantics is definitely a strong advantage of MATLAB -- more flexibility in general. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From sturla at molden.no Wed Jan 7 13:16:51 2009 From: sturla at molden.no (Sturla Molden) Date: Wed, 07 Jan 2009 19:16:51 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964C75E.6020103@ar.media.kyoto-u.ac.jp> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964C75E.6020103@ar.media.kyoto-u.ac.jp> Message-ID: <4964F193.9060801@molden.no> On 1/7/2009 4:16 PM, David Cournapeau wrote: > I think on recent versions of matlab, there is nothing you can do > without modifying the code: matlab has some JIT compilation for loops, > which is supposed to speed up those cases - at least, that's what is > claimed by matlab. Yes it does. After using both for more than 10 years, my impression is this: - Matlab slicing creates new arrays. NumPy slicing creates views. NumPy is faster and more memory efficient. - Matlab JIT compiles loops. NumPy does not. Matlab is faster for stupid programmers that don't know how use slices. But neither Matlab nor Python/NumPy is meant to be used like Java. - Python has psyco. It is about as good as Matlab's JIT. But psyco has no knowledge of NumPy ndarrays. - Using Cython is easier than writing Matlab MEX files. - Python has better support for data structures, better built-in structures (tuple, lists, dics, sets), and general purpose libraries. Matlab has extensive numerical toolboxes that you can buy. - Matlab pass function arguments by value (albeit COW optimized). Python pass references. This makes NumPy more efficient if you need to pass large arrays or array slices. - Matlab tends to fragment the heap (hence the pack command). Python/NumPy does not. This makes long-running processes notoriously unstable on Matlab. - Matlab has some numerical libraries that are better. - I like the Matlab command prompt and IDE better. But its not enough to make me want to use it. - Python is a proper programming language. Matlab is a numerical scripting language - good for small scripts but not complex software systems. Sturla Molden From xavier.gnata at gmail.com Wed Jan 7 13:29:41 2009 From: xavier.gnata at gmail.com (Xavier Gnata) Date: Wed, 07 Jan 2009 19:29:41 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964F193.9060801@molden.no> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964C75E.6020103@ar.media.kyoto-u.ac.jp> <4964F193.9060801@molden.no> Message-ID: <4964F495.5070200@gmail.com> Well it is the best pitch for numpy versus matlab I have read so far :) (and I 100% agree) Xavier > On 1/7/2009 4:16 PM, David Cournapeau wrote: > > >> I think on recent versions of matlab, there is nothing you can do >> without modifying the code: matlab has some JIT compilation for loops, >> which is supposed to speed up those cases - at least, that's what is >> claimed by matlab. >> > > Yes it does. After using both for more than 10 years, my impression is this: > > - Matlab slicing creates new arrays. NumPy slicing creates views. NumPy > is faster and more memory efficient. > > - Matlab JIT compiles loops. NumPy does not. Matlab is faster for stupid > programmers that don't know how use slices. But neither Matlab nor > Python/NumPy is meant to be used like Java. > > - Python has psyco. It is about as good as Matlab's JIT. But psyco has > no knowledge of NumPy ndarrays. > > - Using Cython is easier than writing Matlab MEX files. > > - Python has better support for data structures, better built-in > structures (tuple, lists, dics, sets), and general purpose libraries. > Matlab has extensive numerical toolboxes that you can buy. > > - Matlab pass function arguments by value (albeit COW optimized). Python > pass references. This makes NumPy more efficient if you need to pass > large arrays or array slices. > > - Matlab tends to fragment the heap (hence the pack command). > Python/NumPy does not. This makes long-running processes notoriously > unstable on Matlab. > > - Matlab has some numerical libraries that are better. > > - I like the Matlab command prompt and IDE better. But its not enough to > make me want to use it. > > - Python is a proper programming language. Matlab is a numerical > scripting language - good for small scripts but not complex software > systems. > > > Sturla Molden > > > > > > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > From faltet at pytables.org Wed Jan 7 13:31:06 2009 From: faltet at pytables.org (Francesc Alted) Date: Wed, 7 Jan 2009 19:31:06 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964EBBE.7020706@noaa.gov> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964C75E.6020103@ar.media.kyoto-u.ac.jp> <4964EBBE.7020706@noaa.gov> Message-ID: <200901071931.06474.faltet@pytables.org> A Wednesday 07 January 2009, Christopher Barker escrigu?: [clip] > Even with this nifty JIT, I think Python has many advantages -- if > your code is well written, there will be a only a few places with > these sorts of performance bottlenecks, and weave or Cython, or SWIG, > or Ctypes, or f2py can all give you a good solution. Agreed. Specially Cython, with the latests improvements for supporting optimized NumPy indexing: http://wiki.cython.org/tutorials/numpy would make these loops to work much faster. > One other thought -- could numexp help here? I don't think so. Numexpr is for computing expresions like 'a-b**3-c' element-wise (a, b and c are arrays) quickly. The main reason of its high performance is that it avoids temporary copies of intermediate results. In order to use it, you need to vectorize first your loops, and this is not what Nicolas wants. Cheers, -- Francesc Alted From sturla at molden.no Wed Jan 7 13:32:58 2009 From: sturla at molden.no (Sturla Molden) Date: Wed, 07 Jan 2009 19:32:58 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964ECCA.7060103@noaa.gov> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> <4964ECCA.7060103@noaa.gov> Message-ID: <4964F55A.4070108@molden.no> On 1/7/2009 6:56 PM, Christopher Barker wrote: >> So for simple loops python looses, but for other things, python wins >> by a huge margin. > > which emphasizes the point that you can't write code the same way in the > two languages, though I'd argue that that code needs refactoring in any > language! Roux example would be bad in either language. Slices ('vectorization' in Matlab lingo) is preferred in both cases. It's just that neither Matlab nor Python/NumPy was designed to be used like Java. For loops should not be abused in Python nor in Matlab (but Matlab is more forgiving now than it used to be). Sturla Molden From sturla at molden.no Wed Jan 7 13:50:54 2009 From: sturla at molden.no (Sturla Molden) Date: Wed, 07 Jan 2009 19:50:54 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964EBBE.7020706@noaa.gov> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964C75E.6020103@ar.media.kyoto-u.ac.jp> <4964EBBE.7020706@noaa.gov> Message-ID: <4964F98E.5060906@molden.no> On 1/7/2009 6:51 PM, Christopher Barker wrote: > Even with this nifty JIT, It is not a very nifty JIT. It can transform some simple loops into vectorized expressions. And it removes the overhead from indexing with doubles. But if you are among those that do n = length(x) m = 0 for i = 1.0 : n m = m + x(i) end m = m / n instead of m = mean(x) it will be nifty enough. > All lot of folks seem to think that the only reason to "vectorize" code > in MATLAB, numpy, etc, is for better performance. If MATLAB now has a > good JIT, then there is no point -- I think that's a mistake. Fortran 90/95 has array slicing as well. Sturla Molden From josef.pktd at gmail.com Wed Jan 7 13:52:43 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Wed, 7 Jan 2009 13:52:43 -0500 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964F55A.4070108@molden.no> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> <4964ECCA.7060103@noaa.gov> <4964F55A.4070108@molden.no> Message-ID: <1cd32cbb0901071052x1f3c129cl5c2239987c047fc1@mail.gmail.com> On Wed, Jan 7, 2009 at 1:32 PM, Sturla Molden wrote: > On 1/7/2009 6:56 PM, Christopher Barker wrote: > >>> So for simple loops python looses, but for other things, python wins >>> by a huge margin. >> >> which emphasizes the point that you can't write code the same way in the >> two languages, though I'd argue that that code needs refactoring in any >> language! > > Roux example would be bad in either language. Slices ('vectorization' in > Matlab lingo) is preferred in both cases. It's just that neither Matlab > nor Python/NumPy was designed to be used like Java. For loops should not > be abused in Python nor in Matlab (but Matlab is more forgiving now than > it used to be). > > > Sturla Molden > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > I'm missing name spaces in matlab. everything is from path import * and it's more difficult to keep are larger project organized in matlab than in python. But, I think, matlab is ahead in parallelization (which I haven't used much) and learning matlab is easier than numpy. (dtypes and broadcasting are more restrictive in matlab but, for a beginner, easier to figure out) Josef From sturla at molden.no Wed Jan 7 14:26:25 2009 From: sturla at molden.no (Sturla Molden) Date: Wed, 07 Jan 2009 20:26:25 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <1cd32cbb0901071052x1f3c129cl5c2239987c047fc1@mail.gmail.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> <4964ECCA.7060103@noaa.gov> <4964F55A.4070108@molden.no> <1cd32cbb0901071052x1f3c129cl5c2239987c047fc1@mail.gmail.com> Message-ID: <496501E1.2060400@molden.no> On 1/7/2009 7:52 PM, josef.pktd at gmail.com wrote: > But, I think, > matlab is ahead in parallelization (which I haven't used much) Not really. There is e.g. nothing like Python's multiprocessing package in Matlab. Matlab is genrally single-threaded. Python is multi-threaded but there is a GIL. And having multiple Matlab processes running simultaneously consumes a lot of resources. Python is far better in this respect. Don't confuse vectorization with parallelization. It is not the same. If you are going to do real parallelization, you are better off using Python with multiprocessing or mpi4py. > and learning matlab is easier than numpy. (dtypes and broadcasting are > more restrictive in matlab but, for a beginner, easier to figure out) The available data types is about the same, at least last time I checked. (I am not thinking about Python built-ins here, but NumPy dtypes.) Matlab does not have broadcasting. Array shapes must always match. S.M. From robert.kern at gmail.com Wed Jan 7 15:54:50 2009 From: robert.kern at gmail.com (Robert Kern) Date: Wed, 7 Jan 2009 15:54:50 -0500 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <004401c970db$4e829820$e7ad810a@gnb.st.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> Message-ID: <3d375d730901071254p3a3a110nccf165434a37f5a0@mail.gmail.com> On Wed, Jan 7, 2009 at 10:19, Nicolas ROUX wrote: > Hi, > > I need help ;-) > I have here a testcase which works much faster in Matlab than Numpy. > > The following code takes less than 0.9sec in Matlab, but 21sec in Python. > Numpy is 24 times slower than Matlab ! > The big trouble I have is a large team of people within my company is ready to replace Matlab by Numpy/Scipy/Matplotlib, > but I have to demonstrate that this kind of Python Code is executed with the same performance than Matlab, without writing C extension. > This is becoming a critical point for us. > > This is a testcase that people would like to see working without any code restructuring. Basically, if you want efficient numpy code, you have to use numpy idioms. If you want to continue to use Matlab idioms, keep using Matlab. > The reasons are: > - this way of writing is fairly natural. > - the original code which showed me the matlab/Numpy performance differences is much more complex, > and can't benefit from broadcasting or other numpy tips (I can later give this code) Please do. Otherwise, we can't actually address your concerns. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From dwf at cs.toronto.edu Wed Jan 7 16:41:53 2009 From: dwf at cs.toronto.edu (David Warde-Farley) Date: Wed, 7 Jan 2009 16:41:53 -0500 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <496501E1.2060400@molden.no> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964CDF2.9090308@gmail.com> <1cd32cbb0901070836n56aa3550h8bedf8aad186a581@mail.gmail.com> <1cd32cbb0901070927v43e56df2x44e5e764ea9a29a7@mail.gmail.com> <4964ECCA.7060103@noaa.gov> <4964F55A.4070108@molden.no> <1cd32cbb0901071052x1f3c129cl5c2239987c047fc1@mail.gmail.com> <496501E1.2060400@molden.no> Message-ID: <877CE7F7-8268-4BDD-8503-62F7789608A9@cs.toronto.edu> On 7-Jan-09, at 2:26 PM, Sturla Molden wrote: > Matlab does not have broadcasting. Array shapes must always match. Not totally true. They introduced a clunky, clunky syntax for it in version 7, IIRC, called 'bsxfun'. See http://tinyurl.com/9e7kyt . It's a better solution than indexing with a huge ones() or repmat()'ing all over the place, but not nearly as nice as NumPy overloading. David From gael.varoquaux at normalesup.org Wed Jan 7 17:50:38 2009 From: gael.varoquaux at normalesup.org (Gael Varoquaux) Date: Wed, 7 Jan 2009 23:50:38 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4964F495.5070200@gmail.com> References: <004401c970db$4e829820$e7ad810a@gnb.st.com> <4964C75E.6020103@ar.media.kyoto-u.ac.jp> <4964F193.9060801@molden.no> <4964F495.5070200@gmail.com> Message-ID: <20090107225038.GB5186@phare.normalesup.org> On Wed, Jan 07, 2009 at 07:29:41PM +0100, Xavier Gnata wrote: > Well it is the best pitch for numpy versus matlab I have read so far :) > (and I 100% agree) +1. This is an excellent text. IMHO it should be on the wiki somewhere. Ga?l From jdh2358 at gmail.com Wed Jan 7 17:52:50 2009 From: jdh2358 at gmail.com (John Hunter) Date: Wed, 7 Jan 2009 16:52:50 -0600 Subject: [Numpy-discussion] my cython is slow Message-ID: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> Partly as an excuse to learn cython, and partly because I need to eke out some extra performance of a neighborhood search, I tried to code up a brute force neighborhood search in cython around an N-dimensional point p. I need to incrementally add a point, do a search, add another point, do another search, so some of the algorithms like those in scipy.stats.spatial which assume a static data structure with lots of searches over it probably won't help me. I wrote some cython code to grow a Npoints x Ndimensions numpy array (doubling in size every time I exceed Npoints) and then doing a brute force request for all points within a radius r using a euclidean distance. The code is working fine, but it is still slower than a simple numpy implementation (damn you numpy performance!) Here is the numpy code:: def find_neighbors_numpy(data, point, radius): """ do a plain ol numpy lookup to compare performance and output *data* is a numpoints x numdims array *point* is a numdims length vector radius is the max distance distance return an array of indices into data which are within radius """ numpoints, n = data.shape distance = data - point r = np.sqrt((distance*distance).sum(axis=1)) return np.nonzero(r<=radius)[0] This requires 6 passed through the data, so I should be able to beat it with a properly crafted cython algorithm. I have a class NNBF (Nearest Neighbor Brute Force) with an interface like NUMDIM = 6 nn = nnbf.NNBF(NUMDIM) print 'loading data... this could take a while' # this could take a while for i in range(200000): x = np.random.rand(NUMDIM) nn.add(x) x = np.random.rand(NUMDIM) radius = 0.2 ind = nn.find_neighbors(x, radius) (in my real use case I would be doing a search after every add) when I run this vs numpy, numpy is a little faster testing nnbf... 10 trials: mean=0.0420, min=0.0400 testing numpy... 10 trials: mean=0.0420, min=0.0300 You can grab the code, the python prototype, the test case and setup file at http://matplotlib.sf.net/nnbf.zip. You will need a fairly recent cython to build it: http://www.cython.org/Cython-0.10.3.tar.gz:: # build the extension and run the test code wget http://matplotlib.sf.net/nnbf.zip unzip nnbf.zip cd nnbf python setup.py build_ext --inplace python test_nnbf.py I'm pasting the cython code nnbf.pyx below. If anyone can advise me as to how to remove the bottlenecks (I've decorated the code with some XXX questions) that would be great. I've read the tutorial at http://wiki.cython.org/tutorials/numpy but I think I am still missing something important. I tried to follow Anne's code in ckdtree.pyx which makes use of a raw_data structure but I am not sure how/if to incorporate this into my code -- perhaps there is some juice there.... """ A brute force nearest neighbor routine with incremental add. The internal array data structure grows as you add points """ import numpy as np cimport numpy as np cdef extern from "math.h": float sqrt(float) cdef inline int is_neighbor(int n, double*row, double*pp, double d2max): """ return 1 if the sum-of-squares of n length array row[j]-pp[j] <= d2max """ cdef int j cdef double d, d2 d2 = 0. for j in range(n): d = row[j] - pp[j] d2 += d*d if d2>d2max: return 0 return 1 cdef class NNBF: cdef readonly object data #cdef double* raw_data cdef readonly int n, numrows, numpoints def __init__(self, n): """ create a buffer to hold n dimensional points """ #cdef np.ndarray[double, ndim=2] inner_data self.n = n self.numrows = 100 # XXX how to create empty as contiguous w/o copy? self.data = np.empty((self.numrows, self.n), dtype=np.float) #inner_data = self.data #self.raw_data = inner_data.data self.numpoints = 0 def add(NNBF self, object point): """ add a point to the buffer, grow if necessary """ #cdef np.ndarray[double, ndim=2] inner_data cdef np.ndarray[double, ndim=1] pp pp = np.asarray(point).astype(np.float) self.data[self.numpoints] = pp self.numpoints += 1 if self.numpoints==self.numrows: ## XXX do I need to do memory management here, eg free ## raw_data if I were using it? self.numrows *= 2 newdata = np.empty((self.numrows, self.n), np.float) newdata[:self.numpoints] = self.data self.data = newdata #self.raw_data = inner_data.data def get_data(NNBF self): """ return a copy of data added so far as a numpoints x n array """ return self.data[:self.numpoints] def find_neighbors(NNBF self, object point, double radius): """ return a list of indices into data which are within radius from point """ cdef int i, neighbor cdef double d2max cdef np.ndarray[double, ndim=1] pp cdef np.ndarray[double, ndim=1] row if len(point)!=self.n: raise ValueError('Expected a length %d vector'%self.n) pp = np.asarray(point).astype(np.float) d2max = radius*radius neighbors = [] for i in range(self.numpoints): # XXX : is there a more efficient way to access the row # data? Can/should we be using raw_data here? row = self.data[i] neighbor = is_neighbor( self.n, row.data, pp.data, d2max) # if the number of points in the cluster is small, the # python list performance should not kill us if neighbor: neighbors.append(i) return neighbors From efiring at hawaii.edu Wed Jan 7 18:37:56 2009 From: efiring at hawaii.edu (Eric Firing) Date: Wed, 07 Jan 2009 13:37:56 -1000 Subject: [Numpy-discussion] my cython is slow In-Reply-To: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> References: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> Message-ID: <49653CD4.90808@hawaii.edu> John Hunter wrote: > Partly as an excuse to learn cython, and partly because I need to eke > out some extra performance of a neighborhood search, I tried to code > up a brute force neighborhood search in cython around an N-dimensional > point p. I need to incrementally add a point, do a search, add > another point, do another search, so some of the algorithms like those > in scipy.stats.spatial which assume a static data structure with lots > of searches over it probably won't help me. > > I wrote some cython code to grow a Npoints x Ndimensions numpy array > (doubling in size every time I exceed Npoints) and then doing a brute > force request for all points within a radius r using a euclidean > distance. The code is working fine, but it is still slower than a > simple numpy implementation (damn you numpy performance!) A couple small changes speed it up quite a bit: efiring at manini:~/temp/nnbf$ python test_nnbf.py loading data... this could take a while testing nnbf... 10 trials: mean=0.0150, min=0.0100 testing numpy... 10 trials: mean=0.0660, min=0.0600 It is all a matter of keeping Python objects and function calls out of inner loops. I suspect there is quite a bit more that could be done in that regard, but I haven't looked. Eric -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nnbf.pyx URL: From ezindy at gmail.com Wed Jan 7 20:13:15 2009 From: ezindy at gmail.com (Egor Zindy) Date: Thu, 8 Jan 2009 10:13:15 +0900 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Hello Rich, This is very strange. I got to test my example again, as long as you don't do a %include "dftmagnitude.h" somewhere in the dftmagnitude.i, it's perfectly possible to do a %rename (sms_spectrumMag) my_spectrumMag; (see dftmagnitude3.zip attached in my previous mail and this one). So things for you to check: * does the simple dftmagnitude3.zip compile on your system? * what version of SWIG are you using? (I used 1.3.36 provided with cygwin) * do you have a %include statement somewhere in your own .i file? Matthieu, if you read this, there's a complete example provided in dftmagnitude3.zip. * Wrapped function sms_spectrumMag in dftmagnitude.c and .h * SWIG wrapper dftmagnitude.i uses %inline and %rename statements * Example uses a modified numpy.i (see the previous mails in the thread). * test example provided in test_dftmagnitude.py Haven't tested it under Linux, but under winxp/cygwin/mingw32, the following works for me (in cygwin): $ python setup_dftmagnitude.py build -cmingw32 ; mv build/lib.win32-2.5/_dftmagnitude.pyd . $ python test_dftmagnitude.py Regards, Egor -- My Python: $ python -i Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit (Intel)] on win32 My SWIG: $ swig -version SWIG Version 1.3.36 Compiled with g++ [i686-pc-cygwin] Please see http://www.swig.org for reporting bugs and further information On Thu, Jan 8, 2009 at 1:43 AM, Rich E wrote: > Here is my example, trying to wrap the function sms_spectrumMag that > we have been dealing with: > > %apply (int DIM1, float* IN_ARRAY1) {(int sizeInArray, float* pInArray)}; > %apply (int DIM1, float* INPLACE_ARRAY1) {(int sizeOutArray, float* > pOutArray)}; > > %inline %{ > > void my_spectrumMag( int sizeInArray, float *pInArray, int > sizeOutArray, float *pOutArray) > { > sms_spectrumMag(sizeOutArray, pInArray, pOutArray); > } > > %} > > > at this point, have the new function my_spectrumMag that wraps > sms_spectrumMag() and provides arguments that can be typemapped using > numpy.i Now, I don't want to have to call the function > my_spectrumMag() in python, I want to use the original name, I would > like to call the function as: > > sms_spectrumMag(numpyArray1, numpyArray2) > > But, trying to %rename my_spectrumMag to sms_spectrumMag does not > work, the original sms_spectrumMag gets called in python instead. > Trying to %ignore the original function first as follows removes the > sms_spectrumMag completely from the module and I am left with > my_spectrumMag: > > %ignore sms_spectrumMag; > %rename (sms_spectrumMag) my_spectrumMag; > > > Do you see my problem? > > > On Wed, Jan 7, 2009 at 8:58 AM, Matthieu Brucher > wrote: > > 2009/1/6 Rich E : > >> This helped immensely. I feel like I am getting close to being able > >> to accomplish what I would like with SWIG: producing a python module > >> that can be very 'python-like', while co-existing with the c library > >> that is very 'c-like'. > >> > >> There is one question still remaining though, is it possible to make > >> the wrapped function have the same name still? Using either > >> my_spectrumMag or spectrumMag means I have to create a number of > >> inconsistencies between the python module and the c library. It is > >> ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the > >> wrapped one, with the same name. But my attempts at doing this so far > >> have not compiled because of name conflictions. > > > > Ok course you can. The function is renamed only if you say so. Perhaps > > can you provide a small example of what doesn't work at the moment ? > > > >> Thanks for the help, I think you are doing great things with this > >> numpy interface/typemaps system. > > > > Matthieu > > -- > > Information System Engineer, Ph.D. > > Website: http://matthieu-brucher.developpez.com/ > > Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 > > LinkedIn: http://www.linkedin.com/in/matthieubrucher > > _______________________________________________ > > Numpy-discussion mailing list > > Numpy-discussion at scipy.org > > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: dftmagnitude3.zip Type: application/zip Size: 10887 bytes Desc: not available URL: From bevan07 at gmail.com Wed Jan 7 22:02:50 2009 From: bevan07 at gmail.com (Bevan Jenkins) Date: Thu, 8 Jan 2009 03:02:50 +0000 (UTC) Subject: [Numpy-discussion] Accumulate values that are below threshold Message-ID: Hello, Sometimes the hardest part of a problem is articulating it. Hopefully I can describe what I am trying to do - at least enough to get some help. I am trying to compare values to a threshold and when the values are lower than the threshold they are added to the value in my set until the threshold is reached. Everytime the threshold is reached I want the index and value (accumulated). Hopefully the example below will help threshold =1.0 for indx,val in enumerate(Q): print indx,val 0 100.0 1 20.0 2 16.0 3 7.0 4 3.0 5 1.5 6 0.8 7 0.6 8 0.5 9 0.2 10 0.2 11 0.1 12 0.1 The output I would like is (number of elements and value) 0 100.0 1 20.0 2 16.0 3 7.0 4 3.0 5 1.5 7 1.4 11 1.0 The 1st 6 elements are easy as they are all greater than or equal to the threshold(1.0). Once the values drop below the threshold the next value is added until the threshold is reached. Any help is appreciated, Bevan Jenkins From stefan at sun.ac.za Thu Jan 8 01:36:41 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Thu, 8 Jan 2009 08:36:41 +0200 Subject: [Numpy-discussion] Accumulate values that are below threshold In-Reply-To: References: Message-ID: <9457e7c80901072236ua4d0667j66fe0ed96f9fccdb@mail.gmail.com> Hi Bevan Since the number of output elements are unknown, I don't think you can implement this efficiently using arrays. If your dataset isn't too large, a for-loop should do the trick. Otherwise, you may have to run your code through Cython, which optimises for-loops around Python lists. thresh = 1.0 carry = 0 output = [] for idx, val in data: carry += val if (carry - thresh) >= -1e-15: output.append((idx, carry)) carry = 0 The comparison line above, "(carry - thresh0 >= -1e-15", may look strange -- it basically just does "carry >= thresh". For some reason I don't quite understand, when accumulating floats, it sometimes happens that "1.0 != 1.0", so I use 1e-15 as protection. Regards St?fan 2009/1/8 Bevan Jenkins : > Hello, > > Sometimes the hardest part of a problem is articulating it. Hopefully I can > describe what I am trying to do - at least enough to get some help. > > I am trying to compare values to a threshold and when the values are lower than > the threshold they are added to the value in my set until the threshold is > reached. Everytime the threshold is reached I want the index and value > (accumulated). > > Hopefully the example below will help > > threshold =1.0 > for indx,val in enumerate(Q): > print indx,val > > 0 100.0 > 1 20.0 > 2 16.0 > 3 7.0 > 4 3.0 > 5 1.5 > 6 0.8 > 7 0.6 > 8 0.5 > 9 0.2 > 10 0.2 > 11 0.1 > 12 0.1 > > The output I would like is (number of elements and value) > 0 100.0 > 1 20.0 > 2 16.0 > 3 7.0 > 4 3.0 > 5 1.5 > 7 1.4 > 11 1.0 > > > The 1st 6 elements are easy as they are all greater than or equal to the > threshold(1.0). Once the values drop below the threshold the next value is > added until the threshold is reached. > > > Any help is appreciated, > Bevan Jenkins From ezindy at gmail.com Thu Jan 8 11:22:19 2009 From: ezindy at gmail.com (Egor Zindy) Date: Fri, 9 Jan 2009 01:22:19 +0900 Subject: [Numpy-discussion] ANN: numpy.i - added managed deallocation to ARGOUTVIEW_ARRAY1 (ARGOUTVIEWM_ARRAY1) In-Reply-To: <493D1926.4010506@gmail.com> References: <491F8F4A.30009@gmail.com> <49231156.1060209@gmail.com> <49231EB3.8060802@noaa.gov> <492538B9.10202@gmail.com> <493D1926.4010506@gmail.com> Message-ID: Hello list, I've moved my wiki to the scipy cookbook: http://www.scipy.org/Cookbook/SWIG_Memory_Deallocation For the time being, the listed example files are still stored on the google code SVN, but these could easily be moved if necessary. I've also just finished adding an ARGOUTVIEWM_ARRAY2 example. The example shows how to return a two-dimensional array from C which also benefits from the automatic memory deallocation. A naive "crop" function is wrapped using SWIG/numpy.i and returns a slice of the input array. When used as array_out = crop.crop(array_in, d1_0,d1_1, d2_0,d2_1) it is equivalent to the native numpy slicing array_out = array_in[d1_0:d1_1, d2_0:d2_1] Hope this helps! Regards, Egor -------------- next part -------------- An HTML attachment was scrubbed... URL: From chanley at stsci.edu Thu Jan 8 11:37:51 2009 From: chanley at stsci.edu (Christopher Hanley) Date: Thu, 08 Jan 2009 11:37:51 -0500 Subject: [Numpy-discussion] change made to test_print.py Message-ID: <49662BDF.7000806@stsci.edu> Hi, I've committed the following change to test_print.py to fix one of the tests. Index: test_print.py =================================================================== --- test_print.py (revision 6302) +++ test_print.py (working copy) @@ -154,7 +154,7 @@ else: locale.setlocale(locale.LC_NUMERIC, 'FRENCH') - assert_equal(str(tp(1.2)), str(float(1.2)), + assert_equal(locale.format("%f",tp(1.2)), locale.format("%f",float(1.2)), err_msg='Failed locale test for type %s' % tp) finally: locale.setlocale(locale.LC_NUMERIC, locale=curloc) Chris -- Christopher Hanley Senior Systems Software Engineer Space Telescope Science Institute 3700 San Martin Drive Baltimore MD, 21218 (410) 338-4338 From jdh2358 at gmail.com Thu Jan 8 12:32:32 2009 From: jdh2358 at gmail.com (John Hunter) Date: Thu, 8 Jan 2009 11:32:32 -0600 Subject: [Numpy-discussion] my cython is slow In-Reply-To: <49653CD4.90808@hawaii.edu> References: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> <49653CD4.90808@hawaii.edu> Message-ID: <88e473830901080932i72003322idbaedb9af4de1dae@mail.gmail.com> On Wed, Jan 7, 2009 at 5:37 PM, Eric Firing wrote: > A couple small changes speed it up quite a bit: > > efiring at manini:~/temp/nnbf$ python test_nnbf.py > loading data... this could take a while > testing nnbf... > 10 trials: mean=0.0150, min=0.0100 > testing numpy... > 10 trials: mean=0.0660, min=0.0600 > > It is all a matter of keeping Python objects and function calls out of inner > loops. I suspect there is quite a bit more that could be done in that > regard, but I haven't looked. Much faster, but no longer correct, as you'll see if you uncomment out the nose test test_neighbors that compare actual vs desired. Is the pointer arithmetic correct: dataptr + i I would have thought perhaps: dataptr + i*n but this is segfaulting. Do we need to use a stride? JDH From efiring at hawaii.edu Thu Jan 8 13:34:14 2009 From: efiring at hawaii.edu (Eric Firing) Date: Thu, 08 Jan 2009 08:34:14 -1000 Subject: [Numpy-discussion] my cython is slow In-Reply-To: <88e473830901080932i72003322idbaedb9af4de1dae@mail.gmail.com> References: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> <49653CD4.90808@hawaii.edu> <88e473830901080932i72003322idbaedb9af4de1dae@mail.gmail.com> Message-ID: <49664726.7020309@hawaii.edu> John Hunter wrote: > On Wed, Jan 7, 2009 at 5:37 PM, Eric Firing wrote: > >> A couple small changes speed it up quite a bit: >> >> efiring at manini:~/temp/nnbf$ python test_nnbf.py >> loading data... this could take a while >> testing nnbf... >> 10 trials: mean=0.0150, min=0.0100 >> testing numpy... >> 10 trials: mean=0.0660, min=0.0600 >> >> It is all a matter of keeping Python objects and function calls out of inner >> loops. I suspect there is quite a bit more that could be done in that >> regard, but I haven't looked. > > Much faster, but no longer correct, as you'll see if you uncomment > out the nose test test_neighbors that compare actual vs desired. > > Is the pointer arithmetic correct: > > dataptr + i > > I would have thought perhaps: > > dataptr + i*n > > but this is segfaulting. Do we need to use a stride? Sorry, I was too hasty. Yes, it seems like i*n should be correct, but it isn't; we are missing something simple and fundamental here. I don't see it immediately, and won't be able to look at it for a while. Eric > > JDH > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion From cournape at gmail.com Thu Jan 8 14:15:09 2009 From: cournape at gmail.com (David Cournapeau) Date: Fri, 9 Jan 2009 04:15:09 +0900 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <49662BDF.7000806@stsci.edu> References: <49662BDF.7000806@stsci.edu> Message-ID: <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> On Fri, Jan 9, 2009 at 1:37 AM, Christopher Hanley wrote: > Hi, > > I've committed the following change to test_print.py to fix one of the > tests. > Hi Christopher, Please do not modify those tests - they are supposed to fail, David From chanley at stsci.edu Thu Jan 8 14:29:11 2009 From: chanley at stsci.edu (Christopher Hanley) Date: Thu, 08 Jan 2009 14:29:11 -0500 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> Message-ID: <49665407.5030505@stsci.edu> David Cournapeau wrote: > On Fri, Jan 9, 2009 at 1:37 AM, Christopher Hanley wrote: >> Hi, >> >> I've committed the following change to test_print.py to fix one of the >> tests. >> > > Hi Christopher, > > Please do not modify those tests - they are supposed to fail, > > David > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion Hi David, Sorry. Should these tests be generating a "known failures" then? ====================================================================== FAIL: test_print.test_locale_single ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/chanley/dev/site-packages/lib/python/nose/case.py", line 182, in runTest self.test(*self.arg) File "/Users/chanley/dev/site-packages/lib/python/numpy/testing/decorators.py", line 82, in skipper return f(*args, **kwargs) File "/Users/chanley/dev/site-packages/lib/python/numpy/core/tests/test_print.py", line 165, in test_locale_single return _test_locale_independance(np.float32) File "/Users/chanley/dev/site-packages/lib/python/numpy/core/tests/test_print.py", line 158, in _test_locale_independance err_msg='Failed locale test for type %s' % tp) File "/Users/chanley/dev/site-packages/lib/python/numpy/testing/utils.py", line 183, in assert_equal raise AssertionError(msg) AssertionError: Items are not equal: Failed locale test for type ACTUAL: '1,2' DESIRED: '1.2' ====================================================================== FAIL: test_print.test_locale_double ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/chanley/dev/site-packages/lib/python/nose/case.py", line 182, in runTest self.test(*self.arg) File "/Users/chanley/dev/site-packages/lib/python/numpy/testing/decorators.py", line 82, in skipper return f(*args, **kwargs) File "/Users/chanley/dev/site-packages/lib/python/numpy/core/tests/test_print.py", line 170, in test_locale_double return _test_locale_independance(np.double) File "/Users/chanley/dev/site-packages/lib/python/numpy/core/tests/test_print.py", line 158, in _test_locale_independance err_msg='Failed locale test for type %s' % tp) File "/Users/chanley/dev/site-packages/lib/python/numpy/testing/utils.py", line 183, in assert_equal raise AssertionError(msg) AssertionError: Items are not equal: Failed locale test for type ACTUAL: '1,2' DESIRED: '1.2' ====================================================================== FAIL: test_print.test_locale_longdouble ---------------------------------------------------------------------- Traceback (most recent call last): File "/Users/chanley/dev/site-packages/lib/python/nose/case.py", line 182, in runTest self.test(*self.arg) File "/Users/chanley/dev/site-packages/lib/python/numpy/testing/decorators.py", line 82, in skipper return f(*args, **kwargs) File "/Users/chanley/dev/site-packages/lib/python/numpy/core/tests/test_print.py", line 175, in test_locale_longdouble return _test_locale_independance(np.longdouble) File "/Users/chanley/dev/site-packages/lib/python/numpy/core/tests/test_print.py", line 158, in _test_locale_independance err_msg='Failed locale test for type %s' % tp) File "/Users/chanley/dev/site-packages/lib/python/numpy/testing/utils.py", line 183, in assert_equal raise AssertionError(msg) AssertionError: Items are not equal: Failed locale test for type ACTUAL: '1,2' DESIRED: '1.2' Thanks, Chris -- Christopher Hanley Senior Systems Software Engineer Space Telescope Science Institute 3700 San Martin Drive Baltimore MD, 21218 (410) 338-4338 From cournape at gmail.com Thu Jan 8 14:48:02 2009 From: cournape at gmail.com (David Cournapeau) Date: Fri, 9 Jan 2009 04:48:02 +0900 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <49665407.5030505@stsci.edu> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> Message-ID: <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> On Fri, Jan 9, 2009 at 4:29 AM, Christopher Hanley wrote: > David Cournapeau wrote: >> On Fri, Jan 9, 2009 at 1:37 AM, Christopher Hanley wrote: >>> Hi, >>> >>> I've committed the following change to test_print.py to fix one of the >>> tests. >>> >> >> Hi Christopher, >> >> Please do not modify those tests - they are supposed to fail, >> >> David >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > Hi David, > > Sorry. Should these tests be generating a "known failures" then? No. The problem are known, and are being fixed (in a branch). Since the problem is only in the development trunk, I don't see any problem with having failures for some time, David From jdh2358 at gmail.com Thu Jan 8 14:56:02 2009 From: jdh2358 at gmail.com (John Hunter) Date: Thu, 8 Jan 2009 13:56:02 -0600 Subject: [Numpy-discussion] my cython is slow In-Reply-To: <49664726.7020309@hawaii.edu> References: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> <49653CD4.90808@hawaii.edu> <88e473830901080932i72003322idbaedb9af4de1dae@mail.gmail.com> <49664726.7020309@hawaii.edu> Message-ID: <88e473830901081156s467057d8u8dddccaae9b227dd@mail.gmail.com> On Thu, Jan 8, 2009 at 12:34 PM, Eric Firing wrote: > John Hunter wrote: >> On Wed, Jan 7, 2009 at 5:37 PM, Eric Firing wrote: >> >>> A couple small changes speed it up quite a bit: >>> >>> efiring at manini:~/temp/nnbf$ python test_nnbf.py >>> loading data... this could take a while >>> testing nnbf... >>> 10 trials: mean=0.0150, min=0.0100 >>> testing numpy... >>> 10 trials: mean=0.0660, min=0.0600 >>> >>> It is all a matter of keeping Python objects and function calls out of inner >>> loops. I suspect there is quite a bit more that could be done in that >>> regard, but I haven't looked. >> >> Much faster, but no longer correct, as you'll see if you uncomment >> out the nose test test_neighbors that compare actual vs desired. >> >> Is the pointer arithmetic correct: >> >> dataptr + i >> >> I would have thought perhaps: >> >> dataptr + i*n >> >> but this is segfaulting. Do we need to use a stride? > > Sorry, I was too hasty. Yes, it seems like i*n should be correct, but > it isn't; we are missing something simple and fundamental here. I don't > see it immediately, and won't be able to look at it for a while. OK, the code at > svn co https://matplotlib.svn.sourceforge.net/svnroot/matplotlib/trunk/py4science/examples/pyrex/nnbf now passes the correctness tests and is approx ten time faster than the numpy version. I borrowed the "raw_data" idiom from Anne's ckdtree.pyx, though I don't really understand why it is different that what we were doing with the data.data ptr. I am also not sure if I need to be doing any memory management when I resize the buffer and reset raw_data.... """ A brute force nearest neighbor routine with incremental add. The internal array data structure grows as you add points """ import numpy as np cimport numpy as np cdef extern from "math.h": float sqrt(float) cdef inline int is_neighbor(int n, double*row, double*pp, double d2max): """ return 1 if the sum-of-squares of n length array row[j]-pp[j] <= d2max """ cdef int j cdef double d, d2 d2 = 0. for j in range(n): d = row[j] - pp[j] d2 += d*d if d2>d2max: return 0 return 1 cdef class NNBF: cdef readonly object data cdef double* raw_data cdef readonly int n, numrows, numpoints def __init__(self, n): """ create a buffer to hold n dimensional points """ cdef np.ndarray[double, ndim=2] inner_data self.n = n self.numrows = 10000 # XXX how to create empty as contiguous w/o copy? data = np.empty((self.numrows, self.n), dtype=np.float) self.data = np.ascontiguousarray(data, dtype=np.float) inner_data = self.data self.raw_data = inner_data.data self.numpoints = 0 def add(NNBF self, object point): """ add a point to the buffer, grow if necessary """ cdef np.ndarray[double, ndim=2] inner_data cdef np.ndarray[double, ndim=1] pp pp = np.array(point).astype(np.float) self.data[self.numpoints] = pp self.numpoints += 1 if self.numpoints==self.numrows: ## XXX do I need to do memory management here, eg free ## raw_data if I were using it? self.numrows *= 2 newdata = np.empty((self.numrows, self.n), np.float) newdata[:self.numpoints] = self.data self.data = np.ascontiguousarray(newdata, dtype=np.float) inner_data = self.data self.raw_data = inner_data.data def get_data(NNBF self): """ return a copy of data added so far as a numpoints x n array """ return self.data[:self.numpoints] def find_neighbors(NNBF self, object point, double radius): """ return a list of indices into data which are within radius from point """ cdef int i, neighbor, n cdef double d2max cdef np.ndarray[double, ndim=1] pp # avoid python array indexing in the inner loop if len(point)!=self.n: raise ValueError('Expected a length %d vector'%self.n) pp = np.asarray(point).astype(np.float) d2max = radius*radius neighbors = [] # don't do a python lookup inside the loop n = self.n for i in range(self.numpoints): neighbor = is_neighbor( n, self.raw_data + i*n, pp.data, d2max) # if the number of points in the cluster is small, the # python list performance should not kill us if neighbor: neighbors.append(i) return neighbors def find_neighbors_numpy(self, point, radius): """ do a plain ol numpy lookup to compare performance and output *data* is a numpoints x numdims array *point* is a numdims length vector radius is the max distance distance return an array of indices into data which are within radius """ data = self.get_data() distance = data - point r = np.sqrt((distance*distance).sum(axis=1)) return np.nonzero(r<=radius)[0] From chanley at stsci.edu Thu Jan 8 15:11:55 2009 From: chanley at stsci.edu (Christopher Hanley) Date: Thu, 08 Jan 2009 15:11:55 -0500 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> Message-ID: <49665E0B.3030009@stsci.edu> David Cournapeau wrote: > On Fri, Jan 9, 2009 at 4:29 AM, Christopher Hanley wrote: >> David Cournapeau wrote: >>> On Fri, Jan 9, 2009 at 1:37 AM, Christopher Hanley wrote: >>>> Hi, >>>> >>>> I've committed the following change to test_print.py to fix one of the >>>> tests. >>>> >>> Hi Christopher, >>> >>> Please do not modify those tests - they are supposed to fail, >>> >>> David >>> _______________________________________________ >>> Numpy-discussion mailing list >>> Numpy-discussion at scipy.org >>> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> Hi David, >> >> Sorry. Should these tests be generating a "known failures" then? > > No. The problem are known, and are being fixed (in a branch). Since > the problem is only in the development trunk, I don't see any problem > with having failures for some time, > > David > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion I would disagree. If you were to attempt the following: n = numpy.test() n.wasSuccessful() You expect the result to be 'True'. If not it is necessary to find out why. Right now the following occurs: >>> n.wasSuccessful() False I have no way of knowing that you wanted those tests to fail unless you have them marked as KNOWNFAIL. Since we use numpy in our production systems I need to determine why numpy is failing. We track the changes on the trunk because we need to know how changes will effect our code prior to our customers downloading the latest numpy release. This 'False' return value from wasSuccessful() means that our automated systems tell us that numpy is broken. The common assumption is that tests are not suppose to fail. If they are failing then there is a problem. If you have tests that you want to fail, either make the failure a condition of the test passing or move the tests to the branch where you are doing your development. Thanks, Chris -- Christopher Hanley Senior Systems Software Engineer Space Telescope Science Institute 3700 San Martin Drive Baltimore MD, 21218 (410) 338-4338 From dagss at student.matnat.uio.no Thu Jan 8 15:30:24 2009 From: dagss at student.matnat.uio.no (Dag Sverre Seljebotn) Date: Thu, 08 Jan 2009 21:30:24 +0100 Subject: [Numpy-discussion] my cython is slow In-Reply-To: <88e473830901081156s467057d8u8dddccaae9b227dd@mail.gmail.com> References: <88e473830901071452g12305ec4r8d3deb52547bcc71@mail.gmail.com> <49653CD4.90808@hawaii.edu> <88e473830901080932i72003322idbaedb9af4de1dae@mail.gmail.com> <49664726.7020309@hawaii.edu> <88e473830901081156s467057d8u8dddccaae9b227dd@mail.gmail.com> Message-ID: <49666260.3070408@student.matnat.uio.no> Some of the problems you encounter could probably be remedied by better support in Cython for some situations. I've filed two feature request tickets for myself, but I have no idea when or if I'll get around to them. http://trac.cython.org/cython_trac/ticket/177 http://trac.cython.org/cython_trac/ticket/178 Dag Sverre John Hunter wrote: > On Thu, Jan 8, 2009 at 12:34 PM, Eric Firing wrote: >> John Hunter wrote: >>> On Wed, Jan 7, 2009 at 5:37 PM, Eric Firing wrote: >>> >>>> A couple small changes speed it up quite a bit: >>>> >>>> efiring at manini:~/temp/nnbf$ python test_nnbf.py >>>> loading data... this could take a while >>>> testing nnbf... >>>> 10 trials: mean=0.0150, min=0.0100 >>>> testing numpy... >>>> 10 trials: mean=0.0660, min=0.0600 >>>> >>>> It is all a matter of keeping Python objects and function calls out of inner >>>> loops. I suspect there is quite a bit more that could be done in that >>>> regard, but I haven't looked. >>> Much faster, but no longer correct, as you'll see if you uncomment >>> out the nose test test_neighbors that compare actual vs desired. >>> >>> Is the pointer arithmetic correct: >>> >>> dataptr + i >>> >>> I would have thought perhaps: >>> >>> dataptr + i*n >>> >>> but this is segfaulting. Do we need to use a stride? >> Sorry, I was too hasty. Yes, it seems like i*n should be correct, but >> it isn't; we are missing something simple and fundamental here. I don't >> see it immediately, and won't be able to look at it for a while. > > OK, the code at > > > svn co https://matplotlib.svn.sourceforge.net/svnroot/matplotlib/trunk/py4science/examples/pyrex/nnbf > > now passes the correctness tests and is approx ten time faster than > the numpy version. > > I borrowed the "raw_data" idiom from Anne's ckdtree.pyx, though I > don't really understand why it is different that what we were doing > with the data.data ptr. I am also not sure if I need to be doing any > memory management when I resize the buffer and reset raw_data.... > > """ > A brute force nearest neighbor routine with incremental add. The > internal array data structure grows as you add points > """ > > import numpy as np > cimport numpy as np > > cdef extern from "math.h": > float sqrt(float) > > cdef inline int is_neighbor(int n, double*row, double*pp, double d2max): > """ > return 1 if the sum-of-squares of n length array row[j]-pp[j] <= d2max > """ > cdef int j > cdef double d, d2 > > d2 = 0. > > for j in range(n): > d = row[j] - pp[j] > d2 += d*d > if d2>d2max: > return 0 > return 1 > > cdef class NNBF: > cdef readonly object data > cdef double* raw_data > cdef readonly int n, numrows, numpoints > > def __init__(self, n): > """ > create a buffer to hold n dimensional points > """ > cdef np.ndarray[double, ndim=2] inner_data > > > self.n = n > self.numrows = 10000 > # XXX how to create empty as contiguous w/o copy? > data = np.empty((self.numrows, self.n), dtype=np.float) > self.data = np.ascontiguousarray(data, dtype=np.float) > inner_data = self.data > self.raw_data = inner_data.data > self.numpoints = 0 > > > def add(NNBF self, object point): > """ > add a point to the buffer, grow if necessary > """ > cdef np.ndarray[double, ndim=2] inner_data > cdef np.ndarray[double, ndim=1] pp > pp = np.array(point).astype(np.float) > > > self.data[self.numpoints] = pp > self.numpoints += 1 > if self.numpoints==self.numrows: > ## XXX do I need to do memory management here, eg free > ## raw_data if I were using it? > self.numrows *= 2 > newdata = np.empty((self.numrows, self.n), np.float) > newdata[:self.numpoints] = self.data > self.data = np.ascontiguousarray(newdata, dtype=np.float) > inner_data = self.data > self.raw_data = inner_data.data > > > def get_data(NNBF self): > """ > return a copy of data added so far as a numpoints x n array > """ > return self.data[:self.numpoints] > > > def find_neighbors(NNBF self, object point, double radius): > """ > return a list of indices into data which are within radius > from point > """ > cdef int i, neighbor, n > cdef double d2max > cdef np.ndarray[double, ndim=1] pp > > # avoid python array indexing in the inner loop > if len(point)!=self.n: > raise ValueError('Expected a length %d vector'%self.n) > > pp = np.asarray(point).astype(np.float) > > d2max = radius*radius > neighbors = [] > > # don't do a python lookup inside the loop > n = self.n > > for i in range(self.numpoints): > neighbor = is_neighbor( > n, > self.raw_data + i*n, > pp.data, > d2max) > > # if the number of points in the cluster is small, the > # python list performance should not kill us > if neighbor: > neighbors.append(i) > > return neighbors > > def find_neighbors_numpy(self, point, radius): > """ > do a plain ol numpy lookup to compare performance and output > > *data* is a numpoints x numdims array > *point* is a numdims length vector > radius is the max distance distance > > return an array of indices into data which are within radius > """ > data = self.get_data() > distance = data - point > r = np.sqrt((distance*distance).sum(axis=1)) > return np.nonzero(r<=radius)[0] > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion -- Dag Sverre From cournape at gmail.com Thu Jan 8 15:26:17 2009 From: cournape at gmail.com (David Cournapeau) Date: Fri, 9 Jan 2009 05:26:17 +0900 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <49665E0B.3030009@stsci.edu> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> Message-ID: <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> On Fri, Jan 9, 2009 at 5:11 AM, Christopher Hanley wrote: > David Cournapeau wrote: >> On Fri, Jan 9, 2009 at 4:29 AM, Christopher Hanley wrote: >>> David Cournapeau wrote: >>>> On Fri, Jan 9, 2009 at 1:37 AM, Christopher Hanley wrote: >>>>> Hi, >>>>> >>>>> I've committed the following change to test_print.py to fix one of the >>>>> tests. >>>>> >>>> Hi Christopher, >>>> >>>> Please do not modify those tests - they are supposed to fail, >>>> >>>> David >>>> _______________________________________________ >>>> Numpy-discussion mailing list >>>> Numpy-discussion at scipy.org >>>> http://projects.scipy.org/mailman/listinfo/numpy-discussion >>> Hi David, >>> >>> Sorry. Should these tests be generating a "known failures" then? >> >> No. The problem are known, and are being fixed (in a branch). Since >> the problem is only in the development trunk, I don't see any problem >> with having failures for some time, >> >> David >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > I would disagree. If you were to attempt the following: > > n = numpy.test() > n.wasSuccessful() > > You expect the result to be 'True'. If not it is necessary to find out > why. Right now the following occurs: > > >>> n.wasSuccessful() > False > > I have no way of knowing that you wanted those tests to fail unless you > have them marked as KNOWNFAIL. Since we use numpy in our production > systems I need to determine why numpy is failing. We track the changes > on the trunk because we need to know how changes will effect our code > prior to our customers downloading the latest numpy release. I don't understand: you can't expect the trunk to always work. We try not to break it - but sometimes it does not work. Personally, I don't like knownfailure much anyway: I feel like it is too easy to tag one test known failure, and then nobody cares about it anymore. Those formatting problems were already problems before - the tests only show the problem, it does not cause the problem, so I don't understand why it is so important: a 100 % running test suite with a problem which is not shown or a 95 % running test suite with the problem is the same thing; the code in numpy itself is exactly the same. David From robert.kern at gmail.com Thu Jan 8 15:32:51 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 8 Jan 2009 15:32:51 -0500 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> Message-ID: <3d375d730901081232m70d17347k16301fe8daf99591@mail.gmail.com> On Thu, Jan 8, 2009 at 15:26, David Cournapeau wrote: > Personally, I don't like knownfailure much anyway: I feel like it is > too easy to tag one test known failure, and then nobody cares about it > anymore. Those formatting problems were already problems before - the > tests only show the problem, it does not cause the problem, so I don't > understand why it is so important: a 100 % running test suite with a > problem which is not shown or a 95 % running test suite with the > problem is the same thing; the code in numpy itself is exactly the > same. Don't check in failing tests without using knownfailure. Unit tests are used by others to determine whether or not *they* broke things or whether their installation failed. By checking in a failing test, you are sending others on a wild goose chase trying to figure out what they did wrong when they didn't. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Thu Jan 8 15:33:28 2009 From: robert.kern at gmail.com (Robert Kern) Date: Thu, 8 Jan 2009 15:33:28 -0500 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <49662BDF.7000806@stsci.edu> References: <49662BDF.7000806@stsci.edu> Message-ID: <3d375d730901081233j57897718i91f17828cb73834f@mail.gmail.com> On Thu, Jan 8, 2009 at 11:37, Christopher Hanley wrote: > Hi, > > I've committed the following change to test_print.py to fix one of the > tests. > > Index: test_print.py > =================================================================== > --- test_print.py (revision 6302) > +++ test_print.py (working copy) > @@ -154,7 +154,7 @@ > else: > locale.setlocale(locale.LC_NUMERIC, 'FRENCH') > > - assert_equal(str(tp(1.2)), str(float(1.2)), > + assert_equal(locale.format("%f",tp(1.2)), > locale.format("%f",float(1.2)), Note that this does not test anything. %f coerces the input to a Python float anyways. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From chanley at stsci.edu Thu Jan 8 15:36:37 2009 From: chanley at stsci.edu (Christopher Hanley) Date: Thu, 08 Jan 2009 15:36:37 -0500 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> Message-ID: <496663D5.4090005@stsci.edu> David Cournapeau wrote: > On Fri, Jan 9, 2009 at 5:11 AM, Christopher Hanley wrote: >> David Cournapeau wrote: >>> On Fri, Jan 9, 2009 at 4:29 AM, Christopher Hanley wrote: >>>> David Cournapeau wrote: >>>>> On Fri, Jan 9, 2009 at 1:37 AM, Christopher Hanley wrote: >>>>>> Hi, >>>>>> >>>>>> I've committed the following change to test_print.py to fix one of the >>>>>> tests. >>>>>> >>>>> Hi Christopher, >>>>> >>>>> Please do not modify those tests - they are supposed to fail, >>>>> >>>>> David >>>>> _______________________________________________ >>>>> Numpy-discussion mailing list >>>>> Numpy-discussion at scipy.org >>>>> http://projects.scipy.org/mailman/listinfo/numpy-discussion >>>> Hi David, >>>> >>>> Sorry. Should these tests be generating a "known failures" then? >>> No. The problem are known, and are being fixed (in a branch). Since >>> the problem is only in the development trunk, I don't see any problem >>> with having failures for some time, >>> >>> David >>> _______________________________________________ >>> Numpy-discussion mailing list >>> Numpy-discussion at scipy.org >>> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> I would disagree. If you were to attempt the following: >> >> n = numpy.test() >> n.wasSuccessful() >> >> You expect the result to be 'True'. If not it is necessary to find out >> why. Right now the following occurs: >> >> >>> n.wasSuccessful() >> False >> >> I have no way of knowing that you wanted those tests to fail unless you >> have them marked as KNOWNFAIL. Since we use numpy in our production >> systems I need to determine why numpy is failing. We track the changes >> on the trunk because we need to know how changes will effect our code >> prior to our customers downloading the latest numpy release. > > I don't understand: you can't expect the trunk to always work. We try > not to break it - but sometimes it does not work. > > Personally, I don't like knownfailure much anyway: I feel like it is > too easy to tag one test known failure, and then nobody cares about it > anymore. Those formatting problems were already problems before - the > tests only show the problem, it does not cause the problem, so I don't > understand why it is so important: a 100 % running test suite with a > problem which is not shown or a 95 % running test suite with the > problem is the same thing; the code in numpy itself is exactly the > same. > > David > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion I do not expect the trunk to always work. I even expect it to have bugs. However, I do not expect there to be test failures for known reasons that result in wasSuccessful() returning false. This is a bad programming practice. It creates work for people trying to figure out what is wrong when the answer is already know. Chris -- Christopher Hanley Senior Systems Software Engineer Space Telescope Science Institute 3700 San Martin Drive Baltimore MD, 21218 (410) 338-4338 From reakinator at gmail.com Thu Jan 8 15:43:35 2009 From: reakinator at gmail.com (Rich E) Date: Thu, 8 Jan 2009 21:43:35 +0100 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: I am using %includ "sms.h", which is what is wrapping all my functions. Without doing this, I have to hand-wrap every function in the header file! Is there a way to exclude certain definitions from my c header file when using %include, so that I can hand wrap them instead? On Thu, Jan 8, 2009 at 2:13 AM, Egor Zindy wrote: > Hello Rich, > > This is very strange. I got to test my example again, as long as you don't > do a > %include "dftmagnitude.h" > somewhere in the dftmagnitude.i, it's perfectly possible to do a > %rename (sms_spectrumMag) my_spectrumMag; > (see dftmagnitude3.zip attached in my previous mail and this one). > > So things for you to check: > * does the simple dftmagnitude3.zip compile on your system? > * what version of SWIG are you using? (I used 1.3.36 provided with cygwin) > * do you have a %include statement somewhere in your own .i file? > > Matthieu, if you read this, there's a complete example provided in > dftmagnitude3.zip. > * Wrapped function sms_spectrumMag in dftmagnitude.c and .h > * SWIG wrapper dftmagnitude.i uses %inline and %rename statements > * Example uses a modified numpy.i (see the previous mails in the thread). > * test example provided in test_dftmagnitude.py > > Haven't tested it under Linux, but under winxp/cygwin/mingw32, the following > works for me (in cygwin): > > $ python setup_dftmagnitude.py build -cmingw32 ; mv > build/lib.win32-2.5/_dftmagnitude.pyd . > $ python test_dftmagnitude.py > > Regards, > Egor > > -- > My Python: > $ python -i > Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit (Intel)] > on win32 > > My SWIG: > $ swig -version > > SWIG Version 1.3.36 > > Compiled with g++ [i686-pc-cygwin] > Please see http://www.swig.org for reporting bugs and further information > > > > > On Thu, Jan 8, 2009 at 1:43 AM, Rich E wrote: >> >> Here is my example, trying to wrap the function sms_spectrumMag that >> we have been dealing with: >> >> %apply (int DIM1, float* IN_ARRAY1) {(int sizeInArray, float* pInArray)}; >> %apply (int DIM1, float* INPLACE_ARRAY1) {(int sizeOutArray, float* >> pOutArray)}; >> >> %inline %{ >> >> void my_spectrumMag( int sizeInArray, float *pInArray, int >> sizeOutArray, float *pOutArray) >> { >> sms_spectrumMag(sizeOutArray, pInArray, pOutArray); >> } >> >> %} >> >> >> at this point, have the new function my_spectrumMag that wraps >> sms_spectrumMag() and provides arguments that can be typemapped using >> numpy.i Now, I don't want to have to call the function >> my_spectrumMag() in python, I want to use the original name, I would >> like to call the function as: >> >> sms_spectrumMag(numpyArray1, numpyArray2) >> >> But, trying to %rename my_spectrumMag to sms_spectrumMag does not >> work, the original sms_spectrumMag gets called in python instead. >> Trying to %ignore the original function first as follows removes the >> sms_spectrumMag completely from the module and I am left with >> my_spectrumMag: >> >> %ignore sms_spectrumMag; >> %rename (sms_spectrumMag) my_spectrumMag; >> >> >> Do you see my problem? >> >> >> On Wed, Jan 7, 2009 at 8:58 AM, Matthieu Brucher >> wrote: >> > 2009/1/6 Rich E : >> >> This helped immensely. I feel like I am getting close to being able >> >> to accomplish what I would like with SWIG: producing a python module >> >> that can be very 'python-like', while co-existing with the c library >> >> that is very 'c-like'. >> >> >> >> There is one question still remaining though, is it possible to make >> >> the wrapped function have the same name still? Using either >> >> my_spectrumMag or spectrumMag means I have to create a number of >> >> inconsistencies between the python module and the c library. It is >> >> ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the >> >> wrapped one, with the same name. But my attempts at doing this so far >> >> have not compiled because of name conflictions. >> > >> > Ok course you can. The function is renamed only if you say so. Perhaps >> > can you provide a small example of what doesn't work at the moment ? >> > >> >> Thanks for the help, I think you are doing great things with this >> >> numpy interface/typemaps system. >> > >> > Matthieu >> > -- >> > Information System Engineer, Ph.D. >> > Website: http://matthieu-brucher.developpez.com/ >> > Blogs: http://matt.eifelle.com and http://blog.developpez.com/?blog=92 >> > LinkedIn: http://www.linkedin.com/in/matthieubrucher >> > _______________________________________________ >> > Numpy-discussion mailing list >> > Numpy-discussion at scipy.org >> > http://projects.scipy.org/mailman/listinfo/numpy-discussion >> > >> _______________________________________________ >> Numpy-discussion mailing list >> Numpy-discussion at scipy.org >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > Numpy-discussion mailing list > Numpy-discussion at scipy.org > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > From bevan07 at gmail.com Thu Jan 8 18:59:03 2009 From: bevan07 at gmail.com (Bevan Jenkins) Date: Thu, 8 Jan 2009 23:59:03 +0000 (UTC) Subject: [Numpy-discussion] Accumulate values that are below threshold References: <9457e7c80901072236ua4d0667j66fe0ed96f9fccdb@mail.gmail.com> Message-ID: St?fan van der Walt sun.ac.za> writes: > > Hi Bevan > > Since the number of output elements are unknown, I don't think you can > implement this efficiently using arrays. If your dataset isn't too > large, a for-loop should do the trick. Otherwise, you may have to run > your code through Cython, which optimises for-loops around Python > lists. > > thresh = 1.0 > carry = 0 > output = [] > for idx, val in data: > carry += val > if (carry - thresh) >= -1e-15: > output.append((idx, carry)) > carry = 0 > > The comparison line above, "(carry - thresh0 >= -1e-15", may look > strange -- it basically just does "carry >= thresh". For some reason > I don't quite understand, when accumulating floats, it sometimes > happens that "1.0 != 1.0", so I use 1e-15 as protection. > > Regards > St?fan > > 2009/1/8 Bevan Jenkins gmail.com>: Stefan, Thanks for your solution, it does exactly what i require. At the moment I am just running through proof of concept stuff so the dataset is small. If I start to run into issues with the real data, then that might be the push I need to look into Cython. Bevan From mexicalex at yahoo.com Thu Jan 8 21:15:24 2009 From: mexicalex at yahoo.com (Alexandra Geddes) Date: Thu, 8 Jan 2009 18:15:24 -0800 (PST) Subject: [Numpy-discussion] checking array for NaN values. Message-ID: <494682.97803.qm@web51511.mail.re2.yahoo.com> Is there an easy way to check an array for NaN values? 'all(array)' regards NaN as true (because it's not 0). I've tried all(array != nan) which didn't work either. When i call locations which i know are NaN, it returns -1.#IND. But if i try to call all(array != -1.#IND), python interprets the # as the start of a comment. Any thoughts? thanks, alex. Alexandra Geddes UC Davis From josef.pktd at gmail.com Thu Jan 8 21:34:49 2009 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 8 Jan 2009 21:34:49 -0500 Subject: [Numpy-discussion] checking array for NaN values. In-Reply-To: <494682.97803.qm@web51511.mail.re2.yahoo.com> References: <494682.97803.qm@web51511.mail.re2.yahoo.com> Message-ID: <1cd32cbb0901081834w443f7e9va12231a49392769a@mail.gmail.com> On Thu, Jan 8, 2009 at 9:15 PM, Alexandra Geddes wrote: > Is there an easy way to check an array for NaN values? 'all(array)' regards NaN as true (because it's not 0). I've tried all(array != nan) which didn't work either. When i call locations which i know are NaN, it returns -1.#IND. But if i try to call all(array != -1.#IND), python interprets the # as the start of a comment. > I use >>> np.any(np.isnan(np.array([1,2,np.nan]))) True Josef From cournape at gmail.com Fri Jan 9 00:21:48 2009 From: cournape at gmail.com (David Cournapeau) Date: Fri, 9 Jan 2009 14:21:48 +0900 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <496663D5.4090005@stsci.edu> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> Message-ID: <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> On Fri, Jan 9, 2009 at 5:36 AM, Christopher Hanley wrote: > > I do not expect the trunk to always work. I even expect it to have > bugs. However, I do not expect there to be test failures for known > reasons that result in wasSuccessful() returning false. This is a bad > programming practice. It creates work for people trying to figure out > what is wrong when the answer is already know. Well, I don't agree it is bad practice: it is not ideal, yes, but I don't think using KnownFailure is much better. My rationale being that known failures are almost never worked on because it does not bug anyone anymore, and it is very easy to forget about them - AFAICS, most numpy/scipy known failures have never been worked on after being tagged as such. I don't think we have a good system for those cases, be it known failure - or just failing. I will tag them as known failure, since I am the only one against it, though :) David From cournape at gmail.com Fri Jan 9 00:27:46 2009 From: cournape at gmail.com (David Cournapeau) Date: Fri, 9 Jan 2009 14:27:46 +0900 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> Message-ID: <5b8d13220901082127n2fca120foe043f72b709cf9ec@mail.gmail.com> On Fri, Jan 9, 2009 at 2:21 PM, David Cournapeau wrote: > On Fri, Jan 9, 2009 at 5:36 AM, Christopher Hanley wrote: >> >> I do not expect the trunk to always work. I even expect it to have >> bugs. However, I do not expect there to be test failures for known >> reasons that result in wasSuccessful() returning false. This is a bad >> programming practice. It creates work for people trying to figure out >> what is wrong when the answer is already know. > > Well, I don't agree it is bad practice: it is not ideal, yes, but I > don't think using KnownFailure is much better. My rationale being that > known failures are almost never worked on because it does not bug > anyone anymore, and it is very easy to forget about them - AFAICS, > most numpy/scipy known failures have never been worked on after being > tagged as such. I don't think we have a good system for those cases, > be it known failure - or just failing. > > I will tag them as known failure, since I am the only one against it, though :) Done in r6308 - please tell me if something still does not work as expected, David From ezindy at gmail.com Fri Jan 9 01:05:10 2009 From: ezindy at gmail.com (Egor Zindy) Date: Fri, 9 Jan 2009 15:05:10 +0900 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Hello Rich, I know what you mean. %inclusion of header files saves a lot of effort! So, I had another play with the code (what holiday this turned out to be;) and as long as the declarations in the .i file are made in the right order, it should be possible to: * %include the header file * %ignore a sms_ function * %rename the function my_ to sms_ * %inline the my_ function I changed the .i file (attached) and re-ran the test, it works. Again, this is on my XP/cygwin/mingw32 system, so it could need some tuning on a different system! In all this, not sure where is best to put the %exception statement, but placement shouldn't be critical, because it concerns the my_ function rather than the original (or renamed) sms_ function. Regards, Egor On Fri, Jan 9, 2009 at 5:43 AM, Rich E wrote: > I am using %includ "sms.h", which is what is wrapping all my > functions. Without doing this, I have to hand-wrap every function in > the header file! > > Is there a way to exclude certain definitions from my c header file > when using %include, so that I can hand wrap them instead? > > On Thu, Jan 8, 2009 at 2:13 AM, Egor Zindy wrote: > > Hello Rich, > > > > This is very strange. I got to test my example again, as long as you > don't > > do a > > %include "dftmagnitude.h" > > somewhere in the dftmagnitude.i, it's perfectly possible to do a > > %rename (sms_spectrumMag) my_spectrumMag; > > (see dftmagnitude3.zip attached in my previous mail and this one). > > > > So things for you to check: > > * does the simple dftmagnitude3.zip compile on your system? > > * what version of SWIG are you using? (I used 1.3.36 provided with > cygwin) > > * do you have a %include statement somewhere in your own .i file? > > > > Matthieu, if you read this, there's a complete example provided in > > dftmagnitude3.zip. > > * Wrapped function sms_spectrumMag in dftmagnitude.c and .h > > * SWIG wrapper dftmagnitude.i uses %inline and %rename statements > > * Example uses a modified numpy.i (see the previous mails in the > thread). > > * test example provided in test_dftmagnitude.py > > > > Haven't tested it under Linux, but under winxp/cygwin/mingw32, the > following > > works for me (in cygwin): > > > > $ python setup_dftmagnitude.py build -cmingw32 ; mv > > build/lib.win32-2.5/_dftmagnitude.pyd . > > $ python test_dftmagnitude.py > > > > Regards, > > Egor > > > > -- > > My Python: > > $ python -i > > Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit > (Intel)] > > on win32 > > > > My SWIG: > > $ swig -version > > > > SWIG Version 1.3.36 > > > > Compiled with g++ [i686-pc-cygwin] > > Please see http://www.swig.org for reporting bugs and further > information > > > > > > > > > > On Thu, Jan 8, 2009 at 1:43 AM, Rich E wrote: > >> > >> Here is my example, trying to wrap the function sms_spectrumMag that > >> we have been dealing with: > >> > >> %apply (int DIM1, float* IN_ARRAY1) {(int sizeInArray, float* > pInArray)}; > >> %apply (int DIM1, float* INPLACE_ARRAY1) {(int sizeOutArray, float* > >> pOutArray)}; > >> > >> %inline %{ > >> > >> void my_spectrumMag( int sizeInArray, float *pInArray, int > >> sizeOutArray, float *pOutArray) > >> { > >> sms_spectrumMag(sizeOutArray, pInArray, pOutArray); > >> } > >> > >> %} > >> > >> > >> at this point, have the new function my_spectrumMag that wraps > >> sms_spectrumMag() and provides arguments that can be typemapped using > >> numpy.i Now, I don't want to have to call the function > >> my_spectrumMag() in python, I want to use the original name, I would > >> like to call the function as: > >> > >> sms_spectrumMag(numpyArray1, numpyArray2) > >> > >> But, trying to %rename my_spectrumMag to sms_spectrumMag does not > >> work, the original sms_spectrumMag gets called in python instead. > >> Trying to %ignore the original function first as follows removes the > >> sms_spectrumMag completely from the module and I am left with > >> my_spectrumMag: > >> > >> %ignore sms_spectrumMag; > >> %rename (sms_spectrumMag) my_spectrumMag; > >> > >> > >> Do you see my problem? > >> > >> > >> On Wed, Jan 7, 2009 at 8:58 AM, Matthieu Brucher > >> wrote: > >> > 2009/1/6 Rich E : > >> >> This helped immensely. I feel like I am getting close to being able > >> >> to accomplish what I would like with SWIG: producing a python module > >> >> that can be very 'python-like', while co-existing with the c library > >> >> that is very 'c-like'. > >> >> > >> >> There is one question still remaining though, is it possible to make > >> >> the wrapped function have the same name still? Using either > >> >> my_spectrumMag or spectrumMag means I have to create a number of > >> >> inconsistencies between the python module and the c library. It is > >> >> ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the > >> >> wrapped one, with the same name. But my attempts at doing this so > far > >> >> have not compiled because of name conflictions. > >> > > >> > Ok course you can. The function is renamed only if you say so. Perhaps > >> > can you provide a small example of what doesn't work at the moment ? > >> > > >> >> Thanks for the help, I think you are doing great things with this > >> >> numpy interface/typemaps system. > >> > > >> > Matthieu > >> > -- > >> > Information System Engineer, Ph.D. > >> > Website: http://matthieu-brucher.developpez.com/ > >> > Blogs: http://matt.eifelle.com and > http://blog.developpez.com/?blog=92 > >> > LinkedIn: http://www.linkedin.com/in/matthieubrucher > >> > _______________________________________________ > >> > Numpy-discussion mailing list > >> > Numpy-discussion at scipy.org > >> > http://projects.scipy.org/mailman/listinfo/numpy-discussion > >> > > >> _______________________________________________ > >> Numpy-discussion mailing list > >> Numpy-discussion at scipy.org > >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > > > > > > _______________________________________________ > > Numpy-discussion mailing list > > Numpy-discussion at scipy.org > > http://projects.scipy.org/mailman/listinfo/numpy-discussion > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: dftmagnitudei.zip Type: application/zip Size: 781 bytes Desc: not available URL: From stefan at sun.ac.za Fri Jan 9 02:19:09 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 09:19:09 +0200 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> Message-ID: <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> 2009/1/9 David Cournapeau : > On Fri, Jan 9, 2009 at 5:36 AM, Christopher Hanley wrote: >> >> I do not expect the trunk to always work. I even expect it to have >> bugs. However, I do not expect there to be test failures for known >> reasons that result in wasSuccessful() returning false. This is a bad >> programming practice. It creates work for people trying to figure out >> what is wrong when the answer is already know. > > Well, I don't agree it is bad practice: it is not ideal, yes, but I > don't think using KnownFailure is much better. My rationale being that > known failures are almost never worked on because it does not bug > anyone anymore, and it is very easy to forget about them - AFAICS, > most numpy/scipy known failures have never been worked on after being > tagged as such. I don't think we have a good system for those cases, > be it known failure - or just failing. I agree with you point of view, but I also have sympathy for Cristopher's situation. I thought a solution to both problems would be if we could find an easy way of executing all skipped tests as if they were never decorated. Turns out nose already has this functionality: nosetests numpy --no-skip I think we should urge developers to run the test suite this way, so that we remain aware of failures, even if they are decorated. Hope that helps, St?fan From animator333 at yahoo.com Fri Jan 9 02:19:53 2009 From: animator333 at yahoo.com (Prashant Saxena) Date: Fri, 9 Jan 2009 12:49:53 +0530 (IST) Subject: [Numpy-discussion] replace array values efficiently Message-ID: <100425.79053.qm@web94914.mail.in2.yahoo.com> Hi, I am new to numpy and getting my hands on slowly. How do you replace integers from strings in an integer array. (1D) For example: array = {1,1,1,2,3,3,4} replace 1 with "apple" replace 2 with "cheery" replace 3 with "mango" replace 4 with "banana" I know the general solution, but I am looking for an efficient way, supported by numpy/scipy to do this kind of conversion as fast as possible. Thanks Prashant Add more friends to your messenger and enjoy! Go to http://messenger.yahoo.com/invite/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Jan 9 02:29:17 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 01:29:17 -0600 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> Message-ID: <3d375d730901082329q448c8c86nab1ac1204fd14b1f@mail.gmail.com> On Fri, Jan 9, 2009 at 01:19, St?fan van der Walt wrote: > I think we should urge developers to run the test suite this way, so > that we remain aware of failures, even if they are decorated. I don't think we should use unit tests as a bug tracker. We have Trac. Each known-failing test (or group of related tests) should have a ticket prioritized and scheduled appropriately. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Fri Jan 9 02:33:22 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 01:33:22 -0600 Subject: [Numpy-discussion] replace array values efficiently In-Reply-To: <100425.79053.qm@web94914.mail.in2.yahoo.com> References: <100425.79053.qm@web94914.mail.in2.yahoo.com> Message-ID: <3d375d730901082333p40c5cf01hd034ddb5efef4cac@mail.gmail.com> On Fri, Jan 9, 2009 at 01:19, Prashant Saxena wrote: > Hi, > > I am new to numpy and getting my hands on slowly. > > How do you replace integers from strings in an integer array. (1D) > > For example: > > array = {1,1,1,2,3,3,4} > > replace 1 with "apple" > replace 2 with "cheery" > replace 3 with "mango" > replace 4 with "banana" > > I know the general solution, but I am looking for an efficient way, > supported by numpy/scipy to do this kind of conversion as fast as possible. I'd actually use a dictionary for this: In [1]: replacements = {1: 'apple', 2: 'cherry', 3: 'mango', 4: 'banana'} In [2]: map(replacements.get, [1,1,1,2,3,3,4]) Out[2]: ['apple', 'apple', 'apple', 'cherry', 'mango', 'mango', 'banana'] But if you really want to use numpy for this: In [3]: from numpy import * In [4]: replacements_array = array([None, 'apple', 'cherry', 'mango', 'banana'], dtype=object) In [5]: replacements_array[[1,1,1,2,3,3,4]] Out[5]: array([apple, apple, apple, cherry, mango, mango, banana], dtype=object) -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From david at ar.media.kyoto-u.ac.jp Fri Jan 9 02:19:46 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 09 Jan 2009 16:19:46 +0900 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> Message-ID: <4966FA92.3010405@ar.media.kyoto-u.ac.jp> St?fan van der Walt wrote: > 2009/1/9 David Cournapeau : > >> On Fri, Jan 9, 2009 at 5:36 AM, Christopher Hanley wrote: >> >>> I do not expect the trunk to always work. I even expect it to have >>> bugs. However, I do not expect there to be test failures for known >>> reasons that result in wasSuccessful() returning false. This is a bad >>> programming practice. It creates work for people trying to figure out >>> what is wrong when the answer is already know. >>> >> Well, I don't agree it is bad practice: it is not ideal, yes, but I >> don't think using KnownFailure is much better. My rationale being that >> known failures are almost never worked on because it does not bug >> anyone anymore, and it is very easy to forget about them - AFAICS, >> most numpy/scipy known failures have never been worked on after being >> tagged as such. I don't think we have a good system for those cases, >> be it known failure - or just failing. >> > > I agree with you point of view, but I also have sympathy for > Cristopher's situation. Yes, it is not a black and white situation - I first misunderstood Christopher situation because of the given context of tracking numpy changes. I can see why it is annoying - but it gives me important information (like for example the fact that solaris does not have the same formatting issues than linux and mac os X thanks to recent bug reports). As Robert said, BTS is supposedly a better system for this for this kind of things - but at least for me, trac is so slow and painful to use that I try to avoid it as much as possible. David From stefan at sun.ac.za Fri Jan 9 02:37:32 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 09:37:32 +0200 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run Message-ID: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> Hi, In distutils/config.py line 32 I see: "Usage of try_run is deprecated: please do not \n" \ "use it anymore, and avoid configuration checks \n" \ "involving running executable on the target machine.\n" \ What is the recommended way of doing configuration checks now? Regards St?fan From david at ar.media.kyoto-u.ac.jp Fri Jan 9 02:32:37 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 09 Jan 2009 16:32:37 +0900 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> Message-ID: <4966FD95.8000809@ar.media.kyoto-u.ac.jp> St?fan van der Walt wrote: > Hi, > > In distutils/config.py line 32 I see: > > "Usage of try_run is deprecated: please do not \n" \ > "use it anymore, and avoid configuration checks \n" \ > "involving running executable on the target machine.\n" \ > > What is the recommended way of doing configuration checks now? > It only refers to try_run kind of checks (that is when you need to run an executable on the machine). I added this when I had some problems running it on windows 64 bits with python 2.6 - According to Martin Loewis, try_run was not supposed to work everywhere, in particular in cross-compilation contexts (which is relatively current for windows in the context 32 vs 64 vs itanium). It happened that in that particular case where it is used in numpy, I found a way around it, but I would like to get rid of it completely at some point. David From stefan at sun.ac.za Fri Jan 9 02:51:03 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 09:51:03 +0200 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <3d375d730901082329q448c8c86nab1ac1204fd14b1f@mail.gmail.com> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> <3d375d730901082329q448c8c86nab1ac1204fd14b1f@mail.gmail.com> Message-ID: <9457e7c80901082351y6719b467p6fd156b4d9792c13@mail.gmail.com> 2009/1/9 Robert Kern : > On Fri, Jan 9, 2009 at 01:19, St?fan van der Walt wrote: > >> I think we should urge developers to run the test suite this way, so >> that we remain aware of failures, even if they are decorated. > > I don't think we should use unit tests as a bug tracker. We have Trac. > Each known-failing test (or group of related tests) should have a > ticket prioritized and scheduled appropriately. Yup. Point is, an implementation comes with tests, and sometimes those tests break. We then have two choices: remove the tests or mark them as known failures. Since they are meant to indicate regressions, it would make little sense to remove them, hence these decorators. My suggestion was to use the --no-skip flag to make these decorators invisible to the developers, lest failures be forgotten. St?fan From stefan at sun.ac.za Fri Jan 9 03:04:05 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 10:04:05 +0200 Subject: [Numpy-discussion] change made to test_print.py In-Reply-To: <4966FA92.3010405@ar.media.kyoto-u.ac.jp> References: <49662BDF.7000806@stsci.edu> <5b8d13220901081115w19176d93y4cb4fea55aae1a53@mail.gmail.com> <49665407.5030505@stsci.edu> <5b8d13220901081148ge59c6a5x8bb929f312f6b9c3@mail.gmail.com> <49665E0B.3030009@stsci.edu> <5b8d13220901081226g1a0883ecg2d8e55d7b86bc17@mail.gmail.com> <496663D5.4090005@stsci.edu> <5b8d13220901082121h5806285x3422756eeec2a512@mail.gmail.com> <9457e7c80901082319v28122044h14f625e62cdf0bd9@mail.gmail.com> <4966FA92.3010405@ar.media.kyoto-u.ac.jp> Message-ID: <9457e7c80901090004s5e0bb313web2ff117b8a7b8ec@mail.gmail.com> 2009/1/9 David Cournapeau : > As Robert said, BTS is supposedly a better system for this for this kind > of things - but at least for me, trac is so slow and painful to use that > I try to avoid it as much as possible. We are running Trac 10.2 from November 2006, so it is quite possible that some of the speed issues have been addressed in the meantime. Cheers St?fan From stefan at sun.ac.za Fri Jan 9 03:05:19 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 10:05:19 +0200 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <4966FD95.8000809@ar.media.kyoto-u.ac.jp> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> Message-ID: <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> 2009/1/9 David Cournapeau : > It happened that in that particular case where it is used in numpy, I > found a way around it, but I would like to get rid of it completely at > some point. What do you suggest as workarounds? St?fan From david at ar.media.kyoto-u.ac.jp Fri Jan 9 03:00:36 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 09 Jan 2009 17:00:36 +0900 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> Message-ID: <49670424.7010108@ar.media.kyoto-u.ac.jp> St?fan van der Walt wrote: > 2009/1/9 David Cournapeau : > >> It happened that in that particular case where it is used in numpy, I >> found a way around it, but I would like to get rid of it completely at >> some point. >> > > What do you suggest as workarounds? > > What about not using tests which need to run on the target platform :) It is not always easy, but in numpy cases, it is simple, at least principle (numscons build does not run any test code, for example): try_run is used to check whether some preprocessor defined are available (see numpy/random and numpy/core), which is not necessary. The autobook specifically mentions that code which need to run on target platforms should be avoided, since it breaks cross-compilation; with python distutils, it exactly breaks in those cases. cheers, David From stefan at sun.ac.za Fri Jan 9 03:31:25 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 10:31:25 +0200 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <49670424.7010108@ar.media.kyoto-u.ac.jp> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> <49670424.7010108@ar.media.kyoto-u.ac.jp> Message-ID: <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> 2009/1/9 David Cournapeau : >> What do you suggest as workarounds? > > What about not using tests which need to run on the target platform :) Let me simplify the question. How do you detect the version of the local Fortran compiler without executing the compiler? Or is that OK, and you'd simply like to avoid compiling and running code? St?fan From robert.kern at gmail.com Fri Jan 9 03:37:02 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 02:37:02 -0600 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> <49670424.7010108@ar.media.kyoto-u.ac.jp> <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> Message-ID: <3d375d730901090037n3b3ac24bp9781b7c4cd24af1c@mail.gmail.com> On Fri, Jan 9, 2009 at 02:31, St?fan van der Walt wrote: > 2009/1/9 David Cournapeau : >>> What do you suggest as workarounds? >> >> What about not using tests which need to run on the target platform :) > > Let me simplify the question. How do you detect the version of the > local Fortran compiler without executing the compiler? try_run() is not the right thing to call for such a purpose. Use FCompiler.get_version(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From david at ar.media.kyoto-u.ac.jp Fri Jan 9 03:21:47 2009 From: david at ar.media.kyoto-u.ac.jp (David Cournapeau) Date: Fri, 09 Jan 2009 17:21:47 +0900 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> <49670424.7010108@ar.media.kyoto-u.ac.jp> <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> Message-ID: <4967091B.2080007@ar.media.kyoto-u.ac.jp> St?fan van der Walt wrote: > 2009/1/9 David Cournapeau : > >>> What do you suggest as workarounds? >>> >> What about not using tests which need to run on the target platform :) >> > > Let me simplify the question. How do you detect the version of the > local Fortran compiler without executing the compiler? Or is that OK, > and you'd simply like to avoid compiling and running code? > Ah, sorry, I used the autotools vocabulary you may not be familiar with: when cross compiling, you have at least two platforms, build and host/target (host/target is the same unless you build cross-compilers). If I build foobar on linux for windows, linux is the build, windows the host/target. Anything which runs on the host/target platform cannot work in this context; anything on the build of course can - which generally includes compilers, etc... Basically, assuming a working compiler, pre-processing, compiling, linking test code snippets is OK. Running is not. cheers, David From stefan at sun.ac.za Fri Jan 9 04:27:22 2009 From: stefan at sun.ac.za (=?ISO-8859-1?Q?St=E9fan_van_der_Walt?=) Date: Fri, 9 Jan 2009 11:27:22 +0200 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <3d375d730901090037n3b3ac24bp9781b7c4cd24af1c@mail.gmail.com> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> <49670424.7010108@ar.media.kyoto-u.ac.jp> <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> <3d375d730901090037n3b3ac24bp9781b7c4cd24af1c@mail.gmail.com> Message-ID: <9457e7c80901090127p31fde40at17cf2338e1339341@mail.gmail.com> 2009/1/9 Robert Kern : > try_run() is not the right thing to call for such a purpose. Use > FCompiler.get_version(). That was just an example. What I want to do is run something like "pkg-config blah" and parse the output, but I get the idea from David's post that that is OK. Cheers St?fan From robert.kern at gmail.com Fri Jan 9 04:30:01 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 03:30:01 -0600 Subject: [Numpy-discussion] Replacement for numpy.distutils.config.try_run In-Reply-To: <9457e7c80901090127p31fde40at17cf2338e1339341@mail.gmail.com> References: <9457e7c80901082337q7d1ba180ge52f64914a912036@mail.gmail.com> <4966FD95.8000809@ar.media.kyoto-u.ac.jp> <9457e7c80901090005lc1911dve26e19b9793ea061@mail.gmail.com> <49670424.7010108@ar.media.kyoto-u.ac.jp> <9457e7c80901090031i3be213cfsc3182a678bddd2f9@mail.gmail.com> <3d375d730901090037n3b3ac24bp9781b7c4cd24af1c@mail.gmail.com> <9457e7c80901090127p31fde40at17cf2338e1339341@mail.gmail.com> Message-ID: <3d375d730901090130j5f53c6f3v7343ac73cd694890@mail.gmail.com> On Fri, Jan 9, 2009 at 03:27, St?fan van der Walt wrote: > 2009/1/9 Robert Kern : >> try_run() is not the right thing to call for such a purpose. Use >> FCompiler.get_version(). > > That was just an example. What I want to do is run something like > "pkg-config blah" and parse the output, but I get the idea from > David's post that that is OK. But that's not what try_run() does. Use exec_command() for general purpose running of programs. try_run() compiles a program from source and runs the result. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From nicolas.roux at st.com Fri Jan 9 07:04:36 2009 From: nicolas.roux at st.com (Nicolas ROUX) Date: Fri, 9 Jan 2009 13:04:36 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <496501E1.2060400@molden.no> Message-ID: <002d01c97252$716774a0$e7ad810a@gnb.st.com> Hi ! Thanks a lot for your fast/detailed reply. A very good point for Numpy ;-) I spent all my time trying to prepare my testcase to better share with you, that's why I didn't reply fast. I understand the weakness of the missing JITcompiler in Python vs Matlab, that's why I invistigated numpy vectorization/broadcast. (hoping to find a cool way to write our code in fast Numpy) I used the page http://www.scipy.org/PerformancePython to write my code efficiently in Numpy. While doing it I found one issue. To have pretty code, I created p0 and p1 arrays of indexes. In "test8" I wished to see the commented line working, which is not the case. Having to use "ix_" is not pretty enough, and seems to not work with further dimensions. Why the comment line is not working ? ############################################ def test8(): m = 1024 n = 512 Out = numpy.zeros((m,n)) In = numpy.zeros((m,n)) p0 = numpy.ogrid[0:m] p1 = numpy.ogrid[0:n] Out[0:m,0:n] = In[0:m,0:n] #Out[p0,p1] = In[p0,p1] #This doesn't work Out[numpy.ix_(p0,p1)] = In[numpy.ix_(p0,p1)] ############################################ What is maybe not clear in the above code, is that I don't want to predefine all possible ogrid/vector. The number of possible ogrid/vector is big if in need to define all. ... And this vector definition become more paintful. So Numpy vector style is fine if i can write something like: Out[p0,p1] = In[p0,p1] #2 dimensions case And Out[p0,p1,1] = In[p0,p1,1] #3 dimensions case But is not fine if i have to add ".ix_()" or to multiply the number of vector definitions. Below example with 3 dimensions instead of 2. ############################################ def test9(): m = 1024 n = 512 Out = numpy.zeros((m,n,3)) In = numpy.zeros((m,n,3)) p0 = numpy.ogrid[0:m] p1 = numpy.ogrid[0:n] Out[0:m,0:n,2] = In[0:m,0:n,2] #Out[p0,p1,2] = In[p0,p1,2] Out[numpy.ix_(p0,p1,2)] = In[numpy.ix_(p0,p1,2)] ############################################ Tanks again for your support ;-) Cheers, Nicolas. From ndbecker2 at gmail.com Fri Jan 9 07:05:02 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 09 Jan 2009 07:05:02 -0500 Subject: [Numpy-discussion] memmap from fd? Message-ID: I'm working on interfacing to a custom FPGA board. The kernel driver exposes the FPGA memory via mmap. It might be nice to use numpy memmap to read/write data. One issue is that I think I will need to create the memmap array from a fd, not a file name. The reason is I wrote the driver to only allow 1 exclusive open, and I already have it open for other reasons. Any chance to create a memmap array from a fd? From robert.kern at gmail.com Fri Jan 9 07:10:40 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 06:10:40 -0600 Subject: [Numpy-discussion] memmap from fd? In-Reply-To: References: Message-ID: <3d375d730901090410h3b5f57caqe8a6b639809047a4@mail.gmail.com> On Fri, Jan 9, 2009 at 06:05, Neal Becker wrote: > I'm working on interfacing to a custom FPGA board. The kernel driver exposes the FPGA memory via mmap. > > It might be nice to use numpy memmap to read/write data. One issue is that I think I will need to create the memmap array from a fd, not a file name. The reason is I wrote the driver to only allow 1 exclusive open, and I already have it open for other reasons. Any chance to create a memmap array from a fd? Use os.fdopen(fd) to create a file object which can be passed to the memmap constructor. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From ndbecker2 at gmail.com Fri Jan 9 08:26:00 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 09 Jan 2009 08:26 -0500 Subject: [Numpy-discussion] memmap from fd? References: <3d375d730901090410h3b5f57caqe8a6b639809047a4@mail.gmail.com> Message-ID: Robert Kern wrote: > On Fri, Jan 9, 2009 at 06:05, Neal Becker wrote: >> I'm working on interfacing to a custom FPGA board. The kernel driver >> exposes the FPGA memory via mmap. >> >> It might be nice to use numpy memmap to read/write data. One issue is >> that I think I will need to create the memmap array from a fd, not a file >> name. The reason is I wrote the driver to only allow 1 exclusive open, >> and I already have it open for other reasons. Any chance to create a >> memmap array from a fd? > > Use os.fdopen(fd) to create a file object which can be passed to the > memmap constructor. > Thanks! I'm assuming in this case I can ignore comments about flushing the data to disk? If an assignment to a slice of an memmap array will call mmap, there should be no need for any flushing. From sturla at molden.no Fri Jan 9 08:50:29 2009 From: sturla at molden.no (Sturla Molden) Date: Fri, 9 Jan 2009 14:50:29 +0100 (CET) Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <002d01c97252$716774a0$e7ad810a@gnb.st.com> References: <002d01c97252$716774a0$e7ad810a@gnb.st.com> Message-ID: <696a2c95645b74e165e0d196a6f6582e.squirrel@webmail.uio.no> > I understand the weakness of the missing JITcompiler in Python vs Matlab, > that's why I invistigated numpy vectorization/broadcast. > (hoping to find a cool way to write our code in fast Numpy) > > I used the page http://www.scipy.org/PerformancePython to write my code > efficiently in Numpy. > While doing it I found one issue. > > To have pretty code, I created p0 and p1 arrays of indexes. I must admit I don't quite understand what you are trying to do, and what your problem is. If you just want to do Out[:,:] = In[:,:] there is no need for meshgrids (ogrid), for-loops, or whatever. It is brain dead to use nested for-loops or ogrid for this purpose in NumPy. It is equally foolish to use nested for loops or meshgrid for this purpose in Matlab. If you do, I would seriously question your competence. By the way, you can index ogrid with more than one dimension: p = numpy.ogrid[:m, :n] Out[p] = In[p] From ndbecker2 at gmail.com Fri Jan 9 09:08:48 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 09 Jan 2009 09:08:48 -0500 Subject: [Numpy-discussion] memmap from fd? References: <3d375d730901090410h3b5f57caqe8a6b639809047a4@mail.gmail.com> Message-ID: Robert Kern wrote: > On Fri, Jan 9, 2009 at 06:05, Neal Becker wrote: >> I'm working on interfacing to a custom FPGA board. The kernel driver >> exposes the FPGA memory via mmap. >> >> It might be nice to use numpy memmap to read/write data. One issue is >> that I think I will need to create the memmap array from a fd, not a file >> name. The reason is I wrote the driver to only allow 1 exclusive open, >> and I already have it open for other reasons. Any chance to create a >> memmap array from a fd? > > Use os.fdopen(fd) to create a file object which can be passed to the > memmap constructor. > Looks like this is not going to work without some change to memmap. The problem is, I need read/write access. The only choice in memmap is 'w+'. But this does: if (mode == 'w+') and shape is None: raise ValueError, "shape must be given" fid.seek(0,2) My device has hijacked 'read' to mean something entirely different than you might expect. The seek call invokes 'read'. It looks like the purpose of this code is to find the size of the mappable area. The best solution I think is just throw it away. Consistent with mmap semantics, attempting access outside the mappable area should cause and error - but I don't think there is any reliable way to know the length of the mappable area apriori. Any thoughts? From nouiz at nouiz.org Fri Jan 9 09:25:43 2009 From: nouiz at nouiz.org (=?ISO-8859-1?Q?Fr=E9d=E9ric_Bastien?=) Date: Fri, 9 Jan 2009 09:25:43 -0500 Subject: [Numpy-discussion] inplace matrix multiplication Message-ID: <2d1d7fe70901090625k53fe9415uf177e2d80645f12c@mail.gmail.com> Hi, I would like to know how I can make a call to the blas function gemm in numpy. I need a multiply and accumulate for matrix and I don't want to allocate a new matrix each time I do it. thanks for your time Frederic Bastien -------------- next part -------------- An HTML attachment was scrubbed... URL: From nicolas.roux at st.com Fri Jan 9 09:56:08 2009 From: nicolas.roux at st.com (Nicolas ROUX) Date: Fri, 9 Jan 2009 15:56:08 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <696a2c95645b74e165e0d196a6f6582e.squirrel@webmail.uio.no> Message-ID: <012801c9726a$67af6ea0$e7ad810a@gnb.st.com> Sorry my previous mail was probalby not clear. This mail was following the tread we had before, so with some discussion legacy. I simplified the code to focus only on "what I" need, rather to bother you with the full code. I wrote below a code closer to what I need, where you will agree that vectorization/broadcasting is needed to avoid nested loops. As I wrote in the 1st mail (added at the end), what is important is to keep the code not too ugly due to vectorization syntax. (As written below I try to demonstrate that vectorization/broadcast code could be as readable as twice nested loop ) The real code we have is even more complex, with processing the array element using 5x5 neighbours, instead of 3x3. ###################################################### def test6(): w = 3096 h = 2048 a = numpy.zeros((h,w)) #Normally loaded with real data b = numpy.zeros((h,w,3)) w0 = numpy.ogrid[0:w-2] w1 = numpy.ogrid[1:w-1] w2 = numpy.ogrid[2:w] h0 = numpy.ogrid[0:h-2] h1 = numpy.ogrid[1:h-1] h2 = numpy.ogrid[2:h] p00, p10, p20 = [h0,w0], [h1,w0],[h2,w0] p01, p11, p21 = [h0,w1], [h1,w1],[h2,w1] p02, p12, p22 = [h0,w2], [h1,w2],[h2,w2] b[p11,1] = a[p11] + 1.23*a[p22] \ - numpy.min([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) \ + 0.123*numpy.max([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) ###################################################### This code above is the one I wish to write but is not working. I hope you better understand my issue context ;-) Did I missed something ? Thanks for your help. Cheers, Nicolas. > I understand the weakness of the missing JITcompiler in Python vs Matlab, > that's why I invistigated numpy vectorization/broadcast. > (hoping to find a cool way to write our code in fast Numpy) > > I used the page http://www.scipy.org/PerformancePython to write my code > efficiently in Numpy. > While doing it I found one issue. > > To have pretty code, I created p0 and p1 arrays of indexes. I must admit I don't quite understand what you are trying to do, and what your problem is. If you just want to do Out[:,:] = In[:,:] there is no need for meshgrids (ogrid), for-loops, or whatever. It is brain dead to use nested for-loops or ogrid for this purpose in NumPy. It is equally foolish to use nested for loops or meshgrid for this purpose in Matlab. If you do, I would seriously question your competence. By the way, you can index ogrid with more than one dimension: p = numpy.ogrid[:m, :n] Out[p] = In[p] ============================================================================ =============== ============================================================================ =============== ============================================================================ =============== Hi ! Thanks a lot for your fast/detailed reply. A very good point for Numpy ;-) I spent all my time trying to prepare my testcase to better share with you, that's why I didn't reply fast. I understand the weakness of the missing JITcompiler in Python vs Matlab, that's why I invistigated numpy vectorization/broadcast. (hoping to find a cool way to write our code in fast Numpy) I used the page http://www.scipy.org/PerformancePython to write my code efficiently in Numpy. While doing it I found one issue. To have pretty code, I created p0 and p1 arrays of indexes. In "test8" I wished to see the commented line working, which is not the case. Having to use "ix_" is not pretty enough, and seems to not work with further dimensions. Why the comment line is not working ? ############################################ def test8(): m = 1024 n = 512 Out = numpy.zeros((m,n)) In = numpy.zeros((m,n)) p0 = numpy.ogrid[0:m] p1 = numpy.ogrid[0:n] Out[0:m,0:n] = In[0:m,0:n] #Out[p0,p1] = In[p0,p1] #This doesn't work Out[numpy.ix_(p0,p1)] = In[numpy.ix_(p0,p1)] ############################################ What is maybe not clear in the above code, is that I don't want to predefine all possible ogrid/vector. The number of possible ogrid/vector is big if in need to define all. ... And this vector definition become more paintful. So Numpy vector style is fine if i can write something like: Out[p0,p1] = In[p0,p1] #2 dimensions case And Out[p0,p1,1] = In[p0,p1,1] #3 dimensions case But is not fine if i have to add ".ix_()" or to multiply the number of vector definitions. Below example with 3 dimensions instead of 2. ############################################ def test9(): m = 1024 n = 512 Out = numpy.zeros((m,n,3)) In = numpy.zeros((m,n,3)) p0 = numpy.ogrid[0:m] p1 = numpy.ogrid[0:n] Out[0:m,0:n,2] = In[0:m,0:n,2] #Out[p0,p1,2] = In[p0,p1,2] Out[numpy.ix_(p0,p1,2)] = In[numpy.ix_(p0,p1,2)] ############################################ Tanks again for your support ;-) Cheers, Nicolas. ============================================================================ =============== ============================================================================ =============== ============================================================================ =============== Hi, I need help ;-) I have here a testcase which works much faster in Matlab than Numpy. The following code takes less than 0.9sec in Matlab, but 21sec in Python. Numpy is 24 times slower than Matlab ! The big trouble I have is a large team of people within my company is ready to replace Matlab by Numpy/Scipy/Matplotlib, but I have to demonstrate that this kind of Python Code is executed with the same performance than Matlab, without writing C extension. This is becoming a critical point for us. This is a testcase that people would like to see working without any code restructuring. The reasons are: - this way of writing is fairly natural. - the original code which showed me the matlab/Numpy performance differences is much more complex, and can't benefit from broadcasting or other numpy tips (I can later give this code) ...So I really need to use the code below, without restructuring. Numpy/Python code: ##################################################################### import numpy import time print "Start test \n" dim = 3000 a = numpy.zeros((dim,dim,3)) start = time.clock() for i in range(dim): for j in range(dim): a[i,j,0] = a[i,j,1] a[i,j,2] = a[i,j,0] a[i,j,1] = a[i,j,2] end = time.clock() - start print "Test done, %f sec" % end ##################################################################### Matlab code: ##################################################################### 'Start test' dim = 3000; tic; a =zeros(dim,dim,3); for i = 1:dim for j = 1:dim a(i,j,1) = a(i,j,2); a(i,j,2) = a(i,j,1); a(i,j,3) = a(i,j,3); end end toc 'Test done' ##################################################################### Any idea on it ? Did I missed something ? Thanks a lot, in advance for your help. Cheers, Nicolas. _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion From sturla at molden.no Fri Jan 9 10:46:43 2009 From: sturla at molden.no (Sturla Molden) Date: Fri, 9 Jan 2009 16:46:43 +0100 (CET) Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <012801c9726a$67af6ea0$e7ad810a@gnb.st.com> References: <012801c9726a$67af6ea0$e7ad810a@gnb.st.com> Message-ID: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> > I simplified the code to focus only on "what I" need, rather to bother you > with the full code. def test(): w = 3096 h = 2048 a = numpy.zeros((h,w), order='F') #Normally loaded with real data b = numpy.zeros((h,w,3), order='F') w0 = slice(0,w-2) w1 = slice(1,w-1) w2 = slice(2,w) h0 = slice(0,h-2) h1 = slice(1,h-1) h2 = slice(2,h) p00, p10, p20 = [h0,w0], [h1,w0], [h2,w0] p01, p11, p21 = [h0,w1], [h1,w1], [h2,w1] p02, p12, p22 = [h0,w2], [h1,w2], [h2,w2] b[p11 + [1]] = a[p11] + 1.23*a[p22] \ - numpy.min([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) \ + 0.123*numpy.max([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) Does this work for you? From ndbecker2 at gmail.com Fri Jan 9 11:59:58 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 09 Jan 2009 11:59:58 -0500 Subject: [Numpy-discussion] memmap problem Message-ID: I modified memmap.py to avoid the issues with needed to read. It is working, but I am seeing these: m Exception exceptions.EnvironmentError: (22, 'Invalid argument') in ignored Exception exceptions.EnvironmentError: (22, 'Invalid argument') in ignored Out[22]: eos_memmap([ 0, 0, 0, ..., 255, 255, 255], dtype=uint8) print m[0:4] Exception exceptions.EnvironmentError: (22, 'Invalid argument') in ignored [0 0 0 0] What's that about? From nicolas.roux at st.com Fri Jan 9 12:32:50 2009 From: nicolas.roux at st.com (Nicolas ROUX) Date: Fri, 9 Jan 2009 18:32:50 +0100 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> Message-ID: <015501c97280$4c036100$e7ad810a@gnb.st.com> Thanks ! -1- The code style is good and the performance vs matlab is good. With 400x400: Matlab = 1.56 sec (with nested "for" loop, so no optimization) Numpy = 0.99 sec (with broadcasting) -2- Now with the code below I have strange result. With w=h=400: - Using "slice" => 0.99 sec - Using "numpy.ogrid" => 0.01 sec With w=400 and h=300: - Using "slice", => 0.719sec - Using "numpy.ogrid", => broadcast ERROR ! The last broadcast error is: "ValueError: shape mismatch: objects cannot be broadcast to a single shape" ####################################################### def test(): w = 400 if 1: #---Case with different w and h h = 300 else: #---Case with same w and h h = 400 a = numpy.zeros((h,w)) #Normally loaded with real data b = numpy.zeros((h,w,3)) if 1: #---Case with SLICE w0 = slice(0,w-2) w1 = slice(1,w-1) w2 = slice(2,w) h0 = slice(0,h-2) h1 = slice(1,h-1) h2 = slice(2,h) else: #---Case with OGRID w0 = numpy.ogrid[0:w-2] w1 = numpy.ogrid[1:w-1] w2 = numpy.ogrid[2:w] h0 = numpy.ogrid[0:h-2] h1 = numpy.ogrid[1:h-1] h2 = numpy.ogrid[2:h] p00, p01, p02 = [h0,w0], [h0,w1],[h0,w2] p10, p11, p12 = [h1,w0], [h1,w1],[h1,w2] p20, p21, p22 = [h2,w0], [h2,w1],[h2,w2] b[p11+[1]] = a[p11] - numpy.min([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) \ + 0.123*numpy.max([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) ####################################################### It seems "ogrid" got better performance, but broadcasting is not working any more. Do you think it's normal that broadcast is not possible using ogrid and different w & h ? Did I missed any row/colomn missmatch ??? Thanks. Cheers, Nicolas -----Original Message----- From: numpy-discussion-bounces at scipy.org [mailto:numpy-discussion-bounces at scipy.org] On Behalf Of Sturla Molden Sent: Friday, January 09, 2009 4:47 PM To: Discussion of Numerical Python Subject: Re: [Numpy-discussion] Numpy performance vs Matlab. > I simplified the code to focus only on "what I" need, rather to bother you > with the full code. def test(): w = 3096 h = 2048 a = numpy.zeros((h,w), order='F') #Normally loaded with real data b = numpy.zeros((h,w,3), order='F') w0 = slice(0,w-2) w1 = slice(1,w-1) w2 = slice(2,w) h0 = slice(0,h-2) h1 = slice(1,h-1) h2 = slice(2,h) p00, p10, p20 = [h0,w0], [h1,w0], [h2,w0] p01, p11, p21 = [h0,w1], [h1,w1], [h2,w1] p02, p12, p22 = [h0,w2], [h1,w2], [h2,w2] b[p11 + [1]] = a[p11] + 1.23*a[p22] \ - numpy.min([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) \ + 0.123*numpy.max([a[p11]-a[p00], a[p11]-a[p01], a[p11]-a[p02], a[p11]-a[p10], a[p11]-a[p12], a[p11]-a[p20], a[p11]-a[p21], a[p11]-a[p22]]) Does this work for you? _______________________________________________ Numpy-discussion mailing list Numpy-discussion at scipy.org http://projects.scipy.org/mailman/listinfo/numpy-discussion From pjssilva at ime.usp.br Fri Jan 9 13:45:19 2009 From: pjssilva at ime.usp.br (Paulo J. S. Silva) Date: Fri, 09 Jan 2009 15:45:19 -0300 Subject: [Numpy-discussion] 2-D function and meshgrid Message-ID: <1231526719.28818.19.camel@trinity> Hello, I have a function that receives a array of shape (2,) and returns a number (a function from R^2 -> R). It basically looks like this: def weirdDistance2(x): return dot(dot(weirdMatrix, x), x) (weirdMatrix is a "global" (2,2) array) I want to see its level sets in the box [0, 1] x [0, 1], hence I have to create a meshgrid and then compute it at each point of the mesh: x = linspace(0, 1, 200) y = x.copy() X, Y = meshgrid(x, y) My problem is how to actually compute the function at each point of the mesh. I have come out with two solutions. One very short and clear, but slow, and another longer and more convoluted (it has a loop, I hate loops in numpy code), but faster. Does anyone know a "no explicit-loops" and fast solution? Solution1: def myDistance(a, b): return weirdDistance(np.array((a, b))) vecDistance = np.vectorize(myDistance) return vecDistance(X, Y) Solution 2: nPoints = X.size result = np.zeros(nPoints) points = np.array( [X.ravel(), Y.ravel()] ).T for i in xrange(nPoints): result[i] = weirdDistance(points[i]) result = result.reshape(X.shape) Of course, the first one is slow because the myDistance function creates an array at each call. The second one, even with a loop, avoids the array creations. Best, Paulo From Chris.Barker at noaa.gov Fri Jan 9 14:13:36 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 09 Jan 2009 11:13:36 -0800 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <015501c97280$4c036100$e7ad810a@gnb.st.com> References: <015501c97280$4c036100$e7ad810a@gnb.st.com> Message-ID: <4967A1E0.2040208@noaa.gov> Nicolas ROUX wrote: > -2- Now with the code below I have strange result. > With w=h=400: > With w=400 and h=300: > - Using "numpy.ogrid", => broadcast ERROR ! > > The last broadcast error is: > "ValueError: shape mismatch: objects cannot be broadcast to a single shape" This is probably a broadcasting error, which means your result with w=h is probably wrong. My advice: Always test with non-square arrays, to make sure you are broadcasting as you expect. Don't test with "zeros" or "ones" for your test data -- so you can look at the results and see if you are getting what you expect. I often use something like: a = numpy.arange(w*h).reshape((h,w)) I also test with very small dimensions so that I can easily print the results of each step as I develop the code With this approach you will probably figure out what's going on. -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From sturla at molden.no Fri Jan 9 14:44:03 2009 From: sturla at molden.no (Sturla Molden) Date: Fri, 9 Jan 2009 20:44:03 +0100 (CET) Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <015501c97280$4c036100$e7ad810a@gnb.st.com> References: <015501c97280$4c036100$e7ad810a@gnb.st.com> Message-ID: > -2- Now with the code below I have strange result. > With w=h=400: > - Using "slice" => 0.99 sec > - Using "numpy.ogrid" => 0.01 sec It is not equivalent. The ogrid version only uses diagonal elements, and does less work. > It seems "ogrid" got better performance, but broadcasting is not working > any > more. Broadcasting is working, but not the way you think. Ogrid is not a faster alternative to slicing. You have the same in Matlab. You can index with a slice, an array of indices, or an array of booleans. If you are going to use the second alternative, the shape of the index arrays -- in each dimension -- must equal that of the output. You cannot use a "meshgrid" with different shaped arrays of x, y and z indices. NumPy is no different from Matlab here. From charlesr.harris at gmail.com Fri Jan 9 15:37:50 2009 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 9 Jan 2009 13:37:50 -0700 Subject: [Numpy-discussion] 2-D function and meshgrid In-Reply-To: <1231526719.28818.19.camel@trinity> References: <1231526719.28818.19.camel@trinity> Message-ID: On Fri, Jan 9, 2009 at 11:45 AM, Paulo J. S. Silva wrote: > Hello, > > I have a function that receives a array of shape (2,) and returns a > number (a function from R^2 -> R). It basically looks like this: > > def weirdDistance2(x): > return dot(dot(weirdMatrix, x), x) > > (weirdMatrix is a "global" (2,2) array) > > I want to see its level sets in the box [0, 1] x [0, 1], hence I have to > create a meshgrid and then compute it at each point of the mesh: > > x = linspace(0, 1, 200) > y = x.copy() > X, Y = meshgrid(x, y) > > My problem is how to actually compute the function at each point of the > mesh. I have come out with two solutions. One very short and clear, but > slow, and another longer and more convoluted (it has a loop, I hate > loops in numpy code), but faster. Does anyone know a "no explicit-loops" > and fast solution? > > Solution1: > > def myDistance(a, b): > return weirdDistance(np.array((a, b))) > vecDistance = np.vectorize(myDistance) > return vecDistance(X, Y) > > Solution 2: > > nPoints = X.size > result = np.zeros(nPoints) > points = np.array( [X.ravel(), Y.ravel()] ).T > for i in xrange(nPoints): > result[i] = weirdDistance(points[i]) > result = result.reshape(X.shape) > > Of course, the first one is slow because the myDistance function creates > an array at each call. The second one, even with a loop, avoids the > array creations. > Try IDLE 1.2.4 >>> import numpy as np >>> pts = np.random.rand(5,2) >>> mat = np.random.rand(2,2) >>> res = (np.dot(pts,mat)*pts).sum(axis=1) >>> res array([ 0.63018561, 0.30829864, 0.23173343, 1.79972127, 0.69498856]) >>> for row in pts : np.dot(row,np.dot(mat,row)) 0.63018560596590589 0.30829864146737423 0.23173343333294744 1.7997212735553192 0.69498855520540959 Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Jan 9 15:53:48 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 14:53:48 -0600 Subject: [Numpy-discussion] memmap problem In-Reply-To: References: Message-ID: <3d375d730901091253n597a294au13d9c32ef76463e5@mail.gmail.com> On Fri, Jan 9, 2009 at 10:59, Neal Becker wrote: > I modified memmap.py to avoid the issues with needed to read. It is working, but I am seeing these: > > m > Exception exceptions.EnvironmentError: (22, 'Invalid argument') in ignored > Exception exceptions.EnvironmentError: (22, 'Invalid argument') in ignored > Out[22]: eos_memmap([ 0, 0, 0, ..., 255, 255, 255], dtype=uint8) > > print m[0:4] > Exception exceptions.EnvironmentError: (22, 'Invalid argument') in ignored > [0 0 0 0] > > What's that about? Can you show us your modifications? You may need to also modify __array_finalize__(), _close(), and __del__(). -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Fri Jan 9 16:10:04 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 15:10:04 -0600 Subject: [Numpy-discussion] memmap from fd? In-Reply-To: References: <3d375d730901090410h3b5f57caqe8a6b639809047a4@mail.gmail.com> Message-ID: <3d375d730901091310v27033d5co1fbdc71098b9e672@mail.gmail.com> On Fri, Jan 9, 2009 at 08:08, Neal Becker wrote: > Robert Kern wrote: > >> On Fri, Jan 9, 2009 at 06:05, Neal Becker wrote: >>> I'm working on interfacing to a custom FPGA board. The kernel driver >>> exposes the FPGA memory via mmap. >>> >>> It might be nice to use numpy memmap to read/write data. One issue is >>> that I think I will need to create the memmap array from a fd, not a file >>> name. The reason is I wrote the driver to only allow 1 exclusive open, >>> and I already have it open for other reasons. Any chance to create a >>> memmap array from a fd? >> >> Use os.fdopen(fd) to create a file object which can be passed to the >> memmap constructor. >> > > Looks like this is not going to work without some change to memmap. The problem is, I need read/write access. The only choice in memmap is 'w+'. 'r+' is for reading and writing. > But this does: > if (mode == 'w+') and shape is None: > raise ValueError, "shape must be given" > > fid.seek(0,2) > > My device has hijacked 'read' to mean something entirely different than you might expect. The seek call invokes 'read'. > > It looks like the purpose of this code is to find the size of the mappable area. The best solution I think is just throw it away. We can't. We need it. > Consistent with mmap semantics, attempting access outside the mappable area should cause and error - but I don't think there is any reliable way to know the length of the mappable area apriori. For regular files, that seems to me to be fairly reliable. Why isn't it? -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Fri Jan 9 16:20:59 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 15:20:59 -0600 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <015501c97280$4c036100$e7ad810a@gnb.st.com> References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> Message-ID: <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> On Fri, Jan 9, 2009 at 11:32, Nicolas ROUX wrote: > Thanks ! > > -1- The code style is good and the performance vs matlab is good. > With 400x400: > Matlab = 1.56 sec (with nested "for" loop, so no optimization) > Numpy = 0.99 sec (with broadcasting) > > > -2- Now with the code below I have strange result. > With w=h=400: > - Using "slice" => 0.99 sec > - Using "numpy.ogrid" => 0.01 sec > > With w=400 and h=300: > - Using "slice", => 0.719sec > - Using "numpy.ogrid", => broadcast ERROR ! > > The last broadcast error is: > "ValueError: shape mismatch: objects cannot be broadcast to a single shape" > > ####################################################### > def test(): > w = 400 > > if 1: #---Case with different w and h > h = 300 > else: #---Case with same w and h > h = 400 > > a = numpy.zeros((h,w)) #Normally loaded with real data > b = numpy.zeros((h,w,3)) > > if 1: #---Case with SLICE > w0 = slice(0,w-2) > w1 = slice(1,w-1) > w2 = slice(2,w) > h0 = slice(0,h-2) > h1 = slice(1,h-1) > h2 = slice(2,h) > else: #---Case with OGRID > w0 = numpy.ogrid[0:w-2] > w1 = numpy.ogrid[1:w-1] > w2 = numpy.ogrid[2:w] > h0 = numpy.ogrid[0:h-2] > h1 = numpy.ogrid[1:h-1] > h2 = numpy.ogrid[2:h] > > p00, p01, p02 = [h0,w0], [h0,w1],[h0,w2] > p10, p11, p12 = [h1,w0], [h1,w1],[h1,w2] > p20, p21, p22 = [h2,w0], [h2,w1],[h2,w2] > > b[p11+[1]] = a[p11] - numpy.min([a[p11]-a[p00], > a[p11]-a[p01], > a[p11]-a[p02], > a[p11]-a[p10], > a[p11]-a[p12], > a[p11]-a[p20], > a[p11]-a[p21], > a[p11]-a[p22]]) \ > + 0.123*numpy.max([a[p11]-a[p00], > a[p11]-a[p01], > a[p11]-a[p02], > a[p11]-a[p10], > a[p11]-a[p12], > a[p11]-a[p20], > a[p11]-a[p21], > a[p11]-a[p22]]) > ####################################################### > > It seems "ogrid" got better performance, but broadcasting is not working any > more. > Do you think it's normal that broadcast is not possible using ogrid and > different w & h ? > Did I missed any row/colomn missmatch ??? There are several things wrong. Please read this document for information about how indexing works in numpy. http://docs.scipy.org/doc/numpy/user/basics.indexing.html But basically, you want slices. Using ogrid correctly will be slower. FWIW, ogrid with only one argument is fairly pointless. ogrid is intended to be used with multiple dimensions. If you just need one argument, use arange() or linspace(). With multiple arguments, ogrid will align the arrays such that they can be broadcasted as you expect. Lets take a look at some examples: In [1]: from numpy import * In [2]: ogrid[0:5] Out[2]: array([0, 1, 2, 3, 4]) In [3]: ogrid[0:6] Out[3]: array([0, 1, 2, 3, 4, 5]) Two 1D arrays. Now, if you follow the discussion in the indexing document, you know that if I were to use these as index arrays, one for each axis, the indexing mechanism will try to iterate over them in parallel. Since they have incompatible shapes, this will fail. Instead, if you put both arguments into ogrid: In [4]: ogrid[0:5, 0:6] Out[4]: [array([[0], [1], [2], [3], [4]]), array([[0, 1, 2, 3, 4, 5]])] We get the kind of arrays you need. These shapes are compatible, through broadcasting, and together form the indices to select out the part of the matrix you are interested in. However, just using the slices on the matrix instead of passing the slices through ogrid is faster. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From robert.kern at gmail.com Fri Jan 9 16:31:36 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 15:31:36 -0600 Subject: [Numpy-discussion] inplace matrix multiplication In-Reply-To: <2d1d7fe70901090625k53fe9415uf177e2d80645f12c@mail.gmail.com> References: <2d1d7fe70901090625k53fe9415uf177e2d80645f12c@mail.gmail.com> Message-ID: <3d375d730901091331u26a0d6dcw8dffca7349fec96@mail.gmail.com> On Fri, Jan 9, 2009 at 08:25, Fr?d?ric Bastien wrote: > Hi, > > I would like to know how I can make a call to the blas function gemm in > numpy. I need a multiply and accumulate for matrix and I don't want to > allocate a new matrix each time I do it. You can't in numpy. With scipy.linalg.fblas.dgemm() and the right arguments, you can. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From Chris.Barker at noaa.gov Fri Jan 9 16:40:25 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 09 Jan 2009 13:40:25 -0800 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> Message-ID: <4967C449.9060508@noaa.gov> Robert Kern wrote: > Instead, if you put both arguments into ogrid: > > In [4]: ogrid[0:5, 0:6] > Out[4]: > [array([[0], > [1], > [2], > [3], > [4]]), > array([[0, 1, 2, 3, 4, 5]])] > > We get the kind of arrays you need. These shapes are compatible, > through broadcasting, and together form the indices to select out the > part of the matrix you are interested in. > > However, just using the slices on the matrix instead of passing the > slices through ogrid is faster. So what is ogrid useful for? Just curious... -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From robert.kern at gmail.com Fri Jan 9 16:47:04 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 15:47:04 -0600 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4967C449.9060508@noaa.gov> References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> <4967C449.9060508@noaa.gov> Message-ID: <3d375d730901091347s77f0a9f7l4898c8d9bd77b90b@mail.gmail.com> On Fri, Jan 9, 2009 at 15:40, Christopher Barker wrote: > So what is ogrid useful for? > > Just curious... Floating point grids. x, y = ogrid[0:1:101j, 0:1:101j] -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From sturla at molden.no Fri Jan 9 16:49:27 2009 From: sturla at molden.no (Sturla Molden) Date: Fri, 9 Jan 2009 22:49:27 +0100 (CET) Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4967C449.9060508@noaa.gov> References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> <4967C449.9060508@noaa.gov> Message-ID: >> However, just using the slices on the matrix instead of passing the >> slices through ogrid is faster. > > So what is ogrid useful for? For the same problems where you would use meshgrid in Matlab. That is certain graphics problem for example; e.g. evaluating a surface z = f(x,y) over a grid of x,y values. From Chris.Barker at noaa.gov Fri Jan 9 17:04:37 2009 From: Chris.Barker at noaa.gov (Christopher Barker) Date: Fri, 09 Jan 2009 14:04:37 -0800 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> <4967C449.9060508@noaa.gov> Message-ID: <4967C9F5.8070704@noaa.gov> Sturla Molden wrote: > For the same problems where you would use meshgrid in Matlab. well, I used to use meshgrid a lot because MATLAB could not do broadcasting. Which is probably why the OP has been trying to use it. A note for the docs: The docs refer to ogrid and nd_grid, and as far as I can tell, they are the same thing, but it's confusing. actuall, as I look more, ogrid is a nd_grid with the sparse parameter set to True -- anyway, still confusing! Also, what is the "o" for -- the mnemonic might be helpful thanks, -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov From reakinator at gmail.com Fri Jan 9 17:11:04 2009 From: reakinator at gmail.com (Rich E) Date: Fri, 9 Jan 2009 23:11:04 +0100 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Well I see it works, however with one change: the %apply typemaps need to be done before %include'ing the header file, or else nothing in that header file will automatically get typemapped (only the functions that are written using %inline will be typemapped, which in the case of the exampe you wrote, is all there is. This isn't the case in the library I am writing, there are about 20-30 others that don't need to be written inline). One comment though, upon looking at the wrapped c file: there are two definitions for sms_spectrumMag(), one that expects pointers to c arrays as arguments, and then one later that expects numpy arrays as arguments. Upon testing the function in python, it seems there is no conflict; supplying just a numpy array works perfectly fine. I dont' understand how python is interpreting this so I cannot foresee any problems, I'm just bringing it up in case others can. So now I can wrap the functions in a way where they are very useful in python without messing up the c code. Great, thank you Egor for the ongoing help! Sorry it took your vacation, but you helped me spend mine the way I wanted to (successful programming, hehe). cheers, Rich On Fri, Jan 9, 2009 at 7:05 AM, Egor Zindy wrote: > Hello Rich, > > I know what you mean. %inclusion of header files saves a lot of effort! So, > I had another play with the code (what holiday this turned out to be;) and > as long as the declarations in the .i file are made in the right order, it > should be possible to: > * %include the header file > * %ignore a sms_ function > * %rename the function my_ to sms_ > * %inline the my_ function > > I changed the .i file (attached) and re-ran the test, it works. Again, this > is on my XP/cygwin/mingw32 system, so it could need some tuning on a > different system! > > In all this, not sure where is best to put the %exception statement, but > placement shouldn't be critical, because it concerns the my_ function rather > than the original (or renamed) sms_ function. > > Regards, > Egor > > On Fri, Jan 9, 2009 at 5:43 AM, Rich E wrote: >> >> I am using %includ "sms.h", which is what is wrapping all my >> functions. Without doing this, I have to hand-wrap every function in >> the header file! >> >> Is there a way to exclude certain definitions from my c header file >> when using %include, so that I can hand wrap them instead? >> >> On Thu, Jan 8, 2009 at 2:13 AM, Egor Zindy wrote: >> > Hello Rich, >> > >> > This is very strange. I got to test my example again, as long as you >> > don't >> > do a >> > %include "dftmagnitude.h" >> > somewhere in the dftmagnitude.i, it's perfectly possible to do a >> > %rename (sms_spectrumMag) my_spectrumMag; >> > (see dftmagnitude3.zip attached in my previous mail and this one). >> > >> > So things for you to check: >> > * does the simple dftmagnitude3.zip compile on your system? >> > * what version of SWIG are you using? (I used 1.3.36 provided with >> > cygwin) >> > * do you have a %include statement somewhere in your own .i file? >> > >> > Matthieu, if you read this, there's a complete example provided in >> > dftmagnitude3.zip. >> > * Wrapped function sms_spectrumMag in dftmagnitude.c and .h >> > * SWIG wrapper dftmagnitude.i uses %inline and %rename statements >> > * Example uses a modified numpy.i (see the previous mails in the >> > thread). >> > * test example provided in test_dftmagnitude.py >> > >> > Haven't tested it under Linux, but under winxp/cygwin/mingw32, the >> > following >> > works for me (in cygwin): >> > >> > $ python setup_dftmagnitude.py build -cmingw32 ; mv >> > build/lib.win32-2.5/_dftmagnitude.pyd . >> > $ python test_dftmagnitude.py >> > >> > Regards, >> > Egor >> > >> > -- >> > My Python: >> > $ python -i >> > Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit >> > (Intel)] >> > on win32 >> > >> > My SWIG: >> > $ swig -version >> > >> > SWIG Version 1.3.36 >> > >> > Compiled with g++ [i686-pc-cygwin] >> > Please see http://www.swig.org for reporting bugs and further >> > information >> > >> > >> > >> > >> > On Thu, Jan 8, 2009 at 1:43 AM, Rich E wrote: >> >> >> >> Here is my example, trying to wrap the function sms_spectrumMag that >> >> we have been dealing with: >> >> >> >> %apply (int DIM1, float* IN_ARRAY1) {(int sizeInArray, float* >> >> pInArray)}; >> >> %apply (int DIM1, float* INPLACE_ARRAY1) {(int sizeOutArray, float* >> >> pOutArray)}; >> >> >> >> %inline %{ >> >> >> >> void my_spectrumMag( int sizeInArray, float *pInArray, int >> >> sizeOutArray, float *pOutArray) >> >> { >> >> sms_spectrumMag(sizeOutArray, pInArray, pOutArray); >> >> } >> >> >> >> %} >> >> >> >> >> >> at this point, have the new function my_spectrumMag that wraps >> >> sms_spectrumMag() and provides arguments that can be typemapped using >> >> numpy.i Now, I don't want to have to call the function >> >> my_spectrumMag() in python, I want to use the original name, I would >> >> like to call the function as: >> >> >> >> sms_spectrumMag(numpyArray1, numpyArray2) >> >> >> >> But, trying to %rename my_spectrumMag to sms_spectrumMag does not >> >> work, the original sms_spectrumMag gets called in python instead. >> >> Trying to %ignore the original function first as follows removes the >> >> sms_spectrumMag completely from the module and I am left with >> >> my_spectrumMag: >> >> >> >> %ignore sms_spectrumMag; >> >> %rename (sms_spectrumMag) my_spectrumMag; >> >> >> >> >> >> Do you see my problem? >> >> >> >> >> >> On Wed, Jan 7, 2009 at 8:58 AM, Matthieu Brucher >> >> wrote: >> >> > 2009/1/6 Rich E : >> >> >> This helped immensely. I feel like I am getting close to being able >> >> >> to accomplish what I would like with SWIG: producing a python module >> >> >> that can be very 'python-like', while co-existing with the c library >> >> >> that is very 'c-like'. >> >> >> >> >> >> There is one question still remaining though, is it possible to make >> >> >> the wrapped function have the same name still? Using either >> >> >> my_spectrumMag or spectrumMag means I have to create a number of >> >> >> inconsistencies between the python module and the c library. It is >> >> >> ideal to ignore (%ignore?) the c sms_spectrumMag and instead use the >> >> >> wrapped one, with the same name. But my attempts at doing this so >> >> >> far >> >> >> have not compiled because of name conflictions. >> >> > >> >> > Ok course you can. The function is renamed only if you say so. >> >> > Perhaps >> >> > can you provide a small example of what doesn't work at the moment ? >> >> > >> >> >> Thanks for the help, I think you are doing great things with this >> >> >> numpy interface/typemaps system. >> >> > >> >> > Matthieu >> >> > -- >> >> > Information System Engineer, Ph.D. >> >> > Website: http://matthieu-brucher.developpez.com/ >> >> > Blogs: http://matt.eifelle.com and >> >> > http://blog.developpez.com/?blog=92 >> >> > LinkedIn: http://www.linkedin.com/in/matthieubrucher >> >> > _______________________________________________ >> >> > Numpy-discussion mailing list >> >> > Numpy-discussion at scipy.org >> >> > http://projects.scipy.org/mailman/listinfo/numpy-discussion >> >> > >> >> _______________________________________________ >> >> Numpy-discussion mailing list >> >> Numpy-discussion at scipy.org >> >> http://projects.scipy.org/mailman/listinfo/numpy-discussion >> > >> > >> > _______________________________________________ >> > Numpy-discussion mailing list >> > Numpy-discussion at scipy.org >> > http://projects.scipy.org/mailman/listinfo/numpy-discussion >> > >> > > > From robert.kern at gmail.com Fri Jan 9 17:12:02 2009 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 9 Jan 2009 16:12:02 -0600 Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4967C9F5.8070704@noaa.gov> References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> <4967C449.9060508@noaa.gov> <4967C9F5.8070704@noaa.gov> Message-ID: <3d375d730901091412j3d82867dn85d21ab4e9da0818@mail.gmail.com> On Fri, Jan 9, 2009 at 16:04, Christopher Barker wrote: > Sturla Molden wrote: >> For the same problems where you would use meshgrid in Matlab. > > well, I used to use meshgrid a lot because MATLAB could not do > broadcasting. Which is probably why the OP has been trying to use it. > > A note for the docs: > > The docs refer to ogrid and nd_grid, and as far as I can tell, they are > the same thing, but it's confusing. actuall, as I look more, ogrid is a > nd_grid with the sparse parameter set to True -- anyway, still confusing! I'm not sure why. The docstring seems fairly clear about this. > Also, what is the "o" for -- the mnemonic might be helpful "open". mgrid and ogrid are instantiations of the nd_grid class with different parameters. -- Robert Kern "I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." -- Umberto Eco From sturla at molden.no Fri Jan 9 17:27:14 2009 From: sturla at molden.no (Sturla Molden) Date: Fri, 9 Jan 2009 23:27:14 +0100 (CET) Subject: [Numpy-discussion] Numpy performance vs Matlab. In-Reply-To: <4967C9F5.8070704@noaa.gov> References: <8c051b18e6c219c05fcc3c126f1d566d.squirrel@webmail.uio.no> <015501c97280$4c036100$e7ad810a@gnb.st.com> <3d375d730901091320x5bab1fabs9bfe740f10914f08@mail.gmail.com> <4967C449.9060508@noaa.gov> <4967C9F5.8070704@noaa.gov> Message-ID: <142c3d71d99a9a9c8e22c174e3786c0a.squirrel@webmail.uio.no> > Sturla Molden wrote: >> For the same problems where you would use meshgrid in Matlab. > > well, I used to use meshgrid a lot because MATLAB could not do > broadcasting. Which is probably why the OP has been trying to use it. mgrid and ogrid are both meshgrids, with ogrid having a sparse representation (i.e. it uses less memory). There was no need for broadcasting in the OP's code. It just seemed a bit confused. Regards, Sturla Molden From pjssilva at ime.usp.br Fri Jan 9 21:10:37 2009 From: pjssilva at ime.usp.br (Paulo J. S. Silva) Date: Fri, 09 Jan 2009 23:10:37 -0300 Subject: [Numpy-discussion] 2-D function and meshgrid In-Reply-To: References: <1231526719.28818.19.camel@trinity> Message-ID: <1231553437.10173.27.camel@trinity> Chuck, Thanks, your version is much faster. I would prefer a solution that doesn't force me to re-implement weirdDistance (as my two solutions were). But the function is so simple that it is easier just to re-write it for speed as you did. By the way, I came out with one more solution that looks more Pythonic and does not need to re-write weirdDistance (and hence can be used in more complicated cases). It is also a tad faster than the fastest solution from my first post: Solution 3 points = np.vstack( [x.ravel(), y.ravel()] ).T results = np.array([weirDistance(p) for p in points]) return results.reshape(x.shape) (This is basically solution 2 using list comprehensions to make to code clearer) best, Paulo > > Try > > IDLE 1.2.4 > >>> import numpy as np > >>> pts = np.random.rand(5,2) > >>> mat = np.random.rand(2,2) > >>> res = (np.dot(pts,mat)*pts).sum(axis=1) > >>> res > > array([ 0.63018561, 0.30829864, 0.23173343, 1.79972127, > 0.69498856]) > >>> for row in pts : np.dot(row,np.dot(mat,row)) > > 0.63018560596590589 > 0.30829864146737423 > 0.23173343333294744 > 1.7997212735553192 > 0.69498855520540959 > > Chuck > From ezindy at gmail.com Fri Jan 9 21:29:52 2009 From: ezindy at gmail.com (Egor Zindy) Date: Sat, 10 Jan 2009 11:29:52 +0900 Subject: [Numpy-discussion] help with typemapping a C function to use numpy arrays In-Reply-To: References: Message-ID: Hello again! On Sat, Jan 10, 2009 at 7:11 AM, Rich E wrote: > Well I see it works, however with one change: the %apply typemaps need > to be done before %include'ing the header file, or else nothing in > that header file will automatically get typemapped (only the functions > that are written using %inline will be typemapped, which in the case > of the exampe you wrote, is all there is. This isn't the case in the > library I am writing, there are about 20-30 others that don't need to > be written inline). There were no typemaps in the example you gave, so it's nice to know you found where to put the typemaps definition lines in respect to everything else (this should definitely go in the cookbook!) > > One comment though, upon looking at the wrapped c file: there are two > definitions for sms_spectrumMag(), one that expects pointers to c > arrays as arguments, and then one later that expects numpy arrays as > arguments. I just had a look at the dftmagnitude_wrap.c and I'm confused. If you understand what's going on, maybe you can shed some light? The definition for python is contained here: static PyMethodDef SwigMethods[] = { { (char *)"sms_spectrumMag", _wrap_sms_spectrumMag, METH_VARARGS, NULL}, { NULL, NULL, 0, NULL } }; function name on the python side is "sms_spectrumMag" The _wrap_sms_spectrumMag function the calls either _wrap_sms_spectrumMag__SWIG_1 (argc == 1) or _wrap_sms_spectrumMag__SWIG_0 (argc == 3) depending on argc = (int)PyObject_Length(args); .... This is where things become complicated... _wrap_sms_spectrumMag__SWIG_1 calls my_spectrumMag, the inlined function, but _wrap_sms_spectrumMag__SWIG_0 doesn't! _wrap_sms_spectrumMag__SWIG_0 calls sms_spectrumMag directly with the 3 arguments converted from python: int, float * and float *! Isn't this what we were trying to avoid in the first place? Is there any way to instruct SWIG not to wrap the function directly? > Upon testing the function in python, it seems there is no > conflict; supplying just a numpy array works perfectly fine. I dont' > understand how python is interpreting this so I cannot foresee any > problems, I'm just bringing it up in case others can. > I don't understand either. I thought I did, though! My understanding is that the function name in python is just a string and can be anything instructed by the %rename statement. What I don't understand is that SWIG still tries to wrap the original function, even though we are only interested in wrapping the %inline-d one. > > So now I can wrap the functions in a way where they are very useful in > python without messing up the c code. Yup, if you call the python function with the intended number of arguments (and don't end-up calling the wrapped original function). > Great, thank you Egor for the > ongoing help! Sorry it took your vacation, but you helped me spend > mine the way I wanted to (successful programming, hehe). Don't mention it! I call that a "productive" holiday ;-) Regards, Egor > > cheers, > Rich > > On Fri, Jan 9, 2009 at 7:05 AM, Egor Zindy wrote: > > Hello Rich, > > > > I know what you mean. %inclusion of header files saves a lot of effort! > So, > > I had another play with the code (what holiday this turned out to be;) > and > > as long as the declarations in the .i file are made in the right order, > it > > should be possible to: > > * %include the header file > > * %ignore a sms_ function > > * %rename the function my_ to sms_ > > * %inline the my_ function > > > > I changed the .i file (attached) and re-ran the test, it works. Again, > this > > is on my XP/cygwin/mingw32 system, so it could need some tuning on a > > different system! > > > > In all this, not sure where is best to put the %exception statement, but > > placement shouldn't be critical, because it concerns the my_ function > rather > > than the original (or renamed) sms_ function. > > > > Regards, > > Egor > > > > On Fri, Jan 9, 2009 at 5:43 AM, Rich E wrote: > >> > >> I am using %includ "sms.h", which is what is wrapping all my > >> functions. Without doing this, I have to hand-wrap every function in > >> the header file! > >> > >> Is there a way to exclude certain definitions from my c header file > >> when using %include, so that I can hand wrap them instead? > >> > >> On Thu, Jan 8, 2009 at 2:13 AM, Egor Zindy wrote: > >> > Hello Rich, > >> > > >> > This is very strange. I got to test my example again, as long as you > >> > don't > >> > do a > >> > %include "dftmagnitude.h" > >> > somewhere in the dftmagnitude.i, it's perfectly possible to do a > >> > %rename (sms_spectrumMag) my_spectrumMag; > >> > (see dftmagnitude3.zip attached in my previous mail and this one). > >> > > >> > So things for you to check: > >> > * does the simple dftmagnitude3.zip compile on your system? > >> > * what version of SWIG are you using? (I used 1.3.36 provided with > >> > cygwin) > >> > * do you have a %include statement somewhere in your own .i file? > >> > > >> > Matthieu, if you read this, there's a complete example provided in > >> > dftmagnitude3.zip. > >> > * Wrapped function sms_spectrumMag in dftmagnitude.c and .h > >> > * SWIG wrapper dftmagnitude.i uses %inline and %rename statements > >> > * Example uses a modified numpy.i (see the previous mails in the > >> > thread). > >> > * test example provided in test_dftmagnitude.py > >> > > >> > Haven't tested it under Linux, but under winxp/cygwin/mingw32, the > >> > following > >> > works for me (in cygwin): > >> > > >> > $ python setup_dftmagnitude.py build -cmingw32 ; mv > >> > build/lib.win32-2.5/_dftmagnitude.pyd . > >> > $ python test_dftmagnitude.py > >> > > >> > Regards, > >> > Egor > >> > > >> > -- > >> > My Python: > >> > $ python -i > >> > Python 2.5.4 (r254:67916, Dec 23 2008, 15:10:54) [MSC v.1310 32 bit > >> > (Intel)] > >> > on win32 > >> > > >> > My SWIG: > >> > $ swig -version > >> > > >> > SWIG Version 1.3.36 > >> > > >> > Compiled with g++ [i686-pc-cygwin] > >> > Please see http://www.swig.org for reporting bugs and further > >> > information > >> > > >> > > >> > > >> > > >> > On Thu, Jan 8, 2009 at 1:43 AM, Rich E wrote: > >> >> > >> >> Here is my example, trying to wrap the function sms_spectrumMag that > >> >> we have been dealing with: > >> >> > >> >> %apply (int DIM1, float* IN_ARRAY1) {(int sizeInArray, float* > >> >> pInArray)}; > >> >> %apply (int DIM1, float* INPLACE_ARRAY1) {(int sizeOutArray, float* > >> >> pOutArray)}; > >> >> > >> >> %inline %{ > >> >> > >> >> void my_spectrumMag( int sizeInArray, float *pInArray, int > >> >> sizeOutArray, float *pOutArray) > >> >> { > >> >> sms_spectrumMag(sizeOutArray, pInArray, pOutArray); > >> >> } > >> >> > >> >> %} > >> >> > >> >> > >> >> at this point, have the new function my_spectrumMag that wraps > >> >> sms_spectrumMag() and provides arguments that can be typemapped using > >> >> numpy.i Now, I don't want to have to call the function > >> >> my_spectrumMag() in python, I want to use the original name, I would > >> >> like to call the function as: > >> >> > >> >> sms_spectrumMag(numpyArray1, numpyArray2) > >> >> > >> >> But, trying to %rename my_spectrumMag to sms_spectrumMag does not > >> >> work, the original sms_spectrumMag gets called in python instead. > >> >> Trying to %ignore the original function first as follows removes the > >> >> sms_spectrumMag completely from the module and I am left with > >> >> my_spectrumMag: > >> >> > >> >> %ignore sms_spectrumMag; > >> >> %rename (sms_spectrumMag) my_spectrumMag; > >> >> > >> >> > >> >> Do you see my problem? > >> >> > >> >> > >> >> On Wed, Jan 7, 2009 at 8:58 AM, Matthieu Brucher > >> >> wrote: > >> >> > 2009/1/6 Rich E : > >> >> >> This helped immensely. I feel like I am getting close to being > able > >> >> >> to accomplish what I would like with SWIG: producing a python > module > >> >> >> that can be very 'python-like', while co-existing with the c > library > >> >> >> that is very 'c-like'. > >> >> >> > >> >> >> There is one question still remaining though, is it possible to > make > >> >> >> the wrapped function have the same name still? Using either > >> >> >> my_spectrumMag or spectrumMag means I have to create a number of > >> >> >> inconsistencies between the python module and the c library. It > is > >> >> >> ideal to ignore (%ignore?) the c sms_spectrumMag and instead use > the > >> >> >> wrapped one, with the same name. But my attempts at doing this so > >> >> >> far > >> >> >> have not compiled because of name conflictions. > >> >> > > >> >> > Ok course you can. The function is renamed only if you say so. > >> >> > Perhaps > >> >> > can you provide a small example of what doesn't work at the moment > ? > >> >> > > >> >> >> Thanks for the help, I think you are doing great things with this > >> >> >> numpy interface/typemaps system. > >> >> > > >> >> > Matthieu > >> >> > -- > >> >> > Information System Engineer, Ph.D. > >> >> > Website: http://matthieu-brucher.developpez.com/ > >> >> > Blogs: http://matt.eifelle.com and > >> >> > http://blog.developpez.com/?blog=92 > >> >> > LinkedIn: http://www.linkedin.com/in/matthieubrucher > >> >> > _______________________________________________ > >> >> > Numpy-discussion mailing list > >> >> > Numpy-discussion at scipy.org > >> >> > http://projects.scipy.org/mailman/listinfo/numpy-discussion > >> >> > > >> >> _______________________________________________ > >> >> Numpy-discussion mailing list > >> >> Numpy-discussion at scipy.org > >> >> http://projects.scipy.org/mailman/listinfo/numpy-discussion > >> > > >> > > >> > _______________________________________________ > >> > Numpy-discussion mailing list > >> > Numpy-discussion at scipy.org > >> > http://projects.scipy.org/mailman/listinfo/numpy-discussion > >> > > >> > > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ndbecker2 at gmail.com Fri Jan 9 23:15:27 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Fri, 09 Jan 2009 23:15:27 -0500 Subject: [Numpy-discussion] memmap from fd? References: <3d375d730901090410h3b5f57caqe8a6b639809047a4@mail.gmail.com> <3d375d730901091310v27033d5co1fbdc71098b9e672@mail.gmail.com> Message-ID: Robert Kern wrote: > On Fri, Jan 9, 2009 at 08:08, Neal Becker wrote: >> Robert Kern wrote: >> >>> On Fri, Jan 9, 2009 at 06:05, Neal Becker wrote: >>>> I'm working on interfacing to a custom FPGA board. The kernel driver >>>> exposes the FPGA memory via mmap. >>>> >>>> It might be nice to use numpy memmap to read/write data. One issue is >>>> that I think I will need to create the memmap array from a fd, not a >>>> file >>>> name. The reason is I wrote the driver to only allow 1 exclusive open, >>>> and I already have it open for other reasons. Any chance to create a >>>> memmap array from a fd? >>> >>> Use os.fdopen(fd) to create a file object which can be passed to the >>> memmap constructor. >>> >> >> Looks like this is not going to work without some change to memmap. The >> problem is, I need read/write access. The only choice in memmap is 'w+'. > > 'r+' is for reading and writing. > >> But this does: >> if (mode == 'w+') and shape is None: >> raise ValueError, "shape must be given" >> >> fid.seek(0,2) >> >> My device has hijacked 'read' to mean something entirely different than >> you might expect. The seek call invokes 'read'. >> >> It looks like the purpose of this code is to find the size of the >> mappable area. The best solution I think is just throw it away. > > We can't. We need it. > >> Consistent with mmap semantics, attempting access outside the mappable >> area should cause and error - but I don't think there is any reliable way >> to know the length of the mappable area apriori. > > For regular files, that seems to me to be fairly reliable. Why isn't it? > Because I'm not mmapping a file. I'm mmapping a device. It exposes the memory of the FPGA board as seen on the PCI bus. You just have to know the size. From ndbecker2 at gmail.com Sat Jan 10 13:41:59 2009 From: ndbecker2 at gmail.com (Neal Becker) Date: Sat, 10 Jan 2009 13:41:59 -0500 Subject: [Numpy-discussion] memmap problem References: