From cournape at gmail.com Fri Aug 1 01:46:06 2014 From: cournape at gmail.com (David Cournapeau) Date: Fri, 1 Aug 2014 14:46:06 +0900 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References: Message-ID: The docstring at the beginning of the module is still relevant AFAIK: it was about decreasing import times. See http://mail.scipy.org/pipermail/numpy-discussion/2009-October/045981.html On Fri, Aug 1, 2014 at 10:27 AM, Charles R Harris wrote: > Hi All, > > The _inspect.py function looks like a numpy version of the python inspect > function. ISTR that is was a work around for problems with the early python > versions, but that would have been back in 2009. > > Thoughts? > > Chuck > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From olivier.grisel at ensta.org Fri Aug 1 04:20:29 2014 From: olivier.grisel at ensta.org (Olivier Grisel) Date: Fri, 1 Aug 2014 10:20:29 +0200 Subject: [Numpy-discussion] OSX wheels for older numpy versions on pypi In-Reply-To: References:

Message-ID: 2014-07-31 22:40 GMT+02:00 Matthew Brett : > > Sure, I built and uploaded: > > scipy-0.12.0 py27 > scipy-0.13.0 py27, 33, 34 > > Are there any others you need? Thanks, this is already great. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel From charlesr.harris at gmail.com Fri Aug 1 07:57:47 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 05:57:47 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References: Message-ID: On Thu, Jul 31, 2014 at 11:46 PM, David Cournapeau wrote: > The docstring at the beginning of the module is still relevant AFAIK: it > was about decreasing import times. See > http://mail.scipy.org/pipermail/numpy-discussion/2009-October/045981.html > > > On Fri, Aug 1, 2014 at 10:27 AM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> Hi All, >> >> The _inspect.py function looks like a numpy version of the python inspect >> function. ISTR that is was a work around for problems with the early python >> versions, but that would have been back in 2009. >> >> Thoughts? >> >> It's only used in one function. def get_object_signature(obj): """ Get the signature from obj """ try: sig = formatargspec(*getargspec(obj)) except TypeError as errmsg: sig = '' # msg = "Unable to retrieve the signature of %s '%s'\n"\ # "(Initial error message: %s)" # warnings.warn(msg % (type(obj), # getattr(obj, '__name__', '???'), # errmsg)) return sig Where a local import would do as well. It also has bugs, so evidently isn't called often ;) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Aug 1 08:37:51 2014 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 1 Aug 2014 13:37:51 +0100 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References: Message-ID: On Fri, Aug 1, 2014 at 12:57 PM, Charles R Harris wrote: > > On Thu, Jul 31, 2014 at 11:46 PM, David Cournapeau > wrote: >> >> The docstring at the beginning of the module is still relevant AFAIK: it >> was about decreasing import times. See >> http://mail.scipy.org/pipermail/numpy-discussion/2009-October/045981.html >> >> >> On Fri, Aug 1, 2014 at 10:27 AM, Charles R Harris >> wrote: >>> >>> Hi All, >>> >>> The _inspect.py function looks like a numpy version of the python inspect >>> function. ISTR that is was a work around for problems with the early python >>> versions, but that would have been back in 2009. >>> >>> Thoughts? >>> > > It's only used in one function. Yes, one function that is called at startup, so no, a local import of the stdlib inspect module would not help. > def get_object_signature(obj): > """ > Get the signature from obj > """ > try: > sig = formatargspec(*getargspec(obj)) > except TypeError as errmsg: > sig = '' > # msg = "Unable to retrieve the signature of %s '%s'\n"\ > # "(Initial error message: %s)" > # warnings.warn(msg % (type(obj), > # getattr(obj, '__name__', '???'), > # errmsg)) > return sig > > Where a local import would do as well. It also has bugs, so evidently isn't > called often ;) What bugs? Any bugs relevant to the objects that get_object_signature() is called with? It does not have to work for anything else but those. -- Robert Kern From charlesr.harris at gmail.com Fri Aug 1 09:03:38 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 07:03:38 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 6:37 AM, Robert Kern wrote: > On Fri, Aug 1, 2014 at 12:57 PM, Charles R Harris > wrote: > > > > On Thu, Jul 31, 2014 at 11:46 PM, David Cournapeau > > wrote: > >> > >> The docstring at the beginning of the module is still relevant AFAIK: it > >> was about decreasing import times. See > >> > http://mail.scipy.org/pipermail/numpy-discussion/2009-October/045981.html > >> > >> > >> On Fri, Aug 1, 2014 at 10:27 AM, Charles R Harris > >> wrote: > >>> > >>> Hi All, > >>> > >>> The _inspect.py function looks like a numpy version of the python > inspect > >>> function. ISTR that is was a work around for problems with the early > python > >>> versions, but that would have been back in 2009. > >>> > >>> Thoughts? > >>> > > > > It's only used in one function. > > Yes, one function that is called at startup, so no, a local import of > the stdlib inspect module would not help. > > > def get_object_signature(obj): > > """ > > Get the signature from obj > > """ > > try: > > sig = formatargspec(*getargspec(obj)) > > except TypeError as errmsg: > > sig = '' > > # msg = "Unable to retrieve the signature of %s '%s'\n"\ > > # "(Initial error message: %s)" > > # warnings.warn(msg % (type(obj), > > # getattr(obj, '__name__', '???'), > > # errmsg)) > > return sig > > > > Where a local import would do as well. It also has bugs, so evidently > isn't > > called often ;) > > What bugs? Any bugs relevant to the objects that > get_object_signature() is called with? It does not have to work for > anything else but those. > Undefined variables in getargs. The only two functions used from the module are very small and could simply be brought into `ma/core.py`. The python inspect module is used elsewhere... Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Fri Aug 1 09:54:06 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 07:54:06 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 7:03 AM, Charles R Harris wrote: > > > > On Fri, Aug 1, 2014 at 6:37 AM, Robert Kern wrote: > >> On Fri, Aug 1, 2014 at 12:57 PM, Charles R Harris >> wrote: >> > >> > On Thu, Jul 31, 2014 at 11:46 PM, David Cournapeau >> > wrote: >> >> >> >> The docstring at the beginning of the module is still relevant AFAIK: >> it >> >> was about decreasing import times. See >> >> >> http://mail.scipy.org/pipermail/numpy-discussion/2009-October/045981.html >> >> >> >> >> >> On Fri, Aug 1, 2014 at 10:27 AM, Charles R Harris >> >> wrote: >> >>> >> >>> Hi All, >> >>> >> >>> The _inspect.py function looks like a numpy version of the python >> inspect >> >>> function. ISTR that is was a work around for problems with the early >> python >> >>> versions, but that would have been back in 2009. >> >>> >> >>> Thoughts? >> >>> >> > >> > It's only used in one function. >> >> Yes, one function that is called at startup, so no, a local import of >> the stdlib inspect module would not help. >> >> > def get_object_signature(obj): >> > """ >> > Get the signature from obj >> > """ >> > try: >> > sig = formatargspec(*getargspec(obj)) >> > except TypeError as errmsg: >> > sig = '' >> > # msg = "Unable to retrieve the signature of %s '%s'\n"\ >> > # "(Initial error message: %s)" >> > # warnings.warn(msg % (type(obj), >> > # getattr(obj, '__name__', '???'), >> > # errmsg)) >> > return sig >> > >> > Where a local import would do as well. It also has bugs, so evidently >> isn't >> > called often ;) >> >> What bugs? Any bugs relevant to the objects that >> get_object_signature() is called with? It does not have to work for >> anything else but those. >> > > Undefined variables in getargs. The only two functions used from the > module are very small and could simply be brought into `ma/core.py`. The > python inspect module is used elsewhere... > Importing inspect looks to take about 500 ns on my machine. Although It is hard to be exact, as I suspect the file is sitting in the file cache. Would probably be slower with hard disks. But as the inspect module is already imported elsewhere, the python interpreter should also have it cached. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Aug 1 09:59:53 2014 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 1 Aug 2014 14:59:53 +0100 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris wrote: > Importing inspect looks to take about 500 ns on my machine. Although It is > hard to be exact, as I suspect the file is sitting in the file cache. Would > probably be slower with hard disks. Or where site-packages is on NFS. > But as the inspect module is already > imported elsewhere, the python interpreter should also have it cached. Not on a normal import it's not. >>> import numpy >>> import sys >>> sys.modules['inspect'] Traceback (most recent call last): File "", line 1, in KeyError: 'inspect' You should feel free to remove whatever parts of `_inspect` are not being used and to move the parts that are closer to where they are used if you feel compelled to. Please do not replace the current uses of `_inspect` with `inspect`. -- Robert Kern From charlesr.harris at gmail.com Fri Aug 1 10:23:39 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 08:23:39 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern wrote: > On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris > wrote: > > > Importing inspect looks to take about 500 ns on my machine. Although It > is > > hard to be exact, as I suspect the file is sitting in the file cache. > Would > > probably be slower with hard disks. > > Or where site-packages is on NFS. > > > But as the inspect module is already > > imported elsewhere, the python interpreter should also have it cached. > > Not on a normal import it's not. > > >>> import numpy > >>> import sys > >>> sys.modules['inspect'] > Traceback (most recent call last): > File "", line 1, in > KeyError: 'inspect' > There are two lazy imports of inspect. > > You should feel free to remove whatever parts of `_inspect` are not > being used and to move the parts that are closer to where they are > used if you feel compelled to. Please do not replace the current uses > of `_inspect` with `inspect`. > It is used in just one place. Is importing inspect so much slower than all the other imports we do? Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Aug 1 10:29:01 2014 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 1 Aug 2014 15:29:01 +0100 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 3:23 PM, Charles R Harris wrote: > > On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern wrote: >> >> On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris >> wrote: >> >> > Importing inspect looks to take about 500 ns on my machine. Although It >> > is >> > hard to be exact, as I suspect the file is sitting in the file cache. >> > Would >> > probably be slower with hard disks. >> >> Or where site-packages is on NFS. >> >> > But as the inspect module is already >> > imported elsewhere, the python interpreter should also have it cached. >> >> Not on a normal import it's not. >> >> >>> import numpy >> >>> import sys >> >>> sys.modules['inspect'] >> Traceback (most recent call last): >> File "", line 1, in >> KeyError: 'inspect' > > There are two lazy imports of inspect. Sure, but get_object_signature() is called unlazily when numpy is imported. >> You should feel free to remove whatever parts of `_inspect` are not >> being used and to move the parts that are closer to where they are >> used if you feel compelled to. Please do not replace the current uses >> of `_inspect` with `inspect`. > > It is used in just one place. So? That one place is always called whenever numpy is imported. > Is importing inspect so much slower than all > the other imports we do? Yeah, it's pretty bad. -- Robert Kern From charlesr.harris at gmail.com Fri Aug 1 11:23:51 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 09:23:51 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 8:29 AM, Robert Kern wrote: > On Fri, Aug 1, 2014 at 3:23 PM, Charles R Harris > wrote: > > > > On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern > wrote: > >> > >> On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris > >> wrote: > >> > >> > Importing inspect looks to take about 500 ns on my machine. Although > It > >> > is > >> > hard to be exact, as I suspect the file is sitting in the file cache. > >> > Would > >> > probably be slower with hard disks. > >> > >> Or where site-packages is on NFS. > >> > >> > But as the inspect module is already > >> > imported elsewhere, the python interpreter should also have it cached. > >> > >> Not on a normal import it's not. > >> > >> >>> import numpy > >> >>> import sys > >> >>> sys.modules['inspect'] > >> Traceback (most recent call last): > >> File "", line 1, in > >> KeyError: 'inspect' > > > > There are two lazy imports of inspect. > > Sure, but get_object_signature() is called unlazily when numpy is imported. > > >> You should feel free to remove whatever parts of `_inspect` are not > >> being used and to move the parts that are closer to where they are > >> used if you feel compelled to. Please do not replace the current uses > >> of `_inspect` with `inspect`. > > > > It is used in just one place. > > So? That one place is always called whenever numpy is imported. > > > Is importing inspect so much slower than all > > the other imports we do? > > Yeah, it's pretty bad. > > The buggy code is for tuple parameter unpacking, a path that is not exercised and a feature not in python 3. So... is it safe to excise that nasty bit of code, or does Enthought make use of the numpy _inspect module? The other (fixable) error is in formatargvalues, which is not in __all__ and not used as far as I can tell. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From robert.kern at gmail.com Fri Aug 1 11:29:24 2014 From: robert.kern at gmail.com (Robert Kern) Date: Fri, 1 Aug 2014 16:29:24 +0100 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 4:23 PM, Charles R Harris wrote: > > On Fri, Aug 1, 2014 at 8:29 AM, Robert Kern wrote: >> >> On Fri, Aug 1, 2014 at 3:23 PM, Charles R Harris >> wrote: >> > >> > On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern >> > wrote: >> >> >> >> On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris >> >> wrote: >> >> >> >> > Importing inspect looks to take about 500 ns on my machine. Although >> >> > It >> >> > is >> >> > hard to be exact, as I suspect the file is sitting in the file cache. >> >> > Would >> >> > probably be slower with hard disks. >> >> >> >> Or where site-packages is on NFS. >> >> >> >> > But as the inspect module is already >> >> > imported elsewhere, the python interpreter should also have it >> >> > cached. >> >> >> >> Not on a normal import it's not. >> >> >> >> >>> import numpy >> >> >>> import sys >> >> >>> sys.modules['inspect'] >> >> Traceback (most recent call last): >> >> File "", line 1, in >> >> KeyError: 'inspect' >> > >> > There are two lazy imports of inspect. >> >> Sure, but get_object_signature() is called unlazily when numpy is >> imported. >> >> >> You should feel free to remove whatever parts of `_inspect` are not >> >> being used and to move the parts that are closer to where they are >> >> used if you feel compelled to. Please do not replace the current uses >> >> of `_inspect` with `inspect`. >> > >> > It is used in just one place. >> >> So? That one place is always called whenever numpy is imported. >> >> > Is importing inspect so much slower than all >> > the other imports we do? >> >> Yeah, it's pretty bad. >> > > The buggy code is for tuple parameter unpacking, a path that is not > exercised and a feature not in python 3. So... is it safe to excise that > nasty bit of code, "You should feel free to remove whatever parts of `_inspect` are not being used." > or does Enthought make use of the numpy _inspect module? No, of course not. It's _private for a reason. > The other (fixable) error is in formatargvalues, which is not in __all__ and > not used as far as I can tell. -- Robert Kern From charlesr.harris at gmail.com Fri Aug 1 11:35:55 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 09:35:55 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 9:29 AM, Robert Kern wrote: > On Fri, Aug 1, 2014 at 4:23 PM, Charles R Harris > wrote: > > > > On Fri, Aug 1, 2014 at 8:29 AM, Robert Kern > wrote: > >> > >> On Fri, Aug 1, 2014 at 3:23 PM, Charles R Harris > >> wrote: > >> > > >> > On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern > >> > wrote: > >> >> > >> >> On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris > >> >> wrote: > >> >> > >> >> > Importing inspect looks to take about 500 ns on my machine. > Although > >> >> > It > >> >> > is > >> >> > hard to be exact, as I suspect the file is sitting in the file > cache. > >> >> > Would > >> >> > probably be slower with hard disks. > >> >> > >> >> Or where site-packages is on NFS. > >> >> > >> >> > But as the inspect module is already > >> >> > imported elsewhere, the python interpreter should also have it > >> >> > cached. > >> >> > >> >> Not on a normal import it's not. > >> >> > >> >> >>> import numpy > >> >> >>> import sys > >> >> >>> sys.modules['inspect'] > >> >> Traceback (most recent call last): > >> >> File "", line 1, in > >> >> KeyError: 'inspect' > >> > > >> > There are two lazy imports of inspect. > >> > >> Sure, but get_object_signature() is called unlazily when numpy is > >> imported. > >> > >> >> You should feel free to remove whatever parts of `_inspect` are not > >> >> being used and to move the parts that are closer to where they are > >> >> used if you feel compelled to. Please do not replace the current uses > >> >> of `_inspect` with `inspect`. > >> > > >> > It is used in just one place. > >> > >> So? That one place is always called whenever numpy is imported. > >> > >> > Is importing inspect so much slower than all > >> > the other imports we do? > >> > >> Yeah, it's pretty bad. > >> > > > > The buggy code is for tuple parameter unpacking, a path that is not > > exercised and a feature not in python 3. So... is it safe to excise that > > nasty bit of code, > > "You should feel free to remove whatever parts of `_inspect` are not > being used." > > > or does Enthought make use of the numpy _inspect module? > > No, of course not. It's _private for a reason. > > > The other (fixable) error is in formatargvalues, which is not in __all__ > and > > not used as far as I can tell. > > There is a missing import of the disassembler, `dis`, which I suspect would add substantially to the import time. So it looks like the easy path is to excise the code. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Fri Aug 1 16:11:49 2014 From: cournape at gmail.com (David Cournapeau) Date: Sat, 2 Aug 2014 05:11:49 +0900 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 11:23 PM, Charles R Harris wrote: > > > > On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern wrote: > >> On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris >> wrote: >> >> > Importing inspect looks to take about 500 ns on my machine. Although >> It is >> > hard to be exact, as I suspect the file is sitting in the file cache. >> Would >> > probably be slower with hard disks. >> >> Or where site-packages is on NFS. >> >> > But as the inspect module is already >> > imported elsewhere, the python interpreter should also have it cached. >> >> Not on a normal import it's not. >> >> >>> import numpy >> >>> import sys >> >>> sys.modules['inspect'] >> Traceback (most recent call last): >> File "", line 1, in >> KeyError: 'inspect' >> > > There are two lazy imports of inspect. > > >> >> You should feel free to remove whatever parts of `_inspect` are not >> being used and to move the parts that are closer to where they are >> used if you feel compelled to. Please do not replace the current uses >> of `_inspect` with `inspect`. >> > > It is used in just one place. Is importing inspect so much slower than all > the other imports we do? > Yes, please look at the thread I referred to. The custom inspect cut imports by 30 %, I doubt the ratio is much different today. David -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Fri Aug 1 22:01:47 2014 From: cournape at gmail.com (David Cournapeau) Date: Sat, 2 Aug 2014 11:01:47 +0900 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On my machine, if I use inspect instead of _inspect in numpy.compat.__init__, the import time increases ~ 25 % (from 82 ms to 99 ms). So the hack certainly still make sense, one just need to fix whatever needs fixing (I am still not sure what's broken for the very specific usecase that code was bundled for). David On Sat, Aug 2, 2014 at 5:11 AM, David Cournapeau wrote: > > > > On Fri, Aug 1, 2014 at 11:23 PM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> >> >> >> On Fri, Aug 1, 2014 at 7:59 AM, Robert Kern >> wrote: >> >>> On Fri, Aug 1, 2014 at 2:54 PM, Charles R Harris >>> wrote: >>> >>> > Importing inspect looks to take about 500 ns on my machine. Although >>> It is >>> > hard to be exact, as I suspect the file is sitting in the file cache. >>> Would >>> > probably be slower with hard disks. >>> >>> Or where site-packages is on NFS. >>> >>> > But as the inspect module is already >>> > imported elsewhere, the python interpreter should also have it cached. >>> >>> Not on a normal import it's not. >>> >>> >>> import numpy >>> >>> import sys >>> >>> sys.modules['inspect'] >>> Traceback (most recent call last): >>> File "", line 1, in >>> KeyError: 'inspect' >>> >> >> There are two lazy imports of inspect. >> >> >>> >>> You should feel free to remove whatever parts of `_inspect` are not >>> being used and to move the parts that are closer to where they are >>> used if you feel compelled to. Please do not replace the current uses >>> of `_inspect` with `inspect`. >>> >> >> It is used in just one place. Is importing inspect so much slower than >> all the other imports we do? >> > > Yes, please look at the thread I referred to. The custom inspect cut > imports by 30 %, I doubt the ratio is much different today. > > David > -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Fri Aug 1 22:17:09 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 20:17:09 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 8:01 PM, David Cournapeau wrote: > On my machine, if I use inspect instead of _inspect in > numpy.compat.__init__, the import time increases ~ 25 % (from 82 ms to 99 > ms). > > So the hack certainly still make sense, one just need to fix whatever > needs fixing (I am still not sure what's broken for the very specific > usecase that code was bundled for). > > I'm not sure a one time hit of 17 ms is worth fighting for ;) The problems were that both the `string` and `dis` modules were used without importing them. Evidently those code paths were never traversed, so I removed the code using `dis` and raised an error there instead, it was for parsing tuple arguments. The string.join was fixed using the string method. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Fri Aug 1 22:22:19 2014 From: cournape at gmail.com (David Cournapeau) Date: Sat, 2 Aug 2014 11:22:19 +0900 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Sat, Aug 2, 2014 at 11:17 AM, Charles R Harris wrote: > > > > On Fri, Aug 1, 2014 at 8:01 PM, David Cournapeau > wrote: > >> On my machine, if I use inspect instead of _inspect in >> numpy.compat.__init__, the import time increases ~ 25 % (from 82 ms to 99 >> ms). >> >> So the hack certainly still make sense, one just need to fix whatever >> needs fixing (I am still not sure what's broken for the very specific >> usecase that code was bundled for). >> >> > I'm not sure a one time hit of 17 ms is worth fighting for ;) The problems > were that both the `string` and `dis` modules were used without importing > them. > Don't fix what ain't broken ;) The 17 ms is not what matters, the % is. People regularly complain about import times, and 25 % increase in import time is significant (the above timing are on my new macbook with SSD and 16 Gb RAM -- figures will easily be 1 order of magnitude worse in common situations with slower computers, slower HDD, NFS, etc...) David -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Fri Aug 1 22:36:52 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Fri, 1 Aug 2014 20:36:52 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 8:22 PM, David Cournapeau wrote: > > > > On Sat, Aug 2, 2014 at 11:17 AM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> >> >> >> On Fri, Aug 1, 2014 at 8:01 PM, David Cournapeau >> wrote: >> >>> On my machine, if I use inspect instead of _inspect in >>> numpy.compat.__init__, the import time increases ~ 25 % (from 82 ms to 99 >>> ms). >>> >>> So the hack certainly still make sense, one just need to fix whatever >>> needs fixing (I am still not sure what's broken for the very specific >>> usecase that code was bundled for). >>> >>> >> I'm not sure a one time hit of 17 ms is worth fighting for ;) The >> problems were that both the `string` and `dis` modules were used without >> importing them. >> > > Don't fix what ain't broken ;) > > The 17 ms is not what matters, the % is. People regularly complain about > import times, and 25 % increase in import time is significant (the above > timing are on my new macbook with SSD and 16 Gb RAM -- figures will easily > be 1 order of magnitude worse in common situations with slower computers, > slower HDD, NFS, etc...) > Be interesting to compare times. Could you send along the code you used? My machine is similar except it is a desktop with 2 SSDs in raid 0. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From cournape at gmail.com Sat Aug 2 00:42:38 2014 From: cournape at gmail.com (David Cournapeau) Date: Sat, 2 Aug 2014 13:42:38 +0900 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Sat, Aug 2, 2014 at 11:36 AM, Charles R Harris wrote: > > > > On Fri, Aug 1, 2014 at 8:22 PM, David Cournapeau > wrote: > >> >> >> >> On Sat, Aug 2, 2014 at 11:17 AM, Charles R Harris < >> charlesr.harris at gmail.com> wrote: >> >>> >>> >>> >>> On Fri, Aug 1, 2014 at 8:01 PM, David Cournapeau >>> wrote: >>> >>>> On my machine, if I use inspect instead of _inspect in >>>> numpy.compat.__init__, the import time increases ~ 25 % (from 82 ms to 99 >>>> ms). >>>> >>>> So the hack certainly still make sense, one just need to fix whatever >>>> needs fixing (I am still not sure what's broken for the very specific >>>> usecase that code was bundled for). >>>> >>>> >>> I'm not sure a one time hit of 17 ms is worth fighting for ;) The >>> problems were that both the `string` and `dis` modules were used without >>> importing them. >>> >> >> Don't fix what ain't broken ;) >> >> The 17 ms is not what matters, the % is. People regularly complain about >> import times, and 25 % increase in import time is significant (the above >> timing are on my new macbook with SSD and 16 Gb RAM -- figures will easily >> be 1 order of magnitude worse in common situations with slower computers, >> slower HDD, NFS, etc...) >> > > Be interesting to compare times. Could you send along the code you used? > My machine is similar except it is a desktop with 2 SSDs in raid 0. > I just hacked numpy.lib.__init__ to use inspect instead of _inspect: diff --git a/numpy/compat/__init__.py b/numpy/compat/__init__.py index 5b371f5..57f6d7f 100644 --- a/numpy/compat/__init__.py +++ b/numpy/compat/__init__.py @@ -10,11 +10,11 @@ extensions, which may be included for the following reasons: """ from __future__ import division, absolute_import, print_function -from . import _inspect +import inspect as _inspect from . import py3k -from ._inspect import getargspec, formatargspec +from inspect import getargspec, formatargspec from .py3k import * __all__ = [] -__all__.extend(_inspect.__all__) +__all__.extend(["getargspec", "formatargspec"]) __all__.extend(py3k.__all__) David -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Aug 2 11:18:05 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 2 Aug 2014 09:18:05 -0600 Subject: [Numpy-discussion] Remove numpy/compat/_inspect.py ? In-Reply-To: References:

Message-ID: On Fri, Aug 1, 2014 at 10:42 PM, David Cournapeau wrote: > > > > On Sat, Aug 2, 2014 at 11:36 AM, Charles R Harris < > charlesr.harris at gmail.com> wrote: > >> >> >> >> On Fri, Aug 1, 2014 at 8:22 PM, David Cournapeau >> wrote: >> >>> >>> >>> >>> On Sat, Aug 2, 2014 at 11:17 AM, Charles R Harris < >>> charlesr.harris at gmail.com> wrote: >>> >>>> >>>> >>>> >>>> On Fri, Aug 1, 2014 at 8:01 PM, David Cournapeau >>>> wrote: >>>> >>>>> On my machine, if I use inspect instead of _inspect in >>>>> numpy.compat.__init__, the import time increases ~ 25 % (from 82 ms to 99 >>>>> ms). >>>>> >>>>> So the hack certainly still make sense, one just need to fix whatever >>>>> needs fixing (I am still not sure what's broken for the very specific >>>>> usecase that code was bundled for). >>>>> >>>>> >>>> I'm not sure a one time hit of 17 ms is worth fighting for ;) The >>>> problems were that both the `string` and `dis` modules were used without >>>> importing them. >>>> >>> >>> Don't fix what ain't broken ;) >>> >>> The 17 ms is not what matters, the % is. People regularly complain about >>> import times, and 25 % increase in import time is significant (the above >>> timing are on my new macbook with SSD and 16 Gb RAM -- figures will easily >>> be 1 order of magnitude worse in common situations with slower computers, >>> slower HDD, NFS, etc...) >>> >> >> Be interesting to compare times. Could you send along the code you used? >> My machine is similar except it is a desktop with 2 SSDs in raid 0. >> > > I just hacked numpy.lib.__init__ to use inspect instead of _inspect: > > diff --git a/numpy/compat/__init__.py b/numpy/compat/__init__.py > index 5b371f5..57f6d7f 100644 > --- a/numpy/compat/__init__.py > +++ b/numpy/compat/__init__.py > @@ -10,11 +10,11 @@ extensions, which may be included for the following > reasons: > """ > from __future__ import division, absolute_import, print_function > > -from . import _inspect > +import inspect as _inspect > from . import py3k > -from ._inspect import getargspec, formatargspec > +from inspect import getargspec, formatargspec > from .py3k import * > > __all__ = [] > -__all__.extend(_inspect.__all__) > +__all__.extend(["getargspec", "formatargspec"]) > __all__.extend(py3k.__all__) > > I was more interested in how you timed it. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Aug 2 13:14:19 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 2 Aug 2014 11:14:19 -0600 Subject: [Numpy-discussion] It lives! Or at least is not undead Message-ID: charris at localhost [matmul (master)]$ python3.5 Python 3.5.0a0 (default:4425024f2e01, Aug 2 2014, 10:10:31) [GCC 4.8.3 20140624 (Red Hat 4.8.3-1)] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import numpy as np >>> import testing >>> a = np.ones(3).view(testing.marray) >>> a at 3 marray([ 3., 3., 3.]) >>> 3 at a marray([ 3., 3., 3.]) >>> a at np.eye(3) marray([ 1., 1., 1.]) >>> np.eye(3)@a marray([ 1., 1., 1.]) This was just for quick experimentation. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Aug 2 17:33:33 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 2 Aug 2014 15:33:33 -0600 Subject: [Numpy-discussion] Class to experiment with '@' Message-ID: Hi All, I've attached a subclass of ndarray that implements the new '@' operator for experimentation and comment. It is only intended for playing with that operator and may not work for other things. You will need to install python 3.5.0a1 to play with it. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: matmul.py Type: text/x-python Size: 955 bytes Desc: not available URL: From charlesr.harris at gmail.com Sun Aug 3 15:26:18 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 3 Aug 2014 13:26:18 -0600 Subject: [Numpy-discussion] Class to experiment with '@' In-Reply-To: References: Message-ID: Oops, corrected version attached. I think we need generalized functions in linalg for mat x mat, mat x vec, and vec x mat, and versions that also work for object arrays. I've used einsum in the attached file for the products, but it doesn't work for object arrays. This work is mainly to have a prototype version for the matmul operator that can be used to verify behavior and write some tests. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: matmul.py Type: text/x-python Size: 929 bytes Desc: not available URL: From charlesr.harris at gmail.com Sun Aug 3 17:48:06 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sun, 3 Aug 2014 15:48:06 -0600 Subject: [Numpy-discussion] Class to experiment with '@' In-Reply-To: References:

Message-ID: On Sun, Aug 3, 2014 at 1:26 PM, Charles R Harris wrote: > > > Oops, corrected version attached. I think we need generalized functions in > linalg for mat x mat, mat x vec, and vec x mat, and versions that also work > for object arrays. I've used einsum in the attached file for the products, > but it doesn't work for object arrays. > > This work is mainly to have a prototype version for the matmul operator > that can be used to verify behavior and write some tests. > > Another fix :( Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: matmul.py Type: text/x-python Size: 937 bytes Desc: not available URL: From t.b.poole at gmail.com Mon Aug 4 11:47:50 2014 From: t.b.poole at gmail.com (tpoole) Date: Mon, 4 Aug 2014 08:47:50 -0700 (PDT) Subject: [Numpy-discussion] Requesting Code Review of weighted covariance ENH Message-ID: <1407167270713-38292.post@n7.nabble.com> Hi everyone, I've added the ability to handle weighted data in a covariance calculation, in a similar manner to that already implemented for the calculation of a weighted average. https://github.com/tpoole/numpy/compare/weighted_cov Could an experienced someone please look over my changes before I submit a pull request? Validation of the formula can be found at "Exponential smoothing weighted correlations", F. Pozzi, T. Matteo, and T. Aste, Eur. Phys. J. B. 85, 175 (2012). --- though it is unfortunately paywalled and "An Analysis of WinCross, SPSS, and Mentor Procedures for Estimating the Variance of a Weighted Mean", A. Madansky and H. G. B. Alexander, www.analyticalgroup.com/download/weighted_variance.pdf for the "effective number of samples". Thanks, Tom -- View this message in context: http://numpy-discussion.10968.n7.nabble.com/Requesting-Code-Review-of-weighted-covariance-ENH-tp38292.html Sent from the Numpy-discussion mailing list archive at Nabble.com. From jtaylor.debian at googlemail.com Mon Aug 4 18:05:43 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Tue, 05 Aug 2014 00:05:43 +0200 Subject: [Numpy-discussion] last call for numpy 1.8.2 bugfixes Message-ID: <53E003B7.3090100@googlemail.com> hi, as numpy 1.9 is going to be a relative hard upgrade as indexing changes expose a couple bugs in third party packages and the large amount of small little incompatibilities I will create a numpy 1.8.2 release tomorrow with a couple of important or hard to work around bugfixes. The most important bugfix is fixing the wrong result partition with multiple selections could produce if selections ended up in an equal range, see https://github.com/numpy/numpy/issues/4836 (if the crash is still unreproducable, help appreciated). the rest of the fixes are small ones listed below. If I have missed one or you consider one of the fixes to invasive for a bugfix release please speak up now. As the number of fixes is small I will skip a release candidate. Make fftpack._raw_fft threadsafe https://github.com/numpy/numpy/issues/4656 Prevent division by zero https://github.com/numpy/numpy/issues/650 Fix lack of NULL check in array_richcompare https://github.com/numpy/numpy/issues/4613 incorrect argument order to _copyto in in np.nanmax, np.nanmin https://github.com/numpy/numpy/issues/4628 Hold GIL for types with fields, fixes https://github.com/numpy/numpy/issues/4642 svd ufunc typo https://github.com/numpy/numpy/issues/4733 check alignment of strides for byteswap https://github.com/numpy/numpy/issues/4774 add missing elementsize alignment check for simd reductions https://github.com/numpy/numpy/issues/4853 ifort has issues with optimization flag /O2 https://github.com/numpy/numpy/issues/4602 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: OpenPGP digital signature URL: From matthew.brett at gmail.com Mon Aug 4 18:09:39 2014 From: matthew.brett at gmail.com (Matthew Brett) Date: Mon, 4 Aug 2014 15:09:39 -0700 Subject: [Numpy-discussion] last call for numpy 1.8.2 bugfixes In-Reply-To: <53E003B7.3090100@googlemail.com> References: <53E003B7.3090100@googlemail.com> Message-ID: Hi, On Mon, Aug 4, 2014 at 3:05 PM, Julian Taylor wrote: > hi, > as numpy 1.9 is going to be a relative hard upgrade as indexing changes > expose a couple bugs in third party packages and the large amount of > small little incompatibilities I will create a numpy 1.8.2 release > tomorrow with a couple of important or hard to work around bugfixes. > > The most important bugfix is fixing the wrong result partition with > multiple selections could produce if selections ended up in an equal > range, see https://github.com/numpy/numpy/issues/4836 (if the crash is > still unreproducable, help appreciated). > > the rest of the fixes are small ones listed below. > If I have missed one or you consider one of the fixes to invasive for a > bugfix release please speak up now. > As the number of fixes is small I will skip a release candidate. > > > Make fftpack._raw_fft threadsafe > https://github.com/numpy/numpy/issues/4656 > > Prevent division by zero > https://github.com/numpy/numpy/issues/650 > > Fix lack of NULL check in array_richcompare > https://github.com/numpy/numpy/issues/4613 > > incorrect argument order to _copyto in in np.nanmax, np.nanmin > https://github.com/numpy/numpy/issues/4628 > > Hold GIL for types with fields, fixes > https://github.com/numpy/numpy/issues/4642 > > svd ufunc typo > https://github.com/numpy/numpy/issues/4733 > > check alignment of strides for byteswap > https://github.com/numpy/numpy/issues/4774 > > add missing elementsize alignment check for simd reductions > https://github.com/numpy/numpy/issues/4853 > > ifort has issues with optimization flag /O2 > https://github.com/numpy/numpy/issues/4602 Any chance of a RC to give us some time to test? Cheers, Matthew From jtaylor.debian at googlemail.com Mon Aug 4 18:12:50 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Tue, 05 Aug 2014 00:12:50 +0200 Subject: [Numpy-discussion] last call for numpy 1.8.2 bugfixes In-Reply-To: References: <53E003B7.3090100@googlemail.com> Message-ID: <53E00562.5070602@googlemail.com> On 05.08.2014 00:09, Matthew Brett wrote: > Hi, > > On Mon, Aug 4, 2014 at 3:05 PM, Julian Taylor > wrote: >> hi, >> as numpy 1.9 is going to be a relative hard upgrade as indexing changes >> expose a couple bugs in third party packages and the large amount of >> small little incompatibilities I will create a numpy 1.8.2 release >> tomorrow with a couple of important or hard to work around bugfixes. >>... > > Any chance of a RC to give us some time to test? > I hope I have only selected fixes that are safe and do not require a RC. sure we could do one, but if there are issues we can also just make a quick 1.8.3 release follow up. the main backport PR is: https://github.com/numpy/numpy/pull/4949 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: OpenPGP digital signature URL: From njs at pobox.com Mon Aug 4 18:25:04 2014 From: njs at pobox.com (Nathaniel Smith) Date: Mon, 4 Aug 2014 23:25:04 +0100 Subject: [Numpy-discussion] last call for numpy 1.8.2 bugfixes In-Reply-To: <53E00562.5070602@googlemail.com> References: <53E003B7.3090100@googlemail.com> <53E00562.5070602@googlemail.com> Message-ID: On Mon, Aug 4, 2014 at 11:12 PM, Julian Taylor wrote: > On 05.08.2014 00:09, Matthew Brett wrote: >> Hi, >> >> On Mon, Aug 4, 2014 at 3:05 PM, Julian Taylor >> wrote: >>> hi, >>> as numpy 1.9 is going to be a relative hard upgrade as indexing changes >>> expose a couple bugs in third party packages and the large amount of >>> small little incompatibilities I will create a numpy 1.8.2 release >>> tomorrow with a couple of important or hard to work around bugfixes. >>>... >> >> Any chance of a RC to give us some time to test? >> > > I hope I have only selected fixes that are safe and do not require a RC. > sure we could do one, but if there are issues we can also just make a > quick 1.8.3 release follow up. > > the main backport PR is: https://github.com/numpy/numpy/pull/4949 It's probably better to just make an RC if it's not too much trouble... it's always possible to misjudge what issues arise, if there's a real-but-non-catastrophic issue then people 1.8.2 will remain in use even if 1.8.3 is released afterwards and force downstream libraries to work around the issues, and just in general it's good to have and follow standard processes because special cases lead to errors. -n -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From matthew.brett at gmail.com Mon Aug 4 18:27:38 2014 From: matthew.brett at gmail.com (Matthew Brett) Date: Mon, 4 Aug 2014 15:27:38 -0700 Subject: [Numpy-discussion] last call for numpy 1.8.2 bugfixes In-Reply-To: References: <53E003B7.3090100@googlemail.com> <53E00562.5070602@googlemail.com> Message-ID: On Mon, Aug 4, 2014 at 3:25 PM, Nathaniel Smith wrote: > On Mon, Aug 4, 2014 at 11:12 PM, Julian Taylor > wrote: >> On 05.08.2014 00:09, Matthew Brett wrote: >>> Hi, >>> >>> On Mon, Aug 4, 2014 at 3:05 PM, Julian Taylor >>> wrote: >>>> hi, >>>> as numpy 1.9 is going to be a relative hard upgrade as indexing changes >>>> expose a couple bugs in third party packages and the large amount of >>>> small little incompatibilities I will create a numpy 1.8.2 release >>>> tomorrow with a couple of important or hard to work around bugfixes. >>>>... >>> >>> Any chance of a RC to give us some time to test? >>> >> >> I hope I have only selected fixes that are safe and do not require a RC. >> sure we could do one, but if there are issues we can also just make a >> quick 1.8.3 release follow up. A few days to test would be fine, I'd prefer an RC too, Cheers, Matthew From jtaylor.debian at googlemail.com Mon Aug 4 18:46:14 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Tue, 05 Aug 2014 00:46:14 +0200 Subject: [Numpy-discussion] last call for numpy 1.8.2 bugfixes In-Reply-To: References: <53E003B7.3090100@googlemail.com> <53E00562.5070602@googlemail.com>

Message-ID: <53E00D36.3080500@googlemail.com> On 05.08.2014 00:27, Matthew Brett wrote: > On Mon, Aug 4, 2014 at 3:25 PM, Nathaniel Smith wrote: >> On Mon, Aug 4, 2014 at 11:12 PM, Julian Taylor >> wrote: >>> On 05.08.2014 00:09, Matthew Brett wrote: >>>> Hi, >>>> >>>> On Mon, Aug 4, 2014 at 3:05 PM, Julian Taylor >>>> wrote: >>>>> hi, >>>>> as numpy 1.9 is going to be a relative hard upgrade as indexing changes >>>>> expose a couple bugs in third party packages and the large amount of >>>>> small little incompatibilities I will create a numpy 1.8.2 release >>>>> tomorrow with a couple of important or hard to work around bugfixes. >>>>> ... >>>> >>>> Any chance of a RC to give us some time to test? >>>> >>> >>> I hope I have only selected fixes that are safe and do not require a RC. >>> sure we could do one, but if there are issues we can also just make a >>> quick 1.8.3 release follow up. > > A few days to test would be fine, I'd prefer an RC too, > alright I'll make an RC tomorrow and planning for release this weekend then. From debruinjj at gmail.com Tue Aug 5 08:58:38 2014 From: debruinjj at gmail.com (Jurgens de Bruin) Date: Tue, 5 Aug 2014 14:58:38 +0200 Subject: [Numpy-discussion] Array2 subset of array1 Message-ID: Hi, I am new to numpy so any help would be greatly appreciated. I have two arrays: array1 = np.arange(1,100+1) array2 = np.arange(1,50+1) How can I calculate/determine if array2 is a subset of array1 (falls within array 1) Something like : array2 in array1 = TRUE for the case above. Thank -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Aug 5 09:15:18 2014 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 5 Aug 2014 14:15:18 +0100 Subject: [Numpy-discussion] Array2 subset of array1 In-Reply-To: References: Message-ID: On Tue, Aug 5, 2014 at 1:58 PM, Jurgens de Bruin wrote: > Hi, > > I am new to numpy so any help would be greatly appreciated. > > I have two arrays: > > array1 = np.arange(1,100+1) > array2 = np.arange(1,50+1) > > How can I calculate/determine if array2 is a subset of array1 (falls within > array 1) > > Something like : array2 in array1 = TRUE for the case above. Does this work? np.in1d(array2, array1) See: http://docs.scipy.org/doc/numpy/reference/routines.set.html (Note that while in1d does the best it can, set operations on arrays will usually be slower than if you used a more appropriate data type like 'set' or 'dict'.) -n -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From hoogendoorn.eelco at gmail.com Tue Aug 5 11:29:00 2014 From: hoogendoorn.eelco at gmail.com (Eelco Hoogendoorn) Date: Tue, 5 Aug 2014 17:29:00 +0200 Subject: [Numpy-discussion] Array2 subset of array1 In-Reply-To: References: Message-ID: np.all(np.in1d(array1,array2)) On Tue, Aug 5, 2014 at 2:58 PM, Jurgens de Bruin wrote: > Hi, > > I am new to numpy so any help would be greatly appreciated. > > I have two arrays: > > array1 = np.arange(1,100+1) > array2 = np.arange(1,50+1) > > How can I calculate/determine if array2 is a subset of array1 (falls > within array 1) > > Something like : array2 in array1 = TRUE for the case above. > > Thank > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From sebastian at sipsolutions.net Tue Aug 5 12:33:50 2014 From: sebastian at sipsolutions.net (Sebastian Berg) Date: Tue, 05 Aug 2014 18:33:50 +0200 Subject: [Numpy-discussion] Array2 subset of array1 In-Reply-To: References: Message-ID: <1407256430.3568.6.camel@sebastian-t440> On Di, 2014-08-05 at 14:58 +0200, Jurgens de Bruin wrote: > Hi, > > I am new to numpy so any help would be greatly appreciated. > > I have two arrays: > > array1 = np.arange(1,100+1) > array2 = np.arange(1,50+1) > > How can I calculate/determine if array2 is a subset of array1 (falls > within array 1) > > Something like : array2 in array1 = TRUE for the case above. > Just to be clear. You are looking for the whole of array1 (as a block/subarray) as far as I understand. And there is no obvious numpy way to do this. Depending on your array sizes, you could blow up the first array from (N,) to (N-M+1,M) and then check if any row matches completely. There may be better tricks available though, especially if array1 is large. - Sebastian > Thank > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion From hoogendoorn.eelco at gmail.com Tue Aug 5 12:59:41 2014 From: hoogendoorn.eelco at gmail.com (Eelco Hoogendoorn) Date: Tue, 5 Aug 2014 18:59:41 +0200 Subject: [Numpy-discussion] Array2 subset of array1 In-Reply-To: <1407256430.3568.6.camel@sebastian-t440> References: <1407256430.3568.6.camel@sebastian-t440> Message-ID: ah yes, that may indeed be what you want. depending on your datatype, you could access the underlying raw data as a string. b.tostring() in a.tostring() sort of works; but isn't entirely safe, as you may have false positive matches which arnt aligned to your datatype using str.find in combination with dtype.itemsize could solve that problem; though it isn't the most elegant solution id say. also note that you need to check for identical datatypes and memory layout for this to guarantee correct results. On Tue, Aug 5, 2014 at 6:33 PM, Sebastian Berg wrote: > On Di, 2014-08-05 at 14:58 +0200, Jurgens de Bruin wrote: > > Hi, > > > > I am new to numpy so any help would be greatly appreciated. > > > > I have two arrays: > > > > array1 = np.arange(1,100+1) > > array2 = np.arange(1,50+1) > > > > How can I calculate/determine if array2 is a subset of array1 (falls > > within array 1) > > > > Something like : array2 in array1 = TRUE for the case above. > > > > Just to be clear. You are looking for the whole of array1 (as a > block/subarray) as far as I understand. And there is no obvious numpy > way to do this. Depending on your array sizes, you could blow up the > first array from (N,) to (N-M+1,M) and then check if any row matches > completely. There may be better tricks available though, especially if > array1 is large. > > - Sebastian > > > Thank > > _______________________________________________ > > NumPy-Discussion mailing list > > NumPy-Discussion at scipy.org > > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jtaylor.debian at googlemail.com Tue Aug 5 15:45:02 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Tue, 05 Aug 2014 21:45:02 +0200 Subject: [Numpy-discussion] ANN: NumPy 1.8.2 release candidate Message-ID: <53E1343E.7020805@googlemail.com> Hello, I am pleased to announce the first release candidate for numpy 1.8.2, a pure bugfix release for the 1.8.x series. https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ If no regressions show up the final release is planned this weekend. The upgrade is recommended for all users of the 1.8.x series. Following issues have been fixed: * gh-4836: partition produces wrong results for multiple selections in equal ranges * gh-4656: Make fftpack._raw_fft threadsafe * gh-4628: incorrect argument order to _copyto in in np.nanmax, np.nanmin * gh-4613: Fix lack of NULL check in array_richcompare * gh-4642: Hold GIL for converting dtypes types with fields * gh-4733: fix np.linalg.svd(b, compute_uv=False) * gh-4853: avoid unaligned simd load on reductions on i386 * gh-4774: avoid unaligned access for strided byteswap * gh-650: Prevent division by zero when creating arrays from some buffers * gh-4602: ifort has issues with optimization flag O2, use O1 Source tarballs, windows installers and release notes can be found at https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ Cheers, Julian Taylor -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: OpenPGP digital signature URL: From cgohlke at uci.edu Tue Aug 5 16:32:50 2014 From: cgohlke at uci.edu (Christoph Gohlke) Date: Tue, 05 Aug 2014 13:32:50 -0700 Subject: [Numpy-discussion] ANN: NumPy 1.8.2 release candidate In-Reply-To: <53E1343E.7020805@googlemail.com> References: <53E1343E.7020805@googlemail.com> Message-ID: <53E13F72.8070501@uci.edu> On 8/5/2014 12:45 PM, Julian Taylor wrote: > Hello, > > I am pleased to announce the first release candidate for numpy 1.8.2, a > pure bugfix release for the 1.8.x series. > https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ > > If no regressions show up the final release is planned this weekend. > The upgrade is recommended for all users of the 1.8.x series. > > Following issues have been fixed: > * gh-4836: partition produces wrong results for multiple selections in > equal ranges > * gh-4656: Make fftpack._raw_fft threadsafe > * gh-4628: incorrect argument order to _copyto in in np.nanmax, np.nanmin > * gh-4613: Fix lack of NULL check in array_richcompare > * gh-4642: Hold GIL for converting dtypes types with fields > * gh-4733: fix np.linalg.svd(b, compute_uv=False) > * gh-4853: avoid unaligned simd load on reductions on i386 > * gh-4774: avoid unaligned access for strided byteswap > * gh-650: Prevent division by zero when creating arrays from some buffers > * gh-4602: ifort has issues with optimization flag O2, use O1 > > Source tarballs, windows installers and release notes can be found at > https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ > > Cheers, > Julian Taylor > Hello, thank you. Looks good. All builds and tests pass on Windows (using msvc/MKL). Any chance gh-4722 can make it into the release? Fix seg fault converting empty string to object Christoph From jtaylor.debian at googlemail.com Tue Aug 5 16:57:17 2014 From: jtaylor.debian at googlemail.com (Julian Taylor) Date: Tue, 05 Aug 2014 22:57:17 +0200 Subject: [Numpy-discussion] ANN: NumPy 1.8.2 release candidate In-Reply-To: <53E13F72.8070501@uci.edu> References: <53E1343E.7020805@googlemail.com> <53E13F72.8070501@uci.edu> Message-ID: <53E1452D.5090001@googlemail.com> On 05.08.2014 22:32, Christoph Gohlke wrote: > On 8/5/2014 12:45 PM, Julian Taylor wrote: >> Hello, >> >> I am pleased to announce the first release candidate for numpy 1.8.2, a >> pure bugfix release for the 1.8.x series. >> https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ >> >> If no regressions show up the final release is planned this weekend. >> The upgrade is recommended for all users of the 1.8.x series. >> >> Following issues have been fixed: >> * gh-4836: partition produces wrong results for multiple selections in >> equal ranges >> * gh-4656: Make fftpack._raw_fft threadsafe >> * gh-4628: incorrect argument order to _copyto in in np.nanmax, np.nanmin >> * gh-4613: Fix lack of NULL check in array_richcompare >> * gh-4642: Hold GIL for converting dtypes types with fields >> * gh-4733: fix np.linalg.svd(b, compute_uv=False) >> * gh-4853: avoid unaligned simd load on reductions on i386 >> * gh-4774: avoid unaligned access for strided byteswap >> * gh-650: Prevent division by zero when creating arrays from some buffers >> * gh-4602: ifort has issues with optimization flag O2, use O1 >> >> Source tarballs, windows installers and release notes can be found at >> https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ >> >> Cheers, >> Julian Taylor >> > > Hello, > > thank you. Looks good. All builds and tests pass on Windows (using > msvc/MKL). > > Any chance gh-4722 can make it into the release? > Fix seg fault converting empty string to object > > thanks, I missed that one, pretty simple, I'll add it to the final release. -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: OpenPGP digital signature URL: From matthew.brett at gmail.com Tue Aug 5 17:27:14 2014 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 5 Aug 2014 14:27:14 -0700 Subject: [Numpy-discussion] ANN: NumPy 1.8.2 release candidate In-Reply-To: <53E1452D.5090001@googlemail.com> References: <53E1343E.7020805@googlemail.com> <53E13F72.8070501@uci.edu> <53E1452D.5090001@googlemail.com> Message-ID: Hi, On Tue, Aug 5, 2014 at 1:57 PM, Julian Taylor wrote: > On 05.08.2014 22:32, Christoph Gohlke wrote: >> On 8/5/2014 12:45 PM, Julian Taylor wrote: >>> Hello, >>> >>> I am pleased to announce the first release candidate for numpy 1.8.2, a >>> pure bugfix release for the 1.8.x series. >>> https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ >>> >>> If no regressions show up the final release is planned this weekend. >>> The upgrade is recommended for all users of the 1.8.x series. >>> >>> Following issues have been fixed: >>> * gh-4836: partition produces wrong results for multiple selections in >>> equal ranges >>> * gh-4656: Make fftpack._raw_fft threadsafe >>> * gh-4628: incorrect argument order to _copyto in in np.nanmax, np.nanmin >>> * gh-4613: Fix lack of NULL check in array_richcompare >>> * gh-4642: Hold GIL for converting dtypes types with fields >>> * gh-4733: fix np.linalg.svd(b, compute_uv=False) >>> * gh-4853: avoid unaligned simd load on reductions on i386 >>> * gh-4774: avoid unaligned access for strided byteswap >>> * gh-650: Prevent division by zero when creating arrays from some buffers >>> * gh-4602: ifort has issues with optimization flag O2, use O1 >>> >>> Source tarballs, windows installers and release notes can be found at >>> https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ >>> >>> Cheers, >>> Julian Taylor >>> >> >> Hello, >> >> thank you. Looks good. All builds and tests pass on Windows (using >> msvc/MKL). >> >> Any chance gh-4722 can make it into the release? >> Fix seg fault converting empty string to object >> >> > > thanks, I missed that one, pretty simple, I'll add it to the final release. OSX wheels built and tested and uploaded OK : http://wheels.scikit-image.org https://travis-ci.org/matthew-brett/numpy-atlas-binaries/builds/31747958 Will test against the scipy stack later on today. Cheers, Matthew From derek at astro.physik.uni-goettingen.de Tue Aug 5 19:27:09 2014 From: derek at astro.physik.uni-goettingen.de (Derek Homeier) Date: Wed, 6 Aug 2014 01:27:09 +0200 Subject: [Numpy-discussion] ANN: NumPy 1.8.2 release candidate In-Reply-To: References: <53E1343E.7020805@googlemail.com> <53E13F72.8070501@uci.edu> <53E1452D.5090001@googlemail.com> Message-ID: <418B93F6-DEC6-4104-A690-D75DA6B10C7D@astro.physik.uni-goettingen.de> On 5 Aug 2014, at 11:27 pm, Matthew Brett wrote: > OSX wheels built and tested and uploaded OK : > > http://wheels.scikit-image.org > > https://travis-ci.org/matthew-brett/numpy-atlas-binaries/builds/31747958 > > Will test against the scipy stack later on today. Built and tested against the Fink Python installation under OSX. Seems to resolve one of a couple of f2py test errors appearing with 1.8.1 on Python 3.3 and 3.4: ====================================================================== ERROR: test_return_real.TestCReturnReal.test_all ---------------------------------------------------------------------- Traceback (most recent call last): File "/sw/lib/python3.4/site-packages/nose/case.py", line 382, in setUp try_run(self.inst, ('setup', 'setUp')) File "/sw/lib/python3.4/site-packages/nose/util.py", line 470, in try_run return func() File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 348, in setUp module_name=self.module_name) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 74, in wrapper memo[key] = func(*a, **kw) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 163, in build_code module_name=module_name) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 74, in wrapper memo[key] = func(*a, **kw) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 144, in build_module __import__(module_name) ImportError: No module named ?c_ext_return_real' is gone on 3.4 now but still present on 3.3. Two errors of this kind (with different numbers) remain: ERROR: test_return_real.TestF90ReturnReal.test_all ---------------------------------------------------------------------- Traceback (most recent call last): File "/sw/lib/python3.4/site-packages/nose/case.py", line 382, in setUp try_run(self.inst, ('setup', 'setUp')) File "/sw/lib/python3.4/site-packages/nose/util.py", line 470, in try_run return func() File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 348, in setUp module_name=self.module_name) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 74, in wrapper memo[key] = func(*a, **kw) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 163, in build_code module_name=module_name) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 74, in wrapper memo[key] = func(*a, **kw) File "/sw/lib/python3.4/site-packages/numpy/f2py/tests/util.py", line 144, in build_module __import__(module_name) ImportError: No module named ?_test_ext_module_5415' NumPy version 1.8.2rc1 NumPy is installed in /sw/lib/python3.4/site-packages/numpy Python version 3.4.1 (default, Aug 3 2014, 21:02:44) [GCC 4.2.1 Compatible Apple LLVM 5.1 (clang-503.0.40)] nose version 1.3.3 Cheers, Derek From matthew.brett at gmail.com Tue Aug 5 20:46:21 2014 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 5 Aug 2014 17:46:21 -0700 Subject: [Numpy-discussion] ANN: NumPy 1.8.2 release candidate In-Reply-To: References: <53E1343E.7020805@googlemail.com> <53E13F72.8070501@uci.edu> <53E1452D.5090001@googlemail.com> Message-ID: Hi, On Tue, Aug 5, 2014 at 2:27 PM, Matthew Brett wrote: > Hi, > > On Tue, Aug 5, 2014 at 1:57 PM, Julian Taylor > wrote: >> On 05.08.2014 22:32, Christoph Gohlke wrote: >>> On 8/5/2014 12:45 PM, Julian Taylor wrote: >>>> Hello, >>>> >>>> I am pleased to announce the first release candidate for numpy 1.8.2, a >>>> pure bugfix release for the 1.8.x series. >>>> https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ >>>> >>>> If no regressions show up the final release is planned this weekend. >>>> The upgrade is recommended for all users of the 1.8.x series. >>>> >>>> Following issues have been fixed: >>>> * gh-4836: partition produces wrong results for multiple selections in >>>> equal ranges >>>> * gh-4656: Make fftpack._raw_fft threadsafe >>>> * gh-4628: incorrect argument order to _copyto in in np.nanmax, np.nanmin >>>> * gh-4613: Fix lack of NULL check in array_richcompare >>>> * gh-4642: Hold GIL for converting dtypes types with fields >>>> * gh-4733: fix np.linalg.svd(b, compute_uv=False) >>>> * gh-4853: avoid unaligned simd load on reductions on i386 >>>> * gh-4774: avoid unaligned access for strided byteswap >>>> * gh-650: Prevent division by zero when creating arrays from some buffers >>>> * gh-4602: ifort has issues with optimization flag O2, use O1 >>>> >>>> Source tarballs, windows installers and release notes can be found at >>>> https://sourceforge.net/projects/numpy/files/NumPy/1.8.2rc1/ >>>> >>>> Cheers, >>>> Julian Taylor >>>> >>> >>> Hello, >>> >>> thank you. Looks good. All builds and tests pass on Windows (using >>> msvc/MKL). >>> >>> Any chance gh-4722 can make it into the release? >>> Fix seg fault converting empty string to object >>> >>> >> >> thanks, I missed that one, pretty simple, I'll add it to the final release. > > OSX wheels built and tested and uploaded OK : > > http://wheels.scikit-image.org > > https://travis-ci.org/matthew-brett/numpy-atlas-binaries/builds/31747958 OSX wheel tested OK against current scipy stack for system Python, python.org Python, homebrew, macports: https://travis-ci.org/matthew-brett/scipy-stack-osx-testing/builds/31756325 Cheers, Matthew From charlesr.harris at gmail.com Tue Aug 5 21:19:00 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Tue, 5 Aug 2014 19:19:00 -0600 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ Message-ID: Hi All, I've been looking to implement the "@" operator from Python 3.5. Looking at the current implementation of the dot function, it only uses a vector inner product, which is either that defined in arraytypes.c.src or a version using cblas defined in _dotblas for the float, cfloat, double, cdouble types. I note that the versions defined in arraytypes.c.src include all the numeric types plus boolean, datetime, timedelta, and object. I'm not clear why datetime and timedelta should have dot products, except perhaps for scalar multiplication. The boolean version has the advantage that it can short circuit. I also note that all the operations proposed for "@" can easily be done with einsum except for objects. So I'm wondering if one easy way to implement the functions is to extend einsum to work with objects and make it use blas when available. Another thing that may be worth looking into would be some way to multiply by the complex conjugate, as that is easy to implement at the low level. I'd welcome any thoughts as to how that might be done. Anyway, I'm just looking for a discussion and ideas here. Any input is welcome. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Wed Aug 6 08:31:36 2014 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 6 Aug 2014 13:31:36 +0100 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: On Wed, Aug 6, 2014 at 2:19 AM, Charles R Harris wrote: > Hi All, > > I've been looking to implement the "@" operator from Python 3.5. Looking at > the current implementation of the dot function, it only uses a vector inner > product, which is either that defined in arraytypes.c.src or a version using > cblas defined in _dotblas for the float, cfloat, double, cdouble types. I > note that the versions defined in arraytypes.c.src include all the numeric > types plus boolean, datetime, timedelta, and object. I'm not clear why > datetime and timedelta should have dot products, except perhaps for scalar > multiplication. I guess numeric @ timedelta is at least well-defined, but dot products on datetime make no sense -- datetimes do not support +! One thing we should keep in mind as well is how to allow user-defined dtypes to provide efficient matmul implementations. > The boolean version has the advantage that it can short > circuit. I also note that all the operations proposed for "@" can easily be > done with einsum except for objects. So I'm wondering if one easy way to > implement the functions is to extend einsum to work with objects and make it > use blas when available. Those do seem like nice features regardless of what we do for @ :-). I think the other obvious strategy to consider, is defining a 'dot' gufunc, with semantics identical to @. (This would be useful for backcompat as well: adding/dropping compatibility with older python versions would be as simple as mechanically replacing a @ b with newdot(a, b) or vice-versa.) This would require one new feature in the gufunc machinery: support for "optional core axes", to get the right semantics for 1d arrays. OTOH this would also be useful in general because there are other gufuncs that want to handle 1d arrays the same way @ does -- e.g., 'solve' variants. This would automatically solve both the user-defined dtype problem (ufuncs already allow for new loops to be registered) and the third-party array type problem (via __numpy_ufunc__). > Another thing that may be worth looking into would be some way to multiply > by the complex conjugate, as that is easy to implement at the low level. I'd > welcome any thoughts as to how that might be done. One idea that's come up before was to define a complex-conjugate dtype, which would allow .H to be a view on the original array. A simpler solution would be to define a specialized conjdot gufunc. -n -- Nathaniel J. Smith Postdoctoral researcher - Informatics - University of Edinburgh http://vorpus.org From jaime.frio at gmail.com Wed Aug 6 10:32:56 2014 From: jaime.frio at gmail.com (=?UTF-8?Q?Jaime_Fern=C3=A1ndez_del_R=C3=ADo?=) Date: Wed, 6 Aug 2014 07:32:56 -0700 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: On Wed, Aug 6, 2014 at 5:31 AM, Nathaniel Smith wrote: > I think the other obvious strategy to consider, is defining a 'dot' > gufunc, with semantics identical to @. (This would be useful for > backcompat as well: adding/dropping compatibility with older python > versions would be as simple as mechanically replacing a @ b with > newdot(a, b) or vice-versa.) This would require one new feature in the > gufunc machinery: support for "optional core axes", to get the right > semantics for 1d arrays. Can you elaborate on what those optional core axes would look like? If I am understanding you correctly, this is what now is solved by having more than one gufunc defined, and choosing which one to use based on the input's shapes in a thin Python wrapper. There are several examples in the linalg module you are certainly well aware of. Say you could define the matmul signature as "(i)j,j(k)->(ik)", with dimensions in parenthesis being "optional." Say we modified the gufunc machinery to detect which optional core axes are present and which not. It seems to me that you would then still need to write 4 traditional gufuncs (ij,jk->ik, j,jk->k, ij,j->i, j,j->) and dispatch to one of them. I haven't thought it through, but are there really a set of universal dispatch rules that will apply to any optional core axes problem? Would we not be losing flexibility in doing so? When I looked into gufuncs several months ago, what I missed was a way of defining signatures like n,m->n*(n-1), which would come in very handy if computing all pairwise distances. You can work around this by making the signature n,m->p and always calling the gufunc from a Python wrapper that passes in an out parameter of the right shape. But if someone gets a hold of the gufunc handle and calls it directly without an out parameter, the p defaults to 1 and you are probably in for a big crash. So it would be nice if you could provide a pointer to a function to produce the output shape based on the inputs'. On my wish list for gufunc signatures there is also frozen dimensions, e.g. a gufunc to compute greater circle distances on a sphere can be defined as m,m->, but m has to be 2, and since you don't typically want to be raising errors in the kernel, a Python wrapper is once more necessary. And again an unwrapped call to the gufunc is potentially catastrophic. Sorry for hijacking the thread, but I wouldn't mind spending some time working on expanding this functionality to include the optional axes and my wish-list, if the whole thing makes sense. Jaime -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed Aug 6 11:32:42 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 6 Aug 2014 09:32:42 -0600 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: On Wed, Aug 6, 2014 at 8:32 AM, Jaime Fern?ndez del R?o < jaime.frio at gmail.com> wrote: > On Wed, Aug 6, 2014 at 5:31 AM, Nathaniel Smith wrote: > >> I think the other obvious strategy to consider, is defining a 'dot' >> gufunc, with semantics identical to @. (This would be useful for >> backcompat as well: adding/dropping compatibility with older python >> versions would be as simple as mechanically replacing a @ b with >> newdot(a, b) or vice-versa.) This would require one new feature in the >> gufunc machinery: support for "optional core axes", to get the right >> semantics for 1d arrays. > > > Can you elaborate on what those optional core axes would look like? If I > am understanding you correctly, this is what now is solved by having more > than one gufunc defined, and choosing which one to use based on the input's > shapes in a thin Python wrapper. There are several examples in the linalg > module you are certainly well aware of. > > Say you could define the matmul signature as "(i)j,j(k)->(ik)", with > dimensions in parenthesis being "optional." Say we modified the gufunc > machinery to detect which optional core axes are present and which not. It > seems to me that you would then still need to write 4 traditional gufuncs > (ij,jk->ik, j,jk->k, ij,j->i, j,j->) and dispatch to one of them. I haven't > thought it through, but are there really a set of universal dispatch rules > that will apply to any optional core axes problem? Would we not be losing > flexibility in doing so? > > When I looked into gufuncs several months ago, what I missed was a way of > defining signatures like n,m->n*(n-1), which would come in very handy if > computing all pairwise distances. You can work around this by making the > signature n,m->p and always calling the gufunc from a Python wrapper that > passes in an out parameter of the right shape. But if someone gets a hold > of the gufunc handle and calls it directly without an out parameter, the p > defaults to 1 and you are probably in for a big crash. So it would be nice > if you could provide a pointer to a function to produce the output shape > based on the inputs'. > > On my wish list for gufunc signatures there is also frozen dimensions, > e.g. a gufunc to compute greater circle distances on a sphere can be > defined as m,m->, but m has to be 2, and since you don't typically want to > be raising errors in the kernel, a Python wrapper is once more necessary. > And again an unwrapped call to the gufunc is potentially catastrophic. > > Sorry for hijacking the thread, but I wouldn't mind spending some time > working on expanding this functionality to include the optional axes and my > wish-list, if the whole thing makes sense. > Should also mention that we don't have the ability to operate on stacked vectors because they can't be identified by dimension info. One workaround is to add dummy dimensions where needed, another is to add two flags, row and col, and set them appropriately. Two flags are needed for backward compatibility, i.e., both false is a traditional array. Note that adding dummy dimensions can lead to '[[...]]' scalars. Working with stacked vectors isn't part of the '@' PEP. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed Aug 6 12:14:38 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 6 Aug 2014 10:14:38 -0600 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: On Wed, Aug 6, 2014 at 9:32 AM, Charles R Harris wrote: > > > > On Wed, Aug 6, 2014 at 8:32 AM, Jaime Fern?ndez del R?o < > jaime.frio at gmail.com> wrote: > >> On Wed, Aug 6, 2014 at 5:31 AM, Nathaniel Smith wrote: >> >>> I think the other obvious strategy to consider, is defining a 'dot' >>> gufunc, with semantics identical to @. (This would be useful for >>> backcompat as well: adding/dropping compatibility with older python >>> versions would be as simple as mechanically replacing a @ b with >>> newdot(a, b) or vice-versa.) This would require one new feature in the >>> gufunc machinery: support for "optional core axes", to get the right >>> semantics for 1d arrays. >> >> >> Can you elaborate on what those optional core axes would look like? If I >> am understanding you correctly, this is what now is solved by having more >> than one gufunc defined, and choosing which one to use based on the input's >> shapes in a thin Python wrapper. There are several examples in the linalg >> module you are certainly well aware of. >> >> Say you could define the matmul signature as "(i)j,j(k)->(ik)", with >> dimensions in parenthesis being "optional." Say we modified the gufunc >> machinery to detect which optional core axes are present and which not. It >> seems to me that you would then still need to write 4 traditional gufuncs >> (ij,jk->ik, j,jk->k, ij,j->i, j,j->) and dispatch to one of them. I haven't >> thought it through, but are there really a set of universal dispatch rules >> that will apply to any optional core axes problem? Would we not be losing >> flexibility in doing so? >> >> When I looked into gufuncs several months ago, what I missed was a way of >> defining signatures like n,m->n*(n-1), which would come in very handy if >> computing all pairwise distances. You can work around this by making the >> signature n,m->p and always calling the gufunc from a Python wrapper that >> passes in an out parameter of the right shape. But if someone gets a hold >> of the gufunc handle and calls it directly without an out parameter, the p >> defaults to 1 and you are probably in for a big crash. So it would be nice >> if you could provide a pointer to a function to produce the output shape >> based on the inputs'. >> >> On my wish list for gufunc signatures there is also frozen dimensions, >> e.g. a gufunc to compute greater circle distances on a sphere can be >> defined as m,m->, but m has to be 2, and since you don't typically want to >> be raising errors in the kernel, a Python wrapper is once more necessary. >> And again an unwrapped call to the gufunc is potentially catastrophic. >> >> Sorry for hijacking the thread, but I wouldn't mind spending some time >> working on expanding this functionality to include the optional axes and my >> wish-list, if the whole thing makes sense. >> > > Should also mention that we don't have the ability to operate on stacked > vectors because they can't be identified by dimension info. One workaround > is to add dummy dimensions where needed, another is to add two flags, row > and col, and set them appropriately. Two flags are needed for backward > compatibility, i.e., both false is a traditional array. Note that adding > dummy dimensions can lead to '[[...]]' scalars. Working with stacked > vectors isn't part of the '@' PEP. > > Transpose doesn't work with stacked arrays, so it would also be useful to have a function for that. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Wed Aug 6 16:42:34 2014 From: charlesr.harris at gmail.com (Charles R Harris) Date: Wed, 6 Aug 2014 14:42:34 -0600 Subject: [Numpy-discussion] How to give feedback to github Message-ID: Does anyone know how to complain about features to github? The new author selection list for PRs is practically useless as 1) it only lists authors belonging to the project and 2) it doesn't list the number of PRs for each author. The old list was far more useful. Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Wed Aug 6 17:05:19 2014 From: chris.barker at noaa.gov (Chris Barker) Date: Wed, 6 Aug 2014 14:05:19 -0700 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: On Wed, Aug 6, 2014 at 8:32 AM, Charles R Harris wrote: > Should also mention that we don't have the ability to operate on stacked > vectors because they can't be identified by dimension info. One workaround > is to add dummy dimensions where needed, another is to add two flags, row > and col, and set them appropriately. > I've thought for ages that if you want to naturally do linear algebra, you need to capture the concept of a row and column vector as distinct from each-other and from (1,n) and (n,1) shape arrays. So: +1 -Chris -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From fperez.net at gmail.com Wed Aug 6 18:04:03 2014 From: fperez.net at gmail.com (Fernando Perez) Date: Wed, 6 Aug 2014 15:04:03 -0700 Subject: [Numpy-discussion] How to give feedback to github In-Reply-To: References: Message-ID: The form at: https://github.com/contact or simly email support at github.com are the options. I've used it a couple of times and they've been responsive. Cheers f On Wed, Aug 6, 2014 at 1:42 PM, Charles R Harris wrote: > Does anyone know how to complain about features to github? The new author > selection list for PRs is practically useless as 1) it only lists authors > belonging to the project and 2) it doesn't list the number of PRs for each > author. The old list was far more useful. > > Chuck > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > http://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Fernando Perez (@fperez_org; http://fperez.org) fperez.net-at-gmail: mailing lists only (I ignore this when swamped!) fernando.perez-at-berkeley: contact me here for any direct mail -------------- next part -------------- An HTML attachment was scrubbed... URL: From alan.isaac at gmail.com Wed Aug 6 18:45:30 2014 From: alan.isaac at gmail.com (Alan G Isaac) Date: Wed, 06 Aug 2014 18:45:30 -0400 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: <53E2B00A.4000801@gmail.com> > On Wed, Aug 6, 2014 at 8:32 AM, Charles R Harris wrote: >> Should also mention that we don't have the ability to >> operate on stacked vectors because they can't be >> identified by dimension info. One >> workaround is to add dummy dimensions where needed, >> another is to add two flags, row and col, and set them >> appropriately. On 8/6/2014 5:05 PM, Chris Barker wrote: > I've thought for ages that if you want to naturally do > linear algebra, you need to capture the concept of a row > and column vector as distinct from > each-other and from (1,n) and (n,1) shape arrays. So: It seems to me that although this it may sound trivial to "add two flags", this is a fundamental conceptual change, and I hope it will not go forward without extensive discussion. To aid users like me who might want to think about this, can you please suggest for exploration a language that has adopted this approach. (Ideally, where the decision is considered a good one.) Thank you, Alan Isaac From njs at pobox.com Wed Aug 6 18:57:57 2014 From: njs at pobox.com (Nathaniel Smith) Date: Wed, 6 Aug 2014 23:57:57 +0100 Subject: [Numpy-discussion] Preliminary thoughts on implementing __matmul__ In-Reply-To: References: Message-ID: On Wed, Aug 6, 2014 at 4:32 PM, Charles R Harris