From robbmcleod at gmail.com Wed Feb 1 03:28:12 2017 From: robbmcleod at gmail.com (Robert McLeod) Date: Wed, 1 Feb 2017 09:28:12 +0100 Subject: [Numpy-discussion] composing Euler rotation matrices In-Reply-To: <87d1f24ml4.fsf@otaria.sebmel.org> References: <87h94e4tkx.fsf@otaria.sebmel.org> <87d1f24ml4.fsf@otaria.sebmel.org> Message-ID: Instead of trying to decipher what someone wrote on a Wikipedia, why don't you look at a working piece of source code? e.g. https://github.com/3dem/relion/blob/master/src/euler.cpp Robert On Wed, Feb 1, 2017 at 4:27 AM, Seb wrote: > On Tue, 31 Jan 2017 21:23:55 -0500, > Joseph Fox-Rabinovitz wrote: > > > Could you show what you are doing to get the statement "However, I > > cannot reproduce this matrix via composition; i.e. by multiplying the > > underlying rotation matrices.". I would guess something involving the > > `*` operator instead of `@`, but guessing probably won't help you > > solve your issue. > > Sure, although composition is not something I can take credit for, as > it's a well-described operation for generating linear transformations. > It is the matrix multiplication of two or more transformation matrices. > In the case of Euler transformations, it's matrices specifying rotations > around 3 orthogonal axes by 3 given angles. I'm using `numpy.dot' to > perform matrix multiplication on 2D arrays representing matrices. > > However, it's not obvious from the link I provided what particular > rotation matrices are multiplied and in what order (i.e. what > composition) is used to arrive at the Z1Y2X3 rotation matrix shown. > Perhaps I'm not understanding the conventions used therein. This is one > of my attempts at reproducing that rotation matrix via composition: > > ---<--------------------cut here---------------start------ > ------------->--- > import numpy as np > > angles = np.radians(np.array([30, 20, 10])) > > def z1y2x3(alpha, beta, gamma): > """Z1Y2X3 rotation matrix given Euler angles""" > return np.array([[np.cos(alpha) * np.cos(beta), > np.cos(alpha) * np.sin(beta) * np.sin(gamma) - > np.cos(gamma) * np.sin(alpha), > np.sin(alpha) * np.sin(gamma) + > np.cos(alpha) * np.cos(gamma) * np.sin(beta)], > [np.cos(beta) * np.sin(alpha), > np.cos(alpha) * np.cos(gamma) + > np.sin(alpha) * np.sin(beta) * np.sin(gamma), > np.cos(gamma) * np.sin(alpha) * np.sin(beta) - > np.cos(alpha) * np.sin(gamma)], > [-np.sin(beta), np.cos(beta) * np.sin(gamma), > np.cos(beta) * np.cos(gamma)]]) > > euler_mat = z1y2x3(angles[0], angles[1], angles[2]) > > ## Now via composition > > def rotation_matrix(theta, axis, active=False): > """Generate rotation matrix for a given axis > > Parameters > ---------- > > theta: numeric, optional > The angle (degrees) by which to perform the rotation. Default is > 0, which means return the coordinates of the vector in the rotated > coordinate system, when rotate_vectors=False. > axis: int, optional > Axis around which to perform the rotation (x=0; y=1; z=2) > active: bool, optional > Whether to return active transformation matrix. > > Returns > ------- > numpy.ndarray > 3x3 rotation matrix > """ > theta = np.radians(theta) > if axis == 0: > R_theta = np.array([[1, 0, 0], > [0, np.cos(theta), -np.sin(theta)], > [0, np.sin(theta), np.cos(theta)]]) > elif axis == 1: > R_theta = np.array([[np.cos(theta), 0, np.sin(theta)], > [0, 1, 0], > [-np.sin(theta), 0, np.cos(theta)]]) > else: > R_theta = np.array([[np.cos(theta), -np.sin(theta), 0], > [np.sin(theta), np.cos(theta), 0], > [0, 0, 1]]) > if active: > R_theta = np.transpose(R_theta) > return R_theta > > ## The rotations are given as active > xmat = rotation_matrix(angles[2], 0, active=True) > ymat = rotation_matrix(angles[1], 1, active=True) > zmat = rotation_matrix(angles[0], 2, active=True) > ## The operation seems to imply this composition > euler_comp_mat = np.dot(xmat, np.dot(ymat, zmat)) > ---<--------------------cut here---------------end-------- > ------------->--- > > I believe the matrices `euler_mat' and `euler_comp_mat' should be the > same, but they aren't, so it's unclear to me what particular composition > is meant to produce the matrix specified by this Z1Y2X3 transformation. > What am I missing? > > -- > Seb > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Robert McLeod, Ph.D. Center for Cellular Imaging and Nano Analytics (C-CINA) Biozentrum der Universit?t Basel Mattenstrasse 26, 4058 Basel Work: +41.061.387.3225 robert.mcleod at unibas.ch robert.mcleod at bsse.ethz.ch robbmcleod at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From naresh.p at okdollar.com Wed Feb 1 03:31:27 2017 From: naresh.p at okdollar.com (Naresh P) Date: Wed, 1 Feb 2017 15:01:27 +0630 Subject: [Numpy-discussion] Fwd: Installation Problem In-Reply-To: References: Message-ID: *Warm Regards,* *Naresh* *Developer * *Consumer Goods Myanmar Limited ( CGM )No.15, Junction Square Complex, 37-G, Pyay Road,**13041 Kamayut Township**, Yangon, Myanmar.* *Tel: +959979867914* http://www.cg-m.com ---------- Forwarded message ---------- From: Date: Wed, Feb 1, 2017 at 2:59 PM Subject: Fwd: Installation Problem To: naresh.p at okdollar.com This is a members-only list. Your message has been automatically rejected, since it came from a non-member's email address. Please make sure to use the email account that you used to join this list. ---------- Forwarded message ---------- From: Naresh P To: numpy-discussion at scipy.org Cc: Date: Wed, 1 Feb 2017 14:59:08 +0630 Subject: Fwd: Installation Problem *Warm Regards,* *Naresh* *Developer * *Consumer Goods Myanmar Limited ( CGM )No.15, Junction Square Complex, 37-G, Pyay Road,**13041 Kamayut Township**, Yangon, Myanmar.* *Tel: +959979867914* http://www.cg-m.com On Wed, Feb 1, 2017 at 2:54 PM, wrote: > This is a members-only list. Your message has been automatically > rejected, since it came from a non-member's email address. Please > make sure to use the email account that you used to join this list. > > > > ---------- Forwarded message ---------- > From: Naresh P > To: numpy-discussion at scipy.org > Cc: > Date: Wed, 1 Feb 2017 14:54:21 +0630 > Subject: Fwd: Installation Problem > Hi Team, > > i have still that problem , can please help me. > > let me know , i don't know how get ur membership , > > Thanks > > > > *Warm Regards,* > *Naresh* > *Developer * > *Consumer Goods Myanmar Limited ( CGM )No.15, Junction Square > Complex, 37-G, Pyay Road,**13041 Kamayut Township**, Yangon, Myanmar.* > *Tel: +959979867914* > http://www.cg-m.com > > ---------- Forwarded message ---------- > From: Naresh P > Date: Tue, Jan 31, 2017 at 3:20 PM > Subject: Installation Problem > To: numpy-discussion at scipy.org > > > > Hi , > i tried to somany times , but not install ,below msg are display. > help me > > > [image: Inline image 1] > > > *Warm Regards,* > *Naresh* > *Developer * > *Consumer Goods Myanmar Limited ( CGM )No.15, Junction Square > Complex, 37-G, Pyay Road,**13041 Kamayut Township**, Yangon, Myanmar.* > *Tel: +959979867914* > http://www.cg-m.com > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image.png Type: image/png Size: 27658 bytes Desc: not available URL: From mmwoodman at gmail.com Wed Feb 1 03:55:31 2017 From: mmwoodman at gmail.com (Marmaduke Woodman) Date: Wed, 1 Feb 2017 09:55:31 +0100 Subject: [Numpy-discussion] ANN: xarray v0.9 released In-Reply-To: References: Message-ID: > On 1 Feb 2017, at 05:19, Stephan Hoyer wrote: > > This release includes five months worth of enhancements and bug fixes from 24 contributors, including some significant enhancements to the data model that are not fully backwards compatible. Looks very nice; is the API stable or are you waiting for a v1.0 release? Is there significant overhead compared to plain ndarray? From matthew.brett at gmail.com Wed Feb 1 04:42:15 2017 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 1 Feb 2017 09:42:15 +0000 Subject: [Numpy-discussion] composing Euler rotation matrices In-Reply-To: References: <87h94e4tkx.fsf@otaria.sebmel.org> <87d1f24ml4.fsf@otaria.sebmel.org> Message-ID: Hi, On Wed, Feb 1, 2017 at 8:28 AM, Robert McLeod wrote: > Instead of trying to decipher what someone wrote on a Wikipedia, why don't > you look at a working piece of source code? > > e.g. > > https://github.com/3dem/relion/blob/master/src/euler.cpp Also - have a look at https://pypi.python.org/pypi/transforms3d - and in particular you might get some use from symbolic versions of the transformations, e.g. here : https://github.com/matthew-brett/transforms3d/blob/master/transforms3d/derivations/eulerangles.py It's really easy to mix up the conventions, as I'm sure you know - see http://matthew-brett.github.io/transforms3d/reference/transforms3d.euler.html Cheers, Matthew From shoyer at gmail.com Wed Feb 1 12:33:51 2017 From: shoyer at gmail.com (Stephan Hoyer) Date: Wed, 1 Feb 2017 09:33:51 -0800 Subject: [Numpy-discussion] ANN: xarray v0.9 released In-Reply-To: References:

Message-ID: On Wed, Feb 1, 2017 at 12:55 AM, Marmaduke Woodman wrote: > Looks very nice; is the API stable or are you waiting for a v1.0 release? > We are pretty close to full API stability but not quite there yet. Enough people are using xarray in production that breaking changes are made with serious caution (and deprecation cycles whenever feasible). The only major backwards-incompatible change planned is an overhaul of indexing to use labeled broadcasting and alignment: https://github.com/pydata/xarray/issues/974 There are a few other "nice to have" features for v1.0 but that's the only one that has the potential to change functionality in a way that we can't cleanly deprecate. > Is there significant overhead compared to plain ndarray? Xarray is implemented in Python (not C), so it does have significant overhead for every operation. Adding two arrays takes ~100 us, rather than <1 us in NumPy. So you don't want to use it in your inner loop. That said, the overhead is independent of the size of the array. So if you work with large arrays, it is negligible. -------------- next part -------------- An HTML attachment was scrubbed... URL: From stuart at stuartreynolds.net Wed Feb 1 13:16:26 2017 From: stuart at stuartreynolds.net (Stuart Reynolds) Date: Wed, 1 Feb 2017 10:16:26 -0800 Subject: [Numpy-discussion] composing Euler rotation matrices In-Reply-To: References: <87h94e4tkx.fsf@otaria.sebmel.org> <87d1f24ml4.fsf@otaria.sebmel.org>

Message-ID: [off topic] Nothing good ever comes from using Euler matrices. All the cool kids a using quaternions these days. They're (in some ways) simpler, can be interpolated easily, don't suffer from gimbal lock (discontinuity), and are not confused about which axis rotation is applied first (for Euler you much decide whether you want to apply x.y.z or z.y.x). They'd be a good addition to numpy. On Wed, Feb 1, 2017 at 1:42 AM, Matthew Brett wrote: > Hi, > > On Wed, Feb 1, 2017 at 8:28 AM, Robert McLeod > wrote: > > Instead of trying to decipher what someone wrote on a Wikipedia, why > don't > > you look at a working piece of source code? > > > > e.g. > > > > https://github.com/3dem/relion/blob/master/src/euler.cpp > > Also - have a look at https://pypi.python.org/pypi/transforms3d - and > in particular you might get some use from symbolic versions of the > transformations, e.g. here : > https://github.com/matthew-brett/transforms3d/blob/master/transforms3d/ > derivations/eulerangles.py > > It's really easy to mix up the conventions, as I'm sure you know - see > http://matthew-brett.github.io/transforms3d/reference/ > transforms3d.euler.html > > Cheers, > > Matthew > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From spluque at gmail.com Wed Feb 1 13:31:51 2017 From: spluque at gmail.com (Seb) Date: Wed, 01 Feb 2017 12:31:51 -0600 Subject: [Numpy-discussion] composing Euler rotation matrices References: <87h94e4tkx.fsf@otaria.sebmel.org> <87d1f24ml4.fsf@otaria.sebmel.org>

Message-ID: <87poj1wync.fsf@gmail.com> On Wed, 1 Feb 2017 09:42:15 +0000, Matthew Brett wrote: > Hi, > On Wed, Feb 1, 2017 at 8:28 AM, Robert McLeod wrote: >> Instead of trying to decipher what someone wrote on a Wikipedia, why >> don't you look at a working piece of source code? >> e.g. >> https://github.com/3dem/relion/blob/master/src/euler.cpp > Also - have a look at https://pypi.python.org/pypi/transforms3d - and > in particular you might get some use from symbolic versions of the > transformations, e.g. here : > https://github.com/matthew-brett/transforms3d/blob/master/transforms3d/derivations/eulerangles.py > It's really easy to mix up the conventions, as I'm sure you know - see > http://matthew-brett.github.io/transforms3d/reference/transforms3d.euler.html Thank you very much for providing this package. It looks like this is exactly what I was trying to do (learn). The symbolic versions really help show what is going on in the derivations sub-package by showing how each of the 9 matrix elements are found. I'll try to hack it to use active rotations. -- Seb From boukhdhiramal at yahoo.fr Tue Feb 7 17:40:46 2017 From: boukhdhiramal at yahoo.fr (Boukhdhir Amal) Date: Tue, 7 Feb 2017 22:40:46 +0000 (UTC) Subject: [Numpy-discussion] zero values in the output of PyArray_AsCArray( References: <187263153.61218.1486507246009.ref@mail.yahoo.com> Message-ID: <187263153.61218.1486507246009@mail.yahoo.com> Hi,? ?I am trying to access an array as a C-Type using the function 'PyArray_AsCArray'The problem is that I am getting many 0 values in the resulting C- array. Some of the indexedvalues are correct. This is my code:? static PyObject* cos_func_np(PyObject* self, PyObject* args){? ? ? ? PyObject *in_array_object;? ? ? ? PyObject *out_array; ?? ? ? ? int** segs_2d_array;? ? ? ? ? ? ??? ? ? ? /* Parse single numpy array argument*/? ? ? ? if (! PyArg_ParseTuple(args, "O", &in_array_object))? ? ? ? ? ? return NULL;? ? ? ? int typenum = NPY_INT64;? ? ? ? PyArray_Descr *descr;? ? ? ? descr = PyArray_DescrFromType(typenum); ? ? ? ? ? ? ? ? ? ? ? ? ? ??? ? ? ? npy_intp dims[2]; ? ? ? ? ? ? ? ? ? ? ?? ? ? ? PyArray_AsCArray(&in_array_object, (void**) &segs_2d_array, dims, 2, descr); ? ? ? ? ? ? ? ? ? ? ? ? printf("\n-segs_2d_array: %d --\n", segs_2d_array[1][5]); ? ? ? ? ? ? ? ? ? ? ? ??? ? ? ??? ? ? ? //return Py_BuildValue("O", in_array_object);? ? ? ??} For example:segs_2d_array[0][0] ?and segs_2d_array[1][2] outputs the correct values, however, segs_2d_array[1][3] and segs_2d_array[1][5] are equal to zero. What is wrong with this code please ? -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Mon Feb 13 16:01:37 2017 From: charlesr.harris at gmail.com (Charles R Harris) Date: Mon, 13 Feb 2017 14:01:37 -0700 Subject: [Numpy-discussion] Marten van Kerkwijk added to numpy team. Message-ID: Hi All, I'm pleased to welcome Marten to the numpy team. His reviews of PRs have been very useful in the past and I am happy that he has accepted our invitation to join the team. Cheers, Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Tue Feb 14 03:32:32 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 14 Feb 2017 21:32:32 +1300 Subject: [Numpy-discussion] Marten van Kerkwijk added to numpy team. In-Reply-To: References: Message-ID: On Tue, Feb 14, 2017 at 10:01 AM, Charles R Harris < charlesr.harris at gmail.com> wrote: > Hi All, > > I'm pleased to welcome Marten to the numpy team. His reviews of PRs have > been very useful in the past and I am happy that he has accepted our > invitation to join the team. > Excellent, welcome Marten! Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Tue Feb 14 04:59:36 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 14 Feb 2017 22:59:36 +1300 Subject: [Numpy-discussion] Building external c modules with mingw64 / numpy In-Reply-To: <243DBD016692E54EB12F37B87C66E70E815DB8@didag1> References: <243DBD016692E54EB12F37B87C66E70E815DB8@didag1> Message-ID: On Sat, Jan 21, 2017 at 9:23 PM, Schnizer, Pierre < pierre.schnizer at helmholtz-berlin.de> wrote: > Dear all, > > > > I built an external c-module (pygsl) using mingw 64 from > msys2 mingw64-gcc compiler. > > > > This built required some changes to numpy.distutils to get the > > ?python setup.py config? > > and > > ?python setup.py build? > > working. In this process I replaced 2 files in numpy.distutils from > numpy git repository: > > - numpy.dist_utils.misc_utils.py version ec0e046 > on > 14 Dec 2016 > > - numpy.dist_utils. *mingw32ccompiler.py version *ec0e046 > on > 14 Dec 2016 > > > > mingw32ccompiler.py required to be modified to get it work > > n preprocessor had to be defined as I am using setup.py config > > n specifying the runtime library search path to the linker > > n include path of the vcrtruntime > > > > I attached a patch reflecting the changes I had to make to file > mingw32ccompile.py > > If this information is useful I am happy to answer questions > Thanks for the patch Pierre. For future reference: a pull request on GitHub or a link to a Gist is preferred for us and usually gets you a response quicker. Regarding your question in the patch on including Python's install directory: that shouldn't be necessary, and I'd be wary of applying your patch without understanding why the current numpy.distutils code doesn't work for you. But if your patch works for you then it can't hurt I think. Cheers, Ralf > > > Sincerely yours > > Pierre > > > > PS Version infos: > > Python: > > Python 3.6.0 (v3.6.0:41df79263a11, Dec 23 2016, 08:06:12) [MSC v.1900 64 > bit (AMD64)] on win32 > > > > Numpy: > > >> help(numpy.version) > > Help on module numpy.version in numpy: > > DATA > > full_version = '1.12.0' > > git_revision = '561f1accf861ad8606ea2dd723d2be2b09a2dffa' > > release = True > > short_version = '1.12.0' > > version = '1.12.0' > > > > gcc.exe (Rev2, Built by MSYS2 project) 6.2.0 > > > > > > ------------------------------ > > Helmholtz-Zentrum Berlin f?r Materialien und Energie GmbH > > Mitglied der Hermann von Helmholtz-Gemeinschaft Deutscher > Forschungszentren e.V. > > Aufsichtsrat: Vorsitzender Dr. Karl Eugen Huthmacher, stv. Vorsitzende Dr. > Jutta Koch-Unterseher > Gesch?ftsf?hrung: Prof. Dr. Anke Rita Kaysser-Pyzalla, Thomas Frederking > > Sitz Berlin, AG Charlottenburg, 89 HRB 5583 > > Postadresse: > Hahn-Meitner-Platz 1 > D-14109 Berlin > > http://www.helmholtz-berlin.de > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From larsson at cs.uchicago.edu Tue Feb 14 18:34:32 2017 From: larsson at cs.uchicago.edu (Gustav Larsson) Date: Tue, 14 Feb 2017 15:34:32 -0800 Subject: [Numpy-discussion] Proposal to support __format__ Message-ID: Hi everyone! I want to discuss adding support for __format__ in ndarray and I am willing to contribute code-wise once consensus has been reached. It was briefly discussed on GitHub two years ago (https://github.com/numpy/numpy/issues/5543) and I will re-iterate some of the points made there and build off of that. I have been thinking about this a lot in the last few weeks and my thoughts turned into a fairly fleshed out proposal. The discussion should probably start more high-level, so I apologize if the level of detail is inappropriate at this point in time. I decided on a gist, since the email got too long and clear formatting helps: https://gist.github.com/gustavla/2783543be1204d2b5d368f6a1fb4d069 OK, those are my thoughts for now. What do you think? Cheers, Gustav From shoyer at gmail.com Tue Feb 14 18:59:49 2017 From: shoyer at gmail.com (Stephan Hoyer) Date: Tue, 14 Feb 2017 15:59:49 -0800 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References: Message-ID: On Tue, Feb 14, 2017 at 3:34 PM, Gustav Larsson wrote: > Hi everyone! > > I want to discuss adding support for __format__ in ndarray and I am > willing to > contribute code-wise once consensus has been reached. It was briefly > discussed on GitHub two years ago (https://github.com/numpy/ > numpy/issues/5543) > and I will re-iterate some of the points made there and build off of that. > I > have been thinking about this a lot in the last few weeks and my thoughts > turned > into a fairly fleshed out proposal. The discussion should probably start > more > high-level, so I apologize if the level of detail is inappropriate at this > point in time. > > I decided on a gist, since the email got too long and clear formatting > helps: > > https://gist.github.com/gustavla/2783543be1204d2b5d368f6a1fb4d069 This is a lovely and clearly written document. Thanks for taking the time to think through this! I encourage you to submit it as a pull request to the NumPy repository as a "NumPy Enhancement Proposal", either now or after we've discussed it: https://docs.scipy.org/doc/numpy-dev/neps/index.html > OK, those are my thoughts for now. What do you think? > Two thoughts for now: 1. For object arrays, I would default to calling format on each element (your "map principle") rather than raising an error. 2. It's absolutely OK to leave functionality unimplemented and not immediately nail down every edge case. As a default, I would suggest raising errors whenever non-empty type specifications are provided rather than raising errors in every case. -------------- next part -------------- An HTML attachment was scrubbed... URL: From larsson at cs.uchicago.edu Tue Feb 14 20:35:23 2017 From: larsson at cs.uchicago.edu (Gustav Larsson) Date: Tue, 14 Feb 2017 17:35:23 -0800 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References:

Message-ID: > > I encourage you to submit it as a pull request to the NumPy repository as > a "NumPy Enhancement Proposal", either now or after we've discussed it: > https://docs.scipy.org/doc/numpy-dev/neps/index.html OK, I will let it go through one iteration of comments and then I'll submit one. Thanks! 1. For object arrays, I would default to calling format on each element > (your "map principle") rather than raising an error. I'm glad you brought this up as a possibility. It might be possible, but there are some issues that would need to be resolved. First of all, {} and {:} always works and gives the same result it currently does. So, this only affects the situation where the format spec is non-empty. I think there are two main issues: Heterogeneity: Let's say we have x = np.array([12.3, True, 'string', Foo(10)], dtype=np.object). Then, presumably {:.1f} should cause a ValueError since the string does not support format type 'f'. This could create a lot of ValueError land mines for the user. For x[:2] however it should work and produce something like [12.3 1.0]. Note, the "map principle" still can't be strictly true. Let's say we have an array with type object and mostly string-like elements. Then {:5s} will still not produce exactly {:5s} element-wise, because the string representations need to be repr-based inside the array (otherwise it could break for newlines and things like that and produce spaces that make the boundary between elements ambiguous). This brings me to the next issue. Str vs. repr: If we have a homogeneous object-array with types Foo and Foo implements __format__, it would be great if this worked. However, one issue is that Foo.__format__ might return things like newline (or spaces), which would break (or confuse) the printed output (unless it is made incredibly smart to support "vertical alignment"). This issue is essentially the same as for strings in general, which is why they use repr instead. I can think of two solutions: 1) Try to sanitize (or repr-ify) the string returned by __format__ somehow; 2) Put the responsibility on the user and simply let the rendering break if Foo.__format__ does not play well. 2. It's absolutely OK to leave functionality unimplemented and not > immediately nail down every edge case. As a default, I would suggest > raising errors whenever non-empty type specifications are provided rather > than raising errors in every case. > I agree. Gustav On Tue, Feb 14, 2017 at 3:59 PM, Stephan Hoyer wrote: > On Tue, Feb 14, 2017 at 3:34 PM, Gustav Larsson > wrote: > >> Hi everyone! >> >> I want to discuss adding support for __format__ in ndarray and I am >> willing to >> contribute code-wise once consensus has been reached. It was briefly >> discussed on GitHub two years ago (https://github.com/numpy/nump >> y/issues/5543) >> and I will re-iterate some of the points made there and build off of >> that. I >> have been thinking about this a lot in the last few weeks and my thoughts >> turned >> into a fairly fleshed out proposal. The discussion should probably start >> more >> high-level, so I apologize if the level of detail is inappropriate at this >> point in time. >> >> I decided on a gist, since the email got too long and clear formatting >> helps: >> >> https://gist.github.com/gustavla/2783543be1204d2b5d368f6a1fb4d069 > > > This is a lovely and clearly written document. Thanks for taking the time > to think through this! > > I encourage you to submit it as a pull request to the NumPy repository as > a "NumPy Enhancement Proposal", either now or after we've discussed it: > https://docs.scipy.org/doc/numpy-dev/neps/index.html > > >> OK, those are my thoughts for now. What do you think? >> > > Two thoughts for now: > 1. For object arrays, I would default to calling format on each element > (your "map principle") rather than raising an error. > 2. It's absolutely OK to leave functionality unimplemented and not > immediately nail down every edge case. As a default, I would suggest > raising errors whenever non-empty type specifications are provided rather > than raising errors in every case. > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From shoyer at gmail.com Tue Feb 14 20:55:21 2017 From: shoyer at gmail.com (Stephan Hoyer) Date: Tue, 14 Feb 2017 17:55:21 -0800 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References:

Message-ID: On Tue, Feb 14, 2017 at 5:35 PM, Gustav Larsson wrote: > 1. For object arrays, I would default to calling format on each element >> (your "map principle") rather than raising an error. >> > > I'm glad you brought this up as a possibility. It might be possible, but > there are some issues that would need to be resolved. First of all, {} and > {:} always works and gives the same result it currently does. So, this only > affects the situation where the format spec is non-empty. I think there are > two main issues: > > Heterogeneity: Let's say we have x = np.array([12.3, True, 'string', > Foo(10)], dtype=np.object). Then, presumably {:.1f} should cause a > ValueError since the string does not support format type 'f'. This could > create a lot of ValueError land mines for the user. > Things will absolutely break if you try to do complex operations on in-homogeneously typed arrays. I would put the onus on the user in such a case. > For x[:2] however it should work and produce something like [12.3 1.0]. > Note, the "map principle" still can't be strictly true. Let's say we have > an array with type object and mostly string-like elements. Then {:5s} will > still not produce exactly {:5s} element-wise, because the string > representations need to be repr-based inside the array (otherwise it could > break for newlines and things like that and produce spaces that make the > boundary between elements ambiguous). This brings me to the next issue. > Indeed, this will be a departure from the behavior without a format string, which just uses repr. In my mind, this is the strongest argument against using the map principle here, because there is a discontinuous shift between providing and not providing a format string. > Str vs. repr: If we have a homogeneous object-array with types Foo and Foo > implements __format__, it would be great if this worked. However, one issue > is that Foo.__format__ might return things like newline (or spaces), which > would break (or confuse) the printed output (unless it is made incredibly > smart to support "vertical alignment"). This issue is essentially the same > as for strings in general, which is why they use repr instead. I can think > of two solutions: 1) Try to sanitize (or repr-ify) the string returned by > __format__ somehow; 2) Put the responsibility on the user and simply let > the rendering break if Foo.__format__ does not play well. > I wouldn't do anything fancy here to worry about line breaks. It's basically impossible to get this right for edge cases, so I would certainly put the responsibility on the user. On another note, about Python 2 vs 3: I would definitely take the approach of copying the Python 3 behavior on all versions of NumPy (when feasible) and not being too concerned about compatibility with format on Python 2. The future is Python 3. -------------- next part -------------- An HTML attachment was scrubbed... URL: From amit at haystackapp.net Tue Feb 14 23:24:20 2017 From: amit at haystackapp.net (Amit Bhosle) Date: Tue, 14 Feb 2017 20:24:20 -0800 Subject: [Numpy-discussion] ImportError: Importing the multiarray numpy extension module failed Message-ID: Hi, I'm struggling with a numpy issue and web search hasn't helped. I'm on windows 10, and using Python27. I've tried reinstalling numpy, and also a few different versions, but without any luck. numpy was pulled in as dependency of timezonefinder==1.5.7 that i need, and the numpy-1.12.0.dist-info distribution was installed.. The error on my google-app-engine server's console is as below.. Can someone pls help? thanks a bunch in advance.. AB File "timezonefinder\timezonefinder.py", line 8, in from numpy import array, empty, fromfile File "numpy\__init__.py", lin e 142, in from . import add_newdocs File "numpy\add_newdocs.py", line 13, in from numpy.lib import add_newdoc File "numpy\lib\__init__.py", line 8, in from .type_check import * File "numpy\lib\type_check.py ", line 11, in import numpy.core.numeric as _nx File "numpy\core\__init__.py" , line 24, in raise ImportError(msg) ImportError: Importing the multiarray numpy extension module failed. Most likely you are trying to import a failed build of numpy. If you're working with a numpy git repo, try `git clean -xdf` (removes all files not under version control). Otherwise reinstall numpy. -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Wed Feb 15 00:01:09 2017 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 14 Feb 2017 21:01:09 -0800 Subject: [Numpy-discussion] ImportError: Importing the multiarray numpy extension module failed In-Reply-To: References: Message-ID: On Tue, Feb 14, 2017 at 8:24 PM, Amit Bhosle wrote: > Hi, > > I'm struggling with a numpy issue and web search hasn't helped. I'm on > windows 10, and using Python27. > > I've tried reinstalling numpy, and also a few different versions, but > without any luck. > > numpy was pulled in as dependency of timezonefinder==1.5.7 that i need, and > the numpy-1.12.0.dist-info distribution was installed.. > > The error on my google-app-engine server's console is as below.. > Can someone pls help? Are you using the app engine "standard environment"? That's a very weird Python environment that forbids the installation of all packages that contain C code. This obviously includes numpy, and would explain your error. They do provide a pre-installed super-ancient version of numpy with some features removed, which might work for you if you force-uninstall numpy. Otherwise you might need to switch to the "flexible environment". -n -- Nathaniel J. Smith -- https://vorpus.org From m.h.vankerkwijk at gmail.com Wed Feb 15 11:03:51 2017 From: m.h.vankerkwijk at gmail.com (Marten van Kerkwijk) Date: Wed, 15 Feb 2017 11:03:51 -0500 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References: Message-ID: Hi Gustav, This is great! A few quick comments (mostly echo-ing Stephan's). 1. You basically have a NEP already! Making a PR from it allows to give line-by-line comments, so would help! 2. Don't worry about supporting python2 specifics; just try to ensure it doesn't break; I would not say more about it! 3. On `set_printoptions` -- ideally, it will become possible to use this as a context (i.e., `with set_printoption(...)`). It might make sense to have an `override_format` keyword argument to it. 4. Otherwise, my main suggestion is to start small with the more obvious ones, and not worry too much about format validation, but rather about getting the simple ones to work well (e.g., for an object array, just apply the format given; if it doesn't work, it will error out on its own, which is OK). 5. One bit of detail: the "g" one does confuse me. All the best, Marten From matthew.brett at gmail.com Wed Feb 15 14:02:35 2017 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 15 Feb 2017 19:02:35 +0000 Subject: [Numpy-discussion] PowerPC testing servers Message-ID: Hey, A recent post to the wheel-builders mailing list pointed out some links to places providing free PowerPC hosting for open source projects, if they agree to a submitted request: https://mail.python.org/pipermail/wheel-builders/2017-February/000257.html It would be good to get some testing going on these architectures. Shall we apply for hosting, as the numpy organization? Cheers, Matthew From ralf.gommers at gmail.com Wed Feb 15 14:37:06 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 16 Feb 2017 08:37:06 +1300 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References: Message-ID: On Thu, Feb 16, 2017 at 8:02 AM, Matthew Brett wrote: > Hey, > > A recent post to the wheel-builders mailing list pointed out some > links to places providing free PowerPC hosting for open source > projects, if they agree to a submitted request: > > https://mail.python.org/pipermail/wheel-builders/2017-February/000257.html > > It would be good to get some testing going on these architectures. > Shall we apply for hosting, as the numpy organization? > Those are bare VMs it seems. Remembering the Buildbot and Mailman horrors, I think we should be very reluctant to taking responsibility for maintaining CI on anything that's not hosted and can be controlled with a simple config file in our repo. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Wed Feb 15 14:45:49 2017 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 15 Feb 2017 19:45:49 +0000 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References: Message-ID: On Wed, Feb 15, 2017 at 7:37 PM, Ralf Gommers wrote: > > > On Thu, Feb 16, 2017 at 8:02 AM, Matthew Brett > wrote: >> >> Hey, >> >> A recent post to the wheel-builders mailing list pointed out some >> links to places providing free PowerPC hosting for open source >> projects, if they agree to a submitted request: >> >> https://mail.python.org/pipermail/wheel-builders/2017-February/000257.html >> >> It would be good to get some testing going on these architectures. >> Shall we apply for hosting, as the numpy organization? > > > Those are bare VMs it seems. Remembering the Buildbot and Mailman horrors, I > think we should be very reluctant to taking responsibility for maintaining > CI on anything that's not hosted and can be controlled with a simple config > file in our repo. Not sure what you mean about mailman - maybe the Enthought servers we didn't have access to? For buildbot, I've been maintaining about 12 crappy old machines for about 7 years now [1] - I'm happy to do the same job for a couple of properly hosted PPC machines. At least we'd have some way of testing for these machines, if we get stuck - even if that involved spinning up a VM and installing the stuff we needed from the command line. Cheers, Matthew [1] http://nipy.bic.berkeley.edu/buildslaves From ralf.gommers at gmail.com Wed Feb 15 14:55:33 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 16 Feb 2017 08:55:33 +1300 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References:

Message-ID: On Thu, Feb 16, 2017 at 8:45 AM, Matthew Brett wrote: > On Wed, Feb 15, 2017 at 7:37 PM, Ralf Gommers > wrote: > > > > > > On Thu, Feb 16, 2017 at 8:02 AM, Matthew Brett > > wrote: > >> > >> Hey, > >> > >> A recent post to the wheel-builders mailing list pointed out some > >> links to places providing free PowerPC hosting for open source > >> projects, if they agree to a submitted request: > >> > >> https://mail.python.org/pipermail/wheel-builders/2017- > February/000257.html > >> > >> It would be good to get some testing going on these architectures. > >> Shall we apply for hosting, as the numpy organization? > > > > > > Those are bare VMs it seems. Remembering the Buildbot and Mailman > horrors, I > > think we should be very reluctant to taking responsibility for > maintaining > > CI on anything that's not hosted and can be controlled with a simple > config > > file in our repo. > > Not sure what you mean about mailman - maybe the Enthought servers we > didn't have access to? We did have access (for most of the time), it's just that no one is interested in putting in lots of hours on sysadmin duties. > For buildbot, I've been maintaining about 12 > crappy old machines for about 7 years now [1] - I'm happy to do the > same job for a couple of properly hosted PPC machines. That's awesome persistence. The NumPy and SciPy buildbots certainly weren't maintained like that, half of them were offline or broken for long periods usually. > At least we'd > have some way of testing for these machines, if we get stuck - even if > that involved spinning up a VM and installing the stuff we needed from > the command line. > I do see the value of testing on more platforms of course. It's just about logistics/responsibilities. If you're saying that you'll do the maintenance, and want to apply for resources using the NumPy name, that's much better I think then making "the numpy devs" collectively responsible. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From matthew.brett at gmail.com Wed Feb 15 14:58:00 2017 From: matthew.brett at gmail.com (Matthew Brett) Date: Wed, 15 Feb 2017 19:58:00 +0000 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References:

Message-ID: On Wed, Feb 15, 2017 at 7:55 PM, Ralf Gommers wrote: > > > On Thu, Feb 16, 2017 at 8:45 AM, Matthew Brett > wrote: >> >> On Wed, Feb 15, 2017 at 7:37 PM, Ralf Gommers >> wrote: >> > >> > >> > On Thu, Feb 16, 2017 at 8:02 AM, Matthew Brett >> > wrote: >> >> >> >> Hey, >> >> >> >> A recent post to the wheel-builders mailing list pointed out some >> >> links to places providing free PowerPC hosting for open source >> >> projects, if they agree to a submitted request: >> >> >> >> >> >> https://mail.python.org/pipermail/wheel-builders/2017-February/000257.html >> >> >> >> It would be good to get some testing going on these architectures. >> >> Shall we apply for hosting, as the numpy organization? >> > >> > >> > Those are bare VMs it seems. Remembering the Buildbot and Mailman >> > horrors, I >> > think we should be very reluctant to taking responsibility for >> > maintaining >> > CI on anything that's not hosted and can be controlled with a simple >> > config >> > file in our repo. >> >> Not sure what you mean about mailman - maybe the Enthought servers we >> didn't have access to? > > > We did have access (for most of the time), it's just that no one is > interested in putting in lots of hours on sysadmin duties. > >> >> For buildbot, I've been maintaining about 12 >> crappy old machines for about 7 years now [1] - I'm happy to do the >> same job for a couple of properly hosted PPC machines. > > > That's awesome persistence. The NumPy and SciPy buildbots certainly weren't > maintained like that, half of them were offline or broken for long periods > usually. Right - they do need persistence, and to have someone who takes responsibility for them. >> >> At least we'd >> have some way of testing for these machines, if we get stuck - even if >> that involved spinning up a VM and installing the stuff we needed from >> the command line. > > > I do see the value of testing on more platforms of course. It's just about > logistics/responsibilities. If you're saying that you'll do the maintenance, > and want to apply for resources using the NumPy name, that's much better I > think then making "the numpy devs" collectively responsible. Yes, exactly. I'm happy to take responsibility for them, I just wanted to make sure that numpy devs could get at them if I'm not around for some reason. Matthew From evgeny.burovskiy at gmail.com Wed Feb 15 15:48:04 2017 From: evgeny.burovskiy at gmail.com (Evgeni Burovski) Date: Wed, 15 Feb 2017 23:48:04 +0300 Subject: [Numpy-discussion] ANN: scipy 0.19.0 release candidate 1 Message-ID: Hi, I'm pleased to announce the availability of the first release candidate for scipy 0.19.0. It contains contributions from 120 people over the course of seven months. Please try this release and report any issues on Github tracker, https://github.com/scipy/scipy, or scipy-dev mailing list. Source tarballs and release notes are available from Github releases, https://github.com/scipy/scipy/releases/tag/v0.19.0rc1 Please note that this is a source-only release. We do not provide Windows binaries for this release. OS X and Linux wheels will be provided for the final release. The current release schedule is 22 Feb : 0.19.0rc2, if needed 09 Mar : 0.19.0 final Thanks to everyone who contributed to this release! Cheers, Evgeni A part of the release notes follows below: =================================================== ========================== SciPy 0.19.0 Release Notes ========================== .. note:: Scipy 0.19.0 is not released yet! .. contents:: SciPy 0.19.0 is the culmination of X months of hard work. It contains many new features, numerous bug-fixes, improved test coverage and better documentation. There have been a number of deprecations and API changes in this release, which are documented below. All users are encouraged to upgrade to this release, as there are a large number of bug-fixes and optimizations. Moreover, our development attention will now shift to bug-fix releases on the 0.19.x branch, and on adding new features on the master branch. This release requires Python 2.7 or 3.4-3.6 and NumPy 1.8.2 or greater. Highlights of this release include: - A unified foreign function interface layer, `scipy.LowLevelCallable`. - Cython API for scalar, typed versions of the universal functions from the `scipy.special` module, via `cimport scipy.special.cython_special`. New features ============ Foreign function interface improvements --------------------------------------- `scipy.LowLevelCallable` provides a new unified interface for wrapping low-level compiled callback functions in the Python space. It supports Cython imported "api" functions, ctypes function pointers, CFFI function pointers, ``PyCapsules``, Numba jitted functions and more. See `gh-6509 `_ for details. `scipy.linalg` improvements --------------------------- The function `scipy.linalg.solve` obtained two more keywords ``assume_a`` and ``transposed``. The underlying LAPACK routines are replaced with "expert" versions and now can also be used to solve symmetric, hermitian and positive definite coefficient matrices. Moreover, ill-conditioned matrices now cause a warning to be emitted with the estimated condition number information. Old ``sym_pos`` keyword is kept for backwards compatibility reasons however it is identical to using ``assume_a='pos'``. Moreover, the ``debug`` keyword, which had no function but only printing the ``overwrite_`` values, is deprecated. The function `scipy.linalg.matrix_balance` was added to perform the so-called matrix balancing using the LAPACK xGEBAL routine family. This can be used to approximately equate the row and column norms through diagonal similarity transformations. The functions `scipy.linalg.solve_continuous_are` and `scipy.linalg.solve_discrete_are` have numerically more stable algorithms. These functions can also solve generalized algebraic matrix Riccati equations. Moreover, both gained a ``balanced`` keyword to turn balancing on and off. `scipy.spatial` improvements ---------------------------- `scipy.spatial.SphericalVoronoi.sort_vertices_of_regions` has been re-written in Cython to improve performance. `scipy.spatial.SphericalVoronoi` can handle > 200 k points (at least 10 million) and has improved performance. The function `scipy.spatial.distance.directed_hausdorff` was added to calculate the directed Hausdorff distance. ``count_neighbors`` method of `scipy.spatial.cKDTree` gained an ability to perform weighted pair counting via the new keywords ``weights`` and ``cumulative``. See `gh-5647 `_ for details. `scipy.ndimage` improvements ---------------------------- The callback function C API supports PyCapsules in Python 2.7 Multidimensional filters now allow having different extrapolation modes for different axes. `scipy.optimize` improvements ----------------------------- The `scipy.optimize.basinhopping` global minimizer obtained a new keyword, `seed`, which can be used to seed the random number generator and obtain repeatable minimizations. The keyword `sigma` in `scipy.optimize.curve_fit` was overloaded to also accept the covariance matrix of errors in the data. `scipy.signal` improvements --------------------------- The function `scipy.signal.correlate` and `scipy.signal.convolve` have a new optional parameter `method`. The default value of `auto` estimates the fastest of two computation methods, the direct approach and the Fourier transform approach. A new function has been added to choose the convolution/correlation method, `scipy.signal.choose_conv_method` which may be appropriate if convolutions or correlations are performed on many arrays of the same size. New functions have been added to calculate complex short time fourier transforms of an input signal, and to invert the transform to recover the original signal: `scipy.signal.stft` and `scipy.signal.istft`. This implementation also fixes the previously incorrect ouput of `scipy.signal.spectrogram` when complex output data were requested. The function `scipy.signal.sosfreqz` was added to compute the frequency response from second-order sections. The function `scipy.signal.unit_impulse` was added to conveniently generate an impulse function. The function `scipy.signal.iirnotch` was added to design second-order IIR notch filters that can be used to remove a frequency component from a signal. The dual function `scipy.signal.iirpeak` was added to compute the coefficients of a second-order IIR peak (resonant) filter. The function `scipy.signal.minimum_phase` was added to convert linear-phase FIR filters to minimum phase. The functions `scipy.signal.upfirdn` and `scipy.signal.resample_poly` are now substantially faster when operating on some n-dimensional arrays when n > 1. The largest reduction in computation time is realized in cases where the size of the array is small (<1k samples or so) along the axis to be filtered. `scipy.fftpack` improvements ---------------------------- Fast Fourier transform routines now accept `np.float16` inputs and upcast them to `np.float32`. Previously, they would raise an error. `scipy.cluster` improvements ---------------------------- Methods ``"centroid"`` and ``"median"`` of `scipy.cluster.hierarchy.linkage` have been significantly sped up. Long-standing issues with using ``linkage`` on large input data (over 16 GB) have been resolved. `scipy.sparse` improvements --------------------------- The functions `scipy.sparse.save_npz` and `scipy.sparse.load_npz` were added, providing simple serialization for some sparse formats. The `prune` method of classes `bsr_matrix`, `csc_matrix`, and `csr_matrix` was updated to reallocate backing arrays under certain conditions, reducing memory usage. The methods `argmin` and `argmax` were added to classes `coo_matrix`, `csc_matrix`, `csr_matrix`, and `bsr_matrix`. New function `scipy.sparse.csgraph.structural_rank` computes the structural rank of a graph with a given sparsity pattern. New function `scipy.sparse.linalg.spsolve_triangular` solves a sparse linear system with a triangular left hand side matrix. `scipy.special` improvements ---------------------------- Scalar, typed versions of universal functions from `scipy.special` are available in the Cython space via ``cimport`` from the new module `scipy.special.cython_special`. These scalar functions can be expected to be significantly faster then the universal functions for scalar arguments. See the `scipy.special` tutorial for details. Better control over special-function errors is offered by the functions `scipy.special.geterr` and `scipy.special.seterr` and the context manager `scipy.special.errstate`. The names of orthogonal polynomial root functions have been changed to be consistent with other functions relating to orthogonal polynomials. For example, `scipy.special.j_roots` has been renamed `scipy.special.roots_jacobi` for consistency with the related functions `scipy.special.jacobi` and `scipy.special.eval_jacobi`. To preserve back-compatibility the old names have been left as aliases. Wright Omega function is implemented as `scipy.special.wrightomega`. `scipy.stats` improvements -------------------------- The function `scipy.stats.weightedtau` was added. It provides a weighted version of Kendall's tau. New class `scipy.stats.multinomial` implements the multinomial distribution. New class `scipy.stats.rv_histogram` constructs a continuous univariate distribution with a piecewise linear CDF from a binned data sample. New class `scipy.stats.argus` implements the Argus distribution. `scipy.interpolate` improvements -------------------------------- New class `scipy.interpolate.BSpline` represents splines. ``BSpline`` objects contain knots and coefficients and can evaluate the spline. The format is consistent with FITPACK, so that one can do, for example:: >>> t, c, k = splrep(x, y, s=0) >>> spl = BSpline(t, c, k) >>> np.allclose(spl(x), y) ``spl*`` functions, `scipy.interpolate.splev`, `scipy.interpolate.splint`, `scipy.interpolate.splder` and `scipy.interpolate.splantider`, accept both ``BSpline`` objects and ``(t, c, k)`` tuples for backwards compatibility. For multidimensional splines, ``c.ndim > 1``, ``BSpline`` objects are consistent with piecewise polynomials, `scipy.interpolate.PPoly`. This means that ``BSpline`` objects are not immediately consistent with `scipy.interpolate.splprep`, and one *cannot* do ``>>> BSpline(*splprep([x, y])[0])``. Consult the `scipy.interpolate` test suite for examples of the precise equivalence. In new code, prefer using ``scipy.interpolate.BSpline`` objects instead of manipulating ``(t, c, k)`` tuples directly. New function `scipy.interpolate.make_interp_spline` constructs an interpolating spline given data points and boundary conditions. New function `scipy.interpolate.make_lsq_spline` constructs a least-squares spline approximation given data points. `scipy.integrate` improvements ------------------------------ Now `scipy.integrate.fixed_quad` supports vector-valued functions. Deprecated features =================== `scipy.interpolate.splmake`, `scipy.interpolate.spleval` and `scipy.interpolate.spline` are deprecated. The format used by `splmake/spleval` was inconsistent with `splrep/splev` which was confusing to users. `scipy.special.errprint` is deprecated. Improved functionality is available in `scipy.special.seterr`. Backwards incompatible changes ============================== The deprecated ``scipy.weave`` submodule was removed. `scipy.spatial.distance.squareform` now returns arrays of the same dtype as the input, instead of always float64. `scipy.special.errprint` now returns a boolean. The function `scipy.signal.find_peaks_cwt` now returns an array instead of a list. `scipy.stats.kendalltau` now computes the correct p-value in case the input contains ties. The p-value is also identical to that computed by `scipy.stats.mstats.kendalltau` and by R. If the input does not contain ties there is no change w.r.t. the previous implementation. The function `scipy.linalg.block_diag` will not ignore zero-sized matrices anymore. Instead it will insert rows or columns of zeros of the appropriate size. See gh-4908 for more details. Other changes ============= SciPy wheels will now report their dependency on ``numpy`` on all platforms. This change was made because Numpy wheels are available, and because the pip upgrade behavior is finally changing for the better (use ``--upgrade-strategy=only-if-needed`` for ``pip >= 8.2``; that behavior will become the default in the next major version of ``pip``). Numerical values returned by `scipy.interpolate.interp1d` with ``kind="cubic"`` and ``"quadratic"`` may change relative to previous scipy versions. If your code depended on specific numeric values (i.e., on implementation details of the interpolators), you may want to double-check your results. Authors ======= * @endolith * Max Argus + * Herv? Audren * Alessandro Pietro Bardelli + * Michael Benfield + * Felix Berkenkamp * Matthew Brett * Per Brodtkorb * Evgeni Burovski * Pierre de Buyl * CJ Carey * Brandon Carter + * Tim Cera * Klesk Chonkin * Christian H?ggstr?m + * Luca Citi * Peadar Coyle + * Daniel da Silva + * Greg Dooper + * John Draper + * drlvk + * David Ellis + * Yu Feng * Baptiste Fontaine + * Jed Frey + * Siddhartha Gandhi + * GiggleLiu + * Wim Glenn + * Akash Goel + * Ralf Gommers * Alexander Goncearenco + * Richard Gowers + * Alex Griffing * Radoslaw Guzinski + * Charles Harris * Callum Jacob Hays + * Ian Henriksen * Randy Heydon + * Lindsey Hiltner + * Gerrit Holl + * Hiroki IKEDA + * jfinkels + * Mher Kazandjian + * Thomas Keck + * keuj6 + * Kornel Kielczewski + * Sergey B Kirpichev + * Vasily Kokorev + * Eric Larson * Denis Laxalde * Gregory R. Lee * Josh Lefler + * Julien Lhermitte + * Evan Limanto + * Nikolay Mayorov * Geordie McBain + * Josue Melka + * Matthieu Melot * michaelvmartin15 + * Surhud More + * Brett M. Morris + * Chris Mutel + * Paul Nation * Andrew Nelson * David Nicholson + * Aaron Nielsen + * Joel Nothman * nrnrk + * Juan Nunez-Iglesias * Mikhail Pak + * Gavin Parnaby + * Thomas Pingel + * Ilhan Polat + * Aman Pratik + * Sebastian Pucilowski * Ted Pudlik * puenka + * Eric Quintero * Tyler Reddy * Joscha Reimer * Antonio Horta Ribeiro + * Edward Richards + * Roman Ring + * Rafael Rossi + * Colm Ryan + * Sami Salonen + * Alvaro Sanchez-Gonzalez + * Johannes Schmitz * Kari Schoonbee * Yurii Shevchuk + * Jonathan Siebert + * Jonathan Tammo Siebert + * Scott Sievert + * Sourav Singh * Byron Smith + * Srikiran + * Samuel St-Jean + * Yoni Teitelbaum + * Bhavika Tekwani * Martin Thoma * timbalam + * Svend Vanderveken + * Sebastiano Vigna + * Aditya Vijaykumar + * Santi Villalba + * Ze Vinicius * Pauli Virtanen * Matteo Visconti * Yusuke Watanabe + * Warren Weckesser * Phillip Weinberg + * Nils Werner * Jakub Wilk * Josh Wilson * wirew0rm + * David Wolever + * Nathan Woods * ybeltukov + * G Young * Evgeny Zhurko + A total of 120 people contributed to this release. People with a "+" by their names contributed a patch for the first time. This list of names is automatically generated, and may not be fully complete. From larsson at cs.uchicago.edu Wed Feb 15 16:48:34 2017 From: larsson at cs.uchicago.edu (Gustav Larsson) Date: Wed, 15 Feb 2017 13:48:34 -0800 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References: Message-ID: > > This is great! Thanks! Glad to be met by enthusiasm about this. 1. You basically have a NEP already! Making a PR from it allows to > give line-by-line comments, so would help! I will do this soon. 2. Don't worry about supporting python2 specifics; just try to ensure > it doesn't break; I would not say more about it! Sounds good to me. 3. On `set_printoptions` -- ideally, it will become possible to use > this as a context (i.e., `with set_printoption(...)`). It might make > sense to have an `override_format` keyword argument to it. Having a `with np.printoptions(...)` context manager is a great idea. It does sound orthogonal to __format__ though, so it could be addressed separately. 4. Otherwise, my main suggestion is to start small with the more > obvious ones, and not worry too much about format validation, but > rather about getting the simple ones to work well (e.g., for an object > array, just apply the format given; if it doesn't work, it will error > out on its own, which is OK). Sounds good to me. I was thinking of approaching the implementation by writing unit tests first and group them into different priority tiers. That way, the unit tests can go through another review before implementation gets going. I agree that __format__ doesn't have to check format validation if a ValueError is going to be raised anyway by sub-calls. 5. One bit of detail: the "g" one does confuse me. I will re-write this a bit to make it clearer. Basically, the 'g' with the mix of 'e'/'f' depending on max/min>1000 is all from the current numpy behavior, so it is not something I had much creative input on at all. Although, as it is written right now it may seem so. That is, the goal is to have {:} == {:g} for float arrays, analogous to how {:} == {:g} for built-in floats. Then, if the user departs a bit, like {:.2g}, it will simply be identical to calling np.set_printoptions(precision=2) first. Gustav On Wed, Feb 15, 2017 at 8:03 AM, Marten van Kerkwijk < m.h.vankerkwijk at gmail.com> wrote: > Hi Gustav, > > This is great! A few quick comments (mostly echo-ing Stephan's). > > 1. You basically have a NEP already! Making a PR from it allows to > give line-by-line comments, so would help! > > 2. Don't worry about supporting python2 specifics; just try to ensure > it doesn't break; I would not say more about it! > > 3. On `set_printoptions` -- ideally, it will become possible to use > this as a context (i.e., `with set_printoption(...)`). It might make > sense to have an `override_format` keyword argument to it. > > 4. Otherwise, my main suggestion is to start small with the more > obvious ones, and not worry too much about format validation, but > rather about getting the simple ones to work well (e.g., for an object > array, just apply the format given; if it doesn't work, it will error > out on its own, which is OK). > > 5. One bit of detail: the "g" one does confuse me. > > All the best, > > Marten > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ilhanpolat at gmail.com Wed Feb 15 17:05:00 2017 From: ilhanpolat at gmail.com (Ilhan Polat) Date: Wed, 15 Feb 2017 23:05:00 +0100 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References: Message-ID: On the last item, do we really have to follow that strange, `d`,`g` and so on conventions on formatting? With all respect to the humongous historical baggage, I think that notation is pretty archaic and terminal like. If being pythonic is of a concern here, maybe it is better to use a more verbose syntax. Just throwing out an idea after 15 seconds of thought (so by no means an alternative suggestion) eng:6i5d -> engineering notation (always powers of ten of multiples of 3) 6 integral digits and 5 decimal digits. float (whatever the default is) float:4i2d (you get the idea) etc. FULL DISCLOSURE: I am a very displeased customer of `fprintf ` of matlab (and others) and this archaic formatting. I never got a hang of it so it might be the case that I don't quite get the rationale behind it and I almost always get it wrong. Maybe at least the rationale can be clarified. Lastly, repeating what others mentioned: thank you for this well prepared initiative On Wed, Feb 15, 2017 at 10:48 PM, Gustav Larsson wrote: > This is great! > > > Thanks! Glad to be met by enthusiasm about this. > > 1. You basically have a NEP already! Making a PR from it allows to >> give line-by-line comments, so would help! > > > I will do this soon. > > 2. Don't worry about supporting python2 specifics; just try to ensure >> it doesn't break; I would not say more about it! > > > Sounds good to me. > > 3. On `set_printoptions` -- ideally, it will become possible to use >> this as a context (i.e., `with set_printoption(...)`). It might make >> sense to have an `override_format` keyword argument to it. > > > Having a `with np.printoptions(...)` context manager is a great idea. It > does sound orthogonal to __format__ though, so it could be addressed > separately. > > 4. Otherwise, my main suggestion is to start small with the more >> obvious ones, and not worry too much about format validation, but >> rather about getting the simple ones to work well (e.g., for an object >> array, just apply the format given; if it doesn't work, it will error >> out on its own, which is OK). > > > Sounds good to me. I was thinking of approaching the implementation by > writing unit tests first and group them into different priority tiers. That > way, the unit tests can go through another review before implementation > gets going. I agree that __format__ doesn't have to check format validation > if a ValueError is going to be raised anyway by sub-calls. > > 5. One bit of detail: the "g" one does confuse me. > > > I will re-write this a bit to make it clearer. Basically, the 'g' with the > mix of 'e'/'f' depending on max/min>1000 is all from the current numpy > behavior, so it is not something I had much creative input on at all. > Although, as it is written right now it may seem so. That is, the goal is > to have {:} == {:g} for float arrays, analogous to how {:} == {:g} for > built-in floats. Then, if the user departs a bit, like {:.2g}, it will > simply be identical to calling np.set_printoptions(precision=2) first. > > Gustav > > On Wed, Feb 15, 2017 at 8:03 AM, Marten van Kerkwijk < > m.h.vankerkwijk at gmail.com> wrote: > >> Hi Gustav, >> >> This is great! A few quick comments (mostly echo-ing Stephan's). >> >> 1. You basically have a NEP already! Making a PR from it allows to >> give line-by-line comments, so would help! >> >> 2. Don't worry about supporting python2 specifics; just try to ensure >> it doesn't break; I would not say more about it! >> >> 3. On `set_printoptions` -- ideally, it will become possible to use >> this as a context (i.e., `with set_printoption(...)`). It might make >> sense to have an `override_format` keyword argument to it. >> >> 4. Otherwise, my main suggestion is to start small with the more >> obvious ones, and not worry too much about format validation, but >> rather about getting the simple ones to work well (e.g., for an object >> array, just apply the format given; if it doesn't work, it will error >> out on its own, which is OK). >> >> 5. One bit of detail: the "g" one does confuse me. >> >> All the best, >> >> Marten >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From nathan12343 at gmail.com Wed Feb 15 17:14:42 2017 From: nathan12343 at gmail.com (Nathan Goldbaum) Date: Wed, 15 Feb 2017 16:14:42 -0600 Subject: [Numpy-discussion] Proposal to support __format__ In-Reply-To: References: Message-ID: On Wed, Feb 15, 2017 at 4:05 PM, Ilhan Polat wrote: > On the last item, do we really have to follow that strange, `d`,`g` and so > on conventions on formatting? With all respect to the humongous historical > baggage, I think that notation is pretty archaic and terminal like. If > being pythonic is of a concern here, maybe it is better to use a more > verbose syntax. Just throwing out an idea after 15 seconds of thought (so > by no means an alternative suggestion) > > eng:6i5d -> engineering notation (always powers of ten of multiples of 3) > 6 integral digits and 5 decimal digits. > float (whatever the default is) > float:4i2d (you get the idea) > > etc. > > While I agree with you that printf format codes are arcane, unfortunately they need to be used here since they are supported by Python: https://docs.python.org/3.1/library/string.html#formatspec > > FULL DISCLOSURE: I am a very displeased customer of `fprintf ` of matlab > (and others) and this archaic formatting. I never got a hang of it so it > might be the case that I don't quite get the rationale behind it and I > almost always get it wrong. Maybe at least the rationale can be clarified. > > > Lastly, repeating what others mentioned: thank you for this well prepared > initiative > > > > > On Wed, Feb 15, 2017 at 10:48 PM, Gustav Larsson > wrote: > >> This is great! >> >> >> Thanks! Glad to be met by enthusiasm about this. >> >> 1. You basically have a NEP already! Making a PR from it allows to >>> give line-by-line comments, so would help! >> >> >> I will do this soon. >> >> 2. Don't worry about supporting python2 specifics; just try to ensure >>> it doesn't break; I would not say more about it! >> >> >> Sounds good to me. >> >> 3. On `set_printoptions` -- ideally, it will become possible to use >>> this as a context (i.e., `with set_printoption(...)`). It might make >>> sense to have an `override_format` keyword argument to it. >> >> >> Having a `with np.printoptions(...)` context manager is a great idea. It >> does sound orthogonal to __format__ though, so it could be addressed >> separately. >> >> 4. Otherwise, my main suggestion is to start small with the more >>> obvious ones, and not worry too much about format validation, but >>> rather about getting the simple ones to work well (e.g., for an object >>> array, just apply the format given; if it doesn't work, it will error >>> out on its own, which is OK). >> >> >> Sounds good to me. I was thinking of approaching the implementation by >> writing unit tests first and group them into different priority tiers. That >> way, the unit tests can go through another review before implementation >> gets going. I agree that __format__ doesn't have to check format validation >> if a ValueError is going to be raised anyway by sub-calls. >> >> 5. One bit of detail: the "g" one does confuse me. >> >> >> I will re-write this a bit to make it clearer. Basically, the 'g' with >> the mix of 'e'/'f' depending on max/min>1000 is all from the current numpy >> behavior, so it is not something I had much creative input on at all. >> Although, as it is written right now it may seem so. That is, the goal is >> to have {:} == {:g} for float arrays, analogous to how {:} == {:g} for >> built-in floats. Then, if the user departs a bit, like {:.2g}, it will >> simply be identical to calling np.set_printoptions(precision=2) first. >> >> Gustav >> >> On Wed, Feb 15, 2017 at 8:03 AM, Marten van Kerkwijk < >> m.h.vankerkwijk at gmail.com> wrote: >> >>> Hi Gustav, >>> >>> This is great! A few quick comments (mostly echo-ing Stephan's). >>> >>> 1. You basically have a NEP already! Making a PR from it allows to >>> give line-by-line comments, so would help! >>> >>> 2. Don't worry about supporting python2 specifics; just try to ensure >>> it doesn't break; I would not say more about it! >>> >>> 3. On `set_printoptions` -- ideally, it will become possible to use >>> this as a context (i.e., `with set_printoption(...)`). It might make >>> sense to have an `override_format` keyword argument to it. >>> >>> 4. Otherwise, my main suggestion is to start small with the more >>> obvious ones, and not worry too much about format validation, but >>> rather about getting the simple ones to work well (e.g., for an object >>> array, just apply the format given; if it doesn't work, it will error >>> out on its own, which is OK). >>> >>> 5. One bit of detail: the "g" one does confuse me. >>> >>> All the best, >>> >>> Marten >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> https://mail.scipy.org/mailman/listinfo/numpy-discussion >>> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From amit at haystackapp.net Wed Feb 15 17:16:43 2017 From: amit at haystackapp.net (Amit Bhosle) Date: Wed, 15 Feb 2017 14:16:43 -0800 Subject: [Numpy-discussion] ImportError: Importing the multiarray numpy extension module failed In-Reply-To: References: Message-ID: Hi Nathan, Thanks for the quick response. Yeah - looks like the Google app engine supports only 1.6.1.. Reverting to that version has fixed this issue. Thanks AB On Feb 14, 2017 21:01, "Nathaniel Smith" wrote: > On Tue, Feb 14, 2017 at 8:24 PM, Amit Bhosle wrote: > > Hi, > > > > I'm struggling with a numpy issue and web search hasn't helped. I'm on > > windows 10, and using Python27. > > > > I've tried reinstalling numpy, and also a few different versions, but > > without any luck. > > > > numpy was pulled in as dependency of timezonefinder==1.5.7 that i need, > and > > the numpy-1.12.0.dist-info distribution was installed.. > > > > The error on my google-app-engine server's console is as below.. > > Can someone pls help? > > Are you using the app engine "standard environment"? That's a very > weird Python environment that forbids the installation of all packages > that contain C code. This obviously includes numpy, and would explain > your error. They do provide a pre-installed super-ancient version of > numpy with some features removed, which might work for you if you > force-uninstall numpy. Otherwise you might need to switch to the > "flexible environment". > > -n > > -- > Nathaniel J. Smith -- https://vorpus.org > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -------------- next part -------------- An HTML attachment was scrubbed... URL: From morph at debian.org Wed Feb 15 21:53:42 2017 From: morph at debian.org (Sandro Tosi) Date: Wed, 15 Feb 2017 21:53:42 -0500 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References: Message-ID: > A recent post to the wheel-builders mailing list pointed out some > links to places providing free PowerPC hosting for open source > projects, if they agree to a submitted request: The debian project has some powerpc machines (and we still build numpy on those boxes when i upload a new revision to our archives) and they also have hosts dedicated to let debian developers login and debug issues with their packages on that architecture. I can sponsor access to those machines for some of you, but it is not a place where you can host a CI instance. Just keep it in mind more broadly than powerpc, f.e. these are all the archs where numpy was built after the last upload https://buildd.debian.org/status/package.php?p=python-numpy&suite=unstable (the grayed out archs are the ones non release critical, so packages are built as best effort and if missing is not a big deal) -- Sandro "morph" Tosi My website: http://sandrotosi.me/ Me at Debian: http://wiki.debian.org/SandroTosi G+: https://plus.google.com/u/0/+SandroTosi From ralf.gommers at gmail.com Thu Feb 16 03:50:04 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 16 Feb 2017 21:50:04 +1300 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References:

Message-ID: On Thu, Feb 16, 2017 at 8:58 AM, Matthew Brett wrote: > On Wed, Feb 15, 2017 at 7:55 PM, Ralf Gommers > wrote: > > > > > > On Thu, Feb 16, 2017 at 8:45 AM, Matthew Brett > > wrote: > >> > >> On Wed, Feb 15, 2017 at 7:37 PM, Ralf Gommers > >> wrote: > >> > > >> > > >> > On Thu, Feb 16, 2017 at 8:02 AM, Matthew Brett < > matthew.brett at gmail.com> > >> > wrote: > >> >> > >> >> Hey, > >> >> > >> >> A recent post to the wheel-builders mailing list pointed out some > >> >> links to places providing free PowerPC hosting for open source > >> >> projects, if they agree to a submitted request: > >> >> > >> >> > >> >> https://mail.python.org/pipermail/wheel-builders/2017- > February/000257.html > >> >> > >> >> It would be good to get some testing going on these architectures. > >> >> Shall we apply for hosting, as the numpy organization? > >> > > >> > > >> > Those are bare VMs it seems. Remembering the Buildbot and Mailman > >> > horrors, I > >> > think we should be very reluctant to taking responsibility for > >> > maintaining > >> > CI on anything that's not hosted and can be controlled with a simple > >> > config > >> > file in our repo. > >> > >> Not sure what you mean about mailman - maybe the Enthought servers we > >> didn't have access to? > > > > > > We did have access (for most of the time), it's just that no one is > > interested in putting in lots of hours on sysadmin duties. > > > >> > >> For buildbot, I've been maintaining about 12 > >> crappy old machines for about 7 years now [1] - I'm happy to do the > >> same job for a couple of properly hosted PPC machines. > > > > > > That's awesome persistence. The NumPy and SciPy buildbots certainly > weren't > > maintained like that, half of them were offline or broken for long > periods > > usually. > > Right - they do need persistence, and to have someone who takes > responsibility for them. > > >> > >> At least we'd > >> have some way of testing for these machines, if we get stuck - even if > >> that involved spinning up a VM and installing the stuff we needed from > >> the command line. > > > > > > I do see the value of testing on more platforms of course. It's just > about > > logistics/responsibilities. If you're saying that you'll do the > maintenance, > > and want to apply for resources using the NumPy name, that's much better > I > > think then making "the numpy devs" collectively responsible. > > Yes, exactly. I'm happy to take responsibility for them, I just > wanted to make sure that numpy devs could get at them if I'm not > around for some reason. > In that case, +1 from me! Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Thu Feb 16 03:55:36 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Thu, 16 Feb 2017 21:55:36 +1300 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References: Message-ID: On Thu, Feb 16, 2017 at 3:53 PM, Sandro Tosi wrote: > > A recent post to the wheel-builders mailing list pointed out some > > links to places providing free PowerPC hosting for open source > > projects, if they agree to a submitted request: > > The debian project has some powerpc machines (and we still build numpy > on those boxes when i upload a new revision to our archives) and they > also have hosts dedicated to let debian developers login and debug > issues with their packages on that architecture. I can sponsor access > to those machines for some of you, but it is not a place where you can > host a CI instance. > > Just keep it in mind more broadly than powerpc, f.e. these are all the > archs where numpy was built after the last upload > https://buildd.debian.org/status/package.php?p=python-numpy&suite=unstable > (the grayed out archs are the ones non release critical, so packages > are built as best effort and if missing is not a big deal) Thanks Sandro. It looks like even for the release-critical ones, it's just the build that has to succeed and failures are not detected? For example, armel is green but has 9 failures: https://buildd.debian.org/status/fetch.php?pkg=python-numpy&arch=armel&ver=1%3A1.12.0-2&stamp=1484889563&raw=0 Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From josef.pktd at gmail.com Thu Feb 16 12:52:16 2017 From: josef.pktd at gmail.com (josef.pktd at gmail.com) Date: Thu, 16 Feb 2017 12:52:16 -0500 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References:

Message-ID: On Thu, Feb 16, 2017 at 3:55 AM, Ralf Gommers wrote: > > > On Thu, Feb 16, 2017 at 3:53 PM, Sandro Tosi wrote: > >> > A recent post to the wheel-builders mailing list pointed out some >> > links to places providing free PowerPC hosting for open source >> > projects, if they agree to a submitted request: >> >> The debian project has some powerpc machines (and we still build numpy >> on those boxes when i upload a new revision to our archives) and they >> also have hosts dedicated to let debian developers login and debug >> issues with their packages on that architecture. I can sponsor access >> to those machines for some of you, but it is not a place where you can >> host a CI instance. >> >> Just keep it in mind more broadly than powerpc, f.e. these are all the >> archs where numpy was built after the last upload >> https://buildd.debian.org/status/package.php?p=python-numpy& >> suite=unstable >> (the grayed out archs are the ones non release critical, so packages >> are built as best effort and if missing is not a big deal) > > > Thanks Sandro. It looks like even for the release-critical ones, it's just > the build that has to succeed and failures are not detected? For example, > armel is green but has 9 failures: https://buildd.debian.org/stat > us/fetch.php?pkg=python-numpy&arch=armel&ver=1%3A1.12.0-2& > stamp=1484889563&raw=0 > > Ralf > More general questions on this: Are there any overviews over which packages in the python for science or python for data anlaysis areas work correctly on different platforms: Are there any platforms/processors, besides the standard x32/x54, where this is important? for example for statsmodels: In early releases of statsmodels, maybe 5 to 7 years ago, Yarik and I were still debugging problems on several machines like ppc and s390x during Debian testing. Since then I haven't heard much about specific problems. The current status for statsmodels on Debian machines is pretty mixed. In several of them some dependencies are not available, in some cases we have errors that might be caused by errors in dependencies, e.g. cvxopt. ppc64el test run for statsmodels has a large number of failure but checking scipy, it looks like it's also not working properly https://buildd.debian.org/status/fetch.php?pkg=python- scipy&arch=ppc64el&ver=0.18.1-2&stamp=1477075663&raw=0 In those cases it would be impossible to start debugging, if we would have to debug through the entire dependency chain. CI-testing for Windows, Apple and Linux for mainly x64 seems to be working pretty well, with some delays while version incompatibilities are fixed. But anything that is not in a CI testing setup looks pretty random to me. (I'm mainly curious what the status for those machines are. I'm not really eager to create more debugging work, but sometimes failures on a machine point to code that is "fragile".) Josef > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From morph at debian.org Thu Feb 16 13:52:50 2017 From: morph at debian.org (Sandro Tosi) Date: Thu, 16 Feb 2017 13:52:50 -0500 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References:

Message-ID: On Thu, Feb 16, 2017 at 3:55 AM, Ralf Gommers wrote: > Thanks Sandro. It looks like even for the release-critical ones, it's just > the build that has to succeed and failures are not detected? For example, > armel is green but has 9 failures: > https://buildd.debian.org/status/fetch.php?pkg=python-numpy&arch=armel&ver=1%3A1.12.0-2&stamp=1484889563&raw=0 i made any error in the test suite non-fatal so that we could collect the errors and then report them back. sadly i'm currently lacking the time to report all the errors in the archs, will try to get at that soon -- Sandro "morph" Tosi My website: http://sandrotosi.me/ Me at Debian: http://wiki.debian.org/SandroTosi G+: https://plus.google.com/u/0/+SandroTosi From robbmcleod at gmail.com Fri Feb 17 06:15:05 2017 From: robbmcleod at gmail.com (Robert McLeod) Date: Fri, 17 Feb 2017 12:15:05 +0100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha Message-ID: Hi everyone, I'm pleased to announce that a new branch of NumExpr has been developed that will hopefully lead to a new major version release in the future. You can find the branch on the PyData github repository, and installation is as follows: git clone https://github.com/pydata/numexpr.git cd numexpr git checkout numexpr-3.0 python setup.py install What's new? ========== Faster --------- The operations were re-written in such a way that gcc can auto-vectorize the loops to use SIMD instructions. Each operation now has a strided and aligned branch, which improves performance on aligned arrays by ~ 40 %. The setup time for threads has been reduced, by removing an unnecessary abstraction layer, and various other minor re-factorizations, resulting in improved thread scaling. The combination of speed-ups means that NumExpr3 often runs 200-500 % faster than NumExpr2.6 on a machine with AVX2 support. The break-even point with NumPy is now roughly arrays with 64k-elements, compared to 256-512k-elements for NE2. Plot of comparative performance for NumPy versus NE2 versus NE3 over a range of array sizes are available at: http://entropyproduction.blogspot.ch/2017/02/introduction-to-numexpr-3- alpha.html More NumPy Datatypes -------------------------------- The program was re-factorized from a ascii-encoded byte code to a struct array, so that the operation space is now 65535 instead of 128. As such, support for uint8, int8, uint16, int16, uint32, uint64, and complex64 data types was added. NumExpr3 now uses NumPy 'safe' casting rules. If an operation doesn't return the same result as NumPy, it's a bug. In the future other casting styles will be added if there is a demand for them. More complete function set ------------------------------------ With the enhanced operation space, almost the entire C++11 cmath function set is supported (if the compiler library has them; only C99 is expected). Also bitwise operations were added for all integer datatypes. There are now 436 operations/functions in NE3, with more to come, compared to 190 in NE2. Also a library-enum has been added to the op keys which allows multiple backend libraries to be linked to the interpreter, and changed on a per-expression basis, rather than picking between GNU std and Intel VML at compile time, for example. More complete Python language support ------------------------------------------------------ The Python compiler was re-written from scratch to use the CPython `ast` module and a functional programming approach. As such, NE3 now compiles a wider subset of the Python language. It supports multi-line evaluation, and assignment with named temporaries. The new compiler spends considerably less time in Python to compile expressions, about 200 us for 'a*b' compared to 550 us for NE2. Compare for example: out_ne2 = ne2.evaluate( 'exp( -sin(2*a**2) - cos(2*b**2) - 2*a**2*b**2' ) to: neObj = NumExp( '''a2 = a*a; b2 = b*b out_magic = exp( -sin(2*a2) - cos(2*b2) - 2*a2*b2''' ) This is a contrived example but the multi-line approach will allow for cleaner code and more sophisticated algorithms to be encapsulated in a single NumExpr call. The convention is that intermediate assignment targets are named temporaries if they do not exist in the calling frame, and full assignment targets if they do, which provides a method for multiple returns. Single-level de-referencing (e.g. `self.data`) is also supported for increased convenience and cleaner code. Slicing still needs to be performed above the ne3.evaluate() or ne3.NumExpr() call. More maintainable ------------------------- The code base was generally refactored to increase the prevalence of single-point declarations, such that modifications don't require extensive knowledge of the code. In NE2 a lot of code was generated by the pre-processor using nested #defines. That has been replaced by a object-oriented Python code generator called by setup.py, which generates about 15k lines of C code with 1k lines of Python. The use of generated code with defined line numbers makes debugging threaded code simpler. The generator also builds the autotest portion of the test submodule, for checking equivalence between NumPy and NumExpr3 operations and functions. What's TODO compared to NE2? ------------------------------------------ * strided complex functions * Intel VML support (less necessary now with gcc auto-vectorization) * bytes and unicode support * reductions (mean, sum, prod, std) What I'm looking for feedback on -------------------------------------------- * String arrays: How do you use them? How would unicode differ from bytes strings? * Interface: We now have a more object-oriented interface underneath the familiar evaluate() interface. How would you like to use this interface? Francesc suggested generator support, as currently it's more difficult to use NumExpr within a loop than it should be. Ideas for the future ------------------------- * vectorize real functions (such as exp, sqrt, log) similar to the complex_functions.hpp vectorization. * Add a keyword (likely 'yield') to indicate that a token is intended to be changed by a generator inside a loop with each call to NumExpr.run() If you have any thoughts or find any issues please don't hesitate to open an issue at the Github repo. Although unit tests have been run over the operation space there are undoubtedly a number of bugs to squash. Sincerely, Robert -- Robert McLeod, Ph.D. Center for Cellular Imaging and Nano Analytics (C-CINA) Biozentrum der Universit?t Basel Mattenstrasse 26, 4058 Basel Work: +41.061.387.3225 <061%20387%2032%2025> robert.mcleod at unibas.ch robert.mcleod at bsse.ethz.ch robbmcleod at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at gmail.com Fri Feb 17 07:13:00 2017 From: faltet at gmail.com (Francesc Alted) Date: Fri, 17 Feb 2017 13:13:00 +0100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: References: Message-ID: Yay! This looks really exciting. Thanks for all the hard work! Francesc 2017-02-17 12:15 GMT+01:00 Robert McLeod : > Hi everyone, > > I'm pleased to announce that a new branch of NumExpr has been developed > that will hopefully lead to a new major version release in the future. You > can find the branch on the PyData github repository, and installation is as > follows: > > git clone https://github.com/pydata/numexpr.git > cd numexpr > git checkout numexpr-3.0 > python setup.py install > > What's new? > ========== > > Faster > --------- > > The operations were re-written in such a way that gcc can auto-vectorize > the loops to use SIMD instructions. Each operation now has a strided and > aligned branch, which improves performance on aligned arrays by ~ 40 %. The > setup time for threads has been reduced, by removing an unnecessary > abstraction layer, and various other minor re-factorizations, resulting in > improved thread scaling. > > The combination of speed-ups means that NumExpr3 often runs 200-500 % > faster than NumExpr2.6 on a machine with AVX2 support. The break-even point > with NumPy is now roughly arrays with 64k-elements, compared to > 256-512k-elements for NE2. > > Plot of comparative performance for NumPy versus NE2 versus NE3 over a > range of array sizes are available at: > > http://entropyproduction.blogspot.ch/2017/02/introduction- > to-numexpr-3-alpha.html > > More NumPy Datatypes > -------------------------------- > > The program was re-factorized from a ascii-encoded byte code to a struct > array, so that the operation space is now 65535 instead of 128. As such, > support for uint8, int8, uint16, int16, uint32, uint64, and complex64 data > types was added. > > NumExpr3 now uses NumPy 'safe' casting rules. If an operation doesn't > return the same result as NumPy, it's a bug. In the future other casting > styles will be added if there is a demand for them. > > > More complete function set > ------------------------------------ > > With the enhanced operation space, almost the entire C++11 cmath function > set is supported (if the compiler library has them; only C99 is expected). > Also bitwise operations were added for all integer datatypes. There are now > 436 operations/functions in NE3, with more to come, compared to 190 in NE2. > > Also a library-enum has been added to the op keys which allows multiple > backend libraries to be linked to the interpreter, and changed on a > per-expression basis, rather than picking between GNU std and Intel VML at > compile time, for example. > > > More complete Python language support > ------------------------------------------------------ > > The Python compiler was re-written from scratch to use the CPython `ast` > module and a functional programming approach. As such, NE3 now compiles a > wider subset of the Python language. It supports multi-line evaluation, and > assignment with named temporaries. The new compiler spends considerably > less time in Python to compile expressions, about 200 us for 'a*b' compared > to 550 us for NE2. > > Compare for example: > > out_ne2 = ne2.evaluate( 'exp( -sin(2*a**2) - cos(2*b**2) - > 2*a**2*b**2' ) > > to: > > neObj = NumExp( '''a2 = a*a; b2 = b*b > out_magic = exp( -sin(2*a2) - cos(2*b2) - 2*a2*b2''' ) > > This is a contrived example but the multi-line approach will allow for > cleaner code and more sophisticated algorithms to be encapsulated in a > single NumExpr call. The convention is that intermediate assignment targets > are named temporaries if they do not exist in the calling frame, and full > assignment targets if they do, which provides a method for multiple > returns. Single-level de-referencing (e.g. `self.data`) is also supported > for increased convenience and cleaner code. Slicing still needs to be > performed above the ne3.evaluate() or ne3.NumExpr() call. > > > More maintainable > ------------------------- > > The code base was generally refactored to increase the prevalence of > single-point declarations, such that modifications don't require extensive > knowledge of the code. In NE2 a lot of code was generated by the > pre-processor using nested #defines. That has been replaced by a > object-oriented Python code generator called by setup.py, which generates > about 15k lines of C code with 1k lines of Python. The use of generated > code with defined line numbers makes debugging threaded code simpler. > > The generator also builds the autotest portion of the test submodule, for > checking equivalence between NumPy and NumExpr3 operations and functions. > > > What's TODO compared to NE2? > ------------------------------------------ > > * strided complex functions > * Intel VML support (less necessary now with gcc auto-vectorization) > * bytes and unicode support > * reductions (mean, sum, prod, std) > > > What I'm looking for feedback on > -------------------------------------------- > > * String arrays: How do you use them? How would unicode differ from bytes > strings? > * Interface: We now have a more object-oriented interface underneath the > familiar > evaluate() interface. How would you like to use this interface? > Francesc suggested > generator support, as currently it's more difficult to use NumExpr > within a loop than > it should be. > > > Ideas for the future > ------------------------- > > * vectorize real functions (such as exp, sqrt, log) similar to the > complex_functions.hpp vectorization. > * Add a keyword (likely 'yield') to indicate that a token is intended to > be changed by a generator inside a loop with each call to NumExpr.run() > > If you have any thoughts or find any issues please don't hesitate to open > an issue at the Github repo. Although unit tests have been run over the > operation space there are undoubtedly a number of bugs to squash. > > Sincerely, > > Robert > > -- > Robert McLeod, Ph.D. > Center for Cellular Imaging and Nano Analytics (C-CINA) > Biozentrum der Universit?t Basel > Mattenstrasse 26, 4058 Basel > Work: +41.061.387.3225 <061%20387%2032%2025> > robert.mcleod at unibas.ch > robert.mcleod at bsse.ethz.ch > robbmcleod at gmail.com > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Francesc Alted -------------- next part -------------- An HTML attachment was scrubbed... URL: From davidmenhur at gmail.com Fri Feb 17 10:34:11 2017 From: davidmenhur at gmail.com (=?UTF-8?B?RGHPgGlk?=) Date: Fri, 17 Feb 2017 16:34:11 +0100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: References: Message-ID: This is very nice indeed! On 17 February 2017 at 12:15, Robert McLeod wrote: > * bytes and unicode support > * reductions (mean, sum, prod, std) I use both a lot, maybe I can help you get them working. Also, regarding "Vectorization hasn't been done yet with cmath functions for real numbers (such as sqrt(), exp(), etc.), only for complex functions". What is the bottleneck? Is it in GCC or just someone has to sit down and adapt it? From pierre.schnizer at helmholtz-berlin.de Fri Feb 17 10:54:56 2017 From: pierre.schnizer at helmholtz-berlin.de (Schnizer, Pierre) Date: Fri, 17 Feb 2017 15:54:56 +0000 Subject: [Numpy-discussion] Building external c modules with mingw64 / numpy In-Reply-To: References: <243DBD016692E54EB12F37B87C66E70E815DB8@didag1> Message-ID: <243DBD016692E54EB12F37B87C66E70E819AB4@didag1> Dear Ralf, I made some further improvements as one problem was related to my setup file. I will use numpy git repository to cross check it and then report again. Sincerely yours Pierre Von: NumPy-Discussion [mailto:numpy-discussion-bounces at scipy.org] Im Auftrag von Ralf Gommers Gesendet: Dienstag, 14. Februar 2017 11:00 An: Discussion of Numerical Python Betreff: Re: [Numpy-discussion] Building external c modules with mingw64 / numpy On Sat, Jan 21, 2017 at 9:23 PM, Schnizer, Pierre > wrote: Dear all, I built an external c-module (pygsl) using mingw 64 from msys2 mingw64-gcc compiler. This built required some changes to numpy.distutils to get the ?python setup.py config? and ?python setup.py build? working. In this process I replaced 2 files in numpy.distutils from numpy git repository: - numpy.dist_utils.misc_utils.py version ec0e046 on 14 Dec 2016 - numpy.dist_utils. mingw32ccompiler.py version ec0e046 on 14 Dec 2016 mingw32ccompiler.py required to be modified to get it work ? preprocessor had to be defined as I am using setup.py config ? specifying the runtime library search path to the linker ? include path of the vcrtruntime I attached a patch reflecting the changes I had to make to file mingw32ccompile.py If this information is useful I am happy to answer questions Thanks for the patch Pierre. For future reference: a pull request on GitHub or a link to a Gist is preferred for us and usually gets you a response quicker. Regarding your question in the patch on including Python's install directory: that shouldn't be necessary, and I'd be wary of applying your patch without understanding why the current numpy.distutils code doesn't work for you. But if your patch works for you then it can't hurt I think. Cheers, Ralf Sincerely yours Pierre PS Version infos: Python: Python 3.6.0 (v3.6.0:41df79263a11, Dec 23 2016, 08:06:12) [MSC v.1900 64 bit (AMD64)] on win32 Numpy: >> help(numpy.version) Help on module numpy.version in numpy: DATA full_version = '1.12.0' git_revision = '561f1accf861ad8606ea2dd723d2be2b09a2dffa' release = True short_version = '1.12.0' version = '1.12.0' gcc.exe (Rev2, Built by MSYS2 project) 6.2.0 ________________________________ Helmholtz-Zentrum Berlin f?r Materialien und Energie GmbH Mitglied der Hermann von Helmholtz-Gemeinschaft Deutscher Forschungszentren e.V. Aufsichtsrat: Vorsitzender Dr. Karl Eugen Huthmacher, stv. Vorsitzende Dr. Jutta Koch-Unterseher Gesch?ftsf?hrung: Prof. Dr. Anke Rita Kaysser-Pyzalla, Thomas Frederking Sitz Berlin, AG Charlottenburg, 89 HRB 5583 Postadresse: Hahn-Meitner-Platz 1 D-14109 Berlin http://www.helmholtz-berlin.de _______________________________________________ NumPy-Discussion mailing list NumPy-Discussion at scipy.org https://mail.scipy.org/mailman/listinfo/numpy-discussion ________________________________ Helmholtz-Zentrum Berlin f?r Materialien und Energie GmbH Mitglied der Hermann von Helmholtz-Gemeinschaft Deutscher Forschungszentren e.V. Aufsichtsrat: Vorsitzender Dr. Karl Eugen Huthmacher, stv. Vorsitzende Dr. Jutta Koch-Unterseher Gesch?ftsf?hrung: Prof. Dr. Anke Rita Kaysser-Pyzalla, Thomas Frederking Sitz Berlin, AG Charlottenburg, 89 HRB 5583 Postadresse: Hahn-Meitner-Platz 1 D-14109 Berlin http://www.helmholtz-berlin.de -------------- next part -------------- An HTML attachment was scrubbed... URL: From robbmcleod at gmail.com Fri Feb 17 11:42:09 2017 From: robbmcleod at gmail.com (Robert McLeod) Date: Fri, 17 Feb 2017 17:42:09 +0100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: References: Message-ID: Hi David, Thanks for your comments, reply below the fold. On Fri, Feb 17, 2017 at 4:34 PM, Da?id wrote: > This is very nice indeed! > > On 17 February 2017 at 12:15, Robert McLeod wrote: > > * bytes and unicode support > > * reductions (mean, sum, prod, std) > > I use both a lot, maybe I can help you get them working. > > Also, regarding "Vectorization hasn't been done yet with cmath > functions for real numbers (such as sqrt(), exp(), etc.), only for > complex functions". What is the bottleneck? Is it in GCC or just > someone has to sit down and adapt it? I just haven't done it yet. Basically I'm moving from Switzerland to Canada in a week so this was the gap to push something out that's usable if not perfect. Rather I just import cmath functions, which are inlined but I suspect what's needed is to break them down into their components. For example, the complex arccos function looks like this: static void nc_acos( npy_intp n, npy_complex64 *x, npy_complex64 *r) { npy_complex64 a; for( npy_intp I = 0; I < n; I++ ) { a = x[I]; _inline_mul( x[I], x[I], r[I] ); _inline_sub( Z_1, r[I], r[I] ); _inline_sqrt( r[I], r[I] ); _inline_muli( r[I], r[I] ); _inline_add( a, r[I], r[I] ); _inline_log( r[I] , r[I] ); _inline_muli( r[I], r[I] ); _inline_neg( r[I], r[I]); } } I haven't sat down and inspected whether the cmath versions get vectorized, but there's not a huge speed difference between NE2 and 3 for such a function on float (but their is for complex), so my suspicion is they aren't. Another option would be to add a library such as Yeppp! as LIB_YEPPP or some other library that's faster than glib. For example the glib function "fma(a,b,c)" is slower than doing "a*b+c" in NE3, and that's not how it should be. Yeppp is also built with Python generating C code, so it could either be very easy or very hard. On bytes and unicode, I haven't seen examples for how people use it, so I'm not sure where to start. Since there's practically not a limitation on the number of operations now (the library is 1.3 MB now, compared to 1.2 MB for NE2 with gcc 5.4) the string functions could grow significantly from what we have in NE2. With regards to reductions, NumExpr never multi-threaded them, and could only do outer reductions, so in the end there was no speed advantage to be had compared to having NumPy do them on the result. I suspect the primary value there was in PyTables and Pandas where the expression had to do everything. One of the things I've moved away from in NE3 is doing output buffering (rather it pre-allocates the output array), so for reductions the understanding NumExpr has of broadcasting would have to be deeper. In any event contributions would certainly be welcome. Robert -- Robert McLeod, Ph.D. Center for Cellular Imaging and Nano Analytics (C-CINA) Biozentrum der Universit?t Basel Mattenstrasse 26, 4058 Basel Work: +41.061.387.3225 <061%20387%2032%2025> robert.mcleod at unibas.ch robert.mcleod at bsse.ethz.ch robbmcleod at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From charlesr.harris at gmail.com Sat Feb 18 13:07:08 2017 From: charlesr.harris at gmail.com (Charles R Harris) Date: Sat, 18 Feb 2017 11:07:08 -0700 Subject: [Numpy-discussion] Eric Wieser added to NumPy team. Message-ID: Hi All, I'm pleased to welcome Eric to the NumPy team. There is a pile of pending PRs that grows every day and we are counting on Eric will help us keep it in check ;) Chuck -------------- next part -------------- An HTML attachment was scrubbed... URL: From jni.soma at gmail.com Sat Feb 18 22:19:19 2017 From: jni.soma at gmail.com (Juan Nunez-Iglesias) Date: Sun, 19 Feb 2017 14:19:19 +1100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: References: Message-ID: <7b857f99-49fb-452c-979b-10495b00faf4@Spark> Hi everyone, Thanks for this. It looks absolutely fantastic. I've been putting off using numexpr but it looks like I don't have a choice anymore. ;) Regarding feature requests, I've always found it off putting that I have to wrap my expressions in a string to speed them up. Has anyone explored the possibility of using Python 3.6's frame evaluation API to do this? I remember a vague discussion on this list a while back but I don't know whether anything came of it. Thanks! Juan. On 18 Feb 2017, 3:42 AM +1100, Robert McLeod , wrote: > Hi David, > > Thanks for your comments, reply below the fold. > > > On Fri, Feb 17, 2017 at 4:34 PM, Da?id wrote: > > > This is very nice indeed! > > > > > > On 17 February 2017 at 12:15, Robert McLeod wrote: > > > > * bytes and unicode support > > > > * reductions (mean, sum, prod, std) > > > > > > I use both a lot, maybe I can help you get them working. > > > > > > Also, regarding "Vectorization hasn't been done yet with cmath > > > functions for real numbers (such as sqrt(), exp(), etc.), only for > > > complex functions". What is the bottleneck? Is it in GCC or just > > > someone has to sit down and adapt it? > > > > I just haven't done it yet.? Basically I'm moving from Switzerland to Canada in a week so this was the gap to push something out that's usable if not perfect. Rather I just import cmath functions, which are inlined but I suspect what's needed is to break them down into their components. For example, the complex arccos function looks like this: > > > > static void > > nc_acos( npy_intp n, npy_complex64 *x, npy_complex64 *r) > > { > > ? ? npy_complex64 a; > > ? ? for( npy_intp I = 0; I < n; I++ ) { > > ? ? ? ? a = x[I]; > > ? ? ? ? _inline_mul( x[I], x[I], r[I] ); > > ? ? ? ? _inline_sub( Z_1, r[I], r[I] ); > > ? ? ? ? _inline_sqrt( r[I], r[I] ); > > ? ? ? ? _inline_muli( r[I], r[I] ); > > ? ? ? ? _inline_add( a, r[I], r[I] ); > > ? ? ? ? _inline_log( r[I] , r[I] ); > > ? ? ? ? _inline_muli( r[I], r[I] ); > > ? ? ? ? _inline_neg( r[I], r[I]); > > ? ? } > > } > > > > I haven't sat down and inspected whether the cmath versions get vectorized, but there's not a huge speed difference between NE2 and 3 for such a function on float (but their is for complex), so my suspicion is they aren't.? Another option would be to add a library such as Yeppp! as LIB_YEPPP or some other library that's faster than glib.? For example the glib function "fma(a,b,c)" is slower than doing "a*b+c" in NE3, and that's not how it should be.? Yeppp is also built with Python generating C code, so it could either be very easy or very hard. > > > > On bytes and unicode, I haven't seen examples for how people use it, so I'm not sure where to start. Since there's practically not a limitation on the number of operations now (the library is 1.3 MB now, compared to 1.2 MB for NE2 with gcc 5.4) the string functions could grow significantly from what we have in NE2. > > > > With regards to reductions, NumExpr never multi-threaded them, and could only do outer reductions, so in the end there was no speed advantage to be had compared to having NumPy do them on the result.? I suspect the primary value there was in PyTables and Pandas where the expression had to do everything.? One of the things I've moved away from in NE3 is doing output buffering (rather it pre-allocates the output array), so for reductions the understanding NumExpr has of broadcasting would have to be deeper. > > > > In any event contributions would certainly be welcome. > > > > Robert > > > -- > Robert McLeod, Ph.D. > Center for Cellular Imaging and Nano Analytics (C-CINA) > Biozentrum der Universit?t Basel > Mattenstrasse 26, 4058 Basel > Work: +41.061.387.3225 > robert.mcleod at unibas.ch > robert.mcleod at bsse.ethz.ch > robbmcleod at gmail.com > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Sun Feb 19 05:00:47 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Sun, 19 Feb 2017 23:00:47 +1300 Subject: [Numpy-discussion] Eric Wieser added to NumPy team. In-Reply-To: References: Message-ID: On Sun, Feb 19, 2017 at 7:07 AM, Charles R Harris wrote: > Hi All, > > I'm pleased to welcome Eric to the NumPy team. There is a pile of pending > PRs that grows every day and we are counting on Eric will help us keep it > in check ;) > Welcome to the team Eric! Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From robbmcleod at gmail.com Sun Feb 19 08:41:18 2017 From: robbmcleod at gmail.com (Robert McLeod) Date: Sun, 19 Feb 2017 14:41:18 +0100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: <7b857f99-49fb-452c-979b-10495b00faf4@Spark> References: <7b857f99-49fb-452c-979b-10495b00faf4@Spark> Message-ID: Hi Juan, A guy on reddit suggested looking at SymPy for just such a thing. I know that Dask also represents its process as a graph. https://www.reddit.com/r/Python/comments/5um04m/numexpr3/ I'll think about it some more but it seems a little abstract still. To a certain extent the NE3 compiler already works this way. The compiler has a dictionary in which keys are `ast.Node` types, and each value is a function pointer, which knows how to handle that particular node. Providing an external interface to this would be the most natural extension. There's quite a few things to do before I would think about a functional interface. The things I mentioned in my original mail; pickling of the C-object so that it can be using within modules like `multiprocessing`; having a pre-allocated shared memory region shared among threads for temporaries and parameters, etc. If someone else wants to dabble in it they are welcome to. Robert On Sun, Feb 19, 2017 at 4:19 AM, Juan Nunez-Iglesias wrote: > Hi everyone, > > Thanks for this. It looks absolutely fantastic. I've been putting off > using numexpr but it looks like I don't have a choice anymore. ;) > > Regarding feature requests, I've always found it off putting that I have > to wrap my expressions in a string to speed them up. Has anyone explored > the possibility of using Python 3.6's frame evaluation API to do this? I > remember a vague discussion on this list a while back but I don't know > whether anything came of it. > > Thanks! > > Juan. > > On 18 Feb 2017, 3:42 AM +1100, Robert McLeod , > wrote: > > Hi David, > > Thanks for your comments, reply below the fold. > > On Fri, Feb 17, 2017 at 4:34 PM, Da?id wrote: > >> This is very nice indeed! >> >> On 17 February 2017 at 12:15, Robert McLeod wrote: >> > * bytes and unicode support >> > * reductions (mean, sum, prod, std) >> >> I use both a lot, maybe I can help you get them working. >> >> Also, regarding "Vectorization hasn't been done yet with cmath >> functions for real numbers (such as sqrt(), exp(), etc.), only for >> complex functions". What is the bottleneck? Is it in GCC or just >> someone has to sit down and adapt it? > > > I just haven't done it yet. Basically I'm moving from Switzerland to > Canada in a week so this was the gap to push something out that's usable if > not perfect. Rather I just import cmath functions, which are inlined but I > suspect what's needed is to break them down into their components. For > example, the complex arccos function looks like this: > > static void > nc_acos( npy_intp n, npy_complex64 *x, npy_complex64 *r) > { > npy_complex64 a; > for( npy_intp I = 0; I < n; I++ ) { > a = x[I]; > _inline_mul( x[I], x[I], r[I] ); > _inline_sub( Z_1, r[I], r[I] ); > _inline_sqrt( r[I], r[I] ); > _inline_muli( r[I], r[I] ); > _inline_add( a, r[I], r[I] ); > _inline_log( r[I] , r[I] ); > _inline_muli( r[I], r[I] ); > _inline_neg( r[I], r[I]); > } > } > > I haven't sat down and inspected whether the cmath versions get > vectorized, but there's not a huge speed difference between NE2 and 3 for > such a function on float (but their is for complex), so my suspicion is > they aren't. Another option would be to add a library such as Yeppp! as > LIB_YEPPP or some other library that's faster than glib. For example the > glib function "fma(a,b,c)" is slower than doing "a*b+c" in NE3, and that's > not how it should be. Yeppp is also built with Python generating C code, > so it could either be very easy or very hard. > > On bytes and unicode, I haven't seen examples for how people use it, so > I'm not sure where to start. Since there's practically not a limitation on > the number of operations now (the library is 1.3 MB now, compared to 1.2 MB > for NE2 with gcc 5.4) the string functions could grow significantly from > what we have in NE2. > > With regards to reductions, NumExpr never multi-threaded them, and could > only do outer reductions, so in the end there was no speed advantage to be > had compared to having NumPy do them on the result. I suspect the primary > value there was in PyTables and Pandas where the expression had to do > everything. One of the things I've moved away from in NE3 is doing output > buffering (rather it pre-allocates the output array), so for reductions the > understanding NumExpr has of broadcasting would have to be deeper. > > In any event contributions would certainly be welcome. > > Robert > > -- > Robert McLeod, Ph.D. > Center for Cellular Imaging and Nano Analytics (C-CINA) > Biozentrum der Universit?t Basel > Mattenstrasse 26, 4058 Basel > Work: +41.061.387.3225 <061%20387%2032%2025> > robert.mcleod at unibas.ch > robert.mcleod at bsse.ethz.ch > robbmcleod at gmail.com > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Robert McLeod, Ph.D. Center for Cellular Imaging and Nano Analytics (C-CINA) Biozentrum der Universit?t Basel Mattenstrasse 26, 4058 Basel Work: +41.061.387.3225 robert.mcleod at unibas.ch robert.mcleod at bsse.ethz.ch robbmcleod at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.h.vankerkwijk at gmail.com Sun Feb 19 12:21:38 2017 From: m.h.vankerkwijk at gmail.com (Marten van Kerkwijk) Date: Sun, 19 Feb 2017 12:21:38 -0500 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: References: <7b857f99-49fb-452c-979b-10495b00faf4@Spark> Message-ID: Hi All, Just a side note that at a smaller scale some of the benefits of numexpr are coming to numpy: Julian Taylor has been working on identifying temporary arrays in https://github.com/numpy/numpy/pull/7997. Julian also commented (https://github.com/numpy/numpy/pull/7997#issuecomment-246118772) that with PEP 523 in python 3.6, this should indeed become a lot easier. All the best, Marten From ashwin.pathak at students.iiit.ac.in Mon Feb 20 13:32:35 2017 From: ashwin.pathak at students.iiit.ac.in (ashwin.pathak) Date: Tue, 21 Feb 2017 00:02:35 +0530 Subject: [Numpy-discussion] Numpy Development Queries Message-ID: <8cbf1c9b6c561724e2018d77faadae41@students.iiit.ac.in> Hello all, I am new to this organization and wanted to start with some easy-fix issues to get some knowledge about the soruce code. However, the issues under easy-fix labels have already been solved or someone is at it. Can someone help me find such issues? From ralf.gommers at gmail.com Tue Feb 21 00:01:55 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Tue, 21 Feb 2017 18:01:55 +1300 Subject: [Numpy-discussion] Numpy Development Queries In-Reply-To: <8cbf1c9b6c561724e2018d77faadae41@students.iiit.ac.in> References: <8cbf1c9b6c561724e2018d77faadae41@students.iiit.ac.in> Message-ID: On Tue, Feb 21, 2017 at 7:32 AM, ashwin.pathak < ashwin.pathak at students.iiit.ac.in> wrote: > Hello all, > I am new to this organization and wanted to start with some easy-fix > issues to get some knowledge about the soruce code. However, the issues > under easy-fix labels have already been solved or someone is at it. Can > someone help me find such issues? > Hi Ashwin, welcome. I don't want to seem discouraging, but I do want to explain that NumPy is significantly harder to get started on than SciPy (which you've started on already) as a newcomer to the scientific Python ecosystem. So I'd encourage you to spend some more time on the SciPy issues - there are more easy-fix ones there, and the process of contributing (pull requests, reviews, finding your way around the codebase) is similar for the two projects. Cheers, Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at gmail.com Tue Feb 21 04:10:04 2017 From: faltet at gmail.com (Francesc Alted) Date: Tue, 21 Feb 2017 10:10:04 +0100 Subject: [Numpy-discussion] ANN: NumExpr3 Alpha In-Reply-To: References: <7b857f99-49fb-452c-979b-10495b00faf4@Spark> Message-ID: Yes, Julian is doing an amazing work on getting rid of temporaries inside NumPy. However, NumExpr still has the advantage of using multi-threading right out of the box, as well as integration with Intel VML. Hopefully these features will eventually arrive to NumPy, but meanwhile there is still value in pushing NumExpr. Francesc 2017-02-19 18:21 GMT+01:00 Marten van Kerkwijk : > Hi All, > > Just a side note that at a smaller scale some of the benefits of > numexpr are coming to numpy: Julian Taylor has been working on > identifying temporary arrays in > https://github.com/numpy/numpy/pull/7997. Julian also commented > (https://github.com/numpy/numpy/pull/7997#issuecomment-246118772) that > with PEP 523 in python 3.6, this should indeed become a lot easier. > > All the best, > > Marten > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > -- Francesc Alted -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.h.vankerkwijk at gmail.com Tue Feb 21 13:52:05 2017 From: m.h.vankerkwijk at gmail.com (Marten van Kerkwijk) Date: Tue, 21 Feb 2017 13:52:05 -0500 Subject: [Numpy-discussion] Could we simplify backporting? Message-ID: Hi All, In gh-8594, a question came up how to mark things that should be backported and Chuck commented [1]: > Our backport policy is still somewhat ad hoc, especially as I the only one who has been doing release. What I currently do is set the milestone to the earlier version, so I will find the PR when looking for backports, then do a backport, label it as such, set the milestone on the backported version, and remove the milestone from the original. I'm not completely happy with the process, so if you have better ideas I'd like to hear them. One option I've considered is a `backported` label in addition to the `backport` label, then use the latter for things to be backported. It seems that continuing to set the milestone to a bug-release version if required was a good idea; it is little effort to anyone and keeps things clear. For the rest, might it be possible to make things more automated? E.g., might it be possible to have some travis magic that does a trial merge & test? Could one somehow merge to multiple branches at the same time? I have no idea myself really how these things work, but maybe some of you do! All the best, Marten From matthew.brett at gmail.com Tue Feb 21 14:41:32 2017 From: matthew.brett at gmail.com (Matthew Brett) Date: Tue, 21 Feb 2017 11:41:32 -0800 Subject: [Numpy-discussion] PowerPC testing servers In-Reply-To: References:

Message-ID: Hi, On Thu, Feb 16, 2017 at 12:50 AM, Ralf Gommers wrote: > > > On Thu, Feb 16, 2017 at 8:58 AM, Matthew Brett > wrote: >> >> On Wed, Feb 15, 2017 at 7:55 PM, Ralf Gommers >> wrote: >> > >> > >> > On Thu, Feb 16, 2017 at 8:45 AM, Matthew Brett >> > wrote: >> >> >> >> On Wed, Feb 15, 2017 at 7:37 PM, Ralf Gommers >> >> wrote: >> >> > >> >> > >> >> > On Thu, Feb 16, 2017 at 8:02 AM, Matthew Brett >> >> > >> >> > wrote: >> >> >> >> >> >> Hey, >> >> >> >> >> >> A recent post to the wheel-builders mailing list pointed out some >> >> >> links to places providing free PowerPC hosting for open source >> >> >> projects, if they agree to a submitted request: >> >> >> >> >> >> >> >> >> >> >> >> https://mail.python.org/pipermail/wheel-builders/2017-February/000257.html >> >> >> >> >> >> It would be good to get some testing going on these architectures. >> >> >> Shall we apply for hosting, as the numpy organization? >> >> > >> >> > >> >> > Those are bare VMs it seems. Remembering the Buildbot and Mailman >> >> > horrors, I >> >> > think we should be very reluctant to taking responsibility for >> >> > maintaining >> >> > CI on anything that's not hosted and can be controlled with a simple >> >> > config >> >> > file in our repo. >> >> >> >> Not sure what you mean about mailman - maybe the Enthought servers we >> >> didn't have access to? >> > >> > >> > We did have access (for most of the time), it's just that no one is >> > interested in putting in lots of hours on sysadmin duties. >> > >> >> >> >> For buildbot, I've been maintaining about 12 >> >> crappy old machines for about 7 years now [1] - I'm happy to do the >> >> same job for a couple of properly hosted PPC machines. >> > >> > >> > That's awesome persistence. The NumPy and SciPy buildbots certainly >> > weren't >> > maintained like that, half of them were offline or broken for long >> > periods >> > usually. >> >> Right - they do need persistence, and to have someone who takes >> responsibility for them. >> >> >> >> >> At least we'd >> >> have some way of testing for these machines, if we get stuck - even if >> >> that involved spinning up a VM and installing the stuff we needed from >> >> the command line. >> > >> > >> > I do see the value of testing on more platforms of course. It's just >> > about >> > logistics/responsibilities. If you're saying that you'll do the >> > maintenance, >> > and want to apply for resources using the NumPy name, that's much better >> > I >> > think then making "the numpy devs" collectively responsible. >> >> Yes, exactly. I'm happy to take responsibility for them, I just >> wanted to make sure that numpy devs could get at them if I'm not >> around for some reason. > > > In that case, +1 from me! OK - IBM have kindly given me access to a testing machine, via my own SSH public key. Would it make sense to have a Numpy key, with several people having access to the private key and passphrase? Cheers, Matthew From alex.rogozhnikov at yandex.ru Tue Feb 21 18:05:07 2017 From: alex.rogozhnikov at yandex.ru (Alex Rogozhnikov) Date: Wed, 22 Feb 2017 02:05:07 +0300 Subject: [Numpy-discussion] Fortran order in recarray. Message-ID: Hi, a question about numpy.recarray: There is a parameter order in constructor https://docs.scipy.org/doc/numpy-1.10.1/reference/generated/numpy.recarray.html , but it seems to have no effect: import numpy x = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[1000], order='C') y = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[1000], order='F') print numpy.array(x.ctypes.get_strides()) # [16] print numpy.array(y.ctypes.get_strides()) # [16] is this an intended behavior or bug? Thanks, Alex. -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris.barker at noaa.gov Tue Feb 21 18:10:23 2017 From: chris.barker at noaa.gov (Chris Barker) Date: Tue, 21 Feb 2017 15:10:23 -0800 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References: Message-ID: On Tue, Feb 21, 2017 at 3:05 PM, Alex Rogozhnikov < alex.rogozhnikov at yandex.ru> wrote: > a question about numpy.recarray: > There is a parameter order in constructor https://docs.scipy.org/doc/ > numpy-1.10.1/reference/generated/numpy.recarray.html, but it seems to > have no effect: > x = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[1000], > order='C') > you are creating a 1D array here -- there is no difference between Fortran and C order for a 1D array. For 2D: In [2]: x = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[10,10], order='C') In [3]: x.strides Out[3]: (160, 16) In [4]: y = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[10,10], order='F') In [5]: y.strides Out[5]: (16, 160) note the easier way to get the strides, too :-) -CHB -- Christopher Barker, Ph.D. Oceanographer Emergency Response Division NOAA/NOS/OR&R (206) 526-6959 voice 7600 Sand Point Way NE (206) 526-6329 fax Seattle, WA 98115 (206) 526-6317 main reception Chris.Barker at noaa.gov -------------- next part -------------- An HTML attachment was scrubbed... URL: From alex.rogozhnikov at yandex.ru Tue Feb 21 18:24:39 2017 From: alex.rogozhnikov at yandex.ru (Alex Rogozhnikov) Date: Wed, 22 Feb 2017 02:24:39 +0300 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References:

Message-ID: Ah, got it. Thanks, Chris! I thought recarray can be only one-dimensional (like tables with named columns). Maybe it's better to ask directly what I was looking for: something that works like a table with named columns (but no labelling for rows), and keeps data (of different dtypes) in a column-by-column way (and this is numpy, not pandas). Is there such a magic thing? Alex. > 22 ????. 2017 ?., ? 2:10, Chris Barker ???????(?): > > > > On Tue, Feb 21, 2017 at 3:05 PM, Alex Rogozhnikov > wrote: > a question about numpy.recarray: > There is a parameter order in constructor https://docs.scipy.org/doc/numpy-1.10.1/reference/generated/numpy.recarray.html , but it seems to have no effect: > x = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[1000], order='C') > > you are creating a 1D array here -- there is no difference between Fortran and C order for a 1D array. For 2D: > > In [2]: x = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[10,10], order='C') > > > In [3]: x.strides > Out[3]: (160, 16) > > > In [4]: y = numpy.recarray(dtype=[('a', int), ('b', float)], shape=[10,10], order='F') > > > In [5]: y.strides > Out[5]: (16, 160) > > note the easier way to get the strides, too :-) > > -CHB > > > > -- > > Christopher Barker, Ph.D. > Oceanographer > > Emergency Response Division > NOAA/NOS/OR&R (206) 526-6959 voice > 7600 Sand Point Way NE (206) 526-6329 fax > Seattle, WA 98115 (206) 526-6317 main reception > > Chris.Barker at noaa.gov _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From njs at pobox.com Tue Feb 21 19:53:07 2017 From: njs at pobox.com (Nathaniel Smith) Date: Tue, 21 Feb 2017 16:53:07 -0800 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References:

Message-ID: On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" wrote: Ah, got it. Thanks, Chris! I thought recarray can be only one-dimensional (like tables with named columns). Maybe it's better to ask directly what I was looking for: something that works like a table with named columns (but no labelling for rows), and keeps data (of different dtypes) in a column-by-column way (and this is numpy, not pandas). Is there such a magic thing? Well, that's what pandas is for... A dict of arrays? -n -------------- next part -------------- An HTML attachment was scrubbed... URL: From ralf.gommers at gmail.com Wed Feb 22 04:24:29 2017 From: ralf.gommers at gmail.com (Ralf Gommers) Date: Wed, 22 Feb 2017 22:24:29 +1300 Subject: [Numpy-discussion] Could we simplify backporting? In-Reply-To: References: Message-ID: On Wed, Feb 22, 2017 at 7:52 AM, Marten van Kerkwijk < m.h.vankerkwijk at gmail.com> wrote: > Hi All, > > In gh-8594, a question came up how to mark things that should be > backported and Chuck commented [1]: > > > Our backport policy is still somewhat ad hoc, especially as I the only > one who has been doing release. What I currently do is set the milestone to > the earlier version, so I will find the PR when looking for backports, then > do a backport, label it as such, set the milestone on the backported > version, and remove the milestone from the original. I'm not completely > happy with the process, so if you have better ideas I'd like to hear them. > One option I've considered is a `backported` label in addition to the > `backport` label, then use the latter for things to be backported. > I really don't like the double work and the large amount of noise coming from backporting every other PR to NumPy very quickly. For SciPy the policy is: - anyone can set the "backport-candidate" label - the release manager backports, usually a bunch in one go - only important fixes get backported (involves some judging, but things like silencing warnings, doc fixes, etc. are not important enough) This works well, and I'd hope that we can make the NumPy approach similar. It seems that continuing to set the milestone to a bug-release version > if required was a good idea; it is little effort to anyone and keeps > things clear. +1 For the rest, might it be possible to make things more > automated? E.g., might it be possible to have some travis magic that > does a trial merge & test? Not sure how you would deal with merge conflicts on cherry-picks in an automatic backport thingy? Could one somehow merge to multiple > branches at the same time? > Don't think so. Ralf -------------- next part -------------- An HTML attachment was scrubbed... URL: From alex.rogozhnikov at yandex.ru Wed Feb 22 06:45:14 2017 From: alex.rogozhnikov at yandex.ru (Alex Rogozhnikov) Date: Wed, 22 Feb 2017 14:45:14 +0300 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References:

Message-ID: <0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> Hi Nathaniel, > pandas yup, the idea was to have minimal pandas.DataFrame-like storage (which I was using for a long time), but without irritating problems with its row indexing and some other problems like interaction with matplotlib. > A dict of arrays? that's what I've started from and implemented, but at some point I decided that I'm reinventing the wheel and numpy has something already. In principle, I can ignore this 'column-oriented' storage requirement, but potentially it may turn out to be quite slow-ish if dtype's size is large. Suggestions are welcome. Another strange question: in general, it is considered that once numpy.array is created, it's shape not changed. But if i want to keep the same recarray and change it's dtype and/or shape, is there a way to do this? Thanks, Alex. > 22 ????. 2017 ?., ? 3:53, Nathaniel Smith ???????(?): > > On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" > wrote: > Ah, got it. Thanks, Chris! > I thought recarray can be only one-dimensional (like tables with named columns). > > Maybe it's better to ask directly what I was looking for: > something that works like a table with named columns (but no labelling for rows), and keeps data (of different dtypes) in a column-by-column way (and this is numpy, not pandas). > > Is there such a magic thing? > > Well, that's what pandas is for... > > A dict of arrays? > > -n > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.h.vankerkwijk at gmail.com Wed Feb 22 08:49:59 2017 From: m.h.vankerkwijk at gmail.com (Marten van Kerkwijk) Date: Wed, 22 Feb 2017 08:49:59 -0500 Subject: [Numpy-discussion] Could we simplify backporting? In-Reply-To: References: Message-ID: Hi Ralf, Yes, good to think about other policies. For astropy, we do the decision by labelling with the bug-fix branch (with a policy that it really should fix a bug), and inserting text in that release's bug-fix notes (we really should automate that part...). Then, backporting is done shortly before the bug-fix release and, as far as I can tell (not having done it myself), outside of github. In rare cases with hard-to-resolve merge conflicts, the original PR author gets a note asking for help. As for a travis test: here I was mostly thinking of an allowed-to-fail test that would at least alert one if backporting was going to be an issue. I think travis runs again once one merges, correct? If so, on that merge it could, in principle, do the backport too (if given enough permission, of course; I'm not sure at all I'd want that, just pointing out the possibility! E.g., it might trigger on a message in the merge commit.). All the best, Marten From faltet at gmail.com Wed Feb 22 09:03:32 2017 From: faltet at gmail.com (Francesc Alted) Date: Wed, 22 Feb 2017 15:03:32 +0100 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: <0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> References:

<0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> Message-ID: Hi Alex, 2017-02-22 12:45 GMT+01:00 Alex Rogozhnikov : > Hi Nathaniel, > > > pandas > > > yup, the idea was to have minimal pandas.DataFrame-like storage (which I > was using for a long time), > but without irritating problems with its row indexing and some other > problems like interaction with matplotlib. > > A dict of arrays? > > > that's what I've started from and implemented, but at some point I decided > that I'm reinventing the wheel and numpy has something already. In > principle, I can ignore this 'column-oriented' storage requirement, but > potentially it may turn out to be quite slow-ish if dtype's size is large. > > Suggestions are welcome. > ?You may want to try bcolz: https://github.com/Blosc/bcolz bcolz is a columnar storage, basically as you require, but data is compressed by default even when stored in-memory (although you can disable compression if you want to).? > > Another strange question: > in general, it is considered that once numpy.array is created, it's shape > not changed. > But if i want to keep the same recarray and change it's dtype and/or > shape, is there a way to do this? > ?You can change shapes of numpy arrays, but that usually involves copies of the whole container. With bcolz you can change length and add/del columns without copies.? If your containers are large, it is better to inform bcolz on its final estimated size. See: http://bcolz.blosc.org/en/latest/opt-tips.html ?Francesc? > > Thanks, > Alex. > > > > 22 ????. 2017 ?., ? 3:53, Nathaniel Smith ???????(?): > > On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" > wrote: > > Ah, got it. Thanks, Chris! > I thought recarray can be only one-dimensional (like tables with named > columns). > > Maybe it's better to ask directly what I was looking for: > something that works like a table with named columns (but no labelling for > rows), and keeps data (of different dtypes) in a column-by-column way (and > this is numpy, not pandas). > > Is there such a magic thing? > > > Well, that's what pandas is for... > > A dict of arrays? > > -n > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Francesc Alted -------------- next part -------------- An HTML attachment was scrubbed... URL: From m.h.vankerkwijk at gmail.com Wed Feb 22 09:31:54 2017 From: m.h.vankerkwijk at gmail.com (Marten van Kerkwijk) Date: Wed, 22 Feb 2017 09:31:54 -0500 Subject: [Numpy-discussion] __numpy_ufunc__ In-Reply-To: References:

Message-ID: Hi All, I'd very much like to get `__array_ufunc__` in, and am willing to do some work, but fear we need to get past the last sticking point. As I noted in Chuck's PR [1], in python 3.6 there is now an explicit language change [2], which I think is relevant: ``` It is now possible to set a special method to None to indicate that the corresponding operation is not available. For example, if a class sets __iter__() to None, the class is not iterable. ``` It seems to me entirely logical (but then it would, I suggested it before...) that we allow opting out by setting `__array_ufunc__` to None; in that case, binops return NotImplemented and ufuncs raise errors. (In addtion, or alternatively, one could allow setting `__array__` to None, which would generally disable something to be turned into an array object). But I should note that I much prefer to get something in over wait yet another round! In astropy, there is now more and more clamouring to offer options for pure ndarray functions where quantities are more logical because quantities are twice as slow -- this would instantly be solved with __array_ufunc__... If we can decide on this, then I'd gladly help with remaining issues (e.g., the `ndarray.__array_ufunc__` method, so super can be used). All the best, Marten [1] https://github.com/numpy/numpy/pull/8247 [2] https://docs.python.org/3.6/whatsnew/3.6.html#other-language-changes From alex.rogozhnikov at yandex.ru Wed Feb 22 10:23:40 2017 From: alex.rogozhnikov at yandex.ru (Alex Rogozhnikov) Date: Wed, 22 Feb 2017 18:23:40 +0300 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References:

<0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> Message-ID: <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> Hi Francesc, thanks a lot for you reply and for your impressive job on bcolz! Bcolz seems to make stress on compression, which is not of much interest for me, but the ctable, and chunked operations look very appropriate to me now. (Of course, I'll need to test it much before I can say this for sure, that's current impression). The strongest concern with bcolz so far is that it seems to be completely non-trivial to install on windows systems, while pip provides binaries for most (or all?) OS for numpy. I didn't build pip binary wheels myself, but is it hard / impossible to cook pip-installabel binaries? > ?You can change shapes of numpy arrays, but that usually involves copies of the whole container. sure, but this is ok for me, as I plan to organize column editing in 'batches', so this should require seldom copying. It would be nice to see an example to understand how deep I need to go inside numpy. Cheers, Alex. > 22 ????. 2017 ?., ? 17:03, Francesc Alted ???????(?): > > Hi Alex, > > 2017-02-22 12:45 GMT+01:00 Alex Rogozhnikov >: > Hi Nathaniel, > > >> pandas > > yup, the idea was to have minimal pandas.DataFrame-like storage (which I was using for a long time), > but without irritating problems with its row indexing and some other problems like interaction with matplotlib. > >> A dict of arrays? > > > that's what I've started from and implemented, but at some point I decided that I'm reinventing the wheel and numpy has something already. In principle, I can ignore this 'column-oriented' storage requirement, but potentially it may turn out to be quite slow-ish if dtype's size is large. > > Suggestions are welcome. > > ?You may want to try bcolz: > > https://github.com/Blosc/bcolz > > bcolz is a columnar storage, basically as you require, but data is compressed by default even when stored in-memory (although you can disable compression if you want to).? > > > > Another strange question: > in general, it is considered that once numpy.array is created, it's shape not changed. > But if i want to keep the same recarray and change it's dtype and/or shape, is there a way to do this? > > ?You can change shapes of numpy arrays, but that usually involves copies of the whole container. With bcolz you can change length and add/del columns without copies.? If your containers are large, it is better to inform bcolz on its final estimated size. See: > > http://bcolz.blosc.org/en/latest/opt-tips.html > > ?Francesc? > > > Thanks, > Alex. > > > >> 22 ????. 2017 ?., ? 3:53, Nathaniel Smith > ???????(?): >> >> On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" > wrote: >> Ah, got it. Thanks, Chris! >> I thought recarray can be only one-dimensional (like tables with named columns). >> >> Maybe it's better to ask directly what I was looking for: >> something that works like a table with named columns (but no labelling for rows), and keeps data (of different dtypes) in a column-by-column way (and this is numpy, not pandas). >> >> Is there such a magic thing? >> >> Well, that's what pandas is for... >> >> A dict of arrays? >> >> -n >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > > -- > Francesc Alted > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion -------------- next part -------------- An HTML attachment was scrubbed... URL: From kikocorreoso at gmail.com Wed Feb 22 10:30:18 2017 From: kikocorreoso at gmail.com (Kiko) Date: Wed, 22 Feb 2017 16:30:18 +0100 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> References:

<0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> Message-ID: 2017-02-22 16:23 GMT+01:00 Alex Rogozhnikov : > Hi Francesc, > thanks a lot for you reply and for your impressive job on bcolz! > > Bcolz seems to make stress on compression, which is not of much interest > for me, but the *ctable*, and chunked operations look very appropriate to > me now. (Of course, I'll need to test it much before I can say this for > sure, that's current impression). > > The strongest concern with bcolz so far is that it seems to be completely > non-trivial to install on windows systems, while pip provides binaries for > most (or all?) OS for numpy. > I didn't build pip binary wheels myself, but is it hard / impossible to > cook pip-installabel binaries? > http://www.lfd.uci.edu/~gohlke/pythonlibs/#bcolz Check if the link solves the issue with installing. > > ?You can change shapes of numpy arrays, but that usually involves copies > of the whole container. > > sure, but this is ok for me, as I plan to organize column editing in > 'batches', so this should require seldom copying. > It would be nice to see an example to understand how deep I need to go > inside numpy. > > Cheers, > Alex. > > > > > 22 ????. 2017 ?., ? 17:03, Francesc Alted ???????(?): > > Hi Alex, > > 2017-02-22 12:45 GMT+01:00 Alex Rogozhnikov : > >> Hi Nathaniel, >> >> >> pandas >> >> >> yup, the idea was to have minimal pandas.DataFrame-like storage (which I >> was using for a long time), >> but without irritating problems with its row indexing and some other >> problems like interaction with matplotlib. >> >> A dict of arrays? >> >> >> that's what I've started from and implemented, but at some point I >> decided that I'm reinventing the wheel and numpy has something already. In >> principle, I can ignore this 'column-oriented' storage requirement, but >> potentially it may turn out to be quite slow-ish if dtype's size is large. >> >> Suggestions are welcome. >> > > ?You may want to try bcolz: > > https://github.com/Blosc/bcolz > > bcolz is a columnar storage, basically as you require, but data is > compressed by default even when stored in-memory (although you can disable > compression if you want to).? > > > >> >> Another strange question: >> in general, it is considered that once numpy.array is created, it's shape >> not changed. >> But if i want to keep the same recarray and change it's dtype and/or >> shape, is there a way to do this? >> > > ?You can change shapes of numpy arrays, but that usually involves copies > of the whole container. With bcolz you can change length and add/del > columns without copies.? If your containers are large, it is better to > inform bcolz on its final estimated size. See: > > http://bcolz.blosc.org/en/latest/opt-tips.html > > ?Francesc? > > >> >> Thanks, >> Alex. >> >> >> >> 22 ????. 2017 ?., ? 3:53, Nathaniel Smith ???????(?): >> >> On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" >> wrote: >> >> Ah, got it. Thanks, Chris! >> I thought recarray can be only one-dimensional (like tables with named >> columns). >> >> Maybe it's better to ask directly what I was looking for: >> something that works like a table with named columns (but no labelling >> for rows), and keeps data (of different dtypes) in a column-by-column way >> (and this is numpy, not pandas). >> >> Is there such a magic thing? >> >> >> Well, that's what pandas is for... >> >> A dict of arrays? >> >> -n >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > > -- > Francesc Alted > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From robbmcleod at gmail.com Wed Feb 22 10:34:00 2017 From: robbmcleod at gmail.com (Robert McLeod) Date: Wed, 22 Feb 2017 16:34:00 +0100 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> References:

<0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> Message-ID: Just as a note, Appveyor supports uploading modules to "public websites": https://packaging.python.org/appveyor/ The main issue I would see from this, is the PyPi has my password stored on my machine in a plain text file. I'm not sure whether there's a way to provide Appveyor with a SSH key instead. On Wed, Feb 22, 2017 at 4:23 PM, Alex Rogozhnikov < alex.rogozhnikov at yandex.ru> wrote: > Hi Francesc, > thanks a lot for you reply and for your impressive job on bcolz! > > Bcolz seems to make stress on compression, which is not of much interest > for me, but the *ctable*, and chunked operations look very appropriate to > me now. (Of course, I'll need to test it much before I can say this for > sure, that's current impression). > > The strongest concern with bcolz so far is that it seems to be completely > non-trivial to install on windows systems, while pip provides binaries for > most (or all?) OS for numpy. > I didn't build pip binary wheels myself, but is it hard / impossible to > cook pip-installabel binaries? > > ?You can change shapes of numpy arrays, but that usually involves copies > of the whole container. > > sure, but this is ok for me, as I plan to organize column editing in > 'batches', so this should require seldom copying. > It would be nice to see an example to understand how deep I need to go > inside numpy. > > Cheers, > Alex. > > > > > 22 ????. 2017 ?., ? 17:03, Francesc Alted ???????(?): > > Hi Alex, > > 2017-02-22 12:45 GMT+01:00 Alex Rogozhnikov : > >> Hi Nathaniel, >> >> >> pandas >> >> >> yup, the idea was to have minimal pandas.DataFrame-like storage (which I >> was using for a long time), >> but without irritating problems with its row indexing and some other >> problems like interaction with matplotlib. >> >> A dict of arrays? >> >> >> that's what I've started from and implemented, but at some point I >> decided that I'm reinventing the wheel and numpy has something already. In >> principle, I can ignore this 'column-oriented' storage requirement, but >> potentially it may turn out to be quite slow-ish if dtype's size is large. >> >> Suggestions are welcome. >> > > ?You may want to try bcolz: > > https://github.com/Blosc/bcolz > > bcolz is a columnar storage, basically as you require, but data is > compressed by default even when stored in-memory (although you can disable > compression if you want to).? > > > >> >> Another strange question: >> in general, it is considered that once numpy.array is created, it's shape >> not changed. >> But if i want to keep the same recarray and change it's dtype and/or >> shape, is there a way to do this? >> > > ?You can change shapes of numpy arrays, but that usually involves copies > of the whole container. With bcolz you can change length and add/del > columns without copies.? If your containers are large, it is better to > inform bcolz on its final estimated size. See: > > http://bcolz.blosc.org/en/latest/opt-tips.html > > ?Francesc? > > >> >> Thanks, >> Alex. >> >> >> >> 22 ????. 2017 ?., ? 3:53, Nathaniel Smith ???????(?): >> >> On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" >> wrote: >> >> Ah, got it. Thanks, Chris! >> I thought recarray can be only one-dimensional (like tables with named >> columns). >> >> Maybe it's better to ask directly what I was looking for: >> something that works like a table with named columns (but no labelling >> for rows), and keeps data (of different dtypes) in a column-by-column way >> (and this is numpy, not pandas). >> >> Is there such a magic thing? >> >> >> Well, that's what pandas is for... >> >> A dict of arrays? >> >> -n >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > > -- > Francesc Alted > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Robert McLeod, Ph.D. Center for Cellular Imaging and Nano Analytics (C-CINA) Biozentrum der Universit?t Basel Mattenstrasse 26, 4058 Basel Work: +41.061.387.3225 robert.mcleod at unibas.ch robert.mcleod at bsse.ethz.ch robbmcleod at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From faltet at gmail.com Wed Feb 22 10:37:07 2017 From: faltet at gmail.com (Francesc Alted) Date: Wed, 22 Feb 2017 16:37:07 +0100 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References:

<0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> Message-ID: 2017-02-22 16:30 GMT+01:00 Kiko : > > > 2017-02-22 16:23 GMT+01:00 Alex Rogozhnikov : > >> Hi Francesc, >> thanks a lot for you reply and for your impressive job on bcolz! >> >> Bcolz seems to make stress on compression, which is not of much interest >> for me, but the *ctable*, and chunked operations look very appropriate >> to me now. (Of course, I'll need to test it much before I can say this for >> sure, that's current impression). >> > ?You can disable compression for bcolz by default too: http://bcolz.blosc.org/en/latest/defaults.html#list-of-default-values? > >> The strongest concern with bcolz so far is that it seems to be completely >> non-trivial to install on windows systems, while pip provides binaries for >> most (or all?) OS for numpy. >> I didn't build pip binary wheels myself, but is it hard / impossible to >> cook pip-installabel binaries? >> > > http://www.lfd.uci.edu/~gohlke/pythonlibs/#bcolz > Check if the link solves the issue with installing. > ?Yeah. Also, there are binaries for conda: http://bcolz.blosc.org/en/latest/install.html#installing-from-conda-forge? > >> ?You can change shapes of numpy arrays, but that usually involves copies >> of the whole container. >> >> sure, but this is ok for me, as I plan to organize column editing in >> 'batches', so this should require seldom copying. >> It would be nice to see an example to understand how deep I need to go >> inside numpy. >> > ?Well, if copying is not a problem for you, then you can just create a new numpy container and do the copy by yourself.? Francesc > >> Cheers, >> Alex. >> >> >> >> >> 22 ????. 2017 ?., ? 17:03, Francesc Alted ???????(?): >> >> Hi Alex, >> >> 2017-02-22 12:45 GMT+01:00 Alex Rogozhnikov : >> >>> Hi Nathaniel, >>> >>> >>> pandas >>> >>> >>> yup, the idea was to have minimal pandas.DataFrame-like storage (which I >>> was using for a long time), >>> but without irritating problems with its row indexing and some other >>> problems like interaction with matplotlib. >>> >>> A dict of arrays? >>> >>> >>> that's what I've started from and implemented, but at some point I >>> decided that I'm reinventing the wheel and numpy has something already. In >>> principle, I can ignore this 'column-oriented' storage requirement, but >>> potentially it may turn out to be quite slow-ish if dtype's size is large. >>> >>> Suggestions are welcome. >>> >> >> ?You may want to try bcolz: >> >> https://github.com/Blosc/bcolz >> >> bcolz is a columnar storage, basically as you require, but data is >> compressed by default even when stored in-memory (although you can disable >> compression if you want to).? >> >> >> >>> >>> Another strange question: >>> in general, it is considered that once numpy.array is created, it's >>> shape not changed. >>> But if i want to keep the same recarray and change it's dtype and/or >>> shape, is there a way to do this? >>> >> >> ?You can change shapes of numpy arrays, but that usually involves copies >> of the whole container. With bcolz you can change length and add/del >> columns without copies.? If your containers are large, it is better to >> inform bcolz on its final estimated size. See: >> >> http://bcolz.blosc.org/en/latest/opt-tips.html >> >> ?Francesc? >> >> >>> >>> Thanks, >>> Alex. >>> >>> >>> >>> 22 ????. 2017 ?., ? 3:53, Nathaniel Smith ???????(?): >>> >>> On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov" >>> wrote: >>> >>> Ah, got it. Thanks, Chris! >>> I thought recarray can be only one-dimensional (like tables with named >>> columns). >>> >>> Maybe it's better to ask directly what I was looking for: >>> something that works like a table with named columns (but no labelling >>> for rows), and keeps data (of different dtypes) in a column-by-column way >>> (and this is numpy, not pandas). >>> >>> Is there such a magic thing? >>> >>> >>> Well, that's what pandas is for... >>> >>> A dict of arrays? >>> >>> -n >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> https://mail.scipy.org/mailman/listinfo/numpy-discussion >>> >>> >>> >>> _______________________________________________ >>> NumPy-Discussion mailing list >>> NumPy-Discussion at scipy.org >>> https://mail.scipy.org/mailman/listinfo/numpy-discussion >>> >>> >> >> >> -- >> Francesc Alted >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> >> >> _______________________________________________ >> NumPy-Discussion mailing list >> NumPy-Discussion at scipy.org >> https://mail.scipy.org/mailman/listinfo/numpy-discussion >> >> > > _______________________________________________ > NumPy-Discussion mailing list > NumPy-Discussion at scipy.org > https://mail.scipy.org/mailman/listinfo/numpy-discussion > > -- Francesc Alted -------------- next part -------------- An HTML attachment was scrubbed... URL: From harrigan.matthew at gmail.com Wed Feb 22 10:38:45 2017 From: harrigan.matthew at gmail.com (Matthew Harrigan) Date: Wed, 22 Feb 2017 10:38:45 -0500 Subject: [Numpy-discussion] Fortran order in recarray. In-Reply-To: References:

<0902C347-89B2-41EE-9367-F0C7A4F864D4@yandex.ru> <917C8C1B-FA4C-4128-9991-82DC6AFC3EAA@yandex.ru> Message-ID: Alex, Can you please post some code showing exactly what you are trying to do and any issues you are having, particularly the "irritating problems with its row indexing and some other problems" you quote above? On Wed, Feb 22, 2017 at 10:34 AM, Robert McLeod wrote: > Just as a note, Appveyor supports uploading modules to "public websites": > > https://packaging.python.org/appveyor/ > > The main issue I would see from this, is the PyPi has my password stored > on my machine in a plain text file. I'm not sure whether there's a way to > provide Appveyor with a SSH key instead. > > On Wed, Feb 22, 2017 at 4:23 PM, Alex Rogozhnikov < > alex.rogozhnikov at yandex.ru> wrote: > >> Hi Francesc, >> thanks a lot for you reply and for your impressive job on bcolz! >> >> Bcolz seems to make stress on compression, which is not of much interest >> for me, but the *ctable*, and chunked operations look very appropriate >> to me now. (Of course, I'll need to test it much before I can say this for >> sure, that's current impression). >> >> The strongest concern with bcolz so far is that it seems to be completely >> non-trivial to install on windows systems, while pip provides binaries for >> most (or all?) OS for numpy. >> I didn't build pip binary wheels myself, but is it hard / impossible to >> cook pip-installabel binaries? >> >> ?You can change shapes of numpy arrays, but that usually involves copies >> of the whole container. >> >> sure, but this is ok for me, as I plan to organize column editing in >> 'batches', so this should require seldom copying. >> It would be nice to see an example to understand how deep I need to go >> inside numpy. >> >> Cheers, >> Alex. >> >> >> >> >> 22 ????. 2017 ?., ? 17:03, Francesc Alted ???????(?): >> >> Hi Alex, >> >> 2017-02-22 12:45 GMT+01:00 Alex Rogozhnikov : >> >>> Hi Nathaniel, >>> >>> >>> pandas >>> >>> >>> yup, the idea was to have minimal pandas.DataFrame-like storage (which I >>> was using for a long time), >>> but without irritating problems with its row indexing and some other >>> problems like interaction with matplotlib. >>> >>> A dict of arrays? >>> >>> >>> that's what I've started from and implemented, but at some point I >>> decided that I'm reinventing the wheel and numpy has something already. In >>> principle, I can ignore this 'column-oriented' storage requirement, but >>> potentially it may turn out to be quite slow-ish if dtype's size is large. >>> >>> Suggestions are welcome. >>> >> >> ?You may want to try bcolz: >> >> https://github.com/Blosc/bcolz >> >> bcolz is a columnar storage, basically as you require, but data is >> compressed by default even when stored in-memory (although you can disable >> compression if you want to).? >> >> >> >>> >>> Another strange question: >>> in general, it is considered that once numpy.array is created, it's >>> shape not changed. >>> But if i want to keep the same recarray and change it's dtype and/or >>> shape, is there a way to do this? >>> >> >> ?You can change shapes of numpy arrays, but that usually involves copies >> of the whole container. With bcolz you can change length and add/del >> columns without copies.? If your containers are large, it is better to >> inform bcolz on its final estimated size. See: >> >> http://bcolz.blosc.org/en/latest/opt-tips.html >> >> ?Francesc? >> >> >>> >>> Thanks, >>> Alex. >>> >>> >>> >>> 22 ????. 2017 ?., ? 3:53, Nathaniel Smith ???????(?): >>> >>> On Feb 21, 2017 3:24 PM, "Alex Rogozhnikov"