[Numpy-discussion] performance solving system of equations in numpy and MATLAB
sturla.molden at gmail.com
Thu Dec 17 06:59:20 EST 2015
On 16/12/15 20:47, Derek Homeier wrote:
> Getting around 30 s wall time here on a not so recent 4-core iMac, so that would seem to fit
> (iirc Accelerate should actually largely be using the same machine code as MKL).
Yes, the same kernels, but not the same threadpool. Accelerate uses the
GCD, MKL uses Intel TBB and Intel OpenMP (both of them). GCD scales
better than TBB, even in Intel's own benchmarks. However, GCD uses a
kernel threadpool (accesible via kqueue) which is not fork-safe, whereas
MKL's threadpool is fork-safe (but will leak memory on fork).
More information about the NumPy-Discussion