Feb. 12, 2008
7:07 p.m.
On Feb 12, 2008 11:53 AM, Bruce Southey <bsouthey@gmail.com> wrote:
Hi,
I have a Opteron 248 (2.66GHz) that with gcc 4.1.0 (SUSE10.1?) that gives C qsort with C style compare: 0.650000 C qsort with Python style compare: 0.640000 NumPy newqsort: 0.360000
I did notice that -O3 was essential to get the performance gain as -O2 gave: C qsort with C style compare: 0.690000 C qsort with Python style compare: 0.700000 NumPy newqsort: 0.610000
Try -O2 -finline-functions, it should come in somewhere between -O2 and -O3 Chuck