[pypy-dev] Unicode encode/decode speed (cont)
Eleytherios Stamatogiannakis
estama at gmail.com
Mon Feb 18 17:20:30 CET 2013
We have found another (very simple) madIS query where PyPy is around
250x slower that CPython:
CPython: 314msec
PyPy: 1min 16sec
The query if you would like to test it yourself is the following:
select count(*) from (file 'some_big_text_file.txt' limit 100000);
To run it you'll need some big text file containing at least 100000 text
lines (we have run above query with a very big XML file). You can also
run above query with a lower limit (the behaviour will be the same) as such:
select count(*) from (file 'some_big_text_file.txt' limit 10000);
Be careful for the file to not have a csv, tsv, json, db or gz ending
because a different code path inside the "file" operator will be taken
than the one for simple text files.
l.
More information about the pypy-dev
mailing list