[pypy-dev] Unicode encode/decode speed (cont)

Eleytherios Stamatogiannakis estama at gmail.com
Mon Feb 18 17:20:30 CET 2013


We have found another (very simple) madIS query where PyPy is around 
250x slower that CPython:

CPython: 314msec
PyPy: 1min 16sec

The query if you would like to test it yourself is the following:

select  count(*)  from   (file  'some_big_text_file.txt' limit 100000);

To run it you'll need some big text file containing at least 100000 text 
lines (we have run above query with a very big XML file). You can also 
run above query with a lower limit (the behaviour will be the same) as such:

select  count(*)  from   (file  'some_big_text_file.txt' limit 10000);

Be careful for the file to not have a csv, tsv, json, db or gz ending 
because a different code path inside the "file" operator will be taken 
than the one for simple text files.

l.




More information about the pypy-dev mailing list