[New-bugs-announce] [issue15220] Reduce parsing overhead in email.feedparser.BufferedSubFile

R. David Murray report at bugs.python.org
Thu Jun 28 23:19:24 CEST 2012


New submission from R. David Murray <rdmurray at bitdance.com>:

The idea for the attached patch comes from the QNX development team.  In their measurements, replacing the re.split-plus-line-reassembly code in BufferedSubFile with str.splitlines provided a 30% reduction in email parsing time.  The code is also a lot more readable, which is a plus.

The patch is simple enough, and the improvement is large enough, that I'd like to apply this to all active branches.

----------
components: email
files: feedparser_performance.patch
keywords: patch
messages: 164295
nosy: barry, r.david.murray
priority: normal
severity: normal
stage: patch review
status: open
title: Reduce parsing overhead in email.feedparser.BufferedSubFile
type: performance
versions: Python 2.7, Python 3.2, Python 3.3
Added file: http://bugs.python.org/file26204/feedparser_performance.patch

_______________________________________
Python tracker <report at bugs.python.org>
<http://bugs.python.org/issue15220>
_______________________________________


More information about the New-bugs-announce mailing list