Mailman 3 February 2003 - Python-Dev

Introducing Python
by Gustavo Niemeyer Feb. 27, 2003

Feb. 27, 2003

I've just seen the Introducing Python video, found in http://www.ibiblio.org/obp/pyBiblio/pythonvideo.php This is a very interesting video, at least after you stop laughing. :-)) Jokes apart, it's indeed interesting to know how your mailing list partners/programming partners/benevolent dictators/friends/whatever look like, when they're not ascii characters. I'd advice it to anyone who is part of that community and is not able to be closer in meetings and similar events. Btw, Tim, your <… [View More]

1 0

Slowdown in Python CVS
by M.-A. Lemburg Feb. 27, 2003

Feb. 27, 2003

My nightly run of pybench went up from the usual 7590ms per run to around 8200ms between Monday night and today. Can anyone explain this ? Thanks, -- Marc-Andre Lemburg eGenix.com Professional Python Software directly from the Source (#1, Feb 27 2003) >>> Python/Zope Products & Consulting ... http://www.egenix.com/ >>> mxODBC, mxDateTime, mxTextTools ... http://python.egenix.com/ ________________________________________________________________________ … [View More]

5 7

Dynamic bytecode analysis
by Damien Morton Feb. 27, 2003

Feb. 27, 2003

So I enhanced the PVM dynamic execution profiler to keep track of quads of instructions rather than just pairs. Running it on PyStone, I get the following results: Note: score is frequency * len(trace) http://www.bitfurnace.com/python/dxstats-traces.txt score trace 7204142 LOAD_FAST 3701796 LOAD_FAST, LOAD_FAST 3453243 COMPARE_OP, JUMP_IF_FALSE, POP_TOP 2701110 LOAD_GLOBAL 2603086 JUMP_IF_FALSE, POP_TOP 2402196 COMPARE_OP, JUMP_IF_FALSE 2302782 LOAD_CONST 2251111 STORE_FAST … [View More]

6 7

bug? classes whose metclass has __del__ are not collectible
by Samuele Pedroni Feb. 27, 2003

Feb. 27, 2003

new-style classes keep track of their subclasses through weak-refs so they remain in general collectible. consider this code: (inspired by this post comp.lang.python post id: <b3khka$2g6$1(a)news.wplus.spb.ru> ) <test.py> import sys class MyMetaclass(type): def __init__(cls, name, bases, dict): super(MyMetaclass, cls).__init__(name, bases, dict) print 'initialized', cls.__name__ if 'meta__del__' in sys.argv: print "*meta__del__*" def __del__(… [View More]

2 1

Bytecode analysis
by Damien Morton Feb. 27, 2003

Feb. 27, 2003

Ive done a static analysis of the bytecodes from compiling the python standard library: Python 2.2 (#28, Dec 21 2001, 12:21:22) [MSC 32 bit (Intel)] on win32 Some stats about JUMP_IF_FALSE opcodes Of the 2768 JUMP_IF_FALSE opcodes encountered, 2429 have a POP_TOP on both branches. Id like to propose that JUMP_IF_FALSE consume the top-of-stack. Some stats about constants 50% of constant accesses are to a fixed set of 5 constants http://www.bitfurnace.com/python/stats-consts.txt rank, … [View More]

18 46

RE: [Python-Dev] Re: module extension search order - can it be changed?
by Chermside, Michael Feb. 26, 2003

Feb. 26, 2003

Guido: > I thought for sure Greg had left off the smiley. Oh. Yes. Of course. what-can-I-say-it-was-early-in-the-morning -- Michael Chermside

4 3

Some questions about maintenance of the regular expression code.
by Gary Herron Feb. 26, 2003

Feb. 26, 2003

I've decided to answer Guido's call for someone to take over maintenance of the SRE code since it has started to fall into disrepair. First a short introduction and then on with a question that begs for some discussion on this list. My name is Gary Herron. I've been using Python whenever possible for about 8 years (and for most of the last year and a half I've been able to choose Python almost exclusively -- lucky me). I've mostly lurked around the python and python-dev lists, only … [View More]occasionally offering help or comments. Volunteering to maintain the SRE code seems like a good opportunity to jump in and do something useful. Now on with the questions at hand: The first glance at the regular expression bug list and the _sre.c code results in the observation that several of the bugs are related to running over the recursion limit. The problem comes from using a pattern containing ".*?" in a situation where it is expected to match many thousands of characters. Each character matched by ".*?" causes one level or recursion, quickly overflowing the recursion limit. Question 1: Should we even consider these as bugs? After all the recursion limit is in place to prevent badly used re's from crashing Python with a stack overflow. We could claim the kinds of patterns which cause heavy recursion are miss-uses of regular expressions which are bound to fail when used on long strings. If we take this route, something should be added to the documentation which explains when excessive recursion is likely to bite. Question 2: If we want to solve the problem (instead of just dodging it) how should we proceed? * Increasing the limit beyond the current 10000 is not really an option for two reasons: 1. This doesn't solve the problem. One can always match on a string purposely chosen to be long enough to overflow any recursion limit. 2. A recent patch (browse "cvs log _sre.c" to find a reference) actually lowered the limit from 10000 to 7500 for certain 64-bit machines which apparently suffered a stack overflow before hitting 10000 recursion levels. * An attempt to replace the hard-coded upper limit with a programmed check of the stack space (see Misc/HISTORY for a reference to PyOS_CheckStack) was added and then withdrawn for version 2.0. Does anybody know the history of this? This would not really solve the problem (especially on the 64 bit machines which could not even hit 10000 levels of recursion), but it would push the recursion limit to its highest possible value rather than some arbitrary hard-coded value. * Removing the recursion by the standard method of storing state in a program managed stack and looping rather than recursing would push the storage problem from the stack into the (probably much larger) heap. I haven't looked at the code enough to judge if this is feasible, but if it is, some limit would still remain. It would, however, depend on available memory rather than stack space. And still, the documentation should warn that certain naive pattens on LONG strings could fail after wasting much time chewing through all available memory. * I notice that, unlike pattern ".*?", matching to pattern ".*" does not recurse for each character matched. With only a few minutes of looking at the code, I can't begin to guess if it is feasible to make the former work like the later without recursing. Any comments? Remember that all the points under question 2 are worth considering only if we decide we really ought to support things like patterns using ".*?" to match many thousands of characters. Thanks, Gary Herron [View Less]

9 20

PEP 305 - CSV File API - please have a look
by Skip Montanaro Feb. 26, 2003

Feb. 26, 2003

Folks, In advance of asking Guido to review and pronounce on PEP 305 and its related code, I'd like to ask you to take a few minutes to review what we've produced. There is the PEP, of course: http://www.python.org/peps/pep-0305.html but there is also source code, a large number of test cases and a libref section available in the CVS sandbox. Cliff Wells is working on a csvutils module which will contain adaptations of the "sniffing" routines from his DSV package. Just do a "csv up -… [View More]

1 0

Re: [Python-Dev] Bytecode idea
by Zooko Feb. 26, 2003

Feb. 26, 2003

Jeremy Hylton wrote: > > If you are benchmarking various opcode effects, I'd recommend trying to > revive the simple cycle counter instrumentation I did for Python 2.2. The > idea is to use the Pentium cycle counter to measure the number of cycles > spent on each trip through the mainloop. For Linux >= 2.4 and an x86 CPU, oprofile will tell you (stochastically) how many CPU cycles are spent on each x86 instruction. http://oprofile.sourceforge.net/

1 0

Re: module extension search order - can it be changed?
by Chermside, Michael Feb. 26, 2003

Feb. 26, 2003

> > > Speaking entirely from a point of ignorance, why are the source line #s > > > not shown for frames that are implemented in modules loaded from > > > zipimport? > > > > Because the code printing the tracebacks doesn't know how to look > > inside a zip file. > > Maybe, if the source file can't be found, it could > decompile the bytecode? Too clever by far. The peculiar way in which the comments disappear, the fact that the code is … [View More]

2 1