Mailman 3 January 2014 - Python-ideas

Re: [Python-ideas] Could the ast module's ASTs preserve source_length in addition to lineno and col_offset?
by Haoyi Li Jan. 31, 2014

Jan. 31, 2014

Nothing happened, I suppose. People in general thought it was a good idea but after looking at the python source code, I chickened out of implementing it in favor a dumb parse it till it works<https://github.com/lihaoyi/macropy/blob/master/macropy/core/exact_src.py> technique which sufficed for my purposes. On Fri, Jan 31, 2014 at 2:55 AM, Alexander Ivanov <alehander42(a)gmail.com>wrote: > What happened :? (I am also interested in getting source_length/col_last > kind of … [View More]info. Is there an alternative Python ast wrapper/library which > provides it?) > > On Friday, May 31, 2013 4:47:15 PM UTC+3, Nick Coghlan wrote: >> >> >> On 31 May 2013 20:00, "Haoyi Li" <haoy...(a)gmail.com> wrote: >> > >> > Ok, I'll give it a shot; I'm not familiar with the python codebase or >> build process, but i'll puzzle it out. Where's the place to go for help >> related to this sort of thing? python-dev? >> >> Check the developer guide at docs. python.org/devguide, and if you have >> any follow-up questions, sign up to the core-me...(a)python.org list. >> >> Cheers, >> Nick. >> >> > >> > >> > On Fri, May 31, 2013 at 1:04 AM, Nick Coghlan <ncog...(a)gmail.com> >> wrote: >> >> >> >> >> >> >> On 31 May 2013 14:28, "Haoyi Li" <haoy...(a)gmail.com> wrote: >> >> > >> >> > Anyone else have any thoughts about this? This seems like it would >> be a pretty straightforward thing to do, and I would be happy to go through >> the code and submit a patch. The only question is whether we want to do it >> in the first place; are there any reasons it can't/shouldn't be done that >> I'm not aware of? >> >> >> >> Seems reasonable to me, but would need to see a patch to give a >> definite yes or no. >> >> >> >> Cheers, >> >> Nick. >> >> >> >> > >> >> > >> >> > On Wed, May 29, 2013 at 8:09 PM, Steven D'Aprano < >> st...(a)pearwood.info> wrote: >> >> >> >> >> >> On 30/05/13 10:04, Haoyi Li wrote: >> >> >>> >> >> >>> I don't need to keep the source code, I just need a single integer >> for each >> >> >>> node. I would then be able to reconstruct the source snippet. >> >> >> >> >> >> >> >> >> And so you did say. Sorry for the noise. >> >> >> >> >> >> >> >> >> -- >> >> >> Steven >> >> >> _______________________________________________ >> >> >> Python-ideas mailing list >> >> >> Python...(a)python.org >> >> >> >> http://mail.python.org/mailman/listinfo/python-ideas >> >> > >> >> > >> >> > >> >> > _______________________________________________ >> >> > Python-ideas mailing list >> >> > Python...(a)python.org >> >> > http://mail.python.org/mailman/listinfo/python-ideas >> >> > >> > >> > >> >> [View Less]

1 0

__before__ and __after__ attributes for functions
by Suresh V. Jan. 31, 2014

Jan. 31, 2014

Can we add these two attributes for every function/method where each is a list of callables with the same arguments as the function/method itself? Pardon me if this has been discussed before. Pointers to past discussions (if any) appreciated. Suresh

11 27

statistics module in Python3.4
by Gregory P. Smith Jan. 30, 2014

Jan. 30, 2014

(resending my the original had the wrong list address in the cc for some reason) ---------- Forwarded message ---------- From: Gregory P. Smith <greg(a)krypto.org> Date: Thu, Jan 30, 2014 at 9:59 AM Subject: Re: [Python-ideas] statistics module in Python3.4 To: Wolfgang <wolfgang.maier(a)biologie.uni-freiburg.de> Cc: python-ideas(a)googlegroups.com, Steven D'Aprano <steve(a)pearwood.info>, Larry Hastings <larry(a)hastings.org> +cc Steve, the PEP 450 author On Mon, … [View More]Jan 27, 2014 at 9:41 AM, Wolfgang < wolfgang.maier(a)biologie.uni-freiburg.de> wrote: > Dear all, > I am still testing the new statistics module and I found two cases were > the behavior of the module seems suboptimal to me. > My most important concern is the module's internal _sum function and its > implications, the other one about passing Counter objects to module > functions. > > As for the first subject: > Specifically, I am not happy with the way the function handles different > types. Currently _coerce_types gets called for every element in the > function's input sequence and type conversion follows quite complicated > rules, and - what is worst - make the outcome of _sum() and thereby mean() > dependent on the order of items in the input sequence, e.g.: > > >>> mean((1,Fraction(2,3),1.0,Decimal(2.3),2.0, Decimal(5))) > 1.9944444444444445 > > >>> mean((1,Fraction(2,3),Decimal(2.3),1.0,2.0, Decimal(5))) > Traceback (most recent call last): > File "<pyshell#7>", line 1, in <module> > mean((1,Fraction(2,3),Decimal(2.3),1.0,2.0, Decimal(5))) > File "C:\Python33\statistics.py", line 369, in mean > return _sum(data)/n > File "C:\Python33\statistics.py", line 157, in _sum > T = _coerce_types(T, type(x)) > File "C:\Python33\statistics.py", line 327, in _coerce_types > raise TypeError('cannot coerce types %r and %r' % (T1, T2)) > TypeError: cannot coerce types <class 'fractions.Fraction'> and <class > 'decimal.Decimal'> > > (this is because when _sum iterates over the input type Fraction wins over > int, then float wins over Fraction and over everything else that follows in > the first example, but in the second case Fraction wins over int, but then > Fraction vs Decimal is undefined and throws an error). > > Confusing, isn't it? So here's the code of the _sum function: > > def _sum(data, start=0): > """_sum(data [, start]) -> value > > Return a high-precision sum of the given numeric data. If optional > argument ``start`` is given, it is added to the total. If ``data`` is > empty, ``start`` (defaulting to 0) is returned. > > > Examples > -------- > > >>> _sum([3, 2.25, 4.5, -0.5, 1.0], 0.75) > 11.0 > > Some sources of round-off error will be avoided: > > >>> _sum([1e50, 1, -1e50] * 1000) # Built-in sum returns zero. > 1000.0 > > Fractions and Decimals are also supported: > > >>> from fractions import Fraction as F > >>> _sum([F(2, 3), F(7, 5), F(1, 4), F(5, 6)]) > Fraction(63, 20) > > >>> from decimal import Decimal as D > >>> data = [D("0.1375"), D("0.2108"), D("0.3061"), D("0.0419")] > >>> _sum(data) > Decimal('0.6963') > > """ > > n, d = _exact_ratio(start) > T = type(start) > partials = {d: n} # map {denominator: sum of numerators} > # Micro-optimizations. > coerce_types = _coerce_types > exact_ratio = _exact_ratio > partials_get = partials.get > # Add numerators for each denominator, and track the "current" type. > for x in data: > T = _coerce_types(T, type(x)) > n, d = exact_ratio(x) > partials[d] = partials_get(d, 0) + n > if None in partials: > assert issubclass(T, (float, Decimal)) > assert not math.isfinite(partials[None]) > return T(partials[None]) > total = Fraction() > for d, n in sorted(partials.items()): > total += Fraction(n, d) > if issubclass(T, int): > assert total.denominator == 1 > return T(total.numerator) > if issubclass(T, Decimal): > return T(total.numerator)/total.denominator > return T(total) > > Internally, the function uses exact ratios for its calculations (which I > think is very nice) and only goes through all the pain of coercing types to > return > T(total.numerator)/total.denominator > where T is the final type resulting from the chain of conversions. > > I think a much cleaner (and probably faster) implementation would be to > gather first all the types in the input sequence, then decide what to > return in an input order independent way. > +1 Agreed that this would be cleaner given your example above. > My tentative implementation: > > def _sum2(data, start=None): > if start is not None: > t = set((type(start),)) > n, d = _exact_ratio(start) > else: > t = set() > n = 0 > d = 1 > partials = {d: n} # map {denominator: sum of numerators} > > # Micro-optimizations. > exact_ratio = _exact_ratio > partials_get = partials.get > > # Add numerators for each denominator, and build up a set of all types. > for x in data: > t.add(type(x)) > n, d = exact_ratio(x) > partials[d] = partials_get(d, 0) + n > T = _coerce_types(t) # decide which type to use based on set of all > types > if None in partials: > assert issubclass(T, (float, Decimal)) > assert not math.isfinite(partials[None]) > return T(partials[None]) > total = Fraction() > for d, n in sorted(partials.items()): > total += Fraction(n, d) > if issubclass(T, int): > assert total.denominator == 1 > return T(total.numerator) > if issubclass(T, Decimal): > return T(total.numerator)/total.denominator > return T(total) > > this leaves the re-implementation of _coerce_types. Personally, I'd prefer > something as simple as possible, maybe even: > > def _coerce_types (types): > if len(types) == 1: > return next(iter(types)) > return float > > , but that's just a suggestion. > > In this case then: > > >>> _sum2((1,Fraction(2,3),1.0,Decimal(2.3),2.0, Decimal(5)))/6 > 1.9944444444444445 > > >>> _sum2((1,Fraction(2,3),Decimal(2.3),1.0,2.0, Decimal(5)))/6 > 1.9944444444444445 > > lets check the examples from the _sum docstring just to be sure: > > >>> _sum2([3, 2.25, 4.5, -0.5, 1.0], 0.75) > 11.0 > > >>> _sum2([1e50, 1, -1e50] * 1000) # Built-in sum returns zero. > 1000.0 > > >>> from fractions import Fraction as F > >>> _sum2([F(2, 3), F(7, 5), F(1, 4), F(5, 6)]) > Fraction(63, 20) > > >>> from decimal import Decimal as D > >>> data = [D("0.1375"), D("0.2108"), D("0.3061"), D("0.0419")] > >>> _sum2(data) > Decimal('0.6963') > > > Now the second issue: > It is maybe more a matter of taste and concerns the effects of passing a > Counter() object to various functions in the module. > I know this is undocumented and it's probably the user's fault if he tries > that, but still: > > >>> from collections import Counter > >>> c=Counter((1,1,1,1,2,2,2,2,2,3,3,3,3)) > >>> c > Counter({1: 4, 2: 5, 3: 4}) > >>> mode(c) > 2 > Cool, mode knows how to work with Counters (interpreting them as frequency > tables) > > >>> median(c) > 2 > Looks good > > >>> mean(c) > 2.0 > Very well > > But the truth is that only mode really works as you may think and we were > just lucky with the other two: > >>> c=Counter((1,1,2)) > >>> mean(c) > 1.5 > oops > > >>> median(c) > 1.5 > hmm > > From a quick look at the code you can see that mode actually converts your > input to a Counter behind the scenes anyway, so it has no problem. > mean and median, on the other hand, are simply iterating over their input, > so if that input happens to be a mapping, they'll use just the keys. > > I think there are two simple ways to avoid this pitfall: > 1) add an explicit warning to the docs explaining this behavior or > 2) make mean and median do the same magic with Counters as mode does, i.e. > make them check for Counter as the input type and deal with it as if it > were a frequency table. I'd favor this behavior because it looks like > little extra code, but may be very useful in many situations. I'm not quite > sure whether maybe even all mappings should be treated that way? > I think this definitely needs documenting. Even if a behavior isn't settled on in time for 3.4 would it make sense to add some asserts to prevent passing a Counter to mean and median for the time being so that this could be improved in a later bugfix rather than becoming an odd behavior we need to maintain compatibility with in the future? It's very late in the release cycle so the best option for these kinds of changes may be to just document them as known issues and behaviors that we will or may fix in future releases. I think Steve and Larry should make the call on that. thanks for putting the new module through its paces! -gps [View Less]

1 0

Normalized Python
by anatoly techtonik Jan. 30, 2014

Jan. 30, 2014

Python is a cross-platform language, but I often find myself writing sections specific for Windows and for Linux and sometimes even OS setting specific code. In these moments I that Python is not more cross-platform that C, for example. What could be done? Normalized Python - a set of default, standard behaviors that backup common user expectations about cross-platform and system-independent behavior regardless of backward compatibility and code compatibility concerns. This is needed, for … [View More]

5 5

Need help designing subprocess API for Tulip
by Guido van Rossum Jan. 29, 2014

Jan. 29, 2014

If you're interested, please see us on the python-tulip mailing list at Google Groups. -- --Guido van Rossum (python.org/~guido)

1 0

str.rreplace
by Ram Rachum Jan. 25, 2014

Jan. 25, 2014

I propose implementing str.rreplace. (It'll be to str.replace what str.rsplit is to str.split.) What do you think?

17 31

data banks access using python with a Samsung Galaxy GNU.org FSF.org
by Jason Bursey Jan. 24, 2014

Jan. 24, 2014

For beginners; she knows saber from AMR

1 0

Re: [Python-ideas] Multi-statement anonymous functions
by musicdenotation＠gmail.com Jan. 23, 2014

Jan. 23, 2014

1. Mutable namespaces and variables are for computation processes like while or for loops. They are not for temporary variables (that is why classes and functions have their own scopes). 2. I want not to worry about name clashes.

3 2

Make max() stable
by אלעזר Jan. 21, 2014

Jan. 21, 2014

Hi all, Given several objects with the same key, max() returns the first one: >>> key = lambda x: 0 >>> max(1, 2, key=key) 1 This means it is not stable, at least according to the definition in "Elements of Programming" by Alexander Stepanov and Paul McJones (pg. 52): "Informally, an algorithm is stable if it respects the original order of equivalent objects. So if we think of minimum and maximum as selecting, respectively, the smallest and the second smallest … [View More]

13 25

Re: [Python-ideas] Add `n_threads` argument to `concurrent.futures.ProcessPoolExecutor`
by Andrew Barnert Jan. 21, 2014

Jan. 21, 2014

I slapped together a fork of concurrent/futures/process.py. It's named "procthreadex.py", and it just uses a ThreadPoolExecutor in the _process_worker function. You can get it at http://pastebin.com/Ba2KPYy3, and a test program skeleton at http://pastebin.com/ifwX6NaB. Maybe you can find a use case where ProcessThreadPoolExecutor(4, 4) outperforms ProcessPoolExecutor(16). (I haven't been able to.) >________________________________ > From: Ram Rachum <ram.rachum(a)gmail.… [View More]

2 1