[Python-Dev] Accessing mailing list archives

Victor Stinner vstinner at redhat.com
Tue Jul 31 20:30:49 EDT 2018

Feel free to modify the script to make your own statistics ;-)


2018-08-01 0:57 GMT+02:00 Michael Selik <mike at selik.org>:
> Would it be possible to normalize by the number of mailing list members and
> also by "active" members? The latter would be tricky to define.
> On Mon, Jul 30, 2018 at 3:29 PM Victor Stinner <vstinner at redhat.com> wrote:
>> Hi Bob,
>> I wrote a basic script to compute the number of emails per PEP. It
>> requires to download gzipped mbox files from the web page of archives per
>> month, then ungzip them:
>> https://github.com/vstinner/misc/blob/master/python/parse_mailman_mbox_peps.py
>> Results:
>> https://mail.python.org/pipermail/python-committers/2018-April/005310.html
>> Victor
>> Le lundi 30 juillet 2018, Bob Purvy <bpurvy at gmail.com> a écrit :
>> > hi all,
>> > I've been trying to figure out how to access the archives
>> > programmatically. I'm sure this is easy once you know, but googling various
>> > things hasn't worked.  What I want to do is graph the number of messages
>> > about PEP 572 by time.  (or has someone already done that?)
>> > I installed GNU Mailman, and downloaded the gzip'ed archives for a
>> > number of months and unzipped them, and I suspect that there's some way to
>> > get them all into a single database, but it hasn't jumped out at me.  If I
>> > count the "Message-ID" lines, the "Subject:" lines, and the "\nFrom " lines
>> > in one of those text files, I get slightly different numbers for each.
>> > Alternatively, they're maybe already in a database, and I just need API
>> > access to do the querying?  Can someone help me out?
>> > Bob _______________________________________________
>> Python-Dev mailing list
>> Python-Dev at python.org
>> https://mail.python.org/mailman/listinfo/python-dev
>> Unsubscribe:
>> https://mail.python.org/mailman/options/python-dev/mike%40selik.org

More information about the Python-Dev mailing list