Would it be possible to normalize by the number of mailing list members and also by "active" members? The latter would be tricky to define.

On Mon, Jul 30, 2018 at 3:29 PM Victor Stinner <vstinner@redhat.com> wrote:
Hi Bob,

I wrote a basic script to compute the number of emails per PEP. It requires to download gzipped mbox files from the web page of archives per month, then ungzip them:
https://github.com/vstinner/misc/blob/master/python/parse_mailman_mbox_peps.py

Results:
https://mail.python.org/pipermail/python-committers/2018-April/005310.html

Victor

Le lundi 30 juillet 2018, Bob Purvy <bpurvy@gmail.com> a écrit :
> hi all,
> I've been trying to figure out how to access the archives programmatically. I'm sure this is easy once you know, but googling various things hasn't worked.  What I want to do is graph the number of messages about PEP 572 by time.  (or has someone already done that?)
> I installed GNU Mailman, and downloaded the gzip'ed archives for a number of months and unzipped them, and I suspect that there's some way to get them all into a single database, but it hasn't jumped out at me.  If I count the "Message-ID" lines, the "Subject:" lines, and the "\nFrom " lines in one of those text files, I get slightly different numbers for each.
> Alternatively, they're maybe already in a database, and I just need API access to do the querying?  Can someone help me out?
> Bob _______________________________________________
Python-Dev mailing list
Python-Dev@python.org
https://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: https://mail.python.org/mailman/options/python-dev/mike%40selik.org