Parsing Email 'References' header.
Aspersieman
aspersieman at gmail.com
Thu May 8 08:53:10 EDT 2008
Hi
I have a python script that parses email headers to extract information
from them. I need to get the _last_ messageid in the 'References' field
(http://cr.yp.to/immhf/thread.html) to create a threaded view of these
emails (these messageid's are stored in a database).
Now, I can easily access the 'References' field using the python 'email'
module, but I need a regular expression to get the last messageid in the
'References' field.
Here's what I have so far:
<code>
rx_lastmesgid = re.compile(r"(<.+>$)")
lastmesgid = "".join( filter( rx_lastmesgid.match, parentid ) ) #
parentid's value is
eg:"<1 at mail.gmail.com><2 at mail.gmail.com><3 at mail.gmail.com><4 at mail.gmail.com><5 at mail.gmail.com>"
lastmesgid = "".join( filter( rx_lastmesgid.match, parentid ) )
</code>
I need it to return "<5 at mail.gmail.com>"
Can anyone help?
Thanks
Nicol
--
The three things to remember about Llamas:
1) They are harmless
2) They are deadly
3) They are made of lava, and thus nice to cuddle.
More information about the Python-list
mailing list