Using Google Search Appliance With Mailman Archives?
I'm in the process of migrating from ezmlm to mailman. So far everything is working great. I've even been able to migrate ezmlm list archives to mailman so that I can see the messages via the mailman web interface.
One of the goals of this migration is to be able to use a Google Search Appliance to search the list archives. What I've found is that the archive for each list is in /var/lib/mailman/archives/private/listname, and in this directory are
1) a directory for each month containing the messages
submitted during the month in HTML format.
2) a file for each month containing all the messages
for the month in text format concatenated together.
I'm trying to figure out the best way to search the archive with a GSA. I'm worried that if I search #1 I'll find what I want but it will be in HTML format which won't be very easy to read. If I search #2 I'll find what I want but I'll see the whole file, which will also contain a bunch of stuff I'm not looking for.
Has anybody worked through these issues with a GSA? I'd be interested in hearing how you did it.
Cordially, Jon Forrest
Jon Forrest wrote:
What I've found is that the archive for each list is in /var/lib/mailman/archives/private/listname, and in this directory are
a directory for each month containing the messages submitted during the month in HTML format.
a file for each month containing all the messages for the month in text format concatenated together.
I'm trying to figure out the best way to search the archive with a GSA. I'm worried that if I search #1 I'll find what I want but it will be in HTML format which won't be very easy to read.
Actually, the message portion of an HTML archive file is just plain text surrounded by <pre>, </pre> tags so it is easy to read.
Has anybody worked through these issues with a GSA? I'd be interested in hearing how you did it.
Not with a GSA, but see the FAQs at <http://wiki.list.org/x/MoA9> and <http://wiki.list.org/x/dYA9>.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan
participants (2)
-
Jon Forrest
-
Mark Sapiro