data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
Has anyone created a script they'd like to share that kills off the sobig stuff sitting on hold?
I'll make one but if there's one already out there, it'd save me some time. I'm leaving on vacation Friday night. Lots to do 'till then...
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
data:image/s3,"s3://crabby-images/453c8/453c868146b839a25f378da575fd92bd89ea9f5c" alt=""
Hi,
Get auto_discard script here,
http://sourceforge.net/tracker/?func=detail&aid=636412&group_id=103&atid=300103
and place it in somewhere it can import paths.py (cron dir is suitable). Edit the expire parameter for shorter time and run the script (python auto_discard).
It will kill all the pending posts older than the expire parameter. (it doesn't sort sobig stuff though ...)
Phil Barnett wrote:
-- Tokio Kikuchi, tkikuchi@ is.kochi-u.ac.jp http://weather.is.kochi-u.ac.jp/
data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
On Wednesday 20 August 2003 11:35 pm, Phil Barnett wrote:
In the grand tradition of replying to myself...
Here is the script I cobbled up.
It deletes the worst of the muck. I let cron run it hourly. Watch the wrap. All lines should start with /bin/grep.
45 * * * * /home/mailman/data/cleanup.sh
#! /bin/bash
cd /home/mailman/data
/bin/grep -H -m 1 -l -i '6.00.2600.0000' *.txt | /usr/bin/xargs rm >/dev/null 2>&1 /bin/grep -H -m 1 -l -i 'nigeria' *.txt | /usr/bin/xargs rm >/dev/null 2>&1 /bin/grep -H -m 1 -l -i 'winning notification' *.txt | /usr/bin/xargs rm
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
data:image/s3,"s3://crabby-images/79c3c/79c3c9b7eaaad86a55cfe060f19b8e899a10dce4" alt=""
--- Phil Barnett <philb@philb.us> wrote:
How do you go about getting .txt files in your /home/mailman/data dir ? All I see are binary pck files (which one could run dumpdb on, but was curious to know). I'm running mailman-2.1.2. BTW, does deleting those .pck files have any baring on anything else (is it dangerous) ? Meaning, are there any adverse affects in doing that (is there a db that cares or anything) ?
BTW: for those mailman users looking to incorporate spamassassin, do look into this site - it would be wonderful if mailman proper included these hooks natively in its future releases (or at a min include the info on the URL below on list.org or the FAQ).
http://www.daa.com.au/~james/articles/mailman-spamassassin/
One thing this setup is lacking is "learning" - ie. the ability for
spamassassin to play nicer with mailman (ie. to know what things you
discarded to build-up a more profound history affecting future action).
Regards,
- Nadim
Do you Yahoo!? Yahoo! SiteBuilder - Free, easy-to-use web site design software http://sitebuilder.yahoo.com
data:image/s3,"s3://crabby-images/50535/5053512c679a1bec3b1143c853c1feacdabaee83" alt=""
Take a look in cvs. There's a script called bin/discard which takes a list of file names and does proper discards on all those files. I'll be testing it on mail.python.org when 1) I dig out from the mail hell we've been in for the last few days, and 2) SF's CVS stops sucking <wink>.
-Barry
data:image/s3,"s3://crabby-images/d84b5/d84b5a5995488da5093d42196312f20a9bfdfdf6" alt=""
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
baw> Take a look in cvs. There's a script called bin/discard
baw> which takes a list of file names and does proper discards on
baw> all those files. I'll be testing it on mail.python.org when
baw> 1) I dig out from the mail hell we've been in for the last
baw> few days, and 2) SF's CVS stops sucking <wink>.
I hope this is something where the choice of what to discard rests with the individual list owner/moderator.
Until now, with Mailman, content discrimination has been solely up to the individual list owner and his minions. IMHO it is good to make it easy for the individual list owners to do their job, but anything that requires content decisions to be made across all lists on a Mailman will be inappropriate for at least some, and probably many, mailing list operators. Consider a Mailman that carries many diverse lists including some that, for example, are directly involved with Nigerian affairs and Nigerian folk. Also, depending who operates Mailman and under what auspices and under what jurisdiction, an operator that exercises some discrimination based upon content may viewed as being responsible for content in general. For the sake of those operators who cannot take such responsibility, please continue to make any facilities for discriminating based upon content be tailored to the needs of individual lists and by the individual list owners.
Elsewhere recently there has been mention of Baysen spam filters or the like for Mailman. Again, to be generally useful, it would, IMHO, be necessary to allow these features to be tailored to the needs of individual lists and to continue to plaice all responsibility firmly with the individual list owner where that is the policy of the Mailman operator.
jam
-----BEGIN PGP SIGNATURE-----
iD8DBQE/WK/oUEvv1b/iXy8RAuNFAKCHAsjxKoi8KjvfVd4TQTyoYcNH9ACbBB5M 7n6h6vlje0h83mvozPghpPU= =NUsK -----END PGP SIGNATURE-----
data:image/s3,"s3://crabby-images/50535/5053512c679a1bec3b1143c853c1feacdabaee83" alt=""
On Fri, 2003-09-05 at 11:46, John A. Martin wrote:
I hope this is something where the choice of what to discard rests with the individual list owner/moderator.
The script is just a hack, and must be run by the site admin on the command line. It's just a way to mass discard a ton of crap. It works on file names so it's easy to focus it on a single list.
That's how my Spambayes patch to Mailman works. It hooks into the normal approval process. It'll likely need updating to the latest versions of Spambayes (which recently underwent an interface upheaval). I'd probably be inclined to put this on the feature list for Mailman 2.2.
-Barry
data:image/s3,"s3://crabby-images/d458d/d458d7f6bafff5188464d7d94163b73f52cfeb21" alt=""
On Friday 05 September 2003 18:16, Barry Warsaw wrote:
I'm using this patch (slightly adapted) on our servers since early april. It works almost seamlessly and its very effective but still lacks some UI controls, e.g. actually there's no way to train the filter TTW with a message which is not in the moderation queue. Also, if a message gets discarded or posted to the list, there's no way to recoved the pristine copy, the one that was inspected by the filter, and the only copy one can use is Decorate'ed, mangled and/or filled with unwanted Received headers. Hopefully I'm going to work again on this patch in the next few months.
-- Adde parvum parvo magnus acervus erit -- Ovidio
data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
On Friday 05 September 2003 1:48 am, Nadim Shaikli wrote:
It only deletes the messages that are caught by my filters. In this case, it is:
# Lines that *start* with a '#' are comments. to: friend@public.com message-id: relay.comanche.denmark.eu from: list@listme.com from: .*@uplinkpro.com from: .*@lithesoft.com from: .*@paid4survey.net from: .*@freegift4u.com.* from: .*q-crystal.com.* subject: .*@Podtal.* subject: .*URGENT ASSISTANCE.* from: .*etoyshop.* from: .*bdavisa.* subject: .*new photos from my party.* Content-type: text/html Content-type: text/enriched Content-type: text/x-vcard Content-type: multipart/alternative Content-type: multipart/related Content-type: multipart/mixed Content-type: application/octet-stream Content-Type: text/html Content-Type: text/enriched Content-Type: text/x-vcard Content-Type: multipart/alternative Content-Type: multipart/related Content-Type: multipart/mixed Content-Type: application/octet-stream Content-Disposition: attachment from: .*@lehugo.com.br.* subject: .*Autoreply:.* Precedence: .*bulk.*
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
On Friday 05 September 2003 1:48 am, Nadim Shaikli wrote:
I forgot to answer this.
Yes, the html admin page complains that the mail is lost.
Hit the submit button and it cleans up after itself.
Doesn't appear to cause any problems.
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
data:image/s3,"s3://crabby-images/453c8/453c868146b839a25f378da575fd92bd89ea9f5c" alt=""
Hi,
Get auto_discard script here,
http://sourceforge.net/tracker/?func=detail&aid=636412&group_id=103&atid=300103
and place it in somewhere it can import paths.py (cron dir is suitable). Edit the expire parameter for shorter time and run the script (python auto_discard).
It will kill all the pending posts older than the expire parameter. (it doesn't sort sobig stuff though ...)
Phil Barnett wrote:
-- Tokio Kikuchi, tkikuchi@ is.kochi-u.ac.jp http://weather.is.kochi-u.ac.jp/
data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
On Wednesday 20 August 2003 11:35 pm, Phil Barnett wrote:
In the grand tradition of replying to myself...
Here is the script I cobbled up.
It deletes the worst of the muck. I let cron run it hourly. Watch the wrap. All lines should start with /bin/grep.
45 * * * * /home/mailman/data/cleanup.sh
#! /bin/bash
cd /home/mailman/data
/bin/grep -H -m 1 -l -i '6.00.2600.0000' *.txt | /usr/bin/xargs rm >/dev/null 2>&1 /bin/grep -H -m 1 -l -i 'nigeria' *.txt | /usr/bin/xargs rm >/dev/null 2>&1 /bin/grep -H -m 1 -l -i 'winning notification' *.txt | /usr/bin/xargs rm
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
data:image/s3,"s3://crabby-images/79c3c/79c3c9b7eaaad86a55cfe060f19b8e899a10dce4" alt=""
--- Phil Barnett <philb@philb.us> wrote:
How do you go about getting .txt files in your /home/mailman/data dir ? All I see are binary pck files (which one could run dumpdb on, but was curious to know). I'm running mailman-2.1.2. BTW, does deleting those .pck files have any baring on anything else (is it dangerous) ? Meaning, are there any adverse affects in doing that (is there a db that cares or anything) ?
BTW: for those mailman users looking to incorporate spamassassin, do look into this site - it would be wonderful if mailman proper included these hooks natively in its future releases (or at a min include the info on the URL below on list.org or the FAQ).
http://www.daa.com.au/~james/articles/mailman-spamassassin/
One thing this setup is lacking is "learning" - ie. the ability for
spamassassin to play nicer with mailman (ie. to know what things you
discarded to build-up a more profound history affecting future action).
Regards,
- Nadim
Do you Yahoo!? Yahoo! SiteBuilder - Free, easy-to-use web site design software http://sitebuilder.yahoo.com
data:image/s3,"s3://crabby-images/50535/5053512c679a1bec3b1143c853c1feacdabaee83" alt=""
Take a look in cvs. There's a script called bin/discard which takes a list of file names and does proper discards on all those files. I'll be testing it on mail.python.org when 1) I dig out from the mail hell we've been in for the last few days, and 2) SF's CVS stops sucking <wink>.
-Barry
data:image/s3,"s3://crabby-images/d84b5/d84b5a5995488da5093d42196312f20a9bfdfdf6" alt=""
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
baw> Take a look in cvs. There's a script called bin/discard
baw> which takes a list of file names and does proper discards on
baw> all those files. I'll be testing it on mail.python.org when
baw> 1) I dig out from the mail hell we've been in for the last
baw> few days, and 2) SF's CVS stops sucking <wink>.
I hope this is something where the choice of what to discard rests with the individual list owner/moderator.
Until now, with Mailman, content discrimination has been solely up to the individual list owner and his minions. IMHO it is good to make it easy for the individual list owners to do their job, but anything that requires content decisions to be made across all lists on a Mailman will be inappropriate for at least some, and probably many, mailing list operators. Consider a Mailman that carries many diverse lists including some that, for example, are directly involved with Nigerian affairs and Nigerian folk. Also, depending who operates Mailman and under what auspices and under what jurisdiction, an operator that exercises some discrimination based upon content may viewed as being responsible for content in general. For the sake of those operators who cannot take such responsibility, please continue to make any facilities for discriminating based upon content be tailored to the needs of individual lists and by the individual list owners.
Elsewhere recently there has been mention of Baysen spam filters or the like for Mailman. Again, to be generally useful, it would, IMHO, be necessary to allow these features to be tailored to the needs of individual lists and to continue to plaice all responsibility firmly with the individual list owner where that is the policy of the Mailman operator.
jam
-----BEGIN PGP SIGNATURE-----
iD8DBQE/WK/oUEvv1b/iXy8RAuNFAKCHAsjxKoi8KjvfVd4TQTyoYcNH9ACbBB5M 7n6h6vlje0h83mvozPghpPU= =NUsK -----END PGP SIGNATURE-----
data:image/s3,"s3://crabby-images/50535/5053512c679a1bec3b1143c853c1feacdabaee83" alt=""
On Fri, 2003-09-05 at 11:46, John A. Martin wrote:
I hope this is something where the choice of what to discard rests with the individual list owner/moderator.
The script is just a hack, and must be run by the site admin on the command line. It's just a way to mass discard a ton of crap. It works on file names so it's easy to focus it on a single list.
That's how my Spambayes patch to Mailman works. It hooks into the normal approval process. It'll likely need updating to the latest versions of Spambayes (which recently underwent an interface upheaval). I'd probably be inclined to put this on the feature list for Mailman 2.2.
-Barry
data:image/s3,"s3://crabby-images/d458d/d458d7f6bafff5188464d7d94163b73f52cfeb21" alt=""
On Friday 05 September 2003 18:16, Barry Warsaw wrote:
I'm using this patch (slightly adapted) on our servers since early april. It works almost seamlessly and its very effective but still lacks some UI controls, e.g. actually there's no way to train the filter TTW with a message which is not in the moderation queue. Also, if a message gets discarded or posted to the list, there's no way to recoved the pristine copy, the one that was inspected by the filter, and the only copy one can use is Decorate'ed, mangled and/or filled with unwanted Received headers. Hopefully I'm going to work again on this patch in the next few months.
-- Adde parvum parvo magnus acervus erit -- Ovidio
data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
On Friday 05 September 2003 1:48 am, Nadim Shaikli wrote:
It only deletes the messages that are caught by my filters. In this case, it is:
# Lines that *start* with a '#' are comments. to: friend@public.com message-id: relay.comanche.denmark.eu from: list@listme.com from: .*@uplinkpro.com from: .*@lithesoft.com from: .*@paid4survey.net from: .*@freegift4u.com.* from: .*q-crystal.com.* subject: .*@Podtal.* subject: .*URGENT ASSISTANCE.* from: .*etoyshop.* from: .*bdavisa.* subject: .*new photos from my party.* Content-type: text/html Content-type: text/enriched Content-type: text/x-vcard Content-type: multipart/alternative Content-type: multipart/related Content-type: multipart/mixed Content-type: application/octet-stream Content-Type: text/html Content-Type: text/enriched Content-Type: text/x-vcard Content-Type: multipart/alternative Content-Type: multipart/related Content-Type: multipart/mixed Content-Type: application/octet-stream Content-Disposition: attachment from: .*@lehugo.com.br.* subject: .*Autoreply:.* Precedence: .*bulk.*
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
data:image/s3,"s3://crabby-images/5be3c/5be3cf95b0c37aef9bb6bd3d7d1934ee943e4e9e" alt=""
On Friday 05 September 2003 1:48 am, Nadim Shaikli wrote:
I forgot to answer this.
Yes, the html admin page complains that the mail is lost.
Hit the submit button and it cleans up after itself.
Doesn't appear to cause any problems.
--
Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
participants (6)
-
Barry Warsaw
-
John A. Martin
-
Nadim Shaikli
-
Phil Barnett
-
Simone Piunno
-
Tokio Kikuchi