[Mailman-Users] Mail going to list archives but not to list!

Mark Sapiro msapiro at value.net
Mon Oct 16 00:19:07 CEST 2006


Allan Trick wrote:

>At 03:23 PM 10/15/2006, Mark Sapiro wrote:
>
>>Is OutgoingRunner running? If not, are there any messages in 
>>Mailman's 'qrunner' log indicating why or when it stopped?
>
>I think so.  See below.
>
>>Have you tried 'bin/mailmanctl restart'?
>
>Yes, to no avail.  When I do that, this is what is logged in the qrunner log:
>
>>Oct 15 16:24:25 2006 (3753) NewsRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (1662) Master watcher caught SIGINT.  Restarting.
>>Oct 15 16:24:25 2006 (3752) VirginRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (3754) ArchRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (3753) NewsRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (3756) CommandRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (3752) VirginRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (3756) CommandRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (3757) BounceRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (3757) BounceRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (3751) RetryRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (3751) RetryRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3756, sig: None, sts: 2, class: CommandRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (3755) IncomingRunner qrunner caught SIGINT.  Stopping.
>>Oct 15 16:24:25 2006 (3754) ArchRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (3755) IncomingRunner qrunner exiting.
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3757, sig: None, sts: 2, class: BounceRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3753, sig: None, sts: 2, class: NewsRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3755, sig: None, sts: 2, class: IncomingRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3751, sig: None, sts: 2, class: RetryRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3754, sig: None, sts: 2, class: ArchRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (1662) Master qrunner detected subprocess exit
>>(pid: 3752, sig: None, sts: 2, class: VirginRunner, slice: 1/1) [restarting]
>>Oct 15 16:24:25 2006 (8830) NewsRunner qrunner started.
>>Oct 15 16:24:25 2006 (8828) CommandRunner qrunner started.
>>Oct 15 16:24:25 2006 (8832) IncomingRunner qrunner started.
>>Oct 15 16:24:25 2006 (8833) RetryRunner qrunner started.
>>Oct 15 16:24:25 2006 (8834) ArchRunner qrunner started.
>>Oct 15 16:24:25 2006 (8829) BounceRunner qrunner started.
>>Oct 15 16:24:25 2006 (8835) VirginRunner qrunner started.


OutgoingRunner is missing. there are no 'caught SIGINT', 'exiting',
'[restarting]' or 'started' entries for it.

What is in the qrunner log prior to this? Presumably there are entries
from last Wednesday indicating its 'restart limit' was reached.

Does 'ps -fAw | grep Runner' (or however you spell it on your system)
show OutgoingRunner?

Try

bin/mailmanctl stop

Then make sure all Mailman processes are stopped per
<http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq04.068.htp>

then do

bin/mailmanctl start

If that doesn't get things going, check the qrunner and error logs.

>>Are there any recent entries in Mailman's 'error', 'smtp', and 
>>'smtp-failure' logs?
>
>No - no entries in smtp since last Wednesday (but Mailman still 
>archives attempts to send out, as if they went).


Archiving is separate from sending.


>However, the error 
>log is a bit strange.  This is how it starts, exactly:
>
>>Jan 15 12:15:36 2004 mailmanctl(24486): Primary start-up and 
>>shutdown script for Mailman's qrunner daemon.
>>
>>This script starts, stops, and restarts the main Mailman queue runners, making
<snip>
>>               next time a message is written to them
>>Jan 15 12:15:36 2004 mailmanctl(24486):
>>Jan 15 12:15:36 2004 mailmanctl(24486): No command given.
>>Jan 15 12:17:25 2004 mailmanctl(24740): No child with pid: 22488
>>Jan 15 12:17:25 2004 mailmanctl(24740): [Errno 3] No such process
>>Jan 15 12:17:25 2004 mailmanctl(24740): Stale pid file removed.
>>Jan 15 12:25:00 2004 qrunner(25779): Run one or more qrunners, once 
>>or repeatedly.
>>
>>Each named runner class is run in round-robin fashion.  In other words, the
>>first named runner is run to consume all the files currently in its
>>directory.  When that qrunner is done, the next one is run to consume all the
>>files in /its/ directory, and so on.  The number of total iterations can be
>>given on the command line.
>>
>>Usage: ./qrunner [options]
>>
>>Options:
>
>. . . . <snip> . . . .
>
>Does look like I expect it should.  Does this look like corruption 
>that may have something to do with my problem?


No, it has nothing to do with your problem. These entries are from Jan
2004 and they represent the 'help' output from bin/mailmanctl and
bin/qrunner. Possibly this was during initial installation of Mailman.
I think they represent these processes being run as a service without
a controlling terminal and sending this output to the error log.

-- 
Mark Sapiro <msapiro at value.net>       The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan




More information about the Mailman-Users mailing list