[ mailman-Bugs-670535 ] qrunner stops for no apparent reason

SourceForge.net noreply at sourceforge.net
Sat Apr 19 14:51:57 EDT 2003


Bugs item #670535, was opened at 2003-01-18 22:04
Message generated for change (Comment added) made by bwarsaw
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=100103&aid=670535&group_id=103

Category: mail delivery
Group: 2.1 (stable)
>Status: Pending
Resolution: None
Priority: 8
Submitted By: David Gibbs (midrangeman)
Assigned to: Nobody/Anonymous (nobody)
Summary: qrunner stops for no apparent reason

Initial Comment:
About once every day or so, qrunner will stop for no 
apparent reason.

The qrunner log file has the following ...

Jan 18 14:29:09 2003 (3443) IncomingRunner qrunner 
caught SIGTERM.  Stopping.
Jan 18 14:29:09 2003 (3443) IncomingRunner qrunner 
exiting.
Jan 18 14:29:09 2003 (3441) BounceRunner qrunner 
caught SIGTERM.  Stopping.
Jan 18 14:29:09 2003 (3441) BounceRunner qrunner 
exiting.
Jan 18 14:29:09 2003 (3445) OutgoingRunner qrunner 
caught SIGTERM.  Stopping.
Jan 18 14:29:09 2003 (3445) OutgoingRunner qrunner 
exiting.
Jan 18 14:29:09 2003 (3442) CommandRunner qrunner 
caught SIGTERM.  Stopping.
Jan 18 14:29:09 2003 (3442) CommandRunner qrunner 
exiting.
Jan 18 14:29:09 2003 (3446) VirginRunner qrunner 
caught SIGTERM.  Stopping.
Jan 18 14:29:09 2003 (3446) VirginRunner qrunner 
exiting.
Jan 18 14:29:09 2003 (3440) ArchRunner qrunner caught 
SIGTERM.  Stopping.
Jan 18 14:29:09 2003 (3440) ArchRunner qrunner exiting.
Jan 18 14:29:10 2003 (3444) NewsRunner qrunner 
caught SIGTERM.  Stopping.
Jan 18 14:29:12 2003 (3444) NewsRunner qrunner 
exiting.

No other log has any indication of what might be 
happening.  

Is there a way to increase the logging somewhere so the 
cause can be identified?


----------------------------------------------------------------------

>Comment By: Barry A. Warsaw (bwarsaw)
Date: 2003-04-19 16:51

Message:
Logged In: YES 
user_id=12800

David, have you been able to dig up more information about
this problem? 

I'm moving this to Pending as we have no clue why it's
happening for you and cannot reproduce it on any systems we
have available to us.

----------------------------------------------------------------------

Comment By: Thomas Wouters (twouters)
Date: 2003-03-21 19:54

Message:
Logged In: YES 
user_id=34209

No, having multiple versions of Python should not be causing
this. Nor should the SIGALRM handler being triggered cause
it, unless something is seriously broken in your setup --
but we've already been there.

The only way to see if a SIGTERM is actually being delivered
is running the processes under strace or gdb, but this
seriously disrupts regular operation. There is no way that i
know of to find out where a signal is coming from, once you
find out that it really is a signal. If it *isn't* a real
signal, I would start looking at libc bugs and other
platform bugs. You can try upgrading Python to 2.2.2 (the
latest bugfix release) but I would be very suprised if it
fixed your problem. RedHat does not have a great reputation
for stability, so be sure to check for any RedHat updates.



----------------------------------------------------------------------

Comment By: David Gibbs (midrangeman)
Date: 2003-03-17 12:44

Message:
Logged In: YES 
user_id=86339

Additional environment details:

Redhat Linux 8.0, uname = "Linux xxx.midrange.com 2.4.18-
26.8.0 #1 Mon Feb 24 10:21:42 EST 2003 i686 i686 i386 
GNU/Linux"

Python: 2.2.1

CPU: P4 2.4ghz, 512mb RAM

Dunno if this makes a difference, but I have the following 
directories ...

/usr/lib/python1.5  
/usr/lib/python2.1  
/usr/lib/python2.2

Any chance there is a conflict?


----------------------------------------------------------------------

Comment By: David Gibbs (midrangeman)
Date: 2003-01-24 13:31

Message:
Logged In: YES 
user_id=86339

I added some debug code to mailmanctl and found out that the 
sigalarm handler is firing just before the qrunners are terminating.

----------------------------------------------------------------------

Comment By: David Gibbs (midrangeman)
Date: 2003-01-22 16:31

Message:
Logged In: YES 
user_id=86339

After some further research, QRUNNER seems to stop after 
exactly 24 hours of operation.  That is, 24 hours after qrunner 
starts, it ends as if someone killed it with SIGTERM.  I know 
for a fact that nobody is actually doing this ... and no process 
on my system should be aware of the fact that qrunner is 
actually running.

I will not discount the possiblity that this is an environmental 
factor, but it seems to me that a daemon process should not 
be affected by environmental factors.  

----------------------------------------------------------------------

Comment By: Barry A. Warsaw (bwarsaw)
Date: 2003-01-19 17:46

Message:
Logged In: YES 
user_id=12800

I'm not sure what kind of logging would help.  Some process
somewhere is SIGTERMing the mailmanctl controller process. 
There's no way to know where a signal is coming from, so I'm
not sure what more you could do in mailmanctl.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=100103&aid=670535&group_id=103



More information about the Mailman-coders mailing list