[Mailman-Users] mailmain starting twice
Zbigniew Szalbot
zszalbot at gmail.com
Mon Feb 18 23:08:38 CET 2008
Hello again,
2008/2/18, Mark Sapiro <mark at msapiro.net>:
> Zbigniew Szalbot wrote:
> >
> >Feb 18 18:26:53 2008 (560) ArchRunner qrunner started.
> >Feb 18 18:26:53 2008 (563) IncomingRunner qrunner started.
> >Feb 18 18:26:53 2008 (567) RetryRunner qrunner started.
> >Feb 18 18:26:53 2008 (565) OutgoingRunner qrunner started.
> >Feb 18 18:26:53 2008 (564) NewsRunner qrunner started.
> >Feb 18 18:26:53 2008 (566) VirginRunner qrunner started.
> >Feb 18 18:26:53 2008 (562) CommandRunner qrunner started.
> >Feb 18 18:26:53 2008 (561) BounceRunner qrunner started.
> >Feb 18 18:27:06 2008 (718) CommandRunner qrunner started.
> >Feb 18 18:27:06 2008 (716) ArchRunner qrunner started.
> >Feb 18 18:27:06 2008 (717) BounceRunner qrunner started.
> >Feb 18 18:27:06 2008 (719) IncomingRunner qrunner started.
> >Feb 18 18:27:06 2008 (722) VirginRunner qrunner started.
> >Feb 18 18:27:06 2008 (721) OutgoingRunner qrunner started.
> >Feb 18 18:27:06 2008 (720) NewsRunner qrunner started.
> >Feb 18 18:27:07 2008 (723) RetryRunner qrunner started.
> >Feb 18 18:28:08 2008 (559) Master qrunner detected subprocess exit
> >(pid: 565, sig: None, sts: 1, class: OutgoingRunner, slice: 1/1) [restarting]
>
>
> The above is from the first OutgoingRunner (pid 565) dying on its own
> and being restarted by its master (pid 559).
>
> >Feb 18 18:28:09 2008 (1242) OutgoingRunner qrunner started.
>
> It restarts as pid 1242.
>
> This next bunch is from mailmanctl stop which signals the second master
> (pid 711) and all its children.
>
>
> >Feb 18 18:29:59 2008 (716) ArchRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (716) ArchRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (717) BounceRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (717) BounceRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (719) IncomingRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (719) IncomingRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (718) CommandRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (718) CommandRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (720) NewsRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (720) NewsRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (721) OutgoingRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (721) OutgoingRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (722) VirginRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (722) VirginRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (723) RetryRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:29:59 2008 (723) RetryRunner qrunner exiting.
> >Feb 18 18:29:59 2008 (711) Master watcher caught SIGTERM. Exiting.
> >Feb 18 18:29:59 2008 (711) Master qrunner detected subprocess exit
> >(pid: 716, sig: None, sts: 15, class: ArchRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 718, sig: None, sts: 15, class: CommandRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 717, sig: None, sts: 15, class: BounceRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 719, sig: None, sts: 15, class: IncomingRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 720, sig: None, sts: 15, class: NewsRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 721, sig: None, sts: 15, class: OutgoingRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 722, sig: None, sts: 15, class: VirginRunner, slice: 1/1)
> >Feb 18 18:30:00 2008 (711) Master qrunner detected subprocess exit
> >(pid: 723, sig: None, sts: 15, class: RetryRunner, slice: 1/1)
>
>
> Then you SIGTERM'd the first master.
>
>
> >Feb 18 18:31:27 2008 (560) ArchRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (560) ArchRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (561) BounceRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (561) BounceRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (562) CommandRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (562) CommandRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (563) IncomingRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (563) IncomingRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (564) NewsRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (564) NewsRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (566) VirginRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (566) VirginRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (567) RetryRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (567) RetryRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (1242) OutgoingRunner qrunner caught SIGTERM. Stopping.
> >Feb 18 18:31:27 2008 (1242) OutgoingRunner qrunner exiting.
> >Feb 18 18:31:27 2008 (559) Master watcher caught SIGTERM. Exiting.
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 562, sig: None, sts: 15, class: CommandRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 566, sig: None, sts: 15, class: VirginRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 563, sig: None, sts: 15, class: IncomingRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 561, sig: None, sts: 15, class: BounceRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 560, sig: None, sts: 15, class: ArchRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 564, sig: None, sts: 15, class: NewsRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 567, sig: None, sts: 15, class: RetryRunner, slice: 1/1)
> >Feb 18 18:31:27 2008 (559) Master qrunner detected subprocess exit
> >(pid: 1242, sig: None, sts: 15, class: OutgoingRunner, slice: 1/1)
>
>
> And finally restarted.
>
>
> >Feb 18 18:31:43 2008 (1710) IncomingRunner qrunner started.
> >Feb 18 18:31:43 2008 (1707) ArchRunner qrunner started.
> >Feb 18 18:31:43 2008 (1712) OutgoingRunner qrunner started.
> >Feb 18 18:31:43 2008 (1713) VirginRunner qrunner started.
> >Feb 18 18:31:43 2008 (1709) CommandRunner qrunner started.
> >Feb 18 18:31:43 2008 (1711) NewsRunner qrunner started.
> >Feb 18 18:31:43 2008 (1714) RetryRunner qrunner started.
> >Feb 18 18:31:43 2008 (1708) BounceRunner qrunner started.
>
>
>
> What I am curious about is what's in Mailman's error log relating to
> the first OutgoingRunner (pid 565) dying on its own. I would like to
> see the error and traceback from this.
I did edit mailman startup script as it contained the -s flag (I have
removed it now). Not sure if it was there by default or if I inserted
it at some point. Now for the error log:
Feb 18 19:23:45 2008 mailmanctl(188): No child with pid: 14396
Feb 18 19:23:45 2008 mailmanctl(188): [Errno 3] No such process
Feb 18 19:23:45 2008 mailmanctl(188): Stale pid file removed.
Feb 18 19:25:50 2008 mailmanctl(184): PID unreadable in:
/usr/local/mailman/data/master-qrunner.pid
Feb 18 19:25:50 2008 mailmanctl(184): [Errno 2] No such file or
directory: '/usr/local/mailman/data/master-qrunner.pid'
Feb 18 19:25:50 2008 mailmanctl(184): Is qrunner even running?
Feb 18 18:28:08 2008 qrunner(565): Traceback (most recent call last):
Feb 18 18:28:08 2008 qrunner(565): File
"/usr/local/mailman/bin/qrunner", line 278, in ?
Feb 18 18:28:08 2008 qrunner(565): main()
Feb 18 18:28:08 2008 qrunner(565): File
"/usr/local/mailman/bin/qrunner", line 238, in main
Feb 18 18:28:08 2008 qrunner(565): qrunner.run()
Feb 18 18:28:08 2008 qrunner(565): File
"/usr/local/mailman/Mailman/Queue/Runner.py", line 71, in run
Feb 18 18:28:08 2008 qrunner(565): filecnt = self._oneloop()
Feb 18 18:28:08 2008 qrunner(565): File
"/usr/local/mailman/Mailman/Queue/Runner.py", line 100, in _oneloop
Feb 18 18:28:08 2008 qrunner(565): msg, msgdata =
self._switchboard.dequeue(filebase)
Feb 18 18:28:08 2008 qrunner(565): File
"/usr/local/mailman/Mailman/Queue/Switchboard.py", line 150, in
dequeue
Feb 18 18:28:08 2008 qrunner(565): fp = open(filename)
Feb 18 18:28:08 2008 qrunner(565): IOError : [Errno 2] No such file
or directory: '/usr/local/mailman/qfiles/out/1203355686.956471+f4998fab7c49e7696e310da67e6e976129e32e0b.pck'
Feb 18 18:30:03 2008 mailmanctl(1633): No child with pid: 711
Feb 18 18:30:03 2008 mailmanctl(1633): [Errno 3] No such process
Feb 18 18:30:03 2008 mailmanctl(1633): Stale pid file removed.
Feb 18 18:30:11 2008 mailmanctl(1636): PID unreadable in:
/usr/local/mailman/data/master-qrunner.pid
Feb 18 18:30:11 2008 mailmanctl(1636): [Errno 2] No such file or
directory: '/usr/local/mailman/data/master-qrunner.pid'
Feb 18 18:30:11 2008 mailmanctl(1636): Is qrunner even running?
Feb 18 18:30:22 2008 mailmanctl(1648): PID unreadable in:
/usr/local/mailman/data/master-qrunner.pid
Feb 18 18:30:22 2008 mailmanctl(1648): [Errno 2] No such file or
directory: '/usr/local/mailman/data/master-qrunner.pid'
Feb 18 18:30:22 2008 mailmanctl(1648): Is qrunner even running?
Feb 18 18:31:02 2008 mailmanctl(1683): PID unreadable in:
/usr/local/mailman/data/master-qrunner.pid
Feb 18 18:31:02 2008 mailmanctl(1683): [Errno 2] No such file or
directory: '/usr/local/mailman/data/master-qrunner.pid'
Feb 18 18:31:02 2008 mailmanctl(1683): Is qrunner even running?
Thank you again!
--
Zbigniew Szalbot
More information about the Mailman-Users
mailing list