[Mailman-Users] Re: qrunner running away
Mark Symonds
mark at symonds.net
Sun Jun 22 22:17:32 CEST 2003
Hello,
Previously posted problem below has been resolved. Scrutiny
of the logs revealed that qrunner was complaining about the
nonexistence of a file in /var/spool/mail/qfiles. This apparently
caused qrunner to run in circles, pegging the CPU at 100%.
There were only a few files in there, so we rm -fr'd
/var/spool/mail/qfiles and voila, all is back to normal.
Yay! This was on a RH 7.1 system.
--
Mark Symonds
mark at symonds.net
----- Original Message -----
From: "Mark Symonds" <mark at symonds.net>
To: <mailman-users at python.org>
Sent: Saturday, June 21, 2003 3:29 AM
>
> Hello,
>
> Had mailman working wonderfully here until three days ago,
> when a larger list (just over 2500 users) ceased functioning
> and I noticed the load average was through the roof (usually
> it's very low):
>
> [root at tx exim]# w
> 1:24pm up 387 days, 2:26, 1 user, load average: 4.21, 4.29, 4.24
> USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
> mark pts/0 ca1.symonds.net 1:19pm 0.00s 0.28s 0.01s w
> [root at tx exim]# ps auxw |grep python
> mailman 6751 23.9 2.4 66712 12372 ? R Jun19 534:48 /usr/bin/python -S /var/mailman/cron/qrunner
> mailman 21895 24.8 2.6 62840 13608 ? R Jun19 404:09 /usr/bin/python -S /var/mailman/cron/qrunner
> mailman 3114 25.1 9.2 61756 46716 ? R Jun19 258:03 /usr/bin/python -S /var/mailman/cron/qrunner
> mailman 16755 24.5 11.2 61592 56844 ? R 06:18 104:49 /usr/bin/python -S /var/mailman/cron/qrunner
> root 25531 0.0 0.1 1716 596 pts/0 S 13:24 0:00 grep python
> [root at tx exim]#
>
> ...read FAQ 4.19, killed the qrunner processes and removed the
> lockfiles. Still no joy:
>
> [root at tx Mailman]# w
> 4:44am up 387 days, 17:46, 1 user, load average: 1.08, 1.07, 1.26
> USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT
> mark pts/1 ca1.symonds.net 4:37am 0.00s 0.24s 0.02s w
> [root at tx Mailman]# ps auxw |grep python
> root 11804 96.9 11.3 59256 57492 ? R 03:59 43:54 python ./qrunner
>
> In the logs:
>
> [root at tx mailman]# tail qrunner
> Jun 21 06:08:03 2003 (15197) Could not acquire qrunner lock
> Jun 21 06:09:02 2003 (15209) Could not acquire qrunner lock
> Jun 21 06:10:01 2003 (15217) Could not acquire qrunner lock
> Jun 21 06:11:03 2003 (15240) Could not acquire qrunner lock
> Jun 21 06:12:02 2003 (15252) Could not acquire qrunner lock
> Jun 21 06:13:02 2003 (15265) Could not acquire qrunner lock
> Jun 21 06:14:02 2003 (15274) Could not acquire qrunner lock
> Jun 21 06:15:02 2003 (15294) Could not acquire qrunner lock
> Jun 21 06:16:02 2003 (15307) Could not acquire qrunner lock
> Jun 21 06:17:01 2003 (15319) Could not acquire qrunner lock
> [...ad infinitum...]
>
> Also during this time:
>
> * Trying to subscribe via the mailing list web page, upon
> clicking the "subscribe" button it hangs at the page
> until the browser times out.
>
> * Same thing when trying to auth via the admin web page.
>
> * This is the only broken list on that machine. The rest
> are functioning normally in all respects.
>
> * I thought perhaps someone had changed the domain preferences
> for the list? Checked it with config_list and it is correct.
> Also ran check_db and check_perms, both report all's OK.
>
> Exim 4, RedHat 7.0, Mailman 2.0.13.
>
> Ideas?
>
> Sincerely,
>
> --
> Mark Symonds
> mark at symonds.net
>
>
>
More information about the Mailman-Users
mailing list