[Mailman-Users] qrunner refuses to pass mail to users
falko at tahoe.reservoir.com
falko at tahoe.reservoir.com
Sun Aug 19 18:28:54 CEST 2007
On Sun, Aug 19, 2007 at 12:02:52AM -0700, Mark Sapiro wrote:
> falko at tahoe.reservoir.com
> >
> >I have the following setup:
> >
> >Postfix -> Dspam -> Procmail -> Mailman -> Users
> >
> >I've tried taking Dspam out of the equation, but that did not change a thing.
> >
> >When users send to a mailing list, the message appears to be deilvered to mailman.
> >However, mailman does not do anything afterwards: i.e. its does not send the
> >messages to the users.
> >
> >qrunner is running. I traced the problem to the following:
> >
> >In logs/post I saw this:
> >
> >Aug 19 02:01:09 2007 (15427) post to list from falko at reservoir.com, size=1215, message-id=<20070819054804.GC14916 at reservoir.com>, 12 failures
> >Aug 19 02:01:09 2007 (15428) post to list from falko at reservoir.com, size=1215, message-id=<20070819054804.GC14916 at reservoir.com>, 12 failures
> >
> >The message is repeated more than 12 times.
>
>
> And what is in the smtp-failure log?
>
I saw the following:
Aug 19 12:12:03 2007 (4347) delivery to addr1 at reservoir.com failed with code -1: (111, 'Connection refused')
Aug 19 12:12:03 2007 (4347) delivery to addr2 at reservoir.com failed with code -1: (111, 'Connection refused')
Aug 19 12:12:03 2007 (4347) delivery to addr3 at reservoir.com failed with code -1: (111, 'Connection refused')
Ever since then, I see things like in logs/smtp:
Aug 19 06:01:19 2007 (15455) <20070819051624.GB13998 at tahoe.reservoir.com> smtp to list for 12 recips, completed in 0.001
seconds
Aug 19 06:01:19 2007 (15455) <20070819051624.GB13998 at tahoe.reservoir.com> smtp to list for 12 recips, completed in 0.001
seconds
Aug 19 06:01:19 2007 (15455) <20070819051624.GB13998 at tahoe.reservoir.com> smtp to list for 12 recips, completed in 0.001
seconds
Noting is still going through though.
The errors in logs/error appear to have stopped at 8:00 AM.
>
> >In logs/error I see this:
> >
> >Aug 19 01:46:09 2007 qrunner(5838): Traceback (most recent call last):
> >Aug 19 01:46:09 2007 qrunner(5838): File "/usr/local/mailman/bin/qrunner", line 278, in ?
> >Aug 19 01:46:09 2007 qrunner(5838): main()
> >Aug 19 01:46:09 2007 qrunner(5838): File "/usr/local/mailman/bin/qrunner", line 238, in main
> >Aug 19 01:46:09 2007 qrunner(5838): qrunner.run()
> >Aug 19 01:46:09 2007 qrunner(5838): File "/usr/local/mailman/Mailman/Queue/Runner.py", line 71, in run
> >Aug 19 01:46:09 2007 qrunner(5838): filecnt = self._oneloop()
> >Aug 19 01:46:09 2007 qrunner(5838): File "/usr/local/mailman/Mailman/Queue/Runner.py", line 100, in _oneloop
> >Aug 19 01:46:09 2007 qrunner(5838): msg, msgdata = self._switchboard.dequeue(filebase)
> >Aug 19 01:46:09 2007 qrunner(5838): File "/usr/local/mailman/Mailman/Queue/Switchboard.py", line 150, in dequeue
> >Aug 19 01:46:09 2007 qrunner(5838): fp = open(filename)
> >Aug 19 01:46:09 2007 qrunner(5838): IOError : [Errno 2] No such file or directory: '/var/lib/mailman/qfiles/out/1187500584.
> >9663761+5f32a14b80df78b4db0d2455318501cdaa1d6f0f.pck'
>
>
> See
> <http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq04.068.htp>
>
>
> <snip>
>
> >Aug 19 01:48:05 2007 (15098) Uncaught runner exception: [Errno 13] Permission denied: '/var/lib/mailman/archives/private/sys
> >admin/index.html'
> >Aug 19 01:48:05 2007 (15098) Traceback (most recent call last):
> > File "/usr/local/mailman/Mailman/Queue/Runner.py", line 112, in _oneloop
> > self._onefile(msg, msgdata)
> > File "/usr/local/mailman/Mailman/Queue/Runner.py", line 170, in _onefile
> > keepqueued = self._dispose(mlist, msg, msgdata)
> > File "/usr/local/mailman/Mailman/Queue/ArchRunner.py", line 73, in _dispose
> > mlist.ArchiveMail(msg)
> > File "/usr/local/mailman/Mailman/Archiver/Archiver.py", line 217, in ArchiveMail
> > h.close()
> > File "/usr/local/mailman/Mailman/Archiver/pipermail.py", line 324, in close
> > self.write_TOC()
> > File "/usr/local/mailman/Mailman/Archiver/HyperArch.py", line 1094, in write_TOC
> > toc = open(os.path.join(self.basedir, 'index.html'), 'w')
> >IOError: [Errno 13] Permission denied: '/var/lib/mailman/archives/private/sysadmin/index.html'
> >Aug 19 01:48:05 2007 (15098) SHUNTING: 1187502485.4463329+7a7b88a8b7bb48e961dc70e6ecfb9012cb8d588b
>
>
> There is a bug in check_perms. It does not check for sufficient access
> to archives/private/ - permissions should be at least drwxrws--- and
> group mailman.
>
I set those perms to all files (some files did not have those perms just now.
>
> <snip>
>
> >Aug 19 01:48:33 2007 (15173) Failed to unlink backup file: /var/lib/mailman/qfiles/out/1187500584.9663761+e5cf2faf7251118610
> >19d282cc9557ca9ba9db31.bak
>
>
> This too looks like multiple qrunners processing the same queue slice.
>
This was true. I killed all of them off. And removed the -s option from the init script I was using.
>
> <snip>
>
> >I have run bin/check_perms multiple times, but it tells me that there are no problems.
>
>
> See above.
>
> --
> Mark Sapiro <msapiro at value.net> The highway is for gamblers,
> San Francisco Bay Area, California better use your sense - B. Dylan
>
> ------------------------------------------------------
> Mailman-Users mailing list
> Mailman-Users at python.org
> http://mail.python.org/mailman/listinfo/mailman-users
> Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
> Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
> Unsubscribe: http://mail.python.org/mailman/options/mailman-users/falko%40reservoir.com
>
> Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp
More information about the Mailman-Users
mailing list