[Mailman-Users] qrunner refuses to pass mail to users

falko at tahoe.reservoir.com falko at tahoe.reservoir.com
Sun Aug 19 18:28:54 CEST 2007


On Sun, Aug 19, 2007 at 12:02:52AM -0700, Mark Sapiro wrote:
> falko at tahoe.reservoir.com
> >
> >I have the following setup: 
> >
> >Postfix -> Dspam -> Procmail -> Mailman -> Users
> >
> >I've tried taking Dspam out of the equation, but that did not change a thing.
> >
> >When users send to a mailing list, the message appears to be deilvered to mailman. 
> >However, mailman does not do anything afterwards: i.e. its does not send the 
> >messages to the users. 
> >
> >qrunner is running. I traced the problem to the following: 
> >
> >In logs/post I saw this:
> >
> >Aug 19 02:01:09 2007 (15427) post to list from falko at reservoir.com, size=1215, message-id=<20070819054804.GC14916 at reservoir.com>, 12 failures
> >Aug 19 02:01:09 2007 (15428) post to list from falko at reservoir.com, size=1215, message-id=<20070819054804.GC14916 at reservoir.com>, 12 failures
> >
> >The message is repeated more than 12 times.
> 
> 
> And what is in the smtp-failure log?
> 

I saw the following: 

Aug 19 12:12:03 2007 (4347) delivery to addr1 at reservoir.com failed with code -1: (111, 'Connection refused')
Aug 19 12:12:03 2007 (4347) delivery to addr2 at reservoir.com failed with code -1: (111, 'Connection refused')
Aug 19 12:12:03 2007 (4347) delivery to addr3 at reservoir.com failed with code -1: (111, 'Connection refused')

Ever since then, I see things like in logs/smtp:

Aug 19 06:01:19 2007 (15455) <20070819051624.GB13998 at tahoe.reservoir.com> smtp to list for 12 recips, completed in 0.001
 seconds
Aug 19 06:01:19 2007 (15455) <20070819051624.GB13998 at tahoe.reservoir.com> smtp to list for 12 recips, completed in 0.001
 seconds
Aug 19 06:01:19 2007 (15455) <20070819051624.GB13998 at tahoe.reservoir.com> smtp to list for 12 recips, completed in 0.001
 seconds

Noting is still going through though. 

The errors in logs/error appear to have stopped at 8:00 AM.


> 
> >In logs/error I see this: 
> >
> >Aug 19 01:46:09 2007 qrunner(5838): Traceback (most recent call last):
> >Aug 19 01:46:09 2007 qrunner(5838):   File "/usr/local/mailman/bin/qrunner", line 278, in ?
> >Aug 19 01:46:09 2007 qrunner(5838):      main()
> >Aug 19 01:46:09 2007 qrunner(5838):   File "/usr/local/mailman/bin/qrunner", line 238, in main
> >Aug 19 01:46:09 2007 qrunner(5838):      qrunner.run()
> >Aug 19 01:46:09 2007 qrunner(5838):   File "/usr/local/mailman/Mailman/Queue/Runner.py", line 71, in run
> >Aug 19 01:46:09 2007 qrunner(5838):      filecnt = self._oneloop()
> >Aug 19 01:46:09 2007 qrunner(5838):   File "/usr/local/mailman/Mailman/Queue/Runner.py", line 100, in _oneloop
> >Aug 19 01:46:09 2007 qrunner(5838):      msg, msgdata = self._switchboard.dequeue(filebase)
> >Aug 19 01:46:09 2007 qrunner(5838):   File "/usr/local/mailman/Mailman/Queue/Switchboard.py", line 150, in dequeue
> >Aug 19 01:46:09 2007 qrunner(5838):      fp = open(filename)
> >Aug 19 01:46:09 2007 qrunner(5838): IOError :  [Errno 2] No such file or directory: '/var/lib/mailman/qfiles/out/1187500584.
> >9663761+5f32a14b80df78b4db0d2455318501cdaa1d6f0f.pck'
> 
> 
> See
> <http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq04.068.htp>
> 
> 
> <snip>
> 
> >Aug 19 01:48:05 2007 (15098) Uncaught runner exception: [Errno 13] Permission denied: '/var/lib/mailman/archives/private/sys
> >admin/index.html'
> >Aug 19 01:48:05 2007 (15098) Traceback (most recent call last):
> >  File "/usr/local/mailman/Mailman/Queue/Runner.py", line 112, in _oneloop
> >    self._onefile(msg, msgdata)
> >  File "/usr/local/mailman/Mailman/Queue/Runner.py", line 170, in _onefile
> >    keepqueued = self._dispose(mlist, msg, msgdata)
> >  File "/usr/local/mailman/Mailman/Queue/ArchRunner.py", line 73, in _dispose
> >    mlist.ArchiveMail(msg)
> >  File "/usr/local/mailman/Mailman/Archiver/Archiver.py", line 217, in ArchiveMail
> >    h.close()
> >  File "/usr/local/mailman/Mailman/Archiver/pipermail.py", line 324, in close
> >    self.write_TOC()
> >  File "/usr/local/mailman/Mailman/Archiver/HyperArch.py", line 1094, in write_TOC
> >    toc = open(os.path.join(self.basedir, 'index.html'), 'w')
> >IOError: [Errno 13] Permission denied: '/var/lib/mailman/archives/private/sysadmin/index.html'
> >Aug 19 01:48:05 2007 (15098) SHUNTING: 1187502485.4463329+7a7b88a8b7bb48e961dc70e6ecfb9012cb8d588b
> 
> 
> There is a bug in check_perms. It does not check for sufficient access
> to archives/private/ - permissions should be at least drwxrws--- and
> group mailman.
> 

I set those perms to all files (some files did not have those perms just now.

> 
> <snip>
> 
> >Aug 19 01:48:33 2007 (15173) Failed to unlink backup file: /var/lib/mailman/qfiles/out/1187500584.9663761+e5cf2faf7251118610
> >19d282cc9557ca9ba9db31.bak
> 
> 
> This too looks like multiple qrunners processing the same queue slice.
> 

This was true. I killed all of them off. And removed the -s option from the init script I was using. 

> 
> <snip>
> 
> >I have run bin/check_perms multiple times, but it tells me that there are no problems.
> 
> 
> See above.
> 
> -- 
> Mark Sapiro <msapiro at value.net>       The highway is for gamblers,
> San Francisco Bay Area, California    better use your sense - B. Dylan
> 
> ------------------------------------------------------
> Mailman-Users mailing list
> Mailman-Users at python.org
> http://mail.python.org/mailman/listinfo/mailman-users
> Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
> Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
> Unsubscribe: http://mail.python.org/mailman/options/mailman-users/falko%40reservoir.com
> 
> Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&amp;file=faq01.027.htp


More information about the Mailman-Users mailing list