[Mailman-Users] Mailman causing strange slowdown in SMTP
day7pettersens at gmail.com
Fri Aug 11 07:42:29 CEST 2006
Thanks for the response. The slow down has continued and nothing I've tried
has helped. I answer the questions put to me in context below. Any further
ideas would be greatly appreciated.
> >> I have a mailman installation running on our qmail server that is
> involved in a >>slowdown in SMTP for the designated SMTPHOST in mm_cfg.py.
> I first noticed >>that the messages for the list go out very slowly...about
> 15 seconds per >>message. I saw this in the mailman smtp log, and then I
> saw it in the the qmail >>smtp log as well, so I figured it was a problem
> with qmail. As I looked through the >>qmail logs, I saw that all messages
> that connected to smtp via 127.0.0.1 were >>slow, averaging more than 15
> seconds, while those that connected from >>somewhere else were all a
> fraction of a second. I discovered that if I changed >>SMTPHOST to our
> public IP in mm_cfg.py, any messages (from the list or >>elsewhere) using it
> would slow down as well, and any messages connecting via >>127.0.0.1<http://127.0.0.1/>returned to normal speed. If I shut down mailman using mailmanctl >>stop,
> speed was normal for both localhost or the public IP, whichever had been
> >>used as mailman's SMTPHOST at the time returning to mormal speed. That
> >>verified that mailman is directly involved in the slowdown in smtp. If I
> telnet to the >>smtp and manually send a message, the hangup point is after
> entering 'data' - ie:
> >>helo test.com
> >>250 ok
> >>mail from: test.com
> >>250 ok
> >>rcpt to: <a valid email>
> >>250 ok
> <-------after entering 'data' there is a 15 second + wait if mailman is running
> >>354 ok
> >>This mailman installation serves only one list, an announcement only
> > list,
> > >>of about 35,000 members, with over 10,000 disabled due to bounces that
> > send
> > >>once every 2 weeks or so. It invariably sends
> > >>several hundred to a thousand messages individually at the
> > +/-15sec./message
> > >>rate, then a big bulk mailing to most subscribers that's taking about
> > 3000
> > >>seconds for around 23,000, then a few score more of individual
> > messages
> > >>before it stops functioning.
> > >What exactly is in Mailman's smtp and smtp-failure logs (the
> > >'individual' messages are not the post).
> I figured out that the individual messages were messages left in
> qfiles/out. When message are left like that they take around 15 seconds
> each to clear out. So this is reflected in the smtp log. There are no
> unusual failures in smtp-failure. It does appear from going through the
> logs that the problem happens when there are messages in qfiles/out. When
> the list message goes out to the large list, qfiles/out begins to accumulate
> messages. Those messages are taking the 15 seconds to go out. Once
> qfiles/out clears up, the time appears to drop to normal, usually less than
> a second.
> >>Using ps aux, one can see that all the
> > >>qrunners are there and appear normal, but bin/mailmanctl must be used
> > to
> > >>stop and start things to get them going again.
> > >When things are not going, what is the contents of the queues
> > >(subdirectories of the qfiles/ directory)?
> So there are hundreds of messages backed up in qfiles/out, again each
> taking about 15 seconds , the vast majority not varying more than a second
> from that.
> >> So my questions are...does anyone have any idea what mailman could be
> > >>doing to interfere with smtp on its SMTPHOST and how to fix it? and
> > does
> > >>anyone have any idea why mailman stops working shortly after sending
> > out the
> > >>big list?
> > >What are your various mm_cfg.py settings if any for the VERP settings
> > >and for SMTP_MAX_RCPTS and SMTP_MAX_SESSIONS_PER_CONNECTION.
> > >You might try
> > >SMTP_MAX_RCPTS = 10
> > >SMTP_MAX_SESSIONS_PER_CONNECTION = 1
> > >To see if that makes a difference.
> I tried these settings and they don't make any difference. I have no VERP
> settings in mm_cfg.py
> >If you restart both Mailman and qmail, does the first subsequent
> > >Mailman SMTP session exhibit the 15 second delay?
> If I restart everything, invariably the first mesages is under 2 seconds
> >Have you seen
> > >< http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq04.011.htp
> > >?
> I have looked at that. I see that my concurrency for qmail is at 400.
> Mostly the points there seem to be ways to get better performance on a
> normally functioning list. I would like to try some of them but my main
> problem is this unusual slowdown. Everything was working fine a few eeks
> ago. One strange thing about this is that the logs show the time
> consistently at about 50ms above 15 seconds. Once in awhile I see one that
> is about 50ms about 14 seconds. Here for ex. are the last twelve times-
> 15.058, 15.056, 15.051, 15.060, 15.055, 15.055, 15.051, 15.051, 15.054,
> 15.054, 15.055, 15.051, 14.055. This is very representive - almost like
> there's a fifteen second timer. But every 12-15 or so there is one that's
> 14.0** . Then, to make things interesting, every once in a while (after
> perhaps several hundred or more at 15.0**) there's one or more at a higher
> time - ranging up to about 120 seconds and then back to 15 sec.
> Thanks again...would appreciate any further ideas..
> -Craig Pettersen
> Mark Sapiro <msapiro at value.net > The highway is for gamblers,
> San Francisco Bay Area, California better use your sense - B. Dylan
More information about the Mailman-Users