Mailman 3 re: Much cpu/memory load - Mailman-Developers

re: Much cpu/memory load

Andrew D. Clark

Oct. 31, 2002

6:16 p.m.

I've had the same problem. ArchRunner seems to have serious performance problems if you've got heavy list activity and are archiving to both pipermail and mbox. I "fixed" it by setting ARCHIVE_TO_MBOX=1. This will only archive to mbox, so your pipermail archives won't be updated. You should be able to generate pipermail archives by hand later with bin/arch list_name. It isn't an ideal solution, but at least your message will move out of the archive queue.

-- Andrew Clark Campus Network Programmer Office of Information Technology University of California, Santa Barbara andrew.clark@ucsb.edu (805) 893-5311

...

Show replies by date

Danny Terweij

October 2002

8:41 p.m.

New subject: Much cpu/memory load

From: "Andrew D. Clark" <andrew.clark@ucsb.edu>

...

Hmmzz. i am now trying to reboot my machine with

9:33pm up 1 day, 6:38, 7 users, load average: 42.95, 41.97, 40.85

This is very bad! and this is happened after upgrade to b4(+)

...

...
Date: Thu, 31 Oct 2002 15:48:08 +0100 From: "Danny Terweij" <danny@terweij.nl> To: <mailman-developers@python.org> Subject: [Mailman-Developers] Much cpu/memory load Message-ID: <01ab01c280ec$8739bb20$1e00a8c0@onsnet.org> Content-Type: text/plain; charset="iso-8859-1" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Precedence: list Message: 1

Hi,

Because my /qfiles/archive/*.* files are not processed by the qrunner/ArchRunner my linux box gets overloaded. I have now 55mb unproccessed archive files at the qfiles dir. The pipermail archives are not updated.

When i do a mailmanctl stop then the Archrunner is still active. I did a kill -9 PID and it stops. Then i must remove all the lock files found where lockfiles are.

Then the cpu load is back to normal.

24290 ? S 0:00 /usr/bin/python2 ./mailmanctl start 24291 ? D 34:56 qrunner /usr/local/mailman/bin/qrunner --runner=ArchRunner:0:1 -s 24292 ? S 0:16 qrunner /usr/local/mailman/bin/qrunner --runner=BounceRunner:0:1 -s 24293 ? S 0:03 qrunner /usr/local/mailman/bin/qrunner --runner=CommandRunner:0:1 -s 24294 ? S 0:31 qrunner /usr/local/mailman/bin/qrunner --runner=IncomingRunner:0:1 -s 24295 ? S 0:03 qrunner /usr/local/mailman/bin/qrunner --runner=NewsRunner:0:1 -s 24296 ? S 0:29 qrunner /usr/local/mailman/bin/qrunner --runner=OutgoingRunner:0:1 -s 24297 ? S 0:07 qrunner /usr/local/mailman/bin/qrunner --runner=VirginRunner:0:1 -s

I think that realy somthing is wrong with the ArchRunner. Looks like it is in a loop or is doing nothing. The /qfiles/archive/ dir is growing and growing, i must stop mailman very often now otherwise de box gets overloaded.

The load was this morning on a average of 10 and all my memory was used (20mb left on swap). No error and messages at the logs dir :(

Please take a look Barry :) it is the latest cvs. You may also take al look on my box if you want and have time...

Danny Terweij

Mailman-Developers mailing list Mailman-Developers@python.org http://mail.python.org/mailman/listinfo/mailman-developers

barry＠python.org

9:20 p.m.

New subject: Much cpu/memory load

A quick fix for the Archiver problems, until I can debug them more is to add the following in ArchRunner.py, just under the "class ArchRunner" line:

class ArchRunner(Runner): QDIR = mm_cfg.ARCHQUEUE_DIR SLEEPTIME = mm_cfg.minutes(10)

def _dispose(self, mlist, msg, msgdata):

This at least makes ArchRunner only run once per 10 minutes.

-Barry

Danny Terweij

9:40 p.m.

New subject: Much cpu/memory load

From: "Barry A. Warsaw" <barry@python.org>

...

Thanx, i'll try that tomorrow. (Now here 22:39)

I'll let you know what then happens :)

Danny.

Dan Mick

November 2002

3:03 a.m.

New subject: Much cpu/memory load

Barry A. Warsaw wrote:

...

A quick fix for the Archiver problems, until I can debug them more is to add the following in ArchRunner.py, just under the "class ArchRunner" line:

class ArchRunner(Runner): QDIR = mm_cfg.ARCHQUEUE_DIR SLEEPTIME = mm_cfg.minutes(10)
def _dispose(self, mlist, msg, msgdata):
This at least makes ArchRunner only run once per 10 minutes.

-Barry

The problem's affecting me badly right now too. So far it seems that the holdup is in the 'date.html' index file processing; everything else is finished, but the "bin/qrunner -r Arch -o" process has lost its little mind trying to write the date index. It's in a loop, chewing up 409600 bytes of memory (brk), and then opening, reading, and closing archidxentry.html, hundreds of times in a row.

It's been doing this for about 15 minutes now. (yes, one message.) The process size is 203MB and 196M is resident.

Clearly, there's some horrible misuse of objects in the date-processing stuff, but I lost track of the pdb before it got into the date stuff so I don't have a lot of visibility yet. I have a number of files queued up, though, so I can try again as soon as it ends.

Simone Piunno

10:15 a.m.

New subject: Much cpu/memory load

On Thu, Oct 31, 2002 at 07:03:50PM -0800, Dan Mick wrote:

...

Ok, I believe this is because of my patch for i18n in archives. Do you feel like trying this patch? It should speed up things a lot.... Index: HyperArch.py =================================================================== RCS file: /cvsroot/mailman/mailman/Mailman/Archiver/HyperArch.py,v retrieving revision 2.22 diff -u -r2.22 HyperArch.py --- HyperArch.py 19 Oct 2002 20:59:27 -0000 2.22 +++ HyperArch.py 1 Nov 2002 10:13:21 -0000 @@ -923,10 +923,14 @@ 'sequence': article.sequence, 'author': author } - print Utils.maketext( - 'archidxentry.html', d, raw=1, - lang=self.maillist.preferred_language, - mlist=self.maillist) + print """<LI><A HREF="%(filename)s">%(subject)s + </A><A NAME="%(sequence)i"> </A> + <I>%(author)s + </I>""" % d + #print Utils.maketext( + # 'archidxentry.html', d, raw=1, + # lang=self.maillist.preferred_language, + # mlist=self.maillist -- Adde parvum parvo magnus acervus erit. Simone Piunno, FerraraLUG - http://members.ferrara.linux.it/pioppo

Danny Terweij

11:17 a.m.

New subject: Much cpu/memory load

From: "Simone Piunno" <pioppo@ferrara.linux.it>

...

...
The problem's affecting me badly right now too. So far it seems that the

...

Ok, I believe this is because of my patch for i18n in archives. Do you feel like trying this patch? It should speed up things a lot....

...

Index: HyperArch.py

Gonna try this.

Danny.

Dan Mick

8:20 p.m.

New subject: Much cpu/memory load

Simone Piunno wrote:

...

That's definitely the cause of the many opens of archidxentry.html, but it doesn't explain the 400K-per-iteration growth. Barry, wasn't there some Python instrumentation trick for using the GC to find unreferenced objects and complain, or invoking GC manually in a loop to try to alleviate such problems?....was it in the gc module?...

Dan Mick

8:37 p.m.

New subject: Much cpu/memory load

Dan Mick wrote:

...

yes, with that patch the opens of archidxentry.html go away, and now its only system call activity is brk() after brk() after brk() (with an occasional flush of buffers to date.html). Something is wrong in the date-processing loop that's leaking memory hand-over-fist.

Dan Mick

9:05 p.m.

New subject: Much cpu/memory load

Dan Mick wrote:

...

Adding this: --- /usr/local/src/mailman/Mailman/Archiver/pipermail.py Tue Oct 15 20:22 :49 2002 +++ /export/home/mailman/Mailman/Archiver/pipermail.py Fri Nov 1 12:48:47 2002 @@ -9,6 +9,7 @@ import cPickle as pickle from cStringIO import StringIO from string import lowercase +import gc __version__ = '0.09 (Mailman edition)' VERSION = __version__ @@ -425,6 +426,7 @@ self._update_thread_index(archive, arcdir) def _update_simple_index(self, hdr, archive, arcdir): + gc.set_debug(gc.DEBUG_LEAK) self.message(" " + hdr) self.type = hdr hdr = hdr.lower() @@ -442,6 +444,7 @@ else: count = count + 1 self.write_index_entry(article) + sys.stderr.write("gc, number of uncollectables: %d\n" % gc.collect( )) msgid = self.database.next(archive, hdr) # Finish up this index self.write_index_footer() prints a bunch of "gc, number of uncollectables: 0", and doesn't stop the memory growth. Note that the time between each round of brks() is high, too, as though there's a *lot* of userland work going on. That process can take 70% of a CPU by itself while running.

Dan Mick

3:56 a.m.

New subject: Much cpu/memory load

Sigh. This isn't making life any easier trying to debug the "Archiver eats my machine" problem:

     self._open_index_file_as_stdout(arcdir, hdr)

Any tricks for using pdb with a program that wants to steal stdout?

(it's the first invocation of _update_simple_index, the one for the Date index, that takes all the memory/time; I suspect that's just because it's the first one, and builds a wad if in-core data structures that are then reused by the Subject and Author invocations that follow. But it's pathological, whatever it's doing. gc debugging shows a *wad* of objects stuck in generation 2 (i.e., not leaked, and referred to).)

barry＠python.org

3:51 p.m.

New subject: Much cpu/memory load

Dan found what is probably the majority of the problem. In the i18n patch, Article instances in HyperArch.py grew an _mlist instance so that means a unique instance of the MailList is pickled and unpickled in every Article in the archive. Ug.

I'm working on a fix for that and will see about the template instantiations, but I suspect it's the former that's contributing the bulk of the overhead.

Thanks Dan, and everyone else. Keep an eye on cvs...

-Barry

barry＠python.org

5:36 p.m.

New subject: Much cpu/memory load

Here's a patch that eliminates the MailList object from the article index pickle. It also undoes the incomplete hack in MailList for not pickling the lock object (that should have been a major clue). I've done some moderate testing of this patch, and I think it's safe, and probably fixes the majority of the problem. Please give it a shot. It doesn't optimize the loading of the template files -- that comes next. But I'll still bet you find a huge speed up. One outstanding issue: Depending on when you upgraded to 2.1b4 and when/if you regen'd your archives from scratch, your existing article indexes may have pickled MailList instances in them. These should get cleaned up the next time you archive an article, but only for the month (or other division) that the article is being added to. To clean up your entire archive, you'll need to zap the existing one and re-run bin/arch. If that's not acceptable to people, I'll write a little fixer program to fix *-article indices. Oh, and you'll find that not including the MailList object in your article indices can reduce their size by 10x or more. Nice little added benefit, eh? The braver of you can give this a shot, and after a little more testing, I'll check this into cvs. -Barry -------------------- snip snip -------------------- Index: Mailman/MailList.py =================================================================== RCS file: /cvsroot/mailman/mailman/Mailman/MailList.py,v retrieving revision 2.94 diff -u -r2.94 MailList.py --- Mailman/MailList.py 25 Oct 2002 21:10:45 -0000 2.94 +++ Mailman/MailList.py 4 Nov 2002 17:30:22 -0000 @@ -113,23 +113,6 @@ if func: func(self) - # Never pickle our lock object! We need both of these because MailLists - # get pickled by the archiver and this can cause ArchRunner to unlock - # lists at the wrong time. The real fix may be to not pickle MailList - # objects in the archiver, but that's too much work to verify at the - # moment. - LOCKATTR = '_MailList__lock' - - def __getstate__(self): - d = self.__dict__.copy() - del d[self.LOCKATTR] - return d - - def __setstate__(self, d): - if d.has_key(self.LOCKATTR): - del d[self.LOCKATTR] - self.__dict__ = d - def __getattr__(self, name): # Because we're using delegation, we want to be sure that attribute # access to a delegated member function gets passed to the Index: Mailman/Archiver/HyperArch.py =================================================================== RCS file: /cvsroot/mailman/mailman/Mailman/Archiver/HyperArch.py,v retrieving revision 2.22 diff -u -r2.22 HyperArch.py --- Mailman/Archiver/HyperArch.py 19 Oct 2002 20:59:27 -0000 2.22 +++ Mailman/Archiver/HyperArch.py 4 Nov 2002 17:30:23 -0000 @@ -33,11 +33,14 @@ import types import HyperDatabase import pipermail +import weakref from Mailman import mm_cfg from Mailman import Utils from Mailman import LockFile +from Mailman import MailList from Mailman import EncWord +from Mailman import Errors from Mailman import i18n from Mailman.Logging.Syslog import syslog from Mailman.Mailbox import ArchiverMailbox @@ -234,13 +237,45 @@ if self.charset and self.charset in mm_cfg.VERBATIM_ENCODING: self.quote = unicode_quote + # Mapping of listnames to MailList instances as a weak value dictionary. + _listcache = weakref.WeakValueDictionary() + + def _open_list(self, listname): + # Cache the open list so that any use of the list within this process + # uses the same object. We use a WeakValueDictionary so that when the + # list is no longer necessary, its memory is freed. + mlist = self._listcache.get(listname) + if not mlist: + try: + mlist = MailList.MailList(listname, lock=0) + except Errors.MMListError, e: + syslog('error', 'error opening list: %s\n%s', listname, e) + return None + else: + self._listcache[listname] = mlist + return mlist + + def __getstate__(self): + d = self.__dict__.copy() + # We definitely don't want to pickle the MailList instance, so just + # pickle a reference to it. + del d['_mlist'] + d['__listname'] = self._mlist.internal_name() + # Delete a few other things we don't want in the pickle + for attr in ('prev', 'next', 'body'): + del d[attr] + d['body'] = [] + return d + def __setstate__(self, d): # For loading older Articles via pickle. All this stuff was added # when Simone Piunni and Tokio Kikuchi i18n'ified Pipermail. See SF # patch #594771. self.__dict__ = d - if not d.has_key('_mlist'): - self._mlist = None + listname = d.get('__listname') + if listname: + del d['__listname'] + d['_mlist'] = self._open_list(listname) if not d.has_key('_lang'): if self._mlist is None: self._lang = mm_cfg.DEFAULT_SERVER_LANGUAGE @@ -471,18 +506,6 @@ if line.strip() == '': break self.body.append(line) - - def __getstate__(self): - d={} - for each in self.__dict__.keys(): - if each == "quote": - continue - if each in ['maillist','prev','next','body']: - d[each] = None - else: - d[each] = self.__dict__[each] - d['body']=[] - return d Index: Mailman/Archiver/HyperDatabase.py =================================================================== RCS file: /cvsroot/mailman/mailman/Mailman/Archiver/HyperDatabase.py,v retrieving revision 2.4 diff -u -r2.4 HyperDatabase.py --- Mailman/Archiver/HyperDatabase.py 16 Mar 2002 06:57:36 -0000 2.4 +++ Mailman/Archiver/HyperDatabase.py 4 Nov 2002 17:30:23 -0000 @@ -19,8 +19,6 @@ # import os import marshal -import string -import sys import time import errno @@ -51,11 +49,11 @@ objects. The object itself is stored using marshal. It would be much simpler, and probably faster, to store the actual objects in the DumbBTree and pickle it. - + TBD: Also needs a more sensible name, like IteratableDictionary or SortedDictionary. """ - + def __init__(self, path): self.current_index = 0 self.path = path @@ -65,7 +63,7 @@ self.dict = {} self.sorted = [] self.load() - + def __repr__(self): return "DumbBTree(%s)" % self.path @@ -74,7 +72,7 @@ self.sorted = self.dict.keys() self.sorted.sort() self.__dirty = 0 - + def lock(self): self.lockfile.lock() @@ -83,28 +81,28 @@ self.lockfile.unlock() except LockFile.NotLockedError: pass - + def __delitem__(self, item): # if first hasn't been called, we can skip the sort if self.current_index == 0: del self.dict[item] self.__dirty = 1 return - try: - ci = self.sorted[self.current_index] - except IndexError: - ci = None - if ci == item: - try: - ci = self.sorted[self.current_index + 1] - except IndexError: - ci = None - del self.dict[item] + try: + ci = self.sorted[self.current_index] + except IndexError: + ci = None + if ci == item: + try: + ci = self.sorted[self.current_index + 1] + except IndexError: + ci = None + del self.dict[item] self.__sort(dirty=1) - if ci is not None: - self.current_index = self.sorted.index(ci) - else: - self.current_index = self.current_index + 1 + if ci is not None: + self.current_index = self.sorted.index(ci) + else: + self.current_index = self.current_index + 1 def clear(self): # bulk clearing much faster than deleting each item, esp. with the @@ -118,22 +116,22 @@ else: key = self.sorted[0] self.current_index = 1 - return key, self.dict[key] + return key, self.dict[key] def last(self): if not self.sorted: raise KeyError else: key = self.sorted[-1] - self.current_index = len(self.sorted) - 1 + self.current_index = len(self.sorted) - 1 return key, self.dict[key] - + def next(self): try: key = self.sorted[self.current_index] except IndexError: raise KeyError - self.current_index = self.current_index + 1 + self.current_index = self.current_index + 1 return key, self.dict[key] def has_key(self, key): @@ -154,10 +152,10 @@ self.dict[item] = val self.__dirty = 1 return - try: - current_item = self.sorted[self.current_index] - except IndexError: - current_item = item + try: + current_item = self.sorted[self.current_index] + except IndexError: + current_item = item self.dict[item] = val self.__sort(dirty=1) self.current_index = self.sorted.index(current_item) @@ -198,48 +196,48 @@ # class HyperDatabase(pipermail.Database): __super_addArticle = pipermail.Database.addArticle - + def __init__(self, basedir): self.__cache = {} - self.__currentOpenArchive = None # The currently open indices - self.basedir = os.path.expanduser(basedir) + self.__currentOpenArchive = None # The currently open indices + self.basedir = os.path.expanduser(basedir) # Recently added articles, indexed only by message ID self.changed={} def firstdate(self, archive): - self.__openIndices(archive) - date = 'None' - try: - datekey, msgid = self.dateIndex.first() - date = time.asctime(time.localtime(string.atof(datekey[0]))) - except KeyError: + self.__openIndices(archive) + date = 'None' + try: + datekey, msgid = self.dateIndex.first() + date = time.asctime(time.localtime(float(datekey[0]))) + except KeyError: pass - return date + return date def lastdate(self, archive): - self.__openIndices(archive) - date = 'None' - try: - datekey, msgid = self.dateIndex.last() - date = time.asctime(time.localtime(string.atof(datekey[0]))) - except KeyError: + self.__openIndices(archive) + date = 'None' + try: + datekey, msgid = self.dateIndex.last() + date = time.asctime(time.localtime(float(datekey[0]))) + except KeyError: pass - return date + return date def numArticles(self, archive): - self.__openIndices(archive) - return len(self.dateIndex) + self.__openIndices(archive) + return len(self.dateIndex) def addArticle(self, archive, article, subject=None, author=None, date=None): - self.__openIndices(archive) + self.__openIndices(archive) self.__super_addArticle(archive, article, subject, author, date) def __openIndices(self, archive): - if self.__currentOpenArchive == archive: + if self.__currentOpenArchive == archive: return - self.__closeIndices() - arcdir = os.path.join(self.basedir, 'database') + self.__closeIndices() + arcdir = os.path.join(self.basedir, 'database') omask = os.umask(0) try: try: @@ -248,38 +246,38 @@ if e.errno <> errno.EEXIST: raise finally: os.umask(omask) - for i in ('date', 'author', 'subject', 'article', 'thread'): - t = DumbBTree(os.path.join(arcdir, archive + '-' + i)) - setattr(self, i + 'Index', t) - self.__currentOpenArchive = archive + for i in ('date', 'author', 'subject', 'article', 'thread'): + t = DumbBTree(os.path.join(arcdir, archive + '-' + i)) + setattr(self, i + 'Index', t) + self.__currentOpenArchive = archive def __closeIndices(self): - for i in ('date', 'author', 'subject', 'thread', 'article'): - attr = i + 'Index' - if hasattr(self, attr): - index = getattr(self, attr) - if i == 'article': - if not hasattr(self, 'archive_length'): + for i in ('date', 'author', 'subject', 'thread', 'article'): + attr = i + 'Index' + if hasattr(self, attr): + index = getattr(self, attr) + if i == 'article': + if not hasattr(self, 'archive_length'): self.archive_length = {} l = len(index) self.archive_length[self.__currentOpenArchive] = l - index.close() - delattr(self, attr) - self.__currentOpenArchive = None - + index.close() + delattr(self, attr) + self.__currentOpenArchive = None + def close(self): - self.__closeIndices() - - def hasArticle(self, archive, msgid): - self.__openIndices(archive) - return self.articleIndex.has_key(msgid) - + self.__closeIndices() + + def hasArticle(self, archive, msgid): + self.__openIndices(archive) + return self.articleIndex.has_key(msgid) + def setThreadKey(self, archive, key, msgid): - self.__openIndices(archive) - self.threadIndex[key]=msgid - + self.__openIndices(archive) + self.threadIndex[key]=msgid + def getArticle(self, archive, msgid): - self.__openIndices(archive) + self.__openIndices(archive) if not self.__cache.has_key(msgid): # get the pickled object out of the DumbBTree buf = self.articleIndex[msgid] @@ -288,50 +286,50 @@ article = self.__cache[msgid] return article - def first(self, archive, index): - self.__openIndices(archive) - index = getattr(self, index + 'Index') - try: - key, msgid = index.first() - return msgid - except KeyError: + def first(self, archive, index): + self.__openIndices(archive) + index = getattr(self, index + 'Index') + try: + key, msgid = index.first() + return msgid + except KeyError: return None - - def next(self, archive, index): - self.__openIndices(archive) - index = getattr(self, index + 'Index') - try: - key, msgid = index.next() - return msgid - except KeyError: + + def next(self, archive, index): + self.__openIndices(archive) + index = getattr(self, index + 'Index') + try: + key, msgid = index.next() + return msgid + except KeyError: return None - + def getOldestArticle(self, archive, subject): - self.__openIndices(archive) - subject=string.lower(subject) - try: - key, tempid=self.subjectIndex.set_location(subject) - self.subjectIndex.next() - [subject2, date]= string.split(key, '\0') - if subject!=subject2: return None - return tempid - except KeyError: - return None + self.__openIndices(archive) + subject = subject.lower() + try: + key, tempid=self.subjectIndex.set_location(subject) + self.subjectIndex.next() + [subject2, date]= key.split('\0') + if subject!=subject2: return None + return tempid + except KeyError: + return None def newArchive(self, archive): pass - + def clearIndex(self, archive, index): - self.__openIndices(archive) + self.__openIndices(archive) if hasattr(self.threadIndex, 'clear'): self.threadIndex.clear() return - finished=0 - try: - key, msgid=self.threadIndex.first() - except KeyError: finished=1 - while not finished: - del self.threadIndex[key] - try: - key, msgid=self.threadIndex.next() - except KeyError: finished=1 + finished=0 + try: + key, msgid=self.threadIndex.first() + except KeyError: finished=1 + while not finished: + del self.threadIndex[key] + try: + key, msgid=self.threadIndex.next() + except KeyError: finished=1

barry＠python.org

6:27 p.m.

New subject: Much cpu/memory load

...

...
...
...
...
"DM" == Dan Mick <dan.mick@sun.com> writes:

DM> Sigh.  This isn't making life any easier trying to debug the
DM> "Archiver eats my machine" problem:

DM>          self._open_index_file_as_stdout(arcdir, hdr)

DM> Any tricks for using pdb with a program that wants to steal
DM> stdout?

Rewrite the code so it doesn't steal stdout? ;)

Really, this code should be rewritten to use more modern Pythonic style, such as

def _open_index_file(self, archdir, index_name):
    # ...
# remove the sys.stdout hackery

and then any place that does "print something_useful" should get rewritten as "print >> self.__f, something_useful".

I don't have time for that, but I'd accept a patch.

-Barry

Danny Terweij

11:15 a.m.

New subject: Much cpu/memory load

From: "Dan Mick" <dan.mick@sun.com>

...

As i said before i did also thought it was a loop orso.

So i do not add this quick fix :) i have set the optoin to achrive to mbox only and at night i recreate the achrives now. I hope there will be a working fix soon...

Danny Terweij

October 2002

8:41 p.m.

New subject: Much cpu/memory load

From: "Andrew D. Clark" <andrew.clark@ucsb.edu>

...

Hmmzz. i am now trying to reboot my machine with

9:33pm up 1 day, 6:38, 7 users, load average: 42.95, 41.97, 40.85

This is very bad! and this is happened after upgrade to b4(+)

...

...
Date: Thu, 31 Oct 2002 15:48:08 +0100 From: "Danny Terweij" <danny@terweij.nl> To: <mailman-developers@python.org> Subject: [Mailman-Developers] Much cpu/memory load Message-ID: <01ab01c280ec$8739bb20$1e00a8c0@onsnet.org> Content-Type: text/plain; charset="iso-8859-1" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Precedence: list Message: 1

Hi,

Because my /qfiles/archive/*.* files are not processed by the qrunner/ArchRunner my linux box gets overloaded. I have now 55mb unproccessed archive files at the qfiles dir. The pipermail archives are not updated.

When i do a mailmanctl stop then the Archrunner is still active. I did a kill -9 PID and it stops. Then i must remove all the lock files found where lockfiles are.

Then the cpu load is back to normal.

24290 ? S 0:00 /usr/bin/python2 ./mailmanctl start 24291 ? D 34:56 qrunner /usr/local/mailman/bin/qrunner --runner=ArchRunner:0:1 -s 24292 ? S 0:16 qrunner /usr/local/mailman/bin/qrunner --runner=BounceRunner:0:1 -s 24293 ? S 0:03 qrunner /usr/local/mailman/bin/qrunner --runner=CommandRunner:0:1 -s 24294 ? S 0:31 qrunner /usr/local/mailman/bin/qrunner --runner=IncomingRunner:0:1 -s 24295 ? S 0:03 qrunner /usr/local/mailman/bin/qrunner --runner=NewsRunner:0:1 -s 24296 ? S 0:29 qrunner /usr/local/mailman/bin/qrunner --runner=OutgoingRunner:0:1 -s 24297 ? S 0:07 qrunner /usr/local/mailman/bin/qrunner --runner=VirginRunner:0:1 -s

I think that realy somthing is wrong with the ArchRunner. Looks like it is in a loop or is doing nothing. The /qfiles/archive/ dir is growing and growing, i must stop mailman very often now otherwise de box gets overloaded.

The load was this morning on a average of 10 and all my memory was used (20mb left on swap). No error and messages at the logs dir :(

Please take a look Barry :) it is the latest cvs. You may also take al look on my box if you want and have time...

Danny Terweij

Mailman-Developers mailing list Mailman-Developers@python.org http://mail.python.org/mailman/listinfo/mailman-developers

barry＠python.org

9:20 p.m.

New subject: Much cpu/memory load

A quick fix for the Archiver problems, until I can debug them more is to add the following in ArchRunner.py, just under the "class ArchRunner" line:

class ArchRunner(Runner): QDIR = mm_cfg.ARCHQUEUE_DIR SLEEPTIME = mm_cfg.minutes(10)

def _dispose(self, mlist, msg, msgdata):

This at least makes ArchRunner only run once per 10 minutes.

-Barry

Danny Terweij

9:40 p.m.

New subject: Much cpu/memory load

From: "Barry A. Warsaw" <barry@python.org>

...

Thanx, i'll try that tomorrow. (Now here 22:39)

I'll let you know what then happens :)

Danny.

Dan Mick

November 2002

3:03 a.m.

New subject: Much cpu/memory load

Barry A. Warsaw wrote:

...

A quick fix for the Archiver problems, until I can debug them more is to add the following in ArchRunner.py, just under the "class ArchRunner" line:

class ArchRunner(Runner): QDIR = mm_cfg.ARCHQUEUE_DIR SLEEPTIME = mm_cfg.minutes(10)
def _dispose(self, mlist, msg, msgdata):
This at least makes ArchRunner only run once per 10 minutes.

-Barry

It's been doing this for about 15 minutes now. (yes, one message.) The process size is 203MB and 196M is resident.

Simone Piunno

10:15 a.m.

New subject: Much cpu/memory load

On Thu, Oct 31, 2002 at 07:03:50PM -0800, Dan Mick wrote:

...

Danny Terweij

11:17 a.m.

New subject: Much cpu/memory load

From: "Simone Piunno" <pioppo@ferrara.linux.it>

...

...
The problem's affecting me badly right now too. So far it seems that the

...

Ok, I believe this is because of my patch for i18n in archives. Do you feel like trying this patch? It should speed up things a lot....

...

Index: HyperArch.py

Gonna try this.

Danny.

Dan Mick

November 2002

8:20 p.m.

New subject: Much cpu/memory load

Simone Piunno wrote:

...

Dan Mick

8:37 p.m.

New subject: Much cpu/memory load

Dan Mick wrote:

...

Dan Mick

9:05 p.m.

New subject: Much cpu/memory load

Dan Mick wrote:

...

Dan Mick

3:56 a.m.

New subject: Much cpu/memory load

Sigh. This isn't making life any easier trying to debug the "Archiver eats my machine" problem:

     self._open_index_file_as_stdout(arcdir, hdr)

Any tricks for using pdb with a program that wants to steal stdout?

barry＠python.org

3:51 p.m.

New subject: Much cpu/memory load

I'm working on a fix for that and will see about the template instantiations, but I suspect it's the former that's contributing the bulk of the overhead.

Thanks Dan, and everyone else. Keep an eye on cvs...

-Barry

barry＠python.org

5:36 p.m.

New subject: Much cpu/memory load

barry＠python.org

November 2002

6:27 p.m.

New subject: Much cpu/memory load

...

...
...
...
...
"DM" == Dan Mick <dan.mick@sun.com> writes:

DM> Sigh.  This isn't making life any easier trying to debug the
DM> "Archiver eats my machine" problem:

DM>          self._open_index_file_as_stdout(arcdir, hdr)

DM> Any tricks for using pdb with a program that wants to steal
DM> stdout?

Rewrite the code so it doesn't steal stdout? ;)

Really, this code should be rewritten to use more modern Pythonic style, such as

def _open_index_file(self, archdir, index_name):
    # ...
# remove the sys.stdout hackery

and then any place that does "print something_useful" should get rewritten as "print >> self.__f, something_useful".

I don't have time for that, but I'd accept a patch.

-Barry

Danny Terweij

11:15 a.m.

New subject: Much cpu/memory load

From: "Dan Mick" <dan.mick@sun.com>

...

As i said before i did also thought it was a loop orso.

So i do not add this quick fix :) i have set the optoin to achrive to mbox only and at night i recreate the achrives now. I hope there will be a working fix soon...

Danny Terweij

8148

Age (days ago)

8152

Last active (days ago)

List overview

Download

14 comments

6 participants

participants (6)

Andrew D. Clark
barry＠python.org
Dan Mick
Dan Mick
Danny Terweij
Simone Piunno

re: Much cpu/memory load

Andrew D. Clark

Dan Mick

Index: HyperArch.py

Dan Mick

Dan Mick

Dan Mick

Dan Mick

Dan Mick

Index: HyperArch.py

Dan Mick

Dan Mick

Dan Mick

Dan Mick

tags

participants (6)