listmembers being "(ignored)", lockfile problems, 2.0b5
Greetings.
This is a edited version of a post I sent to the developers this morning. No response, so I can only assume no one cares to respond.
I searched through my archives. I couldn't search through the list archives because the search engine on python.org appears to be broken(and yes, I followed the instructions, choosing only SIG archives.)
I found a bunch of posts related or similar to my problems, but not much in the way of resolutions or anthing that seemed to help me.
The setup we're running is Solaris, python 1.5.2, which I compiled myself. The list was working great, until people started loosing mail delivery. I thought the auto-bounce handler was removing people, but they're still listed as subscribed, and I never received a message from the bounce handler saying they would be removed.
Now, what I'm seeing in the logs:
in logs/smtp-failure:
smtp-failure:Aug 30 11:16:15 2000 (122) -1 <email addr> (ignore) (many, many times)
This user is in fact not getting email from either of the two lists he belongs to. The number in ()'s changes, though not with every entry in the logs.
and, in logs/post, I see constant entries about: Aug 30 04:53:08 2000 (23571) post to <listname> from <listname>-request@<hostname>, size=28501, 177 failures
(always the same number of failures and size; seems to be some sort of unrelated mail loop of some kind.)
and there are posts that go through:
Aug 30 04:44:42 2000 (23411) post to <listname> from
Qrunner seems to be working:
Aug 30 02:55:04 2000 (22304) qrunner begining Aug 30 02:55:09 2000 (22304) qrunner ended
(this is repeated every time qrunner is launched, which is wasteful in terms of space since it runs once a minute; I've got 56k of text for less than 12 hours of running)
Now, this mystery "177 failures" message shows up again: Aug 30 06:50:10 2000 (25068) All recipients refused: please run connect() first Aug 30 06:50:10 2000 (25068) smtp for 177 recips, completed in 2.620 seconds
In logs/bounce, I'm not sure I understand the meanings:
Aug 29 23:13:31 2000 (18077) <listname>: <someuser> - 27 more allowed over 426977 secs
then there will be:
Aug 29 23:13:33 2000 (18077) <listname>: <differentuser> - 0 more allowed over 395006 secs
The non-delivery problem affects digest and non-digest users. Archives are working fine. One list is 1,500 people; 500 normal, 1000 digest users; the other is about 100 people, mostly non-digest. We're using sendmail, and I have the threaded delivery turned on due to the large number of slow/bad mail servers(and experience has shown that splitting them up into chunks results in a dramatic improvement in delivery times.)
Some mail is getting by, and some users report that after re-subscribing, their problems have gone away, although I made some changes(like clearing out qfiles/ and also removing stale lockfiles) around the same time, so it's hard to tell.) However, several users are still not getting any mail no matter what I do(they do get the list welcome message when I re-subscribe them, however.)
Thanks all for any help/tips. I wouldn't mind applying some patches that have been introduced since 2.0b5 that will fix this stuff, if they're available.
Brett
At 04:44 PM 8/30/00 -0400, Brett Dikeman wrote:
Greetings.
This is a edited version of a post I sent to the developers this morning. No response, so I can only assume no one cares to respond.
Having seen your message three times now, I can safely say that it's not that we don't care to respond, it's more like no one has an immediate clue as to what your problem is, and is hoping someone else who does will respond.
At 7:26 PM -0400 8/30/00, Ron Jarrell wrote:
I can safely say that it's not that we don't care to respond, it's more like no one has an immediate clue as to what your problem is
Yah. it wasn't something I'd ever seen, and I've been in the middle of a project trying to beat 8.5 years of accumulated mail list archives (31,000 files of email, about half a gig of data) into usable shape. god, we've been running lists for over 8 years: the first list was simply an alias file exploder to 30 people -- and now, Mailman is the third MLM (listproc 6.0c, majordomo, mailman) the oldest lists have been on, and we're now on our, um, fifth or possibly 6th CPU. It really didn't lend itself to me dropping stuff to snoop...
And, to be honest, the message came across as a bit too much "I'm telling you to do this", which doesn't sit well with me when I'm helping out as a volunteer (although it's not nearly as bad as the words "I'm not on this list, so...." -- whenever I see that, I simply blow off the message, even if I KNOW the answer, because that person is treating us as hired help, and isn't even pretending that they're willing to pay-forward back to the list for the help they're asking for... communities are built of people who give as well as take, and I gave up on the vampires years ago... But I digress...).
but now that it's been cleared up, and that Wad O' Archives is under control, I'll see if I can sniff something... But everyone has to remember we're ALL here to help each other -- but we all ahve lives, jobs, and deadlines, too. And nobody HAS to do anything for anyone here. So it really, really helps if you ask nicely, and be patient -- and I realize that it's not fun to have a live, job, deadline, unhappy users and broken software to boot, but it doesn't help to honk off the folks you're asking help from, does it? (grin)
Now, I know I have the original email around here somewhere. I think....
-- Chuq Von Rospach - Plaidworks Consulting (mailto:chuqui@plaidworks.com) Apple Mail List Gnome (mailto:chuq@apple.com)
And they sit at the bar and put bread in my jar and say 'Man, what are you doing here?'"
participants (3)
-
Brett Dikeman
-
Chuq Von Rospach
-
Ron Jarrell