[Mailman-Users] importing large (1GB) mbox file, hitting a wall here..

Andrew White, PhD awhite at pdbti.org
Thu Dec 28 14:14:58 EST 2017

   I checked for that- it looks like my problem solving was incomplete. I
   found an error message when running arch where it was sticking on a bad
   record - I kept getting "got an unexpected keyword argument 'flags' "
   (even after using cleanarch on the mbox file), and I think that was the
   actual problem, not running out of memory. I ran it last night removing
   that record, and it worked without batching as long as I didn't include
   that batch of records only about .3% of the file..

   At 09:33 AM 12/28/2017, you wrote:

     On 12/27/2017 08:08 PM, Andrew White, PhD wrote:

     >    I then ran bin/arch --wipe dbt-l_pdbti.org. When I
     >    checked the archives, only about 11,000 messages were imported. I
     saw in
     >    the arch help file there can be memory issues, and so to run things
     >    chunks. So, I did this:
     >    bin/arch ---wipe q -s 0 e 10000 dbt-l_pdbti.org
     >    bin/arch -q -s 10001 e 20000 dbt-l_pdbti.org
     >    bin/arch  q -s 20001 e 30000 dbt-l_pdbti.org
     >    bin/arch  q -s 30001 e 40000 dbt-l_pdbti.org
     >    So when I do this, each piece works, but each piece overwrites the
     >    previous- in other words, rather than each chunk adding into the
     >    only the most recent command seems to affect the archives. At the
     end of
     >    these commands, only messages 30,000 to 35,000 are showing up in
     >    archives.

     Are you sure you are not including the --wipe option on the subsequent
     commands? The behavior you describe should not occur unless --wipe is
     specified on the subsequent commands.

     Mark Sapiro <mark at msapiro.net>        The highway is for gamblers,
     San Francisco Bay Area, California    better use your sense - B. Dylan
     Mailman-Users mailing list Mailman-Users at python.org
     Mailman FAQ: [2]http://wiki.list.org/x/AgA3
     Security Policy: [3]http://wiki.list.org/x/QIA9
     Searchable Archives:

   Andrew White, PhD
   Associate Director
   DBT-Linehan Board of Certification, Certified DBT Clinician*
   Licensed Clinical Psychologist
   Portland DBT Institute
   (503) 290.3281 (phone)
   (503) 231.8153 (fax)

   Please be aware that e-mail communication can be intercepted in
   transmission or misdirected. This e-mail message and any documents
   attached to it are confidential and may contain information that is
   protected from disclosure by various federal and state laws, including the
   HIPAA privacy rule (45 C.F.R., Part 164). This information is intended to
   be used solely by the entity or individual to whom this message is
   addressed. If you are not the intended recipient, be advised that any use,
   dissemination, forwarding, printing, or copying of this message without
   the sender's written permission is strictly prohibited and may be
   unlawful. Accordingly, if you have received this message in error, please
   notify the sender immediately with a copy to hipaa(at)pdbti.org and
   destroy this message. Please do not include personal identifying
   information such as your birth date, or personal medical information in
   any emails you send to us. No one can diagnose your condition from email
   or other written communications and is not a reliable mechanism for
   emergency communication.


   Visible links
   1. https://mail.python.org/mailman/listinfo/mailman-users
   2. http://wiki.list.org/x/AgA3
   3. http://wiki.list.org/x/QIA9
   4. http://www.mail-archive.com/mailman-users@python.org/
   5. https://mail.python.org/mailman/options/mailman-users/awhite@pdbti.org

More information about the Mailman-Users mailing list