Character encoding for mass subscribtion?

Hello.
I want to mass-subscribe some mail addresses in mailman. I have a list of (spanish) names and the mail addresses. The names usually contain non-ascii characters like a with akute (á), e with acute (é), n with tilde (ñ) and so on. Mailman doesn't import these names correctly. Why not???
I've tried it with entering the list in the web browser. Not working correctly. I've tried to upload a UTF8-encoded text file. Not working correctly. I've tried some other encodings. Not working.
However, I've tested it with a file containing only 1 mail address and "áéíóóúñÑ" as name. Finally the upload was working when I used 8859-15. Than I have converted my big address list to 8859-15, and now it didn't work anymore. Yes, I can reproduce that. Does mailman try to auto-recognize the character encoding?
How can I get this working??? I need some help.
Cheers
Tim

On 06/25/2013 05:46 AM, e.c. wrote:
The def _(s) was copied from Defaults.py and is a device for allowing the strings representing the language names to be seen as needing i18n, but not translating them too soon.
Putting something like
def _(s): return s add_language('sv', _('Swedish'), 'utf-8') del _
in mm_cfg.py works, but is way to complicated. All that is needed is the single line
add_language('sv', 'Swedish', 'utf-8')
That setting is historical and currently has no effect.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan

On 06/24/2013 04:34 PM, Urwald Urwald wrote:
However, I've tested it with a file containing only 1 mail address and "áéíóóúñÑ" as name. Finally the upload was working when I used 8859-15. Than I have converted my big address list to 8859-15, and now it didn't work anymore. Yes, I can reproduce that. Does mailman try to auto-recognize the character encoding?
Mailman assumes the mass subscribe list is encoded in the character set of the list's preferred language. What this is depends on the language and is defined by the add_language directives at the end of Defaults.py.
For example, a few entries are:
add_language('en', _('English (USA)'), 'us-ascii', 'ltr') add_language('es', _('Spanish (Spain)'), 'iso-8859-1', 'ltr') add_language('et', _('Estonian'), 'iso-8859-15', 'ltr') add_language('eu', _('Euskara'), 'iso-8859-15', 'ltr') add_language('gl', _('Galician'), 'utf-8', 'ltr')
It is fairly simple to change the default character set for English to utf-8 because us-ascii is a proper subset of utf-8, but that's the only such case. See the FAQ at <http://wiki.list.org/x/6oDz>.
Changing the character set for other languages involves a procedure similar to that outlined for Swedish in the link in e.c.'s reply.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan

Hello.
Thanks. This was the information I needed.
(In my case, thought the names are spanish names, the default list language is german, so the encoding is 8859-1.)
(I was just a little bit confused after seeing the confirmation of the mass subscribtion showing up with a wrong encoding. But proceeding to the membership management, there all the names are fine.)
Thanks a lot.
Tim
----- Ursprüngliche Message ----- Von: Mark Sapiro <mark@msapiro.net> An: mailman-users@python.org CC: Gesendet: 19:28 Dienstag, 25.Juni 2013 Betreff: Re: [Mailman-Users] Character encoding for mass subscribtion?
On 06/24/2013 04:34 PM, Urwald Urwald wrote:
However, I've tested it with a file containing only 1 mail address and "áéíóóúñÑ" as name. Finally the upload was working when I used 8859-15. Than I have converted my big address list to 8859-15, and now it didn't work anymore. Yes, I can reproduce that. Does mailman try to auto-recognize the character encoding?
Mailman assumes the mass subscribe list is encoded in the character set of the list's preferred language. What this is depends on the language and is defined by the add_language directives at the end of Defaults.py.
For example, a few entries are:
add_language('en', _('English (USA)'), 'us-ascii', 'ltr') add_language('es', _('Spanish (Spain)'), 'iso-8859-1', 'ltr') add_language('et', _('Estonian'), 'iso-8859-15', 'ltr') add_language('eu', _('Euskara'), 'iso-8859-15', 'ltr') add_language('gl', _('Galician'), 'utf-8', 'ltr')
It is fairly simple to change the default character set for English to utf-8 because us-ascii is a proper subset of utf-8, but that's the only such case. See the FAQ at <http://wiki.list.org/x/6oDz>.
Changing the character set for other languages involves a procedure similar to that outlined for Swedish in the link in e.c.'s reply.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan
Mailman-Users mailing list Mailman-Users@python.org http://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://wiki.list.org/x/AgA3 Security Policy: http://wiki.list.org/x/QIA9 Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-users/urwald%40rocketmail.com

On 06/25/2013 05:46 AM, e.c. wrote:
The def _(s) was copied from Defaults.py and is a device for allowing the strings representing the language names to be seen as needing i18n, but not translating them too soon.
Putting something like
def _(s): return s add_language('sv', _('Swedish'), 'utf-8') del _
in mm_cfg.py works, but is way to complicated. All that is needed is the single line
add_language('sv', 'Swedish', 'utf-8')
That setting is historical and currently has no effect.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan

On 06/24/2013 04:34 PM, Urwald Urwald wrote:
However, I've tested it with a file containing only 1 mail address and "áéíóóúñÑ" as name. Finally the upload was working when I used 8859-15. Than I have converted my big address list to 8859-15, and now it didn't work anymore. Yes, I can reproduce that. Does mailman try to auto-recognize the character encoding?
Mailman assumes the mass subscribe list is encoded in the character set of the list's preferred language. What this is depends on the language and is defined by the add_language directives at the end of Defaults.py.
For example, a few entries are:
add_language('en', _('English (USA)'), 'us-ascii', 'ltr') add_language('es', _('Spanish (Spain)'), 'iso-8859-1', 'ltr') add_language('et', _('Estonian'), 'iso-8859-15', 'ltr') add_language('eu', _('Euskara'), 'iso-8859-15', 'ltr') add_language('gl', _('Galician'), 'utf-8', 'ltr')
It is fairly simple to change the default character set for English to utf-8 because us-ascii is a proper subset of utf-8, but that's the only such case. See the FAQ at <http://wiki.list.org/x/6oDz>.
Changing the character set for other languages involves a procedure similar to that outlined for Swedish in the link in e.c.'s reply.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan

Hello.
Thanks. This was the information I needed.
(In my case, thought the names are spanish names, the default list language is german, so the encoding is 8859-1.)
(I was just a little bit confused after seeing the confirmation of the mass subscribtion showing up with a wrong encoding. But proceeding to the membership management, there all the names are fine.)
Thanks a lot.
Tim
----- Ursprüngliche Message ----- Von: Mark Sapiro <mark@msapiro.net> An: mailman-users@python.org CC: Gesendet: 19:28 Dienstag, 25.Juni 2013 Betreff: Re: [Mailman-Users] Character encoding for mass subscribtion?
On 06/24/2013 04:34 PM, Urwald Urwald wrote:
However, I've tested it with a file containing only 1 mail address and "áéíóóúñÑ" as name. Finally the upload was working when I used 8859-15. Than I have converted my big address list to 8859-15, and now it didn't work anymore. Yes, I can reproduce that. Does mailman try to auto-recognize the character encoding?
Mailman assumes the mass subscribe list is encoded in the character set of the list's preferred language. What this is depends on the language and is defined by the add_language directives at the end of Defaults.py.
For example, a few entries are:
add_language('en', _('English (USA)'), 'us-ascii', 'ltr') add_language('es', _('Spanish (Spain)'), 'iso-8859-1', 'ltr') add_language('et', _('Estonian'), 'iso-8859-15', 'ltr') add_language('eu', _('Euskara'), 'iso-8859-15', 'ltr') add_language('gl', _('Galician'), 'utf-8', 'ltr')
It is fairly simple to change the default character set for English to utf-8 because us-ascii is a proper subset of utf-8, but that's the only such case. See the FAQ at <http://wiki.list.org/x/6oDz>.
Changing the character set for other languages involves a procedure similar to that outlined for Swedish in the link in e.c.'s reply.
-- Mark Sapiro <mark@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan
Mailman-Users mailing list Mailman-Users@python.org http://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://wiki.list.org/x/AgA3 Security Policy: http://wiki.list.org/x/QIA9 Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-users/urwald%40rocketmail.com
participants (3)
-
e.c.
-
Mark Sapiro
-
Urwald Urwald