[ python-Bugs-1251921 ] Fail codecs.lookup() on 'mbcs' and 'tactis'
SourceForge.net
noreply at sourceforge.net
Sat Aug 6 21:04:22 CEST 2005
Bugs item #1251921, was opened at 2005-08-04 09:11
Message generated for change (Comment added) made by liturgist
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=1251921&group_id=5470
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Unicode
Group: Python 2.4
Status: Open
Resolution: Remind
Priority: 5
Submitted By: liturgist (liturgist)
Assigned to: M.-A. Lemburg (lemburg)
Summary: Fail codecs.lookup() on 'mbcs' and 'tactis'
Initial Comment:
$ python
Python 2.4.1 (#1, May 16 2005, 15:19:29)
[GCC 4.0.0 20050512 (Red Hat 4.0.0-5)] on linux2
Type "help", "copyright", "credits" or "license" for
more information.
>>> import codecs
>>> codecs.lookup('ascii')
(<built-in function ascii_encode>, <built-in function
ascii_decode>, <class encodings.ascii.StreamReader at
0xb7f1302c>, <class encodings.ascii.StreamWriter at
0xb7f82fbc>)
>>> codecs.lookup('mbcs')
Traceback (most recent call last):
File "<stdin>", line 1, in ?
File "/usr/lib/python2.4/encodings/__init__.py", line
96, in search_function
globals(), locals(), _import_tail)
File "/usr/lib/python2.4/encodings/mbcs.py", line 14,
in ?
class Codec(codecs.Codec):
File "/usr/lib/python2.4/encodings/mbcs.py", line 18,
in Codec
encode = codecs.mbcs_encode
AttributeError: 'module' object has no attribute
'mbcs_encode'
>>> codecs.lookup('tactis')
Traceback (most recent call last):
File "<stdin>", line 1, in ?
LookupError: unknown encoding: tactis
----------------------------------------------------------------------
>Comment By: liturgist (liturgist)
Date: 2005-08-06 14:04
Message:
Logged In: YES
user_id=197677
Where is bug 854511 regarding Thai encodings? The URL has
some discussion of this bug on TISI and ISO encodings.
http://mail.python.org/pipermail/python-bugs-list/2003-December/021394.html
----------------------------------------------------------------------
Comment By: M.-A. Lemburg (lemburg)
Date: 2005-08-06 07:51
Message:
Logged In: YES
user_id=38388
Reopened: this is indeed a bug - the tactis codec is
referenced in the aliases table but was never added to the
encodings package.
Either the codec will have to be added, or the alias entry
removed.
----------------------------------------------------------------------
Comment By: Martin v. Löwis (loewis)
Date: 2005-08-06 06:56
Message:
Logged In: YES
user_id=21627
Yes, UTF-8 is a multi-byte character set, but no, UTF-8 and
mbcs are not synonyms. See
http://docs.python.org/lib/standard-encodings.html
for the documentation. encodings.aliases.aliases is no
registration - it is an alias databse. So if somebody refers
to "dbcs", this will be aliased to "mbcs". That does not
mean "mbcs" needs to be supported - it only means "dbcs" is
not supported on Linux, either.
tactis was never part of a Python release, so its lack is
not a bug.
Closing this report as invalid.
----------------------------------------------------------------------
Comment By: liturgist (liturgist)
Date: 2005-08-04 10:00
Message:
Logged In: YES
user_id=197677
If 'mbcs' is Windows-only, then why does it appear in the
list of registered codecs in encodings.aliases.aliases on
the Linux platform? Shouldn't it not be registered on other
platforms?
----------------------------------------------------------------------
Comment By: liturgist (liturgist)
Date: 2005-08-04 09:55
Message:
Logged In: YES
user_id=197677
UTF-8 is an MBCS encoding in that is is a "multiple byte
character set", right? :-) UTF-16 is an MSCS; multiple
short character set.
There appears to have been some discussion on 'tactis' and
'tactis260' earlier in bug 854511.
http://mail.python.org/pipermail/python-bugs-list/2003-December/021394.html
----------------------------------------------------------------------
Comment By: M.-A. Lemburg (lemburg)
Date: 2005-08-04 09:38
Message:
Logged In: YES
user_id=38388
mbcs is only available on Windows.
I'm not sure what happened to the tactis codec - it's
possible that it never got checked in.
Do you have a reference for the tactis encoding ?
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=105470&aid=1251921&group_id=5470
More information about the Python-bugs-list
mailing list