Can Python be used for UTF/Double Bytes/Asian Characters?
David LeBlanc
whisper at oz.nospamnet
Fri Jun 1 17:30:46 EDT 2001
In article <5f28b650.0105310946.1298918b at posting.google.com>,
akiyomiya at hotmail.com says...
> Does Python have a module or some sort of class that can be used for
> UTF/Double Bytes/Asian Characters?
>
> I heard that Perl has a module that you can use for UTF/Double
> Bytes/Asian Characters. So I am wondering if Python works same way or
> better.
>
> We are modifying a search (Inktomi), so people can input UTF/Double
> Bytes/Asian Characters and search for some documents.
>
> Thank you in advance.
>
> AK
>
I don't know about DBCS or Asian characters, but Unicode is a standard
feature in Python 2.0 (and maybe also Python 2.0, but i'm only sure about
2.1). I don't know if the regular expression stuff has been made unicode
aware or not. If not, then you might want to look into Tcl, which does
have a complete Unicode implementation, including regular expressions.
>From reading stuff in the comp.lang.tcl newsgroup, Tcl seems quite well
regarded in the Asian language community.
Hopefully Python will suit you though.
Dave LeBlanc
More information about the Python-list
mailing list