Fwd: How to Split Chinese Character with backslash representation?

limodou limodou at gmail.com
Thu Oct 26 23:51:53 EDT 2006


---------- Forwarded message ----------
From: limodou <limodou at gmail.com>
Date: Oct 27, 2006 11:51 AM
Subject: Re: How to Split Chinese Character with backslash representation?
To: Wijaya Edward <ewijaya at i2r.a-star.edu.sg>


On 10/27/06, Wijaya Edward <ewijaya at i2r.a-star.edu.sg> wrote:
>
> Hi all,
>
> I was trying to split a string that
> represent chinese characters below:
>
>
> >>> str = '\xc5\xeb\xc7\xd5\xbc'
> >>> print str2,
> ???
> >>> fields2 = split(r'\\',str)
> >>> print fields2,
> ['\xc5\xeb\xc7\xd5\xbc']
>
> But why the split function here doesn't seem
> to do the job for obtaining the desired result:
>
> ['\xc5','\xeb','\xc7','\xd5','\xbc']
>
\xXX just internal representation of None ASCII, I guess above string
is encoded with local locale, maybe gbk. You can get the bytes list
through:

str = '\xc5\xeb\xc7\xd5\xbc'
list(str)

And string is just a list of characters.

--
I like python!
UliPad <<The Python Editor>>: http://wiki.woodpecker.org.cn/moin/UliPad
My Blog: http://www.donews.net/limodou


-- 
I like python!
UliPad <<The Python Editor>>: http://wiki.woodpecker.org.cn/moin/UliPad
My Blog: http://www.donews.net/limodou



More information about the Python-list mailing list