> Thomas Moore: > > >>>>u=u'\u9019\u662f\u4e2d\u6587\u5b57\u4e32' > >>>>u.split() > > > > [u'\u9019\u662f\u4e2d\u6587\u5b57\u4e32'] > > > > > > I think u should get split. > > Where do you think "這是中文字串" should be split and why? Isn't a unicode string character by character? -Frank