[I18n-sig] Big5 Codecs

Frank J.S. Chen frank63@ms5.hinet.net
Wed, 1 Nov 2000 11:43:14 -0000

> But the Unicode Consortium's mapping table does not round-trip Big 5
> --- so where did you get the table?

What do you mean "round-trip"? If a big5 code point is undefined, it 
still has a corrosponding Unicode code point, but nothing in BIG5 encoding
string. This Python table is post-handled by myself to fit with the

> EUDC are the End-User Defined Character region, the 3rd level of Big
> 5. Several groups, including HKUST, the Hong Kong government, and the
> Taiwan military define characters in the 3rd region. Other Big 5
> extensions, such as ETen, also use this block.
> EUDC is divided into three segments: 0xFA40 -- 0xFEFE, 0x8E40 --
> 0xA0FE, and 0x8140 -- 0x8DFE.

That's a problem!

Chen Chien-Hsun