There is a new Unicode implementation snaphot available at the secret URL. It contains quite a few small changes to the internal APIs, doc strings for all methods and some new methods (e.g. .title()) on the Unicode and the string objects. The code page mappings are now integer->integer which should make them more performant.
Some of the C codec APIs have changed, so you may need to adapt code that already uses these (Fredrik ?!).
Still missing is a MSVC project file... haven't gotten around yet to build one. The code does compile on WinXX though, as Finn Bock told me in private mail.
Please try out the new stuff... Most interesting should be the code in Lib/codecs.py as it provides a very high level interface to all those builtin codecs.
BTW: I would like to implement a .readline() method using only the .read() method as basis. Does anyone have a good idea on how this could be done without buffering ? (Unicode has a slightly larger choice of line break chars as C; the .splitlines() method will deal with these)