Python plain text converter?

Gilles Lenfant glenfant at NOSPAM.bigfoot.com
Fri Feb 28 06:29:42 EST 2003


"dromedary" <camel at oasis.com> a écrit dans le message de news:
280220030041274458%camel at oasis.com...
> Does anyone know if Python can strip the nasty formatting codes from
> documents in, say, Word or QPS? I'd like to write a small batch
> processor that will grab a bunch of files and covert them to text only
> format for manipulation using regular expressions. Unfortunately I
> can't figure out how to get rid of the nasty codes (e.g., ¿). Any
> assistance appreciated.
>
> Jon

Use Word as a COM object with Mark Hammond's win32 extensions, rather than
parsing a Word binary file.

Search from http://starship.python.net/

There is a demo using Word.

--Gilles







More information about the Python-list mailing list