Carl Scharenberg carl.scharenberg at
Thu Sep 2 09:29:18 EDT 2004

> This seems to be of somewhat better quality than the output of the typical 
> random-text generator.  Can anyone suggest something on CPAN useful for 
> such?

You can do this by analyzing a sample text at a higher level. Instead
of generating text from the frequency of single letters, you generate
using the frequencies of 2, 3, or 4-letter sequences. You analyze a
large text so you have a database of frequencies. When generating each
new character you look at the frequences of the letters given that the
3 previous letters are 'the'. The possibilities are a space, 'r'
(their), 'y' (they), and some others. Overall it will generate words
and even phrases that seem to almost make sense. It is neat stuff.


More information about the Python-list mailing list