It's some arbitrary text composed of 95% ASCII characters and 5% non-ASCII. On
this specific example, utf8 decodes at around 250 MB/s, latin1 at almost 1 GB/s
(on the same machine on which I ran the benchmarks).
--
Daniel Stutzbach, Ph.D.
President, Stutzbach Enterprises, LLC