<?xml version="1.0" encoding="windows-1252"?>
<node id="1009794" title="Re^2: How to generate random sequence of UTF-8 characters" created="2012-12-20 15:33:36" updated="2012-12-20 15:33:36">
<type id="11">
note</type>
<author id="925765">
ted.byers</author>
<data>
<field name="doctext">
&lt;p&gt;Thanks.  I will give it a try.&lt;/p&gt;&lt;p&gt;There is a point of misunderstanding, though, and that is I am aiming for a sample of random sequences.  Each sequence would be five to ten characters, but the sample would be comprised of a few million such sequences.  Thus, if my sample size is ten million strings, and each string is ten characters, and there are a million valid utf-8 characters, the each character would be in the sample an average of 100 times.  It is a statistical approach; each item in the sample has just a tiny portion of all possible values, but the whole sample includes all possible values multiple times.  I tend to be a bit thorough when testing code I am not familiar with (my code for computing eigensystems of general matrices was testing on 100 million randomly generated matrices - with not one failure BTW).&lt;/p&gt;&lt;p&gt;Thanks again.&lt;/p&gt;&lt;p&gt;Ted&lt;/p&gt;</field>
<field name="root_node">
1009778</field>
<field name="parent_node">
1009784</field>
</data>
</node>
