<?xml version="1.0" encoding="windows-1252"?>
<node id="886995" title="Re^3: statistics of a large text" created="2011-02-08 10:32:30" updated="2011-02-08 10:32:30">
<type id="11">
note</type>
<author id="26179">
tilly</author>
<data>
<field name="doctext">
No, there isn't.  Sorting does the actual work, and therefore takes the bulk of the time.  If it is too slow, then it is time for you to look into distributing (and parallelizing) the work with Hadoop.</field>
<field name="root_node">
884345</field>
<field name="parent_node">
886922</field>
</data>
</node>
