<?xml version="1.0" encoding="windows-1252"?>
<node id="995533" title="Re: Best way to Download and Process a XML file" created="2012-09-25 08:05:09" updated="2012-09-25 08:05:09">
<type id="11">
note</type>
<author id="694914">
dHarry</author>
<data>
<field name="doctext">
&lt;p&gt;Sanity check: 150GB XML file??? Maybe it's time to rethink the problem?!&lt;/p&gt;
&lt;p&gt;Assuming enough disk space and patience option 1 will work.&lt;/p&gt;
&lt;p&gt;Option 2 also has its drawbacks, e.g. "finally save it" sounds to me like keeping the file in memory... Or do you want to edit the file "in place"? Anyway, with XML files this big you probably don't want a pure Perl implementation. [XML::LibXML] jumps to mind. I have happy experience parsing big XML files (10s of GB) &lt;a href="http://xerces.apache.org/"&gt;Xerces&lt;/a&gt;. &lt;/p&gt;

&lt;p&gt;Cheers&lt;/p&gt;
&lt;p&gt;Harry&lt;/p&gt;</field>
<field name="root_node">
995446</field>
<field name="parent_node">
995446</field>
</data>
</node>
