<?xml version="1.0" encoding="windows-1252"?>
<node id="995447" title="Re: Best way to Download and Process a XML file" created="2012-09-24 18:30:43" updated="2012-09-24 18:30:43">
<type id="11">
note</type>
<author id="757127">
tobyink</author>
<data>
<field name="doctext">
&lt;p&gt;150 GB? Ouch.&lt;/p&gt;
&lt;p&gt;[mod://AnyEvent::HTTP] should allow you to issue an HTTP request, and process it a chunk at a time, while it arrives, without having to save it anywhere.&lt;/p&gt;
&lt;p&gt;And [mod://XML::Twig] can parse XML chunk by chunk.&lt;/p&gt;
&lt;p&gt;Pairing the two you ought to be able to do this without temporary files. Exactly how to do it, I can't help you. I have limited experience with AnyEvent::HTTP; and virtually none with XML::Twig.&lt;/p&gt;

&lt;!-- Node text goes above. Div tags should contain sig only --&gt;
&lt;div class="pmsig"&gt;&lt;div class="pmsig-757127"&gt;
&lt;small&gt;&lt;small&gt;
&lt;tt&gt;perl -E'sub Monkey::do{say$_,for@_,do{($monkey=&amp;#x5B;caller(0)]-&gt;&amp;#x5B;3])=~s{::}{ }and$monkey}}"Monkey say"-&gt;Monkey::do'
&lt;/tt&gt;&lt;/small&gt;&lt;/small&gt;
&lt;/div&gt;&lt;/div&gt;</field>
<field name="root_node">
995446</field>
<field name="parent_node">
995446</field>
</data>
</node>
