<?xml version="1.0" encoding="windows-1252"?>
<node id="1014974" title="Re: selecting columns from a tab-separated-values file" created="2013-01-23 13:08:09" updated="2013-01-23 13:08:09">
<type id="11">
note</type>
<author id="203787">
dhoss</author>
<data>
<field name="doctext">
&lt;p&gt;I've found great success using good ol' [metacpan://Text::CSV_XS] and reading in say, 10k lines at a time.&lt;/p&gt;
&lt;p&gt;&lt;b&gt;UPDATE: &lt;/b&gt; I can attest to the speed as I've been dealing with moving several terabytes of image data off of or around on AWS S3, which usually involves dumping an enormous number (40 million) of rows of csv data and having to run through it one way or another.&lt;/p&gt;
&lt;!-- Node text goes above. Div tags should contain sig only --&gt;
&lt;div class="pmsig"&gt;&lt;div class="pmsig-203787"&gt;
&lt;p&gt;Three thousand years of beautiful tradition, from Moses to Sandy Koufax,  &lt;b&gt;you're god damn right I'm living in the fucking past&lt;/b&gt;&lt;/p&gt;
&lt;/div&gt;&lt;/div&gt;</field>
<field name="root_node">
1014517</field>
<field name="parent_node">
1014517</field>
</data>
</node>
