Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Re^2: This looks like whitespace in my CSV but doesn't seem to be

by Tux (Canon)
on Sep 30, 2012 at 18:00 UTC ( [id://996520]=note: print w/replies, xml ) Need Help??


in reply to Re: This looks like whitespace in my CSV but doesn't seem to be
in thread This looks like whitespace in my CSV but doesn't seem to be

Text:CSV_XS has a native builtin way to delete trailing and leading whitespace:

my $csv= text::CSV_XS->new ({ binary => 1, allow_whitespace => 1, auto +_diag => 1 });

But that would not help in this case for two reasons

  • It does not strip whitespace inside quotation, but only surrounding sep_char:
    allow_whitespace When this option is set to true, whitespace (TAB's and SPAC +E's) surrounding the separation character is removed when parsin +g. If either TAB or SPACE is one of the three major characters "sep_char", "quote_char", or "escape_char" it will not be considered whitespace.
  • Thee whitespace stripped is only space or TAB's, and not non-breaking Unicode space stuff

Spreadsheet::Read however offers to strip leading and trailing whitespace from every field. I could extend that on request to allow it to strip Unicode whitespace too.

strip If set, "ReadData ()" will remove trailing- and/or leading- whitespace from every field. strip leading strailing ----- ------- --------- 0 n/a n/a 1 strip n/a 2 n/a strip 3 strip strip

Enjoy, Have FUN! H.Merijn

Replies are listed 'Best First'.
Re^3: This looks like whitespace in my CSV but doesn't seem to be
by Marshall (Canon) on Oct 05, 2012 at 01:38 UTC
    This sounds just fine.

    I haven't worked with any CSV files with leading spaces. That idea appears to be uncommon. But your suggestion sounds good.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://996520]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others rifling through the Monastery: (4)
As of 2024-03-19 03:03 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found