Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change

Re: To split with spaces

by sundialsvc4 (Abbot)
on Aug 05, 2013 at 12:31 UTC ( #1047901=note: print w/ replies, xml ) Need Help??

in reply to To split with spaces

Here are another couple of useful tips:

  • Use hexdump or a similar tool to examine the contents of the file byte-by-byte.   Don’t assume anything:   a “blank space” could be tabs, spaces, or even characters that are unprintable according to the internationalization (I18N) settings of whatever tool you may happen to be using.   When you are showing excerpts of such files to us, enclose them in <code> tags.   You can write a program to split according to any sort of bright-line rule.
  • Once you think you have a bright-line rule, write a script to prove it.   Take every assumption that you think holds true for the entire catalog of such files that you have, then write scripts that will survive only-if those assumptions are correct; otherwise they die in a meaningful way.   Run those scripts against a broad cross-section of the files.   Run them automatically against new files that come in.   (Sometimes you find that you are debugging, not only the programs that you wrote to consume the files, but the programs that other people wrote to produce them.)

Comment on Re: To split with spaces

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1047901]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (5)
As of 2016-02-14 22:18 GMT
Find Nodes?
    Voting Booth?

    How many photographs, souvenirs, artworks, trophies or other decorative objects are displayed in your home?

    Results (471 votes), past polls