Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?

Re^2: Extracting information from file to Hash

by bart (Canon)
on Jan 18, 2012 at 22:07 UTC ( #948632=note: print w/ replies, xml ) Need Help??

in reply to Re: Extracting information from file to Hash
in thread Extracting information from file to Hash

It would be nice if the regular expression could be made less specific, but some features of your data format make that tricky (e.g. the fact that the value following "=>" can be a zero-length string).
However, I think it's pretty much guaranteed that there will be whitespace between the key/value pairs. Yet, you use /\s*/. Also, nowhere do I see specified that there even may be whitespace around the "=>" — you just made that up. As this file looks to be computer generated, I sincerely doubt that this will ever be the case. Finally: the only place do I see whitespace inside a column value, is in the final column of the line: the timestamp.

In short: I think this regex will do:

/^Collection=>(\S*) \s+ ImageCount=>(\S*) \s+ Status=>(\S*) \s+ Missing=>(\S*) \s+ Modified=>(.*\S) /x

And if you do this:

my %r = /^ (Collection)=>(\S*) \s+ (ImageCount)=>(\S*) \s+ (Status)=>(\S*) \s+ (Missing)=>(\S*) \s+ (Modified)=>(.*\S) /x;
you even get a nice hash record out of it, even though it is restricted to one match per line (otherwise, when using /g you'd get list context, with a different behavior as a result.

Comment on Re^2: Extracting information from file to Hash
Select or Download Code

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://948632]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (13)
As of 2014-11-27 14:35 GMT
Find Nodes?
    Voting Booth?

    My preferred Perl binaries come from:

    Results (185 votes), past polls