Split isn't going to get your far on this. What you need is a good regex. Try this:
while ( my $line = <DATA>){
my( @a, @b );
my ( $junk, $useful ) = split /\;/, $line;
my @elements = $useful =~ m/
(\d+ # some numbers
(?: # followed by (don't capture)
\(.*?\) # other stuff (non-greedy) in parens
)? # (optionally)
) # end capture
/smxg;
for my $element ( @elements ){
if ( $element =~ /\(.*\)/ ){
push @b, $element;
}
else{
push @a, $element;
}
}
# do something with @a and @b
}
__DATA__
2302.1;2302, 2304(3-8, 92-99), 2305(2, 4-9)
231.1;2301, 2303, 2304(1, 2, 90, 91), 2305(1, 3), 2306, 2307, 2308
I'm sure you could do this more compactly, but I didn't take the time to optimize. The first RE does the split you're looking for, followed by a check for each item to see if it contains parens.
perl -e 'split//,q{john hurl, pest caretaker}and(map{print @_[$_]}(joi
+n(q{},map{sprintf(qq{%010u},$_)}(2**2*307*4993,5*101*641*5261,7*59*79
+*36997,13*17*71*45131,3**2*67*89*167*181))=~/\d{2}/g));'
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.