Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: How we can do regex of this?

by moritz (Cardinal)
on Dec 23, 2012 at 14:12 UTC ( #1010100=note: print w/ replies, xml ) Need Help??


in reply to How we can do regex of this?

A regex isn't the right tool for the job. Since the table has fixed width, use substr or unpack to extract the cells; perlpacktut has an example.

If the width of the columns isn't known in advance, use a regex to find the column width by considering the first line, and use unpack/substr for the rest of the table.

Update: some example code. Trimming of leading/trailing whitespaces left as an exercise to the reader

use 5.010; use strict; use warnings; my $head_line = <DATA>; my @c_idx; while ($head_line =~ /\S+/g) { push @c_idx, $-[0]; } my $sep = <DATA>; # ignored; while (<DATA>) { chomp; my @columns; for my $idx (0..$#c_idx) { if ($idx == $#c_idx) { push @columns, substr $_, $c_idx[$idx]; } else { push @columns, substr $_, $c_idx[$idx], $c_idx[$idx+1] - $ +c_idx[$idx]; } } use Data::Dumper; print Dumper \@columns; } __DATA__ Mod Sub-Module Model Serial ---- --------------------------- ------------------ ----------- 1 Distributed Forwarding Card WS-F6700-DFC3CXL SAL1434RLPY 3 7600 ES+ DFC XL 7600-ES+3CXL JAE14520N29 3 7600 ES+T 20x1GE SFP 76-ES+T-20GQ JAE145301XM 5 Policy Feature Card 3 7600-PFC3CXL JAE14330E6J 5 C7600 MSFC4 Daughterboard 7600-MSFC4 JAE14320QBE 6 Policy Feature Card 3 7600-PFC3CXL JAE14330EAO 6 C7600 MSFC4 Daughterboard 7600-MSFC4 JAE14320QA8 7 Distributed Forwarding Card WS-F6700-DFC3CXL SAL1433QHBR 8 Distributed Forwarding Card WS-F6700-DFC3CXL SAL1433QXF9


Comment on Re: How we can do regex of this?
Download Code
Re^2: How we can do regex of this?
by Anonymous Monk on Dec 23, 2012 at 18:46 UTC
    Somewhat simpler:
    use strict; use warnings; my @names = split(' ', <DATA>); my @lenghts = split(' ', <DATA>); my %sizes; @sizes{@names} = map { length } @lenghts; sub strip_space { my ($str) = @_; $str =~ s/^\s+//; unpack 'A*', $str; } my %data; while (defined(my $line = <DATA>)) { my $pos = 0; foreach my $name (@names) { push @{$data{$name}}, strip_space(substr($line, $pos, $sizes{$name})); $pos += $sizes{$name} + 1; } } use Data::Dumper; print Dumper \%data; __END__ Mod Sub-Module Model Serial ---- --------------------------- ------------------ ----------- 1 Distributed Forwarding Card WS-F6700-DFC3CXL SAL1434RLPY 3 7600 ES+ DFC XL 7600-ES+3CXL JAE14520N29 3 7600 ES+T 20x1GE SFP 76-ES+T-20GQ JAE145301XM 5 Policy Feature Card 3 7600-PFC3CXL JAE14330E6J 5 C7600 MSFC4 Daughterboard 7600-MSFC4 JAE14320QBE 6 Policy Feature Card 3 7600-PFC3CXL JAE14330EAO 6 C7600 MSFC4 Daughterboard 7600-MSFC4 JAE14320QA8 7 Distributed Forwarding Card WS-F6700-DFC3CXL SAL1433QHBR 8 Distributed Forwarding Card WS-F6700-DFC3CXL SAL1433QXF9

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1010100]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others making s'mores by the fire in the courtyard of the Monastery: (6)
As of 2014-09-01 19:12 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite cookbook is:










    Results (16 votes), past polls