Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic
 
PerlMonks  

Re: split at '- - -'

by blue_cowdawg (Monsignor)
on May 17, 2013 at 13:39 UTC ( #1033974=note: print w/replies, xml ) Need Help??


in reply to split at '- - -'

Since I wasn't 100% sure of what you were asking for, I give you two! two solutions in one!

#!/usr/bin/perl -w use strict; use Data::Dumper; my $line="this---is---Sparta!"; # # Case 1 a line with *fields* seperated by dashes my @f=split('---',$line); #split the line along the dashes print Dumper(\@f); # Case 2 Records seperated by three dashes my $oldIFS=$/; #save the old IFS $/ = undef; # make it undefined $line=<DATA>; # slurp in file $/=$oldIFS; # restore IFS @f = split('---',$line); # split on dashes $_ =~ s/^\n// foreach @f; # remove leading EOL chomp (@f); # remove traiing EOL print Dumper(\@f); exit(0); # we're done, going home __END__ line1 --- line2 --- line3
when run gives you:
$VAR1 = [ 'this', 'is', 'Sparta!' ]; $VAR1 = [ 'line1', 'line2', 'line3' ];


Peter L. Berghold -- Unix Professional
Peter -at- Berghold -dot- Net; AOL IM redcowdawg Yahoo IM: blue_cowdawg

Replies are listed 'Best First'.
Re^2: split at '- - -'
by tobyink (Abbot) on May 17, 2013 at 13:54 UTC

    undef isn't the only useful thing you can assign to $/...

    use strict; use Data::Dumper; my @f; { local $/ = "\n---\n"; chomp(@f = <DATA>); } print Dumper(\@f); exit; __END__ line1 --- line2 --- line3 ---
    package Cow { use Moo; has name => (is => 'lazy', default => sub { 'Mooington' }) } say Cow->new->name
Re^2: split at '- - -'
by hdb (Prior) on May 17, 2013 at 14:00 UTC

    As a small variant, there could also be a hybrid case where you want to split on "---" if they are on a line by themselves but not if they occur within a line. Adapting Peter's code:

    #!/usr/bin/perl -w use strict; use Data::Dumper; my $line; { local $/ = undef; $line=<DATA>; } chomp $line; my @f = split('\n---\n',$line); print Dumper(\@f); __END__ line1 --- line2---line2a --- line3
Re^2: split at '- - -'
by Saved (Beadle) on May 17, 2013 at 14:05 UTC
    Thanx, looking good...

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1033974]
help
Chatterbox?
[ambrus]: Corion: from the docs, it definitely looks like it would be possible to write an AnyEvent driver for it using Prima::File and Prima::Timer.
[ambrus]: (I haven't found an idle event.)
[ambrus]: However, I also looked at the internals, and found two interesting things:
[marto]: stupid xpath question, I have a xpath copied from the browser, is the * wildcard supposed to work for IDs? e.g. '//*[@id="*"]/ div/div/header/p/a '?
[ambrus]: The main loop for unixish systems is in https://metacpan. org/source/ KARASIK/Prima-1. 49/unix/apc_app.c. It is a hand-rolled select loop, but also always spins at least once every 0.2 seconds for some reason.
[ambrus]: marto: no, not that way. if you just want to test for an existing id attribute, then write *[@id] , or you can match a regexp to the value of that attribute like *[@id=~/something/ ]
[marto]: ambrus, thanks I ended up going with '//*[starts- with(@id, "thread_")]...
[ambrus]: Sometimes it's easier to just match elements in ways other than xpath, like directly with perl code.

How do I use this? | Other CB clients
Other Users?
Others about the Monastery: (6)
As of 2016-12-08 10:50 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    On a regular basis, I'm most likely to spy upon:













    Results (140 votes). Check out past polls.