Re: Parsing issues

The code below may give you some hints on how to process your data:

use strict;
use warnings;
my @differenthtml;
$/="htmlpagemark";
while (<DATA>){
  chomp;
  next if $_ eq "htmlpagemark";
  next unless length($_) > 0;
  push @differenthtml, $_;
}

for my $item (0..$#differenthtml){
  print  "===Item $item ==\n$differenthtml[$item]\n"
}
__DATA__
htmlpagemark    http://finance.yahoo.com #/q/hp?s=%5EDJI&d=10&e=8&f=20
+12&g=d&a=0&b=2&c=199
+2&z=66&y=5214
#  <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN" #"http://www.w3.o
+rg/TR/html4/strict.dtd">
# It also consists of a lot of more text before the next #"h t m l p a
+ g e m a r k"
htmlpagemark    http://another/URL
More text
#sorry if this question reveals to be noobish and simple #but i have n
+o idea how to solve this issue
+ the=    #contents of a read text file
END
[download]

"By three methods we may learn wisdom: First, by reflection, which is noblest; Second, by imitation, which is easiest; and third by experience, which is the bitterest." -Confucius

Comment on Re: Parsing issues Download Code

In Section Seekers of Perl Wisdom