http://www.perlmonks.org?node_id=1004125


in reply to Parsing issues

The code below may give you some hints on how to process your data:
use strict; use warnings; my @differenthtml; $/="htmlpagemark"; while (<DATA>){ chomp; next if $_ eq "htmlpagemark"; next unless length($_) > 0; push @differenthtml, $_; } for my $item (0..$#differenthtml){ print "===Item $item ==\n$differenthtml[$item]\n" } __DATA__ htmlpagemark http://finance.yahoo.com #/q/hp?s=%5EDJI&d=10&e=8&f=20 +12&g=d&a=0&b=2&c=199 +2&z=66&y=5214 # <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN" #"http://www.w3.o +rg/TR/html4/strict.dtd"> # It also consists of a lot of more text before the next #"h t m l p a + g e m a r k" htmlpagemark http://another/URL More text #sorry if this question reveals to be noobish and simple #but i have n +o idea how to solve this issue + the= #contents of a read text file END

             "By three methods we may learn wisdom: First, by reflection, which is noblest; Second, by imitation, which is easiest; and third by experience, which is the bitterest."           -Confucius