Beefy Boxes and Bandwidth Generously Provided by pair Networks
Just another Perl shrine
 
PerlMonks  

Re^2: Extracting HTML content between the h tags

by vagabonding electron (Hermit)
on Aug 05, 2012 at 14:34 UTC ( #985531=note: print w/ replies, xml ) Need Help??


in reply to Re: Extracting HTML content between the h tags
in thread Extracting HTML content between the h tags

flexvault

Thank you very much!
Since I have read a lot that one should not parse a HTML without a module I did not try this before either :-)
I will certainly check this approach out.
I think it could be difficult in case that the last hr-tag is missing (described in Re^3: Extracting HTML content between the h tags ).
Thanks again!


Comment on Re^2: Extracting HTML content between the h tags
Re^3: Extracting HTML content between the h tags
by flexvault (Parson) on Aug 05, 2012 at 15:14 UTC
    p>vagabonding electron,

    For the missing hr-tag, just test for $keyword after the 'while' loop:

    if ( $keyword ) ## Same as if ( $keyword ne "" ) { print "$keyword\n"; }

    Whenever I start a new project/gig, I try to think whether this is similar to something I've done before, and if it is, then I use that code or technique as the starting point. If it is totally new (very rare), I still have a bag of tricks ( subroutines ) that I copy ( use ... ) into the new work. Look at every thing you do today as something you may be able to use for the rest of your programming life.

    You're lucky to have Perl, since a lot of the code I did before Perl is worthless today, but knowledge and techniques can be applied to Perl!

    Good Luck!

    "Well done is better than well said." - Benjamin Franklin

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://985531]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (7)
As of 2014-07-25 10:47 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (170 votes), past polls