Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation
 
PerlMonks  

Re: question about lookaheads and threatexpert/html parsing

by tangent (Parson)
on Mar 23, 2016 at 23:49 UTC ( [id://1158664]=note: print w/replies, xml ) Need Help??


in reply to question about lookaheads and threatexpert/html parsing

This is how you might do it with HTML::TreeBuilder::XPath:
use Data::Dumper; use HTML::TreeBuilder::XPath; my $html = q| <ul><li>The following Host Names were requested from a host database:< +/li> <ul> <li>192.5.5.241</li> <li>192.5.5.242</li> </ul></ul> |; my $tree = HTML::TreeBuilder::XPath->new; $tree->parse($html); $tree->eof; my @wanted; my @nodes = $tree->findnodes('//ul'); for my $node ( @nodes ) { my $text = $node->findvalue('li') or next; $text =~ m/^The following Host Name/ or next; @wanted = $node->findvalues('ul/li'); last; } print Dumper \@wanted;
Output:
$VAR1 = [ '192.5.5.241', '192.5.5.242' ];

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1158664]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others examining the Monastery: (5)
As of 2024-04-19 23:44 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found