Re^6: How to grab a portion of file with regex


Pathologically Eclectic Rubbish Lister
	PerlMonks

Re^6: How to grab a portion of file with regex

by Anonymous Monk

on Mar 15, 2013 at 06:39 UTC ( [id://1023624]=note: print w/replies, xml )

Need Help??

in reply to Re^5: How to grab a portion of file with regex
in thread How to grab a portion of file with regex

You make some valid points. The example in the question didn't seem to need to content of the div. I do agree that working with the DOM is a much better way to parse HTML. <

#!/usr/bin/perl --
use strict; use warnings; use XML::LibXML;
my $dom = XML::LibXML->new(
    qw/ recover 2 /
)->load_html(
    location => q{junk.html}
);
for (
    $dom->findnodes(q{
//div[
    contains( @class, 'lastUnit' )
]
    })
){
    print "\n", $_->nodePath, "\n\n$_\n";
}
__END__
[download]

Comment on Re^6: How to grab a portion of file with regex Download Code

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://1023624]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others rifling through the Monastery: (3)

As of 2024-04-25 17:27 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found