Re^4: How to grab a portion of file with regex


laziness, impatience, and hubris
	PerlMonks

Re^4: How to grab a portion of file with regex

by Anonymous Monk

on Mar 15, 2013 at 04:00 UTC ( [id://1023613]=note: print w/replies, xml )

Need Help??

in reply to Re^3: How to grab a portion of file with regex
in thread How to grab a portion of file with regex

Well instead of trolling why not supply a working example to help ?? Its always the Anonymous Monk lacking courage to put a name to a comment

How is it trolling to point out the shortcomings of a "solution"? Maybe you should look up the definition of troll

What courage is required to point out a simple fact about HTML::Parser? Are you under the impression that HTML::Parser is a high level parser?

Your "solution" doesn't fetch the portion of page from class = lastUnit to class = line margin10 -- its incomplete -- it is lots easier/shorter/simpler to use m{\Q$start\E(.+?)\Q$end\E}i instead of that HTML::Parser low-levelness

Have you seen Re: How to grab a portion of file with regex (don't)? Its not unlike a minimum of three different tutorials/walkthroughs/step-by-step-instructions on extracting/xpathing the dom , some even compare/contrast with HTML::Parser

Comment on Re^4: How to grab a portion of file with regex Download Code

Replies are listed 'Best First'.
Re^5: How to grab a portion of file with regex by kielstirling (Scribe) on Mar 15, 2013 at 04:22 UTC
You make some valid points. The example in the question didn't seem to need the content of the div. I do agree that working with the DOM is a much better way to parse HTML.	[reply]
Re^6: How to grab a portion of file with regex by Anonymous Monk on Mar 15, 2013 at 06:39 UTC
You make some valid points. The example in the question didn't seem to need to content of the div. I do agree that working with the DOM is a much better way to parse HTML. < :) `#!/usr/bin/perl -- use strict; use warnings; use XML::LibXML; my $dom = XML::LibXML->new( qw/ recover 2 / )->load_html( location => q{junk.html} ); for ( $dom->findnodes(q{ //div[ contains( @class, 'lastUnit' ) ] }) ){ print "\n", $_->nodePath, "\n\n$_\n"; } __END__` [download]	[reply] [d/l]

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://1023613]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others having an uproarious good time at the Monastery: (5)

As of 2024-04-19 15:26 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found