Re: about retrieving and parsing html without writing on disk


laziness, impatience, and hubris
	PerlMonks

by LanX (Saint)

on Apr 09, 2018 at 22:15 UTC ( [id://1212613]=note: print w/replies, xml )

Need Help??

hmm, I'm too busy to install the modules, but it's at least possible to open a variable for reading and writing.

open my $fh , "<", \$cache

so if you can operate with filehandles instead of files this should work.

HTML::Parser allows ->parse_file($fh) and even ->parse($string)

Maybe have a look at $string = $mech->content(...) from WWW::Mechanize

Cheers Rolf
_{(addicted to the Perl Programming Language and ☆☆☆☆ :)

Wikisyntax for the Monastery}

Comment on Re: about retrieving and parsing html without writing on disk Select or Download Code

Replies are listed 'Best First'.
Re^2: about retrieving and parsing html without writing on disk by rizzo (Curate) on Apr 10, 2018 at 00:30 UTC
Maybe have a look at $string = $mech->content(...) from WWW::Mechanize and maybe at HTTP::Response as well, because `$mech->get( $uri )` returns an object of that type.	[reply] [d/l]
Re^3: about retrieving and parsing html without writing on disk by Your Mother (Archbishop) on Apr 10, 2018 at 06:10 UTC
Good note for checking `$response->code` and such. Along those lines, for the OP, if you use WWW::Mechanize remember that it fails hard, dies, on any non-success responses, 400s and 500s, unless you set `autocheck => 0`. You also have access to the response object from the mech object with `$mech->response` so you don't necessarily need a new variable for it.	[reply] [d/l] [select]

Domain Nodelet^?

Node Status^?

node history
Node Type: note [id://1212613]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others having an uproarious good time at the Monastery: (3)

As of 2024-04-24 04:14 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Voting Booth^?

No recent polls found