Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical

Re: Getting the innerHTML from the DOM , not the “source”

by moritz (Cardinal)
on Apr 23, 2013 at 18:28 UTC ( #1030211=note: print w/ replies, xml ) Need Help??

in reply to Getting the innerHTML from the DOM , not the “source”

You need something that processes client-side javascript. WWW::Mechanize::Firefox comes to mind; I don't know it well enough to tell you for sure if it can do it, but it looks like a good option to investigate.

Comment on Re: Getting the innerHTML from the DOM , not the “source”
Replies are listed 'Best First'.
Re^2: Getting the innerHTML from the DOM , not the “source”
by Karels (Initiate) on Apr 25, 2013 at 16:45 UTC
    OK, so I am using WWW::Mechanize::Firefox and I am getting at the data I am after. From a processing POV I need to loop through several pages. I would like to update the contents of $mech-> with the contents of a new url--it there a way to do this? When I try re-issuing $mech->get with a new argument the program appears to hang... I don't see anything obvious in the Cookbook, examples, etc.

    Actually, now I get a warning:

    Subroutine MozRepl::__load_plugins redefined at C:/Perl/site/lib/Modul +e/Pluggable/ line 104.
    Any thoughts would be welcome.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1030211]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others lurking in the Monastery: (3)
As of 2015-11-27 03:51 GMT
Find Nodes?
    Voting Booth?

    What would be the most significant thing to happen if a rope (or wire) tied the Earth and the Moon together?

    Results (717 votes), past polls