Beefy Boxes and Bandwidth Generously Provided by pair Networks httptech
Don't ask to ask, just ask
 
PerlMonks  

Re: Getting the innerHTML from the DOM , not the “source”

by moritz (Cardinal)
on Apr 23, 2013 at 18:28 UTC ( #1030211=note: print w/ replies, xml ) Need Help??


in reply to Getting the innerHTML from the DOM , not the “source”

You need something that processes client-side javascript. WWW::Mechanize::Firefox comes to mind; I don't know it well enough to tell you for sure if it can do it, but it looks like a good option to investigate.


Comment on Re: Getting the innerHTML from the DOM , not the “source”
Re^2: Getting the innerHTML from the DOM , not the “source”
by Karels (Initiate) on Apr 25, 2013 at 16:45 UTC
    OK, so I am using WWW::Mechanize::Firefox and I am getting at the data I am after. From a processing POV I need to loop through several pages. I would like to update the contents of $mech-> with the contents of a new url--it there a way to do this? When I try re-issuing $mech->get with a new argument the program appears to hang... I don't see anything obvious in the Cookbook, examples, etc.

    Actually, now I get a warning:

    Subroutine MozRepl::__load_plugins redefined at C:/Perl/site/lib/Modul +e/Pluggable/Fast.pm line 104.
    Any thoughts would be welcome.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1030211]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (11)
As of 2014-04-19 15:29 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    April first is:







    Results (482 votes), past polls