Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: Getting the innerHTML from the DOM , not the “source”

by moritz (Cardinal)
on Apr 23, 2013 at 18:28 UTC ( #1030211=note: print w/ replies, xml ) Need Help??


in reply to Getting the innerHTML from the DOM , not the “source”

You need something that processes client-side javascript. WWW::Mechanize::Firefox comes to mind; I don't know it well enough to tell you for sure if it can do it, but it looks like a good option to investigate.


Comment on Re: Getting the innerHTML from the DOM , not the “source”
Re^2: Getting the innerHTML from the DOM , not the “source”
by Karels (Initiate) on Apr 25, 2013 at 16:45 UTC
    OK, so I am using WWW::Mechanize::Firefox and I am getting at the data I am after. From a processing POV I need to loop through several pages. I would like to update the contents of $mech-> with the contents of a new url--it there a way to do this? When I try re-issuing $mech->get with a new argument the program appears to hang... I don't see anything obvious in the Cookbook, examples, etc.

    Actually, now I get a warning:

    Subroutine MozRepl::__load_plugins redefined at C:/Perl/site/lib/Modul +e/Pluggable/Fast.pm line 104.
    Any thoughts would be welcome.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1030211]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (3)
As of 2015-07-05 03:23 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (60 votes), past polls