Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Re^2: How to extract xpath from the webpage

by perladdict (Chaplain)
on Nov 04, 2009 at 05:50 UTC ( #804855=note: print w/ replies, xml ) Need Help??


in reply to Re: How to extract xpath from the webpage
in thread How to extract xpath from the webpage

Hi Corion, I am doing web page automation to find the links, text and image links by using selenium,which uses xpath to locate the links like "//td2/div/a/img" from the web page source. I am trying.
I am trying with Html::TreeBuilder::xpath, i don't know what are all the other modules i can import in my script.


Comment on Re^2: How to extract xpath from the webpage
Re^3: How to extract xpath from the webpage
by Corion (Pope) on Nov 04, 2009 at 08:10 UTC

    If Selenium supports XPath queries, you don't need any Perl XPath modules. If you want to access Selenium and its results, see WWW::Selenium. If you want to use HTML::TreeBuilder::XPath, I'm not sure where your actual problem in your code is. The "synopsis" section shows how to extract HTML fragments from a given HTML string. Maybe you want to fetch the images using LWP::UserAgent then?

    Personally, I automate websites with WWW::Mechanize::FireFox, which supports Javascript (and XPath).

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://804855]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (12)
As of 2015-07-03 07:30 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (48 votes), past polls