Beefy Boxes and Bandwidth Generously Provided by pair Networks
There's more than one way to do things
 
PerlMonks  

Re: Retrieve select information from HTML

by Anonymous Monk
on Jul 19, 2013 at 01:13 UTC ( [id://1045234]=note: print w/replies, xml ) Need Help??


in reply to Retrieve select information from HTML

Any help would be appreciated. I did look at HTML::Miner and HTML::Tree but neither seemed to accomplish what I needed with my limited knowledge of HTML.

See HTML::Tree::Scanning and these walkthroughs

See htmltreexpather.pl , Parsing HTML / Re^4: Parsing HTML, A regex question , NASA's Astronomy Picture of the Day / Re: NASA's Astronomy Picture of the Day , Re: Extracting HTML content between the h tags, Re^2: Help With Online Table Scraper, Re^4: web::scraper using an xpath, .... HTML Parser suggestions

xpather.pl
Re: Get Node Value from irregular XML (xpather.pl)
Re: Having trouble with siblings
Re^2: XML parsing and Lists
Re: Counting number of child nodes based on element value (typos)
Re^3: Extracting specific childnodes (xpath whitespace)
Re^3: Extracting specific childnodes (play xmllint --shell )
Re: How do i get value of an element if the next elememnt has specific value in XML::LibXML using Xpath?
Re: How do i get value of an element if the next elememnt has specific value in XML::LibXML using Xpath?
Re: How to parse xml with namespase vale in XMl:LibXML? ( XPath error : Undefined namespace prefix )
Re^2: How to parse xml with namespase vale in XMl:LibXML? (xmllint --shell setns / xpathtester)

//div[@class='warrantyDetails']/div[@class='warrantDescription'] //div[@class='warrantDescription'] //div[@class='warrantServiceTagMain'] //div[@class='warrantServiceTagSub'] //div[@class='TopTwoWarrantySummaryDiv'] //div[@class='TopTwoWarrantyListItem'] //*[ name() = "meta" and @name = "supportservicetag" ] //*[ name() = "meta" and @name = "supportproductcode" ] //*[ name() = "meta" and @name = "supportproductselected" ] //*[ name() = "meta" and @name = "servicetag" ]

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1045234]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others having an uproarious good time at the Monastery: (6)
As of 2024-04-18 21:01 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found