http://www.perlmonks.org?node_id=1045234


in reply to Retrieve select information from HTML

Any help would be appreciated. I did look at HTML::Miner and HTML::Tree but neither seemed to accomplish what I needed with my limited knowledge of HTML.

See HTML::Tree::Scanning and these walkthroughs

See htmltreexpather.pl , Parsing HTML / Re^4: Parsing HTML, A regex question , NASA's Astronomy Picture of the Day / Re: NASA's Astronomy Picture of the Day , Re: Extracting HTML content between the h tags, Re^2: Help With Online Table Scraper, Re^4: web::scraper using an xpath, .... HTML Parser suggestions

xpather.pl
Re: Get Node Value from irregular XML (xpather.pl)
Re: Having trouble with siblings
Re^2: XML parsing and Lists
Re: Counting number of child nodes based on element value (typos)
Re^3: Extracting specific childnodes (xpath whitespace)
Re^3: Extracting specific childnodes (play xmllint --shell )
Re: How do i get value of an element if the next elememnt has specific value in XML::LibXML using Xpath?
Re: How do i get value of an element if the next elememnt has specific value in XML::LibXML using Xpath?
Re: How to parse xml with namespase vale in XMl:LibXML? ( XPath error : Undefined namespace prefix )
Re^2: How to parse xml with namespase vale in XMl:LibXML? (xmllint --shell setns / xpathtester)

//div[@class='warrantyDetails']/div[@class='warrantDescription'] //div[@class='warrantDescription'] //div[@class='warrantServiceTagMain'] //div[@class='warrantServiceTagSub'] //div[@class='TopTwoWarrantySummaryDiv'] //div[@class='TopTwoWarrantyListItem'] //*[ name() = "meta" and @name = "supportservicetag" ] //*[ name() = "meta" and @name = "supportproductcode" ] //*[ name() = "meta" and @name = "supportproductselected" ] //*[ name() = "meta" and @name = "servicetag" ]