Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re^5: can't extract node with HTML::TreeBuilder::XPath

by tobyink (Canon)
on Aug 01, 2012 at 06:35 UTC ( [id://984707]=note: print w/replies, xml ) Need Help??


in reply to Re^4: can't extract node with HTML::TreeBuilder::XPath
in thread can't extract node with HTML::TreeBuilder::XPath

Or you could read the HTML5 specification which it almost perfectly complies with. That's the whole point of it - it doesn't need to document how it parses HTML, because it parses it per spec, and the same way as almost every modern browser.

perl -E'sub Monkey::do{say$_,for@_,do{($monkey=[caller(0)]->[3])=~s{::}{ }and$monkey}}"Monkey say"->Monkey::do'
  • Comment on Re^5: can't extract node with HTML::TreeBuilder::XPath

Replies are listed 'Best First'.
Re^6: can't extract node with HTML::TreeBuilder::XPath
by Anonymous Monk on Aug 01, 2012 at 07:15 UTC

    Or you could read the HTML5 specification which it almost perfectly complies with. That's the whole point of it - it doesn't need to document how it parses HTML, because it parses it per spec, and the same way as almost every modern browser.

    How could anyone know to read that? Because you mention it here on perlmonks? The only way to even get a hint that it compiles with some html5 spec is to read the source -- the only mention in the documentation is where "foobar" is not a real HTML element name (as found in the HTML5 spec) -- in short, nowhere in your module documentation do you actually tell anyone go read w3.... for the algorithm

      "The only way to even get a hint that it compiles with some html5 spec is to read the source"

      Or to read the name of the module. It's not like it's called "WWW::XYZ::ABC"; it's called "HTML::HTML5::Parser". Perhaps it might be a parser for HTML5?

      perl -E'sub Monkey::do{say$_,for@_,do{($monkey=[caller(0)]->[3])=~s{::}{ }and$monkey}}"Monkey say"->Monkey::do'

        Or to read the name of the module. It's not like it's called "WWW::XYZ::ABC"; it's called "HTML::HTML5::Parser". Perhaps it might be a parser for HTML5?

        How clever, name as substitute for documentation

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://984707]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (4)
As of 2024-04-20 02:20 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found