Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Re: Encoding entities ONLY in the <body></body> of a webpage

by little (Curate)
on Jun 14, 2003 at 13:07 UTC ( #265888=note: print w/ replies, xml ) Need Help??


in reply to Encoding entities ONLY in the <body></body> of a webpage

you might look into HTML::Parser and HTML::TokeParser, then utilize the one or the other to get only the content of the body element and process that further.

On the other hand I'd like to point to the fact that meta tags for keywors and description and title also can and mostly will contain entities or characters that should be replaced with entities to be properly displayed.

Have a nice day
All decision is left to your taste


Comment on Re: Encoding entities ONLY in the <body></body> of a webpage

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://265888]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (7)
As of 2014-07-25 11:22 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (170 votes), past polls