http://www.perlmonks.org?node_id=265888


in reply to Encoding entities ONLY in the <body></body> of a webpage

you might look into HTML::Parser and HTML::TokeParser, then utilize the one or the other to get only the content of the body element and process that further.

On the other hand I'd like to point to the fact that meta tags for keywors and description and title also can and mostly will contain entities or characters that should be replaced with entities to be properly displayed.

Have a nice day
All decision is left to your taste

  • Comment on Re: Encoding entities ONLY in the <body></body> of a webpage