Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re^3: TreeBuilder and encoding

by Khen1950fx (Canon)
on Jul 15, 2013 at 23:08 UTC ( #1044470=note: print w/ replies, xml ) Need Help??


in reply to Re^2: TreeBuilder and encoding
in thread TreeBuilder and encoding

I think that you are working it a little to hard. There is no "utf-8", but there is ":utf8". I always use ":encoding(UTF-8)", just to be safe.

Here's what I did: If you use the new_from_url method, then it will call LWP::UserAgent for you.
#!/usr/bin/perl use strict; use warnings; use HTML::TreeBuilder 5 -weak; my $url = 'http://buyingguide.winemag.com/catalog/peju-1998-reserve-cabern +et-sauvignon-napa-rutherford'; my $tree = HTML::TreeBuilder->new_from_url( $url ); $tree->parse_content( $url ); my $review_et = $tree->look_down('itemprop', 'reviewBody'); binmode STDOUT, ":encoding(UTF-8)"; print $review_et->as_text; $tree->delete;


Comment on Re^3: TreeBuilder and encoding
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1044470]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (14)
As of 2015-07-31 12:09 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (277 votes), past polls