Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic
 
PerlMonks  

Re: Perl HTML:: Parser

by 2teez (Priest)
on Apr 26, 2013 at 04:41 UTC ( #1030761=note: print w/ replies, xml ) Need Help??


in reply to Perl HTML:: Parser

Hi Anonymous Monk,
Please, if I can make an assumption that since you used the module HTML::FormatText you intended to get your output in plain text not have the whole html page with all the tags in as text.
If this is what you want, then you can do like so:

use warnings; use strict; use HTML::TreeBuilder 5 -weak; use HTML::FormatText; my $tree = HTML::TreeBuilder->new_from_url("http://www.google.com"); my $format = HTML::FormatText->new(leftmargin=>3, rightmargin=>50); print $format->format($tree);
Output:
Search Images Maps Play YouTube News Gmail Drive More Web History | Settings | Sign in Nigeria Advanced searchLanguage tools Google.com.ng offered in: Hausa Igbo Yorb Pidgin Advertising ProgramsBusiness SolutionsAbout GoogleGoogle.com 2013 - Privacy & Terms
NOTE:
  1. Of course, you might need the module LWP::UserAgent, to get your html file, if you don't have html file stored.
  2. Please, note the usage of the module HTML::TreeBuilder, if you have your html file, you might use a different method.
  3. However, if you are using a linux OS, you can also see lynx like so: lynx -dump http://www.google.com
I hope this helps. graff was right on the question he asked about the clarity of what you wanted done.

If you tell me, I'll forget.
If you show me, I'll remember.
if you involve me, I'll understand.
--- Author unknown to me


Comment on Re: Perl HTML:: Parser
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1030761]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (7)
As of 2014-12-20 19:56 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (97 votes), past polls