Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked
 
PerlMonks  

[SOLVED] How to get text from site?

by milovidov (Novice)
on Feb 12, 2013 at 01:01 UTC ( #1018272=perlquestion: print w/replies, xml ) Need Help??
milovidov has asked for the wisdom of the Perl Monks concerning the following question:

Hello all!

Is exist way for getting text from any site? It means that need text from site as can see him visitor of site.

Thanks for your attention!

UPDATE

I found solution simular that offered by grondilu, but with Lynx browser, instead elinks. Because Lynx is a cross-platform solution

Replies are listed 'Best First'.
Re: How to get text from site?
by trizen (Hermit) on Feb 12, 2013 at 01:54 UTC
    HTML::Strip may be helpful.
    use strict; use warnings; use encoding qw(UTF-8); use HTML::Strip qw(); use LWP::Simple qw(get); my $url = 'http://example.com'; my $hs = HTML::Strip->new(); my $clean_text = $hs->parse(get($url)); print $clean_text;
Re: How to get text from site?
by grondilu (Friar) on Feb 12, 2013 at 12:25 UTC

    If you want the text to be at least roughly formatted, you can use the '-dump' option of a text-only browser:

    print qx{elinks -dump http://www.perl.org};

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://1018272]
Approved by ww
help
Chatterbox?
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others chilling in the Monastery: (7)
As of 2016-12-04 02:46 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    On a regular basis, I'm most likely to spy upon:













    Results (61 votes). Check out past polls.