Beefy Boxes and Bandwidth Generously Provided by pair Networks
more useful options

[SOLVED] How to get text from site?

by milovidov (Novice)
on Feb 12, 2013 at 01:01 UTC ( #1018272=perlquestion: print w/ replies, xml ) Need Help??
milovidov has asked for the wisdom of the Perl Monks concerning the following question:

Hello all!

Is exist way for getting text from any site? It means that need text from site as can see him visitor of site.

Thanks for your attention!


I found solution simular that offered by grondilu, but with Lynx browser, instead elinks. Because Lynx is a cross-platform solution

Comment on [SOLVED] How to get text from site?
Replies are listed 'Best First'.
Re: How to get text from site?
by trizen (Hermit) on Feb 12, 2013 at 01:54 UTC
    HTML::Strip may be helpful.
    use strict; use warnings; use encoding qw(UTF-8); use HTML::Strip qw(); use LWP::Simple qw(get); my $url = ''; my $hs = HTML::Strip->new(); my $clean_text = $hs->parse(get($url)); print $clean_text;
Re: How to get text from site?
by grondilu (Pilgrim) on Feb 12, 2013 at 12:25 UTC

    If you want the text to be at least roughly formatted, you can use the '-dump' option of a text-only browser:

    print qx{elinks -dump};

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://1018272]
Approved by ww
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chilling in the Monastery: (13)
As of 2015-11-30 16:04 GMT
Find Nodes?
    Voting Booth?

    What would be the most significant thing to happen if a rope (or wire) tied the Earth and the Moon together?

    Results (777 votes), past polls