Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation
 
PerlMonks  

Re: Re: Re: HTML <=> Text convertion

by Willard B. Trophy (Hermit)
on Dec 10, 2003 at 16:17 UTC ( #313761=note: print w/ replies, xml ) Need Help??


in reply to Re: Re: HTML <=> Text convertion
in thread HTML <=> Text convertion

Well, if it's your last resort, you are wasting a huge amount of your time and effort. Lazy Programmers -- and you do aspire to be one -- always use the quickest solution first.

I prefer w3m -dump over lynx for generating plain text from HTML. It handles tables properly. It runs CGI locally for testing HTML output.

If you are wanting text you can reformat easily, use the -cols option. It's your friend for stripping markup.

--
bowling trophy thieves, die!


Comment on Re: Re: Re: HTML <=> Text convertion
Re: Re: Re: Re: HTML <=> Text convertion
by TVSET (Chaplain) on Dec 11, 2003 at 21:44 UTC
    There was a reason I wanted to do it the "Perl-way". I am not the only root on the system, but I pretty much the only doing Perl there. Therefor, nothing Perl-related changes without my knowledge on that machine. Lynx/links/w3m though can be removed/upgraded without me noticing. Easy to fix, I know, but good enough reason for me to try to find something else as a solution. :)

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://313761]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (7)
As of 2014-07-25 23:13 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (175 votes), past polls