Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Re: Parsing Cell Contents of Extracted HTML Tables

by epoptai (Curate)
on Jun 29, 2001 at 23:43 UTC ( #92785=note: print w/ replies, xml ) Need Help??


in reply to Parsing Cell Contents of Extracted HTML Tables

I can't solve your problem but can tell you that the 'decode' method only toggles the use of HTML::Entities. Look into the 'br_translate' method which translates <br> to \n to eliminate the strange concatenation.

Perhaps you could use the information extracted from the table to reparse the file for links and such.

--
Check out my Perlmonks Related Scripts like framechat, reputer, and xNN.


Comment on Re: Parsing Cell Contents of Extracted HTML Tables

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://92785]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others taking refuge in the Monastery: (10)
As of 2015-07-01 21:48 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (22 votes), past polls