http://www.perlmonks.org?node_id=92785


in reply to Parsing Cell Contents of Extracted HTML Tables

I can't solve your problem but can tell you that the 'decode' method only toggles the use of HTML::Entities. Look into the 'br_translate' method which translates <br> to \n to eliminate the strange concatenation.

Perhaps you could use the information extracted from the table to reparse the file for links and such.

--
Check out my Perlmonks Related Scripts like framechat, reputer, and xNN.