Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid

Re: Parsing HTML files to recover data...

by Anonymous Monk
on Nov 22, 2006 at 06:10 UTC ( #585445=note: print w/replies, xml ) Need Help??

in reply to Parsing HTML files to recover data...

I have had great success scraping data out of html files using XML::LibXML. This will parse the html into a DOM tree and allow XPath searches for the data. While this may be overkill both for the learning curve or CPU cycles, the code required for coaxing the data out of the files will be pretty simple. You may also end up with a code that is easily changed to solve any similar problem.
  • Comment on Re: Parsing HTML files to recover data...

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://585445]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others perusing the Monastery: (7)
As of 2018-03-19 04:02 GMT
Find Nodes?
    Voting Booth?
    When I think of a mole I think of:

    Results (232 votes). Check out past polls.