Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change
 
PerlMonks  

Re: Scraping HTML: orthodoxy and reality

by John M. Dlugosz (Monsignor)
on Jul 10, 2003 at 18:21 UTC ( #273086=note: print w/ replies, xml ) Need Help??


in reply to Scraping HTML: orthodoxy and reality

OK, you convinced me to use regex instead of a parser for my program. This avoids the problem of re-formatting the parse tree to resemble the original input (I can modify the found lines in-place easily), and I can live with "parser" limitations and simply not write goofy stuff in my HTML.

—John


Comment on Re: Scraping HTML: orthodoxy and reality

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://273086]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (11)
As of 2015-07-06 15:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (77 votes), past polls