stabu has asked for the wisdom of the Perl Monks concerning the following question:
Hi
I'm using perl on win32, and every now and again I have to extract info from a html page. I use regex to precisely tell perl what I want pulled out. But this requires a close study of the html source, and much trial and error on the regex themselves. What is the tools that you guys use for analysing html at such depth. I started with Word (ha!), the notepad, then editplus, and now vim. All allow a good view of the html source but each have their difference from perl's regex, so the trial and error factor is still very high. Anybody have any suggestions?
Thanks in advance for answers.
I'm using perl on win32, and every now and again I have to extract info from a html page. I use regex to precisely tell perl what I want pulled out. But this requires a close study of the html source, and much trial and error on the regex themselves. What is the tools that you guys use for analysing html at such depth. I started with Word (ha!), the notepad, then editplus, and now vim. All allow a good view of the html source but each have their difference from perl's regex, so the trial and error factor is still very high. Anybody have any suggestions?
Thanks in advance for answers.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: html analysis tool via regex
by davorg (Chancellor) on Oct 13, 2005 at 09:12 UTC | |
Re: html analysis tool via regex
by marto (Cardinal) on Oct 13, 2005 at 08:24 UTC | |
by pajout (Curate) on Oct 13, 2005 at 08:33 UTC | |
Re: html analysis tool via regex
by GrandFather (Saint) on Oct 13, 2005 at 09:22 UTC | |
Re: html analysis tool via regex
by saintmike (Vicar) on Oct 13, 2005 at 08:17 UTC | |
Re: html analysis tool via regex
by jbrugger (Parson) on Oct 13, 2005 at 08:19 UTC | |
Re: html analysis tool via regex
by stabu (Scribe) on Oct 13, 2005 at 10:01 UTC | |
by planetscape (Chancellor) on Oct 14, 2005 at 04:35 UTC |
Back to
Seekers of Perl Wisdom