A long time ago, before RSS feeds, I wrote a program to parse various newspaper websites and did the regexes by hand. I had 24 different rules for 90+ papers. When I rewrote it, I got it down to 9 rules, mainly based on web page design, since I used a parsing engine.
You're going to save yourself a ton of work since if the data changes you're going to have to rewrite your regexes each time.
To disagree, one doesn't have to be disagreeable - Barry Goldwater