in reply to Re^3: Any spider framework?
in thread Any spider framework?
In the case of <a name="foo"> it simply won't match, as the regexp includes href.
To be reliable, a parser (actually just a lexer; it could be regex based) should extract whole tags, and you should then test each on its own. That would be much more reliable.