You might also look at HTML::TokeParser to get at the information you're after. I used it to strip headlines from newspaper sites a while back and it worked well for me.

