in reply to
Re^2: fix the problem of the web crawler
in thread fix the problem of the web crawler
Here are a couple more (very specific) hints:
- Uncomment out the print page line so you can see the content you are scraping (or just go to the appropriate URL and view source).
- Change this part of the regex since it is apparently out-of-date: <td\sclass="coauthor"\salign="right"\sbgcolor="[^"]+">
Also, I don't mean to be a jerk, but it is really better for you if you work through this yourself. Instead of sending me messages, you should show what you are trying here and people will be more willing to help when they've seen that you are indeed making a noble effort. Like the ancient saying goes: "Monks help those that help themselves!"