in reply to fix the problem of the web crawler
Three hints (and a suggestion):
- The URLs that the script is generating are correct.
- The regex doesn't seem to be matching anything because the style code has changed on the website.
- There are lots of other potential problems with your script that can be found with use strict; use warnings;
- Considering you posted nearly the same wall of script a year ago, it might be worth paying someone to clean it up and make it work properly.
Edit: Is this how the output (conf.txt.) is supposed to look? (I accept all PayPal alternatives... Just kidding... Sort of... But seriously, if this is the expected output and you follow my hints, you'll figure it out.)
1=James F. Blakesley=Frederick H. Wolf 1=James F. Blakesley=Keith S. Murray 1=James F. Blakesley=Dagmar Murray 2=James F. Blinn=Turner Whitted 2=James F. Blinn=Pat Hanrahan 2=James F. Blinn=Tomas Porter 2=James F. Blinn=Flip Phillips 2=James F. Blinn=Martin E. Newell 2=James F. Blinn=Jeffrey M. Lane 2=James F. Blinn=Nick England 2=James F. Blinn=Loren C. Carpenter 2=James F. Blinn=Alvy Ray Smith 2=James F. Blinn=Donna J. Cox 2=James F. Blinn=Helga M. Leonardt Hendriks 2=James F. Blinn=Charles T. Loop 2=James F. Blinn=Rob Pike 2=James F. Blinn=Richard Ellison 3=James F. Blowey=John W. Barrett 3=James F. Blowey=Stephen Langdon 3=James F. Blowey=John R. King 4=James F. Bowring=Mary Jean Harrold 4=James F. Bowring=James M. Rehg 4=James F. Bowring=Alessandro Orso 4=James F. Bowring=James A. Jones
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: fix the problem of the web crawler
by ati (Initiate) on Nov 08, 2012 at 18:11 UTC | |
by frozenwithjoy (Priest) on Nov 08, 2012 at 21:44 UTC | |
by ati (Initiate) on Nov 09, 2012 at 14:55 UTC | |
by frozenwithjoy (Priest) on Nov 09, 2012 at 15:29 UTC | |
by ati (Initiate) on Nov 10, 2012 at 16:04 UTC |
In Section
Seekers of Perl Wisdom