Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re: [OT] Ethical and Legal Screen Scraping

by tlm (Prior)
on Jul 26, 2005 at 03:16 UTC ( #478061=note: print w/replies, xml ) Need Help??


in reply to [OT] Ethical and Legal Screen Scraping

For example, suppose I write a little tool using LWP::UserAgent or WWW::Mechanize (rather than LWP::RobotUA or WWW::Mechanize::Polite ?, say) that simply collects a number of web pages for me while I sleep. Is it illegal or unethical for such a scraper to ignore robots.txt?

Whether it is legal or not I won't get into, but AFAIC, I don't have any ethical objection to such a tool as long as it doesn't impose a greater load on the target server(s) than you would if you were to perform the same task manually.

the lowliest monk

  • Comment on Re: [OT] Ethical and Legal Screen Scraping

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://478061]
help
Chatterbox?
[Corion]: A pleasant daypart to everybody!

How do I use this? | Other CB clients
Other Users?
Others scrutinizing the Monastery: (7)
As of 2018-05-23 07:25 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Notices?