Beefy Boxes and Bandwidth Generously Provided by pair Networks Frank
Keep It Simple, Stupid
 
PerlMonks  

Re: search a foreign directory

by suaveant (Parson)
on Apr 24, 2001 at 21:04 UTC ( #75142=note: print w/ replies, xml ) Need Help??


in reply to search a foreign directory

well... the easy way would be to use something like wget which supports recursive downloads, then search it locally...

Or you could write a web spider of your own in perl using LWP and search the pages each time (or make a local copy as with the wget). Probably be a good idea to cache the pages locally for a while and search them locally, then rebuild the link and go to the actual site.

of course, google and altavista have an option to search within a specific domain, so if they are in there you could just use them :)

Update BTW, to use the domain searching in AV and google go to their Advanced Search pages
                - Ant


Comment on Re: search a foreign directory
Re: search a foreign directory
by cLive ;-) (Parson) on Apr 24, 2001 at 23:29 UTC
    You dont have to go to the advanced search page. Altavista supports (amongst others) these nice little shortcuts:

    +host:domain.com - only search for results on this domain
    +link:domain.com - only search for results that link to domain.com

    The latter is useful when you want to see who's linked to your site :)

    Just include them along with your search term to restrict. This means you can create a search of the site easily by auto populating a search box with the +host:domain.com string and allowing users to enter their term. Or use JavaScript to hide the term and present an empty search box.

    cLive ;-)

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://75142]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others pondering the Monastery: (13)
As of 2014-04-25 08:43 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    April first is:







    Results (585 votes), past polls