Beefy Boxes and Bandwidth Generously Provided by pair Networks
Clear questions and runnable code
get the best and fastest answer
 
PerlMonks  

Regex Searching the WWW

by Polyglot (Monk)
on Jun 24, 2009 at 11:01 UTC ( #774344=perlquestion: print w/ replies, xml ) Need Help??
Polyglot has asked for the wisdom of the Perl Monks concerning the following question:

A friend of mine recently wanted to do a search for every possible usage of the word "academic," as in "academic acceptance", "academic administration", "academic dean", etc. Does anyone here know of a good way to use regex on the WWW to acquire such a list, or simply to search online with a regex tool for any purpose? A good CPAN module, perhaps?

Blessings,

~Polyglot~

Comment on Regex Searching the WWW
Re: Regex Searching the WWW
by davorg (Chancellor) on Jun 24, 2009 at 11:10 UTC
      Google, as programmers may be aware, does have a Regex search--but it is limited to programming code. They have not, so far as I am aware, made available any other regex searching of internet content in general. For what my friend was wanting, I think there were about two mentions of "academic" in someone's code somewhere. Obviously, that wasn't quite the answer he wanted.

      Blessings,

      ~Polyglot~

Re: Regex Searching the WWW
by apl (Monsignor) on Jun 24, 2009 at 11:55 UTC
    Is there a resaon you need to do this programmatically, as opposed to simply using Google?
      Google provides oodles of hits, but does not feed back a nice list, which is what I would want in this instance. I suppose if I were willing to click on every link provided by google, and then copy and paste the terms I was looking for into my own list, perhaps a few days from now I might have something close to complete. But that's a few days' work, and I'm lazy. That's why I want to use a little regex to make life easier.

      Blessings,

      ~Polyglot~

Re: Regex Searching the WWW
by aixtal (Sexton) on Jun 24, 2009 at 19:16 UTC
    Check www.exalead.com. They do have a regexp search in the advanced search form.
Re: Regex Searching the WWW
by planetscape (Canon) on Jun 25, 2009 at 00:49 UTC

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://774344]
Front-paged by Arunbear
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others about the Monastery: (5)
As of 2014-09-23 10:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    How do you remember the number of days in each month?











    Results (216 votes), past polls