Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Regex Searching the WWW

by Polyglot (Chaplain)
on Jun 24, 2009 at 11:01 UTC ( [id://774344]=perlquestion: print w/replies, xml ) Need Help??

Polyglot has asked for the wisdom of the Perl Monks concerning the following question:

A friend of mine recently wanted to do a search for every possible usage of the word "academic," as in "academic acceptance", "academic administration", "academic dean", etc. Does anyone here know of a good way to use regex on the WWW to acquire such a list, or simply to search online with a regex tool for any purpose? A good CPAN module, perhaps?

Blessings,

~Polyglot~

Replies are listed 'Best First'.
Re: Regex Searching the WWW
by davorg (Chancellor) on Jun 24, 2009 at 11:10 UTC
      Google, as programmers may be aware, does have a Regex search--but it is limited to programming code. They have not, so far as I am aware, made available any other regex searching of internet content in general. For what my friend was wanting, I think there were about two mentions of "academic" in someone's code somewhere. Obviously, that wasn't quite the answer he wanted.

      Blessings,

      ~Polyglot~

Re: Regex Searching the WWW
by apl (Monsignor) on Jun 24, 2009 at 11:55 UTC
    Is there a resaon you need to do this programmatically, as opposed to simply using Google?
      Google provides oodles of hits, but does not feed back a nice list, which is what I would want in this instance. I suppose if I were willing to click on every link provided by google, and then copy and paste the terms I was looking for into my own list, perhaps a few days from now I might have something close to complete. But that's a few days' work, and I'm lazy. That's why I want to use a little regex to make life easier.

      Blessings,

      ~Polyglot~

Re: Regex Searching the WWW
by planetscape (Chancellor) on Jun 25, 2009 at 00:49 UTC
Re: Regex Searching the WWW
by aixtal (Sexton) on Jun 24, 2009 at 19:16 UTC
    Check www.exalead.com. They do have a regexp search in the advanced search form.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: perlquestion [id://774344]
Front-paged by Arunbear
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others sharing their wisdom with the Monastery: (3)
As of 2024-04-19 22:05 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found