in reply to Regex Searching the WWW
If you had a sufficient corpus of English (American? ;-)) texts, you could use something like Ted Pedersen's Ngram Statistics Package to find word bigrams (see also: Corpus linguistics). One place to start might be the UCL Survey of English Usage.
HTH,
planetscape
In Section
Seekers of Perl Wisdom