http://www.perlmonks.org?node_id=774558


in reply to Regex Searching the WWW

If you had a sufficient corpus of English (American? ;-)) texts, you could use something like Ted Pedersen's Ngram Statistics Package to find word bigrams (see also: Corpus linguistics). One place to start might be the UCL Survey of English Usage.

HTH,

planetscape