Collins Dictionaries were doing a lot of corpus linguistics using Perl when I left, back in 2002. They look after the Collins/Birmingham University Bank of English, which is a great big huge corpus. There are also a variety of monitor corpora, which are used to gauge changes in usage over time.
in reply to What are the monks doing with Perl and Linguistics?
Corpus data collection got a whole lot easier with the web ... ☺ -- Sitescooper is particularly handy for large-scale text collection (with permission, of course).
bowling trophy thieves, die!