http://www.perlmonks.org?node_id=11133775

karlgoethebier has asked for the wisdom of the Perl Monks concerning the following question:

I have no serious idea for the moment. And done nothing so far. Background is that Kurt Schumacher claimed that Goethe had a vocabulary of about 29.000 words and Adenauer only had a vocabulary of about 500 words.

Update: Thanks to all for the kind and inspiring replies. I guess Lingua::Stem is the way to go. I‘ll open another thread about tokenizing.

«The Crux of the Biscuit is the Apostrophe»