I used the Lingua::Stem when I made concordances of some Shakespeare and Melville texts that I dowloaded from Project Gutenberg. I found that the stemming was quite conservative for my purposes, erring on the side of avoiding collisions.

My more challenging problem was the proper choice of stoplist words, which would not be indexed at all.

I will someday integrate stemming into my Style and Spelling Checker, I hope.

It should work perfectly the first time! - toma


In reply to Re: Natural Language Index Stemming by toma
in thread Natural Language Index Stemming by rob_au

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":