Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid

Re: Natural Language Index Stemming

by toma (Vicar)
on Jun 18, 2002 at 06:23 UTC ( #175290=note: print w/replies, xml ) Need Help??

in reply to Natural Language Index Stemming

I used the Lingua::Stem when I made concordances of some Shakespeare and Melville texts that I dowloaded from Project Gutenberg. I found that the stemming was quite conservative for my purposes, erring on the side of avoiding collisions.

My more challenging problem was the proper choice of stoplist words, which would not be indexed at all.

I will someday integrate stemming into my Style and Spelling Checker, I hope.

It should work perfectly the first time! - toma

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://175290]
NodeReaper says "Shhhh! Be vewy vewy quiet, I'm hunting wumpus"

How do I use this? | Other CB clients
Other Users?
Others cooling their heels in the Monastery: (7)
As of 2018-01-18 16:05 GMT
Find Nodes?
    Voting Booth?
    How did you see in the new year?

    Results (212 votes). Check out past polls.