Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Re: Natural Language Index Stemming

by cjf (Parson)
on Jun 18, 2002 at 04:58 UTC ( #175274=note: print w/ replies, xml ) Need Help??


in reply to Natural Language Index Stemming

As for Lingua::Stem, I just tried out a few examples from Stemming Performance that you linked to:

use strict; use Lingua::Stem; my $stemmer = Lingua::Stem->new(); my @words = qw/maintained maintenance environment experience/; my $stems = $stemmer->stem(@words); print "$_ " for (@$stems);

The output was:

maintain mainten environ experi

So it appears to have failed to merge maintain with maintenance(?), but correctly dealed with the environment/experience difference described on that page. This is the first time I've looked into the subject, so I could be a fair bit off the mark :).

As for other (sort of) related modules, I've found TheDamian's Lingua::EN::Inflect to be useful (and fun) to use on occasion. I'm not sure how much that applies to your question though.

++ for an interesting thread, I look forward to hearing what your conclusions are.

Edited 18 June 2002 (footpad): Fixed broken </code> tag.


Comment on Re: Natural Language Index Stemming
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://175274]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others exploiting the Monastery: (9)
As of 2014-12-18 06:43 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (43 votes), past polls