Beefy Boxes and Bandwidth Generously Provided by pair Networks
Think about Loose Coupling
 
PerlMonks  

Re: Comparing sets of phrases stored in a database?

by erix (Vicar)
on Sep 30, 2012 at 20:33 UTC ( #996537=note: print w/replies, xml ) Need Help??


in reply to Comparing sets of phrases stored in a database?

Interesting question.

In PostgreSQL there are some text tools that may already be adequate for such a database:

The built-in full-text search (includes indexing, parsing, stemming, ranking). [1]

The extension pg_trgm (trigrams). Can be used to index, provides similarity functions. [2]

The extension fuzzystrmatch (with soundex, levenshtein etc.). [3]

[1] http://www.postgresql.org/docs/current/static/textsearch.html

[2] http://www.postgresql.org/docs/current/static/pgtrgm.html

[3] http://www.postgresql.org/docs/current/static/fuzzystrmatch.html