Beefy Boxes and Bandwidth Generously Provided by pair Networks
Welcome to the Monastery
 
PerlMonks  

Re: NLP - natural language regex-collections?

by perlcapt (Pilgrim)
on Oct 17, 2004 at 00:54 UTC ( #399851=note: print w/ replies, xml ) Need Help??


in reply to NLP - natural language regex-collections?

I have played with something similar while developing a ship command and navigation simulator. (Eventually to have voice recognition and generation I/O, but currently just text based.) The experience that I'm drawing on is a CAI (Computer Aided Instruction) system that was the rage in the 70's: Plato V.

The problem which they solved was interpretation of free form text into logical relationships of key words. Essentially a thesaurus that worked from many to one. The variety of logical statements that might be recognized were written with the key words. The free text was parsed into key words.

This was amazingly effective. Uncanny for the users. The implementation is simple in Perl, using it text parsing power and hashes. I'll dig around and see what Perl I have for this.

Update:

I just started looking at the Lingua:: modules. There is a lot there. It certainly is a good place to start. Anyone have any experience with these modules?


Comment on Re: NLP - natural language regex-collections?
Re^2: NLP - natural language regex-collections?
by erix (Vicar) on Oct 17, 2004 at 07:21 UTC
    Thesaurus mapping many to one. That is indeed where I expect the best possibilities. A thesaurus that includes multiword phrases, up to sentences. I was thinking of a database that just stores all sentences it encounters, minus some pre-storage streamlining via stemming and problem domain jargon identification.

    But I know from experience that it is easier to talk about it than to implement useful code :)

    I must look better at the Lingua:: stuff; it will take some time. It seems that most of it is word-, not phrase- or sentence-based (as I was hoping).

    Thanks. I will let you know what I find.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://399851]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others exploiting the Monastery: (13)
As of 2014-07-24 20:35 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (167 votes), past polls