I'm currently researching cross-lingual digital libraries and I use Perl, although I am fairly new to the language. I have just finished writing a light stemmer, some ngram code, some ngram comparaison code, and basically i'm at that 'generating stats' stage. I'm looking for similarities between documents, differences in them too, and then look at language and context, and so on. The idea is to make documents searchable in many different langauges. I did a masters where I used Java, and made a system that could retrieve a similar english document in french and german..it kinda worked ;)
I'm always interested in hearing what other are up to in that area, maybe we can swap some tools and share some ideas!!
in reply to Re: What are the monks doing with Perl and Linguistics?
in thread What are the monks doing with Perl and Linguistics?