Perl: the Markov chain saw | |
PerlMonks |
Re^3: Brainstorming session: detecting plagiarismby planetscape (Chancellor) |
on Jun 09, 2005 at 05:48 UTC ( [id://464972]=note: print w/replies, xml ) | Need Help?? |
You might also want to check out Ted Pedersen's Ngram Statistics Package, with regard to the problem of improbable word pairs. The output can be easily sorted to highlight least likely occurrences. Of course you would want to compare to a corpus (of written English, say), to get a fairly good idea of "normal" parameters. Good luck, and keep us posted, please!
planetscape
In Section
Seekers of Perl Wisdom
|
|