Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl: the Markov chain saw
 
PerlMonks  

Re^3: Brainstorming session: detecting plagiarism

by planetscape (Chancellor)
on Jun 09, 2005 at 05:48 UTC ( [id://464972]=note: print w/replies, xml ) Need Help??


in reply to Re^2: Brainstorming session: detecting plagiarism
in thread Brainstorming session: detecting plagiarism

You might also want to check out Ted Pedersen's Ngram Statistics Package, with regard to the problem of improbable word pairs. The output can be easily sorted to highlight least likely occurrences. Of course you would want to compare to a corpus (of written English, say), to get a fairly good idea of "normal" parameters.

Good luck, and keep us posted, please!

planetscape
  • Comment on Re^3: Brainstorming session: detecting plagiarism

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://464972]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others examining the Monastery: (5)
As of 2024-04-23 18:33 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found