You should get a copy of The General Inquirer: A Computer Approach to Content Analysis
by Philip J. Stone, Dexter C. Dunphy, Marshall S. Smith, and Daniel M. Ogilvie. MIT Press, LCCN 66-22541 published in 1966. It describes software that pretty much does what you are talking about. You might also try googling with this as a starting point to find more current projects. In general a great deal of similar work has been done not in the computer department, but rather on the humanities side of campus. In fact a journal you might want to read is Computers and the Humanities
(or similar, it has been a long time) since it dealt specifically with content analysis. Likewise googling with search terms of +COMPUTER and +CONCORDANCE should be interesting as well.
"Never try to teach a pig to sing...it wastes your time and it annoys the pig."