Yes nimdoc, rather uncanny I agree.
Anyway I think we are a long way from Lingua::Lit::Condense
yet but it would great for students indeed.
Imagine,
condense prideandprejudice.txt
"Human bonding is complex"
condense oneflewover.txt
"Insanity is relative"
condense 1984.txt
"Government watches people"
cool!
Anyway, I searched ibiblio/gutenberg and turned found no lunch
nor on Google. Which confirms there is no free lunch.
I found a couple of excerpts though to cat together. They were formatted
pretty weirdly so a quick blast with
perl -pi -e 's/\W/ /g' lunch.txt
and we lost all the punctuation, which imho is best for this sort
of pseudo science (the loss of symbols to untrapped punctuation outweighs their syntactic value).
Anyway Burrows ....
I have picked the choice few sentences out of 15 or so, truncated some,
Old Pete men suck the Florida tan the same about you
taboos curses and subway is lurking in Washington Square a cell of rotten
he has scored he cruises the same dirty junky he hangs off
Thanks kid I am evidently his sharkskin suit
And the bodys string of inquiry ruling that anesthizes his enveloping presence
all over from shooting in my kid I ll wipe your release
- absolute filth imho, but who should concerned mothers sue? Mr Markov
or Mr Burrows?, or Me?.
Still searching for Joyce.
Andy
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|