in reply to Re: Re: Non-English posts on Perlmonks
in thread Non-English posts on Perlmonks

I don't know about you, but to me (and others I know), rot13 is blatantly obvious, and I doubt padding/punctuation would change that. I don't think making up a plausible language within an hour is possible.
  • Comment on Re: Re: Re: Non-English posts on Perlmonks

Replies are listed 'Best First'.
Re: Re: Re: Re: Non-English posts on Perlmonks
by Anonymous Monk on Jul 13, 2003 at 21:45 UTC
    rot13 is blatantly obvious

    Okay, rot14 then ;-P

    Really though, languages follow many simple patterns. What you could do would be to take a couple hundred posts from Perlmonks (or anywhere else) and analyze them for average word length and order. So taking your post I could use the notation 'A' for an alphabetical character and 'P' for punctuation (you could obviously get more specific here) and you get:

    A AAAPA AAAA AAAAA AAAP AAA AA AA PAAA AAAAAA A AAAAPP AAANN AA AAAAAA +AAA AAAAAAP ...

    You then average out the structure of the words and create general rules like 'a one-letter word is seldom followed by another one-letter word' and 'this type of punctuation occurs every X letters.' You use these rules to create an acceptable level of variation and then use some random generator to generate numbers in this variation. You then account for certain letters occuring more often than others and assign them accordingly. Dead simple.

    Again though, it's hardly worth worrying about, but it is a neat (SIMPLE) academic exercise.