Beefy Boxes and Bandwidth Generously Provided by pair Networks
"be consistent"
 
PerlMonks  

Re^2: RFC: Is the Bible encoded in DNA?

by wstryder (Novice)
on May 14, 2018 at 17:50 UTC ( #1214484=note: print w/replies, xml ) Need Help??


in reply to Re: RFC: Is the Bible encoded in DNA?
in thread RFC: Is the Bible encoded in DNA?

What I really need is a mathematician help me out a bit in advance, before I get any results. My computer is now crunching at 500 million base pairs. If generated at random, there is a chance of one in 26^100 of getting a certain English sentence 100 characters in length. That number is astronomically small, there are only 10^80 or so atoms in the entire universe. But with the way I'm searching the DNA, what is the probability of finding a sequence 100 characters in length? The math is beyond my abilities. So far I've got 42 characters for Lorem Ipsum and 45 for the hebrew Bible. So the results so far do not in any way suggest a divine author for DNA. But what result would? What would be needed to convince people? 100 characters, 5000 characters or more?

To everyone saying that with this method you can find the entire works of Shakespeare or the communist manifesto embedded in DNA I say - give it a try. Use my script or better yet, write your own. It's not possible, not with the encoding I'm using.

  • Comment on Re^2: RFC: Is the Bible encoded in DNA?

Replies are listed 'Best First'.
Re^3: RFC: Is the Bible encoded in DNA?
by wstryder (Novice) on May 14, 2018 at 19:01 UTC

    Nobody has spotted the obvious, that the code for reading the file is painfully slow.

    A significant improvement would be:

    while (read($fh, my $char, 1) && $eof) { if ($char =~ m/[ACGT]/ ) { $i++; if ($i >= $start_genome && $i < $end_genome) { push (@genome, $char); } elsif ($i > $end_genome) { $eof = 0; # do the searching here, instead of outside the loop! } } }
      That's now fixed and the algorithm doesn't go back to the beginning of the file every time a chunk is read. Silly me.
Re^3: RFC: Is the Bible encoded in DNA?
by Anonymous Monk on May 14, 2018 at 20:12 UTC
    So far I've got 42 characters for Lorem Ipsum and 45 for the hebrew Bible.
    Sorry, can't help you more right now. See Multiple comparisons problem. Any good statistics handbook should explain it in depth.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1214484]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others about the Monastery: (5)
As of 2019-10-16 08:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Notices?