Subject: Re: "t" stands for the type true and for the boolean expression true ..
From: rpw3@rpw3.org (Rob Warnock)
Date: Thu, 27 Oct 2005 04:16:30 -0500
Newsgroups: comp.lang.lisp
Message-ID: <8t2dne8XEshzB_3enZ2dnUVZ_smdnZ2d@speakeasy.net>
Joe Marshall  <jmarshall@alum.mit.edu> wrote:
+---------------
| rpw3@rpw3.org (Rob Warnock) writes:
| >     > (let ((acgt (vector 'a 'c 'g 't)))
| >         (loop for i below 200 collect (aref acgt (random 4))))
| >     (G T T C C G G C C G A C T G A A T T C A T T G T A C A G G C A C
| >      G T G A G C C C G C G C A C G G G G C T G G G A T T T G G A T G
| >      T A T A T A A T G G A G T C G G A T A G C G A C G T C C G A C C
| >      A C A A A T C T C C G C G G C G A C C T T A G A G A C T A T A G
| >      G A C G G T G G A T T A T T C T A A T T T A C G A C C G C A G T
| >      G T T G C C A T C T G C A G C A C C A A T C C G A A A A A G G C
| >      G G G T A T C G)
| >     > 
| > Hey, this DNA stuff is fun!!   ;-}  ;-}
| 
| Keep this man away from the gene-splicing equipment.
+---------------

So what *is* the probability that a random 200 base-pair sequence
encodes something biologically active? (...or even a viable virus?!?)  ;-}

I know, I know, they actually come in base-pair *triplets* [that is,
DNA is really base-64], so a 200 base-pair sequence isn't even valid.
And it needs special "start" & "stop" triplets at the beginning and end. 
[And if it's a virus, it's probably RNA, not DNA.]

But that said, what's the probability that a random *well-formed*
207 base-pair sequence encodes something biologically active (or viable)?


-Rob

-----
Rob Warnock			<rpw3@rpw3.org>
627 26th Avenue			<URL:http://rpw3.org/>
San Mateo, CA 94403		(650)572-2607