Been meaning to comment on this report of a couple of weeks ago: We don't carry around anywhere near as much junk DNA as previously claimed. Not too many slacker letters in the genome going along for the ride.
"Our genome is simply alive with switches: millions of places that determine whether a gene is switched on or off," says Ewan Birney of EMBL-EBI, lead analysis coordinator for ENCODE. "The Human Genome Project showed that only 2% of the genome contains genes, the instructions to make proteins. With ENCODE, we can see that around 80% of the genome is actively doing something. We found that a much bigger part of the genome – a surprising amount, in fact – is involved in controlling when and where proteins are produced, than in simply manufacturing the building blocks."
This report has a number of important implications:
If we had cheaper genetic sequencing equipment 30 years ago we would have made very good use of the data because of the cost of computer memory, disk, and CPU. The amount of data we need to process thru to tease out what genes do is huge. We each have about 3 billion genetic letters. But when sequencing a genome it has to be sequenced many times to identify errors and to fit together overlapping pieces. So each genome's sequencing requires processing of tens of billions of genetic letters with lots comparisons and building up of data structures to gradually connect all the pieces together.
Once each genome's sequence is known then using it to compare against other genomes requires even more computer powers. The differences in those letters need to be compared with many attributes of each of us for tens or hundreds of millions of people in order to discover all their effects.
Share | | Randall Parker, 2012 September 24 09:17 PM Bodies Perfected Genes |
I question whether humans, as presently constituted, will ever be able to understand such a complex system as the human genome.
After all, our brain evolved merely to keep us alive long enough to reproduce and assure the survival of our descendants.
A brain with orders-of-magnitude-higher capacity and complexity would be required.
Strong AI will eventually be capable of understanding the entire genome, but by then we humans won't understand the strong AI.
We'll have to take its word for almost everything beyond a certain level of complexity.
Don't know about you, but to me that seems preferable to inescapable ignorance.
Yeah, I have to say, while I'm all about the genome sequencing, it seems that it's going to take decades to obtain a reasonably broad understanding of the genome necessary to safely manipulate outcomes. I'm sure there will be some surprising developments sooner, but it looks like a lot of the promise of genomics may only hold for humans a couple generations away. Oh well. Onwards!
Well, no. Ewan Birney has got it wrong. Parts of the genome that accumulate random mutations are obviously not doing much, otherwise mutations wouldn't accumulate in a random way. If they were doing something useful then the bits that were helpful would be conserved. The portion of our DNA where random mutations don't accumulate is much less than 80%. The biologist P.Z. Myers has a thing or two to say about ENCODE:
http://freethoughtblogs.com/pharyngula/2012/09/23/the-encode-delusion/