Scientists find a fingerprint of evolution across the human genome

April 8, 2008

The Human Genome Project revealed that only a small fraction of the 3 billion “letter” DNA code actually instructs cells to manufacture proteins, the workhorses of most life processes. This has raised the question of what the remaining part of the human genome does. How much of the rest performs other biological functions, and how much is merely residue of prior genetic events?

Scientists from Cold Spring Harbor Laboratory (CSHL) and the University of Chicago now report that one of the steps in turning genetic information into proteins leaves genetic fingerprints, even on regions of the DNA that are not involved in coding for the final protein. They estimate that such fingerprints affect at least a third of the genome, suggesting that while most DNA does not code for proteins, much of it is nonetheless biologically important – important enough, that is, to persist during evolution.

Conservation of genetic information

To gauge how critical a particular stretch of DNA is, biologists often look at the detailed sequence of “letters” it consists of, and compare it with a corresponding stretch in related creatures like mice. If the stretch serves no purpose, the thinking goes, the two sequences will differ because of numerous mutations since the two species last shared an ancestor. In contrast, it’s believed that the sequences of important genes will be similar, or “conserved,” in different species, because animals with mutations in these genes did not survive. Biologists therefore regard conserved sequences as a sign of biological importance.

To test for conservation, researchers need to find matching stretches in the two species. This is relatively easy for stretches that “code” for proteins, where scientists long ago learned the meaning of the sequence. For “noncoding” regions, however, the comparison is often ambiguous. Even within a gene, stretches of DNA that code for pieces of the target protein are usually interspersed with much larger noncoding stretches, called introns, that are removed from the RNA working copy of the DNA before the protein is made.

Signs of splicing

Previous researchers assumed that mutations in the middle of introns do not affect the final protein, so they simply accumulate. In the new work, however, the researchers found signs that evolution rejects some types of mutations even in these regions of the genome. Although the selection is weak, “introns are not neutral,” in their effect on survival, says CSHL professor Michael Zhang, a bioinformatics expert who headed the research team.

To look for selection, CSHL researcher Chaolin Zhang, a doctoral candidate at Stony Brook University, looked in the human genome for a subtle statistical imbalance in how often various “letters” appear. The researchers attribute this imbalance to special short stretches of DNA that mark regions to be removed. Unless these signal sequences are sprinkled throughout an intron, the data suggest, it may not be properly spliced out, with potentially fatal consequences. Other sequences must likewise be preserved in the regions to be retained.

The scientists found a preference for some “letters” across intron regions, and the opposite preference in coding regions. Together, these regions make up at least a third of the genome, which is thus under selective pressure during evolution. The result supports other recent studies that suggest that, although most DNA does not code for proteins, much of it is nonetheless biologically important.

In addition to demonstrating how splicing affects genetic evolution, the statistical analysis identified possible signaling sequences, some that were already known and others that are new. According to co-author Adrian Krainer, a CSHL professor and splicing expert, “the exciting thing will be to experimentally test whether these predicted elements are really true.”

“RNA landscape of evolution for optimal exon and intron discrimination” appears in the April 15, 2008 edition of the Proceedings of the National Academy of Sciences. The paper is available online at http://www.pnas.org_cgi_doi_10.1073_pnas.0801692105 .

Source: Cold Spring Harbor Laboratory


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.7 /5 (37 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • Koen - Apr 09, 2008
    • Rank: not rated yet
    Information theory describes the efficiency of codes (source codes and channel codes). Efficient source codes compress the number of symbols used for encoding information. Channel codes are used to make sure information is send/received with minimal (or no) loss of information.

    The 'biological' channel is simply the copying of DNA encoded information during cell division: DNA encoded information can 'mutate' which is the equivalent of information loss in received information.

    The redundancy in the DNA code could very well be a channel code in order to minimize the loss of information during cell division. About 2 third of all DNA seem to be useless/redundant information; such a redundancy is not unusual for channel codes.
  • Koen - Apr 09, 2008
    • Rank: not rated yet
    A correction to my previous comment: just skip the words 'cell division'; the copying of DNA/RNA encoded information also takes place in other processes such as the translation of exons into proteins. The 'intron' DNA could be a channel code from the perspective of information theory. The article does not describe how well exon information is copied (translated into proteins, or to other RNA exon copies) in case the intron DNA is absent. Does mutated exon information still result into the same protein? How well are RNA exons translated to proteins in absense of introns?
  • ligfiets - Apr 09, 2008
    • Rank: not rated yet
    But how important is the efficiency of genetic code? If i understand Dawkins correctly, the genetic code is not there for the organism, but the other way around. 'Redundant' code could be just code which is very good at letting other code do the hard work.
  • HenisDov - Apr 09, 2008
    • Rank: not rated yet
    Link to my comment:

    http://www.physfo...ic=14988&st=195&#entry331556

    Dov Henis
  • MongHTan,PhD - Apr 09, 2008
    • Rank: not rated yet
    But how important is the efficiency of genetic code? If i understand Dawkins correctly, the genetic code is not there for the organism, but the other way around. 'Redundant' code could be just code which is very good at letting other code do the hard work.


    Please see the fallacies of Dawkinsian pseudo-Genetics here:

    http://www.physfo...pic=6365&view=findpost&p=330762 (post on March 7 2008).

    Author "Decoding Scientism" (work in progress since July 2007).
  • MongHTan,PhD - Apr 09, 2008
    • Rank: not rated yet
    Sorry, wrong date: the following article was posted on April 7, 2008:

    http://www.physfo...pic=6365&view=findpost&p=330762.

    Thank you for your attention.

    Author "Decoding Scientism" (work in progress since July 2007).

April 8, 2008 all stories

Comments: 6

4.7 /5 (37 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • PLoS Genetics 2009 maize genome collection
    created Nov 19, 2009 | popularity not rated yet | comments 0
  • Technique finds gene regulatory sites without knowledge of regulators
    created Nov 19, 2009 | popularity not rated yet | comments 0
  • DNA 'barcode' for tropical trees
    created Nov 04, 2009 | popularity not rated yet | comments 0
  • Researchers develop genetic map for cowpea, accelerating development of new varieties
    created Oct 13, 2009 | popularity not rated yet | comments 0
  • Bug splatter on your car's windshield is a treasure trove of genomic biodiversity
    created Oct 08, 2009 | popularity not rated yet | comments 0


Other News

Got a pain? -- Have a cup of Brazilian mint

Biology / Plants & Animals

created 49 minutes ago | popularity 5 / 5 (1) | comments 0

For thousands of years it has been prescribed by traditional healers in Brazil to treat a range of ailments from headaches and stomach pain to fever and flu.


Opposites attract: Monkeys choose mating partners with different genes

Biology / Evolution

created 49 minutes ago | popularity not rated yet | comments 0

The world's largest species of monkey 'chooses' mates with genes that are different from their own to guarantee healthy and strong offspring, according to a new research study.


Ice Cold: Cooler Than Being Cool

Ice Cold: Cooler Than Being Cool

Biology / Cell & Microbiology

created 3 hours ago | popularity 4.8 / 5 (5) | comments 0

(PhysOrg.com) -- Water expands when it freezes. Anyone who has ever left a can of soda or bottle of water in the freezer too long has witnessed this first hand. So how do plants and animals survive severe ...


When is a stem cell really a stem cell?

When is a stem cell really a stem cell?

Biology / Biotechnology

created 2 hours ago | popularity 5 / 5 (2) | comments 0

(PhysOrg.com) -- Induced pluripotent stem (iPS) cells -- adult cells reprogrammed to look and function like versatile embryonic stem cells -- are of growing interest in medicine. They may provide a way to ...


Probing life's extremes in Yellowstone

Probing life's extremes in Yellowstone (w/ Podcast)

Biology / Cell & Microbiology

created 1hour ago | popularity not rated yet | comments 0

(PhysOrg.com) -- Idaho National Laboratory biologist Frank Roberto squats on a bare, gravelly patch of ground in Yellowstone National Park's rolling backcountry. At his feet, scalding water churns in a mustard-yellow ...