Standards for a New Genomic Era

October 21, 2009

(PhysOrg.com) -- A team of geneticists at Los Alamos National Laboratory, together with a consortium of international researchers, has recently proposed a set of standards designed to elucidate the quality of publicly available genetic sequencing information. The new standards could eventually allow genetic researchers to develop vaccines more efficiently or help public health or security personnel more quickly respond to potential public-health emergencies.

In a recent issue of Science, Los Alamos geneticist Patrick Chain and colleagues presented six labels for sequence data that are, or will become, available in public databases rather than the two labels used today. The six labels would roughly characterize the completeness and accuracy—and consequently, the potential reliability—of genetic sequencing data. This is of great importance since researchers use such data on a daily basis for cross-referencing unknown genetic material with the of known organisms.

Every with DNA has chromosomes containing the four molecular building blocks, or base pairs, represented by letters A, T, G, and C. One chromosome can contain millions of base pairs arranged like rungs on a ladder of DNA. The base pairs are arranged in sets of specific sequences that make up genes. These gene sequences can contain genetic instructions that help or harm an organism—for example by encoding enzymes that digest certain foods, or inducing cellular aberrations that give rise to certain diseases.

Genome researchers have catalogued from thousands of organisms and placed them in publicly available libraries. Researchers can use these libraries to crosscheck genetic data, for example when attempting to isolate an unknown public health threat, or to determine where a potentially helpful or harmful gene may be located on an organism's chromosome. For scientific fields such as biofuels research or environmental remediation, genetic data could help researchers determine whether microorganisms can efficiently break down plant matter to aid in ethanol production, or digest environmental contaminants like hydrocarbons.

However, because of the complexity of genetic data, genetic information in public libraries can range from very rough to very refined. In the past, genetic data has been classified either as "draft" or "finished," leaving a wide range of uncertainty about the potential accuracy of genetic data.

"In the past few years we've seen major advances in genetic sequencing technology, so we've seen an explosion in the amount of publicly available data," said Chain, who is lead author of the Science paper. "The amount of base-pair sequencing data generated each day is in the billions—orders of magnitude larger than what was generated a few years ago. Different sequencing technologies have different levels of accuracy. High degrees of uncertainty in a sequence can potentially lead a researcher down a wrong path that they could follow for a year or more. We now have a need for standards that will provide researchers with an unambiguous estimation of the quality of genetic sequence data."

Working with researchers from genome sequencing centers big and small—including the U.S. Department of Energy's Joint Genome Institute, the Sanger Institute, the Human Microbiome Project Jumpstart Consortium sequencing centers, Michigan State University, and the Ontario Institute for Cancer Research, among others—Chain and colleagues have proposed that sequence data be placed into one of six categories that augment the existing two categories. The six standards range from "standard draft sequence," representing minimum requirements for public submission, to a "finished sequence," the highest standard, which can be verified to contain only one sequencing error per 100,000 base pairs.

"My hope is all the major genome centers and advanced genomics groups use the gradations that fit their needs," said Chris Detter, LANL Genome Science Group Leader and Joint Genome Institute-LANL Center director. "Some centers may want all six, while some may only want three, but as long as they keep them intact, we are in good shape. Then, my hope is that the smaller genomics groups adopt the classes as written to help the rest of the scientific community know what they are generating and submitting."

Source: Los Alamos National Laboratory (news : web)


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 3 /5 (1 vote)


October 21, 2009 all stories

Comments: 0

3 /5 (1 vote)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Establishing standard definitions for genome sequences
    created Oct 08, 2009 | popularity not rated yet | comments 0
  • Horse genome sequence draft is issued
    created Feb 07, 2007 | popularity not rated yet | comments 0
  • Illinois pig part of swine genome project
    created Jan 14, 2006 | popularity not rated yet | comments 0
  • Genome Institute Reaches Milestone with a Mighty Microbe
    created May 08, 2007 | popularity not rated yet | comments 0
  • Human chromosome 3 is sequenced
    created Apr 27, 2006 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Human Leukocyte Antigen (HLA) typing
    created 2 hours ago
  • Breeding program
    created Nov 20, 2009
  • How does a concentration gradient provide energy?
    created Nov 20, 2009
  • Eyesight and Neural Damage from Electronics
    created Nov 19, 2009
  • Quick question about the Golgi Apparatus?
    created Nov 19, 2009
  • The beginning of humans
    created Nov 18, 2009
  • More from Physics Forums - Biology

Other News

The Monarchs' annual migration ritual has yet to be scientifically explained

Tree-eating bugs threaten Monarch butterfly in Mexico

Biology / Ecology

created 3 hours ago | popularity not rated yet | comments 0

The mysterious Monarch butterfly, which migrates en masse annually between Canada and Mexico, is now facing a new peril: another insect thriving in Western Mexican forests.


Extinct goat Myotragus balearicus

Extinct goat was cold-blooded

Biology / Plants & Animals

created Nov 18, 2009 | popularity 4.9 / 5 (31) | comments 10

(PhysOrg.com) -- An extinct goat that lived on a barren Mediterranean island survived for millions of years by reducing in size and by becoming cold-blooded, which has never before been discovered in mammals.


Bigger not necessarily better, when it comes to brains

Bigger not necessarily better, when it comes to brains

Biology / Plants & Animals

created Nov 17, 2009 | popularity 4.5 / 5 (17) | comments 11

(PhysOrg.com) -- Tiny insects could be as intelligent as much bigger animals, despite only having a brain the size of a pinhead, say scientists at Queen Mary, University of London.


Right-handed chimpanzees provide clues to the origin of human language

Biology / Plants & Animals

created Nov 16, 2009 | popularity 3 / 5 (1) | comments 7

Most of the linguistic functions in humans are controlled by the left cerebral hemisphere. A study of captive chimpanzees at the Yerkes National Primate Research Center (Atlanta, Georgia), reported in the January 2010 issue ...


The creature was found at a depth of 161 metres

Japanese researchers film rare baby fish 'fossil'

Biology / Plants & Animals

created Nov 17, 2009 | popularity 4.7 / 5 (7) | comments 4

Japanese marine researchers said Tuesday they had found and successfully filmed a young coelacanth -- a rare type of fish known as "a living fossil" -- in deep water off Indonesia.