Future of biology rests in harnessing data avalanche

September 4th, 2008 Future of biology rests in harnessing data avalanche

(PhysOrg.com) -- Like most sciences, biology is inundated with data. However, a group of researchers warns in a Nature feature that the avalanche of biological information is at the point where the discipline may be unable to reach its full potential without improvements for curating data into on-line databases. The commentary appears in the September 4, issue of the journal and outlines specific remedies to harness the information overload.

By July 2008, data-extractors or curators had indexed over 18 million articles in PubMed and sequences of over 260,000 organisms into GenBank. Both are examples of databases where biological information is stored for public access. Data curation is very labor intensive.

“There is a lack of standardization or consistency in the way scientists report their findings in different journals,” remarked corresponding author Sue Rhee of the Carnegie Institution’s Department of Plant Biology and principal investigator of The Arabidopsis Information Resource (TAIR). “In some cases the researchers don’t even specify the species of a gene under study. That leaves biocurators, who have advance degrees in biology, and expertise with databases and scripting languages, to read the full text and transfer the essence of the information into specific fields in the database. They spend a lot of time just figuring out the basics. And that leaves a lot of room for error.”

Curation is not just a data organization tool. Such input has become essential to biological research. The authors note that eleven different databases had ¾ of a million visitors who viewed 20 million pages in just one month. And with inference programs that feed on the curated data, researchers can now tap into other work that relates to theirs and use that data in their own experiments—a huge advancement that is accelerating the pace of biology. “With this vast universe of information, the whole nature of experimentation is changing,” continued Rhee. “But the field is being held back with the curation backlog.”

The group of authors outlined a series of solutions to the problem. The first is to have authors input their data directly into databases upon acceptance in refereed journals. This step has already begun with Plant Physiology and TAIR. When a manuscript in accepted, researchers now fill in a web form about Arabidopsis genes. Second, the commentators urge the biological community to adopt standard reporting formats that are universally agreed upon. And third, curation needs to be elevated by academic institutions and funding agencies. There should also be incentives for researchers to curate their own data, such as increases in academic recognition, career advancement, and funding. They additionally suggest that “community annotation” could be modeled after large-scale astronomy projects like the Sloan Digital Sky Survey, or the Galaxy Zoo, where 80,000 astronomers and interested amateurs classified one million galaxies in less than three weeks.

“The effort and cost required to curate the data is small compared with the cost of carrying out the research in the first place, yet this additional step adds tremendously to the value of the research results to society,” commented Eva Huala, director of TAIR.

Wolf Frommer, acting director of Carnegie’s Department of Plant Biology noted that “advances in our understanding of biology will affect our food supply, our health-care system, the development of remedies for climate change, and many other aspects of daily life. Basic and applied research have to go hand in hand with curation of databases so that humanity can adapt to the quickly changing world as fast as possible.”

Provided by Carnegie Institution


print this article email this article download pdf blog this article bookmark this article     Digg this Stumble it share on Facebook share on Reddit add to delicious save to Yahoo! bookmarks
5/5 after 3 votes

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first


September 4th, 2008 all stories
Biology /

Comments: 1
Rank: 5/5 after 3 votes

  • Stumble this up

  • Digg this

  • Share it:
  • share on Facebook
  • share on MySpace
  • share on Slashdot
  • rss-newsfeed
  • share on Google
  • share on Reddit
  • add to delicious
  • save to Yahoo! bookmarks
  • share on Windows Live
  • Add to Mixx!
Rating: 5/5 after 3 votes

  • Related Stories

  • Grid browser finds the meaning of life
    created May 20, 2009 | popularity not rated yet | comments 0
  • MicroRNA undermines tumor suppression
    created Mar 17, 2009 | popularity not rated yet | comments 0
  • Brain on a chip?
    created Mar 16, 2009 | popularity not rated yet | comments 0
  • Data mining promises to dig up new drugs
    created Feb 02, 2009 | popularity not rated yet | comments 0
  • Studies offer guide as protein interaction mapping comes of age
    created Jan 06, 2009 | popularity not rated yet | comments 0

Tags


  • Physicists Demonstrate Quantum Memory with Matter Qubits
    Physicists Demonstrate Quantum Memory with Matter Qubits
    Physics / General Physics
    created Jul 03, 2009 | popularity 4.4 / 5 (17) | comments 1
  • 'Holey' Nanosheets for Wastewater Dye Removal
    Nanotechnology / Nanomaterials
    created Jul 01, 2009 | popularity 5 / 5 (5) | comments 1
  • Jellyfish Robot Swims Like its Biological Counterpart
    Jellyfish Robot Swims Like its Biological Counterpart
    Electronics / Robotics
    created Jun 26, 2009 | popularity 4.4 / 5 (8) | comments 1
  • Could Maxwell's Demon Exist in Nanoscale Systems?
    Could Maxwell's Demon Exist in Nanoscale Systems?
    Physics / General Physics
    created Jun 24, 2009 | popularity 4.4 / 5 (18) | comments 29
  • Living Safely with Robots, Beyond Asimov's Laws
    Living Safely with Robots, Beyond Asimov's Laws
    Electronics / Robotics
    created Jun 22, 2009 | popularity 4.6 / 5 (54) | comments 40
  • Other News

    Hormone clue to root growth

    Biology / Plants & Animals

    created 46 minutes ago | popularity not rated yet | comments 0

    (PhysOrg.com) -- Plant roots provide the crops we eat with water, nutrients and anchorage. Understanding how roots grow and how hormones control that growth is crucial to improving crop yields, which will be necessary to ...


    Final rules out for government stem cell research

    Biology / Other

    created 1hour ago | popularity not rated yet | comments 0

    (AP) -- The government is creating a master list of which embryonic stem cells qualify for taxpayer-funded research, now that President Barack Obama has lifted restrictions on the field.


    Battle of the sexes benefits offspring, says research

    Battle of the sexes benefits offspring, says research

    Biology / Plants & Animals

    created 4 hours ago | popularity 5 / 5 (1) | comments 0

    Parents compensate for a lazy partner by working harder to bring up their offspring, but not enough to completely make up for the lack of parenting, says research by bird biologists at the University of Bath.


    Researchers uncover the potential for super-sized abalone

    Biology / Microbiology

    created 8 hours ago | popularity 4 / 5 (1) | comments 0

    (PhysOrg.com) -- Designer abalone could be an option in the foreseeable future, thanks to a project conducted at UQ's Heron Island Research Station and the St Lucia campus.


    Scientists 'rebuild' giant moa using ancient DNA

    Biology / Plants & Animals

    created Jul 01, 2009 | popularity 5 / 5 (12) | comments 12

    (PhysOrg.com) -- Scientists have performed the first DNA-based reconstruction of the giant extinct moa bird, using prehistoric feathers recovered from caves and rock shelters in New Zealand.