Software developed by Boston College lab delivers speed and accuracy to genome research

March 28, 2008

It took a global corps of scientists approximately $500 million and 13 years to identify the more than 35,000 genes of the human genome. Five years later, Boston College Biologist Gabor Marth and his research team have developed software that can analyze half a million DNA sequences in 10 minutes.

The Marth laboratory’s proprietary PyroBayes software is one of a new breed of computer programs able to accurately process the mountains of genome data flowing from the latest generation of gene decoding machines, which have placed a premium on computational speed and accuracy in data-crunching fields known as bioinformatics and high-throughput biology, said Marth, an associate professor of Biology.

“We’re on the edge of a real technological revolution that I think will help us understand the genetic causes of diseases in humans and how genetic materials determine traits in animals,” said Marth. “It is going to lead to less expensive technologies that will allow researchers to decode any individual.”

PyroBayes will aid researchers involved in the 1,000 Genomes Project, which announced last month a plan to sequence the genomes of 1,000 individuals from around the world. The NIH, which helps direct the project, has awarded Marth more than $1.3 million to develop software over the next four years.

The advances of the Marth lab were revealed in two articles published by the professor and his assistants in the February issue of Nature Methods, the premier journal of scientific research methodology.

In an article co-authored by Marth, post-doctoral researcher Chip Stewart, and graduate students Aaron Quinlan and Mike Strömberg, the group unveiled the lab’s PyroBayes base caller software, which examines data from one of the latest generation of DNA decoding machines – from Roche / 454 Life Sciences – faster and with far greater accuracy than other programs for pyrosequencing, a technology that utilizes the detection of pyrophosphate for decoding the sequence of DNA, the carrier of genetic information in living organisms.

A second Nature Methods article, written in collaboration with colleagues from the Washington University School of Medicine, reported that three other computer programs developed by the Marth lab made it possible to quickly and accurately examine the whole genome of a laboratory worm and identify key differences between the sample strain and an earlier strain – a comparative process known as re-sequencing, now being applied to the genomes of humans and other organisms. This second study used another next-generation DNA sequencing platform, the Illumina/Solexa machine.

Advances are driving re-sequencing costs down, but researchers must still prove the effectiveness of the new technology by working with smaller organisms, which made the worm study critical, Marth said. “This brings us closer to a major milestone in human individual re-sequencing – the decoding of the genome of human beings in routine fashion,” said Marth.

Of the few computer programs available for the new sequencing machines, the software package developed by the Marth lab is the only one capable of working with a variety of decoding machines and offers greater accuracy, allowing researchers to separate true genetic variations from data errors, said Marth. PyroBayes, a Linux-based package, is made available to fellow academic researchers at no cost.

As a member of its analysis group, the Marth lab participates in the data analysis of the 1000 Genomes Project, which was launched last month. The goal of the project is to sequence the genomes of at least 1,000 people from around the world to create the most detailed and medically useful picture to date of human genetic variation.

Ultimately, advances in bioinformatics will help push genetic science forward, shedding new light on human health and disease. Marth sees his lab’s role in providing critical tools that help researchers to organize data, interpret them, and visualize genome variations.

“We are excited to develop the software that will help these super-fast, high-throughput sequencing machines to realize their potential to produce invaluable data for research,” Marth said.

Source: Boston College


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.9 /5 (7 votes)


March 28, 2008 all stories

Comments: 0

4.9 /5 (7 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Do 68 molecules hold the key to understanding disease?
    created Sep 04, 2008 | popularity not rated yet | comments 0
  • Ashwell receptor reduces mortality during sepsis
    created May 18, 2008 | popularity not rated yet | comments 0
  • New study supports a stem cell origin of cancer
    created Jan 09, 2007 | popularity not rated yet | comments 0


Other News

The Monarchs' annual migration ritual has yet to be scientifically explained

Tree-eating bugs threaten Monarch butterfly in Mexico

Biology / Ecology

created 13 hours ago | popularity 4 / 5 (1) | comments 0

The mysterious Monarch butterfly, which migrates en masse annually between Canada and Mexico, is now facing a new peril: another insect thriving in Western Mexican forests.


Bigger not necessarily better, when it comes to brains

Bigger not necessarily better, when it comes to brains

Biology / Plants & Animals

created Nov 17, 2009 | popularity 4.5 / 5 (18) | comments 11

(PhysOrg.com) -- Tiny insects could be as intelligent as much bigger animals, despite only having a brain the size of a pinhead, say scientists at Queen Mary, University of London.


Extinct goat Myotragus balearicus

Extinct goat was cold-blooded

Biology / Plants & Animals

created Nov 18, 2009 | popularity 4.9 / 5 (33) | comments 10

(PhysOrg.com) -- An extinct goat that lived on a barren Mediterranean island survived for millions of years by reducing in size and by becoming cold-blooded, which has never before been discovered in mammals.


Right-handed chimpanzees provide clues to the origin of human language

Biology / Plants & Animals

created Nov 16, 2009 | popularity 3 / 5 (1) | comments 7

Most of the linguistic functions in humans are controlled by the left cerebral hemisphere. A study of captive chimpanzees at the Yerkes National Primate Research Center (Atlanta, Georgia), reported in the January 2010 issue ...


The creature was found at a depth of 161 metres

Japanese researchers film rare baby fish 'fossil'

Biology / Plants & Animals

created Nov 17, 2009 | popularity 4.7 / 5 (7) | comments 4

Japanese marine researchers said Tuesday they had found and successfully filmed a young coelacanth -- a rare type of fish known as "a living fossil" -- in deep water off Indonesia.