Genome comparison tools found to be susceptible to slip-ups

May 26, 2010 by Hannah Hickey

(PhysOrg.com) -- You might call it comparing apples and oranges, but lining up different species' genomes is common practice in evolutionary research. Scientists can see how species have evolved, pinpoint which sections of DNA are similar between species, meaning they probably are crucial to the animals' survival, or sketch out evolutionary trees in places where the fossil record is spotty.

But the tools used to align genomes from different species have serious quality-control issues, according to a study published online this week in the journal .

"We discovered that there's a disturbingly low level of agreement between genome alignments produced by different tools," said corresponding author Martin Tompa, a UW professor of computer science and engineering and of genome sciences. "What this should suggest to biologists is that they should be very cautious about trusting these alignments in their entirety."

This is especially true when comparing distantly related species, and in regions of the genome that do not code for a protein, he said.

Aligning genomes, while simple in theory, is difficult in practice. Aligning more than two sequences becomes much harder with every additional sequence. At the scale of a mammal's entire genome, all of its , finding the optimal alignment of many genomes is far beyond the capabilities of any computer, Tompa said.

Various software tools instead use strategic shortcuts.

"At a high level the tools are very similar," Tompa said. "They make different decisions at the lower, more detailed levels, and those decisions seem to have widespread effect on the outcome."

The new paper compared the alignments from a previous study in which four research teams each took the same 1 percent of the human genome and aligned it to the genomes of 27 other vertebrate animals, ranging from mouse to elephant.

"This is a marvelous dataset," Tompa said. "It's a very large-scale multiple sequence alignment, done by four expert teams using four different tools, all of them working on the same input sequences."

However, the new study found that the resulting alignments were quite different. The authors also compared the coverage of each tool, meaning how much of the human DNA it was able to match to each other species, as well as what fraction of alignments were suspiciously close to a random match.

The best-performing tool was the newest one, Pecan, developed by the European Bioinformatics Institute.

"Our study pretty clearly points to Pecan as being the highest-quality alignment of the four tools we compared," Tompa said. It aligned as much of the human to other species as any of the other tools, and its matches were considerably more reliable, especially between more distantly related species.

The other tools in the study were Threaded Blockset Aligner (or TBA), Multiple Limited Area Global Alignment of Nucleotides (or MLAGAN) and Mavid. All four are free programs developed by academic institutions, Tompa said.

"I'm hoping that the designers of these tools will take a very close look at our paper and might be able to improve their tools as a result," he said. "I think we're all interested in having a better understanding of which methods work the best and how to make them better."

More information: http://www.nature. … bt.1637.html

Provided by University of Washington (news : web)

4.5 /5 (2 votes)  

Rank 4.5 /5 (2 votes)
Related Stories
Relevant PhysicsForums posts
  • Mitosis
    created5 hours ago
  • Stem cell question.
    created6 hours ago
  • Protease cleavage
    created12 hours ago
  • Pertubance in a model
    created19 hours ago
  • Cancer drugs and Alzheimer's, Oh my!
    createdFeb 09, 2012
  • Squishing cells
    createdFeb 09, 2012
  • More from Physics Forums - Biology

More news stories

The power of estrogen -- male snakes attract other males

A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...

Biology / Plants & Animals

created 17 hours ago | popularity 4.8 / 5 (6) | comments 2 | with audio podcast

Grass to gas: Researchers' genome map speeds biofuel development

Researchers at the University of Georgia have taken a major step in the ongoing effort to find sources of cleaner, renewable energy by mapping the genomes of two originator cells of Miscanthus x giganteus, a large perenn ...

Biology / Biotechnology

created 14 hours ago | popularity 3.8 / 5 (5) | comments 0 | with audio podcast

Miami battling invasion of giant African snails

No one knows how they got there. But an invasion of African giant snails has southern Florida in a panic over potential crop damage, disease and general yuckiness surrounding the slimy gastropods.

Biology / Ecology

created 21 hours ago | popularity 4 / 5 (1) | comments 4

Experts reveal how plants don't get sunburn

(PhysOrg.com) -- Experts at the University of Glasgow have discovered how plants survive the harmful rays of the sun.

Biology / Cell & Microbiology

created 17 hours ago | popularity 4.8 / 5 (5) | comments 0 | with audio podcast

Protein libraries in a snap

(PhysOrg.com) -- A Rice University undergraduate will depart with not only a degree but also a possible patent for his invention of an efficient way to create protein libraries, an important component of biomolecular ...

Biology / Cell & Microbiology

created 21 hours ago | popularity 4.8 / 5 (4) | comments 1 | with audio podcast


Anonymous knocks CIA website offline (Update)

The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.

New error-correcting codes guarantee the fastest possible rate of data transmission

Error-correcting codes are one of the triumphs of the digital age. They’re a way of encoding information so that it can be transmitted across a communication channel — such as an optical fiber o ...

Humans may have helped the decline of African rainforests 3000 years ago

(PhysOrg.com) -- Large areas of rainforests in Central Africa mysteriously disappeared over three thousand years ago, to be replaced by savannas. The prevailing theory has been that the cause was a change ...

Google users warned of threat to smartphone wallets

Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.

New power source discovered

(PhysOrg.com) -- Researchers at the Massachusetts Institute of Technology (MIT) and RMIT University have made a breakthrough in energy storage and power generation.

Small modular reactor design could be a 'SUPERSTAR'

(PhysOrg.com) -- Though most of today's nuclear reactors are cooled by water, we've long known that there are alternatives; in fact, the world's first nuclear-powered electricity in 1951 came from a reactor ...