Genome comparison tools found to be susceptible to slip-ups
May 26, 2010 by Hannah Hickey(PhysOrg.com) -- You might call it comparing apples and oranges, but lining up different species' genomes is common practice in evolutionary research. Scientists can see how species have evolved, pinpoint which sections of DNA are similar between species, meaning they probably are crucial to the animals' survival, or sketch out evolutionary trees in places where the fossil record is spotty.
But the tools used to align genomes from different species have serious quality-control issues, according to a study published online this week in the journal Nature Biotechnology.
"We discovered that there's a disturbingly low level of agreement between genome alignments produced by different tools," said corresponding author Martin Tompa, a UW professor of computer science and engineering and of genome sciences. "What this should suggest to biologists is that they should be very cautious about trusting these alignments in their entirety."
This is especially true when comparing distantly related species, and in regions of the genome that do not code for a protein, he said.
Aligning genomes, while simple in theory, is difficult in practice. Aligning more than two sequences becomes much harder with every additional sequence. At the scale of a mammal's entire genome, all of its genetic code, finding the optimal alignment of many genomes is far beyond the capabilities of any computer, Tompa said.
Various software tools instead use strategic shortcuts.
"At a high level the tools are very similar," Tompa said. "They make different decisions at the lower, more detailed levels, and those decisions seem to have widespread effect on the outcome."
The new paper compared the alignments from a previous study in which four research teams each took the same 1 percent of the human genome and aligned it to the genomes of 27 other vertebrate animals, ranging from mouse to elephant.
"This is a marvelous dataset," Tompa said. "It's a very large-scale multiple sequence alignment, done by four expert teams using four different tools, all of them working on the same input sequences."
However, the new study found that the resulting alignments were quite different. The authors also compared the coverage of each tool, meaning how much of the human DNA it was able to match to each other species, as well as what fraction of alignments were suspiciously close to a random match.
The best-performing tool was the newest one, Pecan, developed by the European Bioinformatics Institute.
"Our study pretty clearly points to Pecan as being the highest-quality alignment of the four tools we compared," Tompa said. It aligned as much of the human genome to other species as any of the other tools, and its matches were considerably more reliable, especially between more distantly related species.
The other tools in the study were Threaded Blockset Aligner (or TBA), Multiple Limited Area Global Alignment of Nucleotides (or MLAGAN) and Mavid. All four are free programs developed by academic institutions, Tompa said.
"I'm hoping that the designers of these tools will take a very close look at our paper and might be able to improve their tools as a result," he said. "I think we're all interested in having a better understanding of which methods work the best and how to make them better."
More information: http://www.nature. … bt.1637.html
-
New genome sequencing targets announced
Jul 24, 2006 |
not rated yet |
0
-
New gene prediction method capitalizes on multiple genomes
Dec 20, 2007 |
not rated yet |
0
-
With genomes, bigger may really be better
Mar 04, 2009 |
not rated yet |
0
-
Sequencing thousand and one genomes
Sep 29, 2008 |
not rated yet |
0
-
Researchers predict infinite genomes
Sep 22, 2005 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (31) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Mitosis
5 hours ago
-
Stem cell question.
6 hours ago
-
Protease cleavage
12 hours ago
-
Pertubance in a model
19 hours ago
-
Cancer drugs and Alzheimer's, Oh my!
Feb 09, 2012
-
Squishing cells
Feb 09, 2012
- More from Physics Forums - Biology
More news stories
The power of estrogen -- male snakes attract other males
A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...
17 hours ago |
4.8 / 5 (6) |
2
|
Grass to gas: Researchers' genome map speeds biofuel development
Researchers at the University of Georgia have taken a major step in the ongoing effort to find sources of cleaner, renewable energy by mapping the genomes of two originator cells of Miscanthus x giganteus, a large perenn ...
14 hours ago |
3.8 / 5 (5) |
0
|
Miami battling invasion of giant African snails
No one knows how they got there. But an invasion of African giant snails has southern Florida in a panic over potential crop damage, disease and general yuckiness surrounding the slimy gastropods.
21 hours ago |
4 / 5 (1) |
4
Experts reveal how plants don't get sunburn
(PhysOrg.com) -- Experts at the University of Glasgow have discovered how plants survive the harmful rays of the sun.
17 hours ago |
4.8 / 5 (5) |
0
|
Protein libraries in a snap
(PhysOrg.com) -- A Rice University undergraduate will depart with not only a degree but also a possible patent for his invention of an efficient way to create protein libraries, an important component of biomolecular ...
21 hours ago |
4.8 / 5 (4) |
1
|
Anonymous knocks CIA website offline (Update)
The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.
New error-correcting codes guarantee the fastest possible rate of data transmission
Error-correcting codes are one of the triumphs of the digital age. Theyre a way of encoding information so that it can be transmitted across a communication channel such as an optical fiber o ...
Humans may have helped the decline of African rainforests 3000 years ago
(PhysOrg.com) -- Large areas of rainforests in Central Africa mysteriously disappeared over three thousand years ago, to be replaced by savannas. The prevailing theory has been that the cause was a change ...
Google users warned of threat to smartphone wallets
Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.
New power source discovered
(PhysOrg.com) -- Researchers at the Massachusetts Institute of Technology (MIT) and RMIT University have made a breakthrough in energy storage and power generation.
Small modular reactor design could be a 'SUPERSTAR'
(PhysOrg.com) -- Though most of today's nuclear reactors are cooled by water, we've long known that there are alternatives; in fact, the world's first nuclear-powered electricity in 1951 came from a reactor ...