Petascale computing tools could provide deeper insight into genomic evolution

November 17, 2009
Petascale computing tools could provide deeper insight into genomic evolution

Enlarge

Microscale rearrangements of genes among 12 Drosophila species. Each gene is indicated by a colored line, showing how gene order is shuffled during the evolution of these species. Credit: Image courtesy of Stephen Schaeffer

Technological advances in high-throughput DNA sequencing have opened up the possibility of determining how living things are related by analyzing the ways in which their genes have been rearranged on chromosomes. However, inferring such evolutionary relationships from rearrangement events is computationally intensive on even the most advanced computing systems available today.

Research recently funded by the American Recovery and Reinvestment Act of 2009 aims to develop computational tools that will utilize next-generation petascale computers to understand genomic evolution. The four-year $1 million project, supported by the National Science Foundation's PetaApps program, was awarded to a team of universities that includes the Georgia Institute of Technology, the University of South Carolina and The Pennsylvania State University.

"Genome sequences are now available for many organisms, but making biological sense of the requires high-performance computing methods and an evolutionary perspective, whether you are trying to understand how genes of new functions arise, why genes are organized as they are in chromosomes, or why these arrangements are subject to change," said lead investigator David A. Bader, a professor in the Computational Science and Engineering Division of Georgia Tech's College of Computing.

Even on today's fastest parallel computers, it could take centuries to analyze genome rearrangements for large, complex organisms. That is why the research team -- which also includes Jijun Tang, an associate professor in the Department of Computer Science and Engineering at the University of South Carolina; and Stephen Schaeffer, an associate professor of biology at Penn State -- is focusing on future generations of petascale machines, which will be able to process more than a thousand trillion, or 10^15, calculations per second. Today, most personal computers can only process a few hundred thousand calculations per second.

The researchers plan to develop new algorithms in an open-source software framework that will utilize the capabilities of parallel, petascale computing platforms to infer ancestral rearrangement events. The starting point for developing these new algorithms will be GRAPPA, an open-source code co-developed by Bader and initially released in 2000 that reconstructed the evolutionary relatedness among species.

"GRAPPA is currently the most accurate method for determining genome rearrangement, but it has only been applied to small genomes with simple events because of the limitation of the algorithms and the lack of computational power," explained Bader, who is also executive director of high-performance computing at Georgia Tech.

On a dataset of a dozen bellflower genomes, the latest version of GRAPPA determined the flowers' evolutionary relatedness one billion times faster than the original implementation that did not utilize parallel processing or optimization.

The researchers will test the performance of their new algorithms by analyzing a collection of fruit fly genomes.

"Fruit flies -- formally known as Drosophila -- are an excellent model system for studying genome rearrangement because the genome sizes are relatively small for animals, the mechanism that alters gene order is reasonably well understood, and the evolutionary relationships among the 12 sequenced genomes are known," said Schaeffer.

The analysis of genome rearrangements in Drosophila will provide a relatively simple system to understand the mechanisms that underlie gene order diversity, which can later be extended to more complex mammalian genomes, such as primates.

The researchers believe these new algorithms will make genome rearrangement analysis more reliable and efficient, while potentially revealing new evolutionary patterns. In addition, the algorithms will enable a better understanding of the mechanisms and rate of gene rearrangements in genomes, and the importance of the rearrangements in shaping the organization of genes within the genome.

"Ultimately this information can be used to identify microorganisms, develop better vaccines, and help researchers better understand the dynamics of microbial communities and biochemical pathways," added Bader.

Source: Georgia Institute of Technology


Rank 5 /5 (3 votes)
Relevant PhysicsForums posts

More news stories

Independent group inspects Apple supplier

(AP) -- An independent group, the Fair Labor Association, has started auditing Apple Inc.'s Chinese supplier Foxconn after a request by Apple.

Technology / Business

created 29 minutes ago | popularity not rated yet | comments 0

Teaching teens safety in the virtual world

A new cyber safety program on the dangers of social networking is being developed by Flinders University, in light of an alarming report which shows children as young as 12 are meeting internet strangers in ...

Technology / Internet

created 1 hour ago | popularity not rated yet | comments 0

Ethanol mandate not the best option

Many people are willing to pay a premium for ethanol, but not enough to justify the government mandate for the corn-based fuel, a Michigan State University economist argues.

Technology / Energy & Green Tech

created 1 hour ago | popularity 5 / 5 (1) | comments 0

Microsoft India retail site down after 'cyber attack'

Microsoft said Monday it was investigating an attack by hackers on its Indian retail website, reportedly carried out by a Chinese group called the "Evil Shadow Team."

Technology / Internet

created 4 hours ago | popularity not rated yet | comments 0

Chinese city seizes Apple iPads in name dispute

(AP) -- Authorities have seized Apple iPads from retailers in a city in northern China due to a dispute with a domestic company that says it owns the iPad name, an official said Monday. The Chinese company said it is asking ...

Technology / Business

created 4 hours ago | popularity not rated yet | comments 0


Fast photon control brings quantum photonic technologies closer

(PhysOrg.com) -- Using photons instead of electrons to transmit information could lead to faster and more secure ways to communicate, among other advantages. Now a team of physicists has taken another step toward realizing ...

Planck mission steps closer to the cosmic blueprint

(PhysOrg.com) -- ESA's Planck mission has revealed that our Galaxy contains previously undiscovered islands of cold gas and a mysterious haze of microwaves. These results give scientists new treasure to mine ...

New ability to regrow blood vessels holds promise for treatment of heart disease

(Medical Xpress) -- University of Texas at Austin researchers have demonstrated a new and more effective method for regrowing blood vessels in the heart and limbs — a research advancement that could have ...

Myths and shame keep many from seeking bankruptcy protection

(PhysOrg.com) -- Two interesting facts that may counter modern ideas about bankruptcy: The overwhelming majority of U.S. filings belong to individuals rather than corporations or entities, and most of these ...

Big Society could threaten biodiversity conservation

A study of the Moray Firth Seal Management Plan (MFSMP), in north-east Scotland, identified four key conditions for long-term success, three of which pointed to the importance of direct government involvement.

Motivation to exercise affects behavior

(Medical Xpress) -- For many people, the motivation to exercise fluctuates from week to week, and these fluctuations predict whether they will be physically active, according to researchers at Penn State. In an effort to ...