Darwin's famous finches and Venter's marine microbes
March 13, 2007Although the Galápagos finches were to play a pivotal role in the inception of Darwin’s theory of evolution through natural selection, he had no inkling of their significance when he collected them during his voyage on the HMS Beagle.
Similarly, it is hard to predict the impact the vast amount of marine microbial DNA – collected during the Sorcerer II Global Ocean Sampling Expedition by J. Craig Venter, Ph.D., and his team – will have on our understanding of the natural world.
"If anything, this is just the beginning," says Gerard Manning, Ph.D., director of the Razavi Newman Center for Bioinformatics at the Salk Institute for Biological Studies. "We’re starting to explore this trove of sequences now, but it may be decades before we fully understand it all."
Just like the famous ornithologist John Gould who had to classify the Galápagos finches before they led Darwin on the right track, Manning and many others have been busy during the last couple of months wading through roughly 7.7 million sequenced snippets of sea-borne genomic DNA to impose order on the flood of data and to classify the identified proteins.
Their findings are detailed in series of papers, published in this week’s online edition of the journal Public Library of Science Biology.
The authors are plying the rapidly emerging trade of metagenomics (also known as environmental genomics) that seeks to examine genomic snapshots taken directly from the environment.
"Metagenomics allows us to sample the 99 percent of all bacteria that won’t grow in the lab," explains Manning. "GOS opens a huge window into biological and genomic diversity and, within this diversity, to better understand many of the fundamentals of biology." he adds.
Expanding the universe of protein families
But instead of whole genomes, metagenomics produces a whole grab bag of bits and pieces for which scientists have to develop new methods to extract meaning. In one of the papers, an array of scientists, spearheaded by first author Shibu Yooseph, Ph.D., and his colleagues at the Craig Venter Institute, compared every DNA fragment with every other available DNA fragment to produce clusters of related sequences. This exhaustive analysis predicted more than 6 million proteins in the GOS data – nearly twice the number of all proteins ever described before – and laid the groundwork for further studies.
Manning, a co-author on Yooseph’s paper, looked at the other side of the coin. He ran all the public sequences and GOS data against Pfam, a collection of signature profiles for all known protein families. Each of these profiles is an average of all known members of a certain protein family.
"Instead of starting with a human kinase to find a bacterial kinase, for example, you start with all of them together, which makes the search much more sensitive, but also very computationally expensive," Manning says. "We did almost 350 million comparisons, which is probably an order of magnitude or two more than anybody has ever done before."
Manning and co-author Yufeng Zhai, Ph.D., a bioinformatics programmer in the Razavi Newman Center for Bioinformatics at the Salk, could only accomplish this rather gargantuan task with the help of Time Logic, a company in Carlsbad, California. The company specializes in hardware that accelerates genomic searches. "We only have one of their accelerators, but Time Logic stepped up and lent us eight more," says Manning. The final computation took two weeks, but would have taken well over a century on a traditional computer.
The Salk scientists could assign over half of all GOS sequences to known protein families, and discovered that certain protein profiles are more popular in the ocean or on land. For example, gram-positive bacteria are best known for their hardy spores, but this ability has been entirely lost in their marine relatives.. Flagella, whip-like extensions propelling bacteria forward and pili, short extensions used to exchange genetic material between bacteria (also known as microbial sex), are also less frequent in marine environments.
"By comparing our findings with the Yooseph clusters, we also discovered hundreds of new gene families that hadn’t even been seen before," says Zhai and adds that by adding the diverse GOS data to known profiles, "we were able to make them more sensitive and diverse, and so increase their power to categorize novel sequences."
Diversity of microbial kinases
In a separate study, Manning, Zhai, and first author Natarajan Kannan, Ph.D., a postdoctoral researcher in the lab of HHMI investigator and UCSD professor Susan S. Taylor, Ph.D., traded the breadth of the ocean survey for the depth of a single protein domain. They zoomed in on kinases, extremely well studied enzymes, which control every aspect of eukaryotic cell biology and are important cancer drug targets. They control the activity of proteins and small molecules by attaching tiny phosphate groups to them. By contrast, much less has been known about their bacterial counterparts.
Again and again, the researchers combed the GOS data for bacterial kinases, each time rebuilding their domain profiles by including the new members found in the previous round. All in all, they dug up 45,000 protein kinase sequences that fell into 20 distinct families, of which the
eukaryotic protein kinases are just one. The additional 19 families spanned a huge range and included several that had never been described before.
"Prokaryotic protein-like kinases were considered to be some sort of niche players, but actually they are more prevalent and widespread than histidine kinases," explains Manning. Bacteria were thought to rely mostly on histidine kinases, which are structurally different from protein kinases, for all their signaling needs.
Even though the different kinase families had very little similarity in their sequence, it emerged that 10 key residues were conserved in almost all kinase families, fingering them as being at the core of what it means to be a kinase. Seven of those had been previously known to be important in human kinases, but the other three were unexpected finds.
The other surprising finding was just how innovative and plastic the different families were, even with these core residues, as one or another family had found ways to eliminate any but one of the 10 key residues. Using structural modeling, and patterns of sequence conservation, Kannan was able to show that loss of one key residue could be compensated by changes around other conserved regions of the protein, and that some of these changes in bacterial kinases are also seen in specific human kinases.
Says Manning, "By looking at all these very distant microbial relatives we can understand more even about human kinases and their relationship to cancer and other diseases. We go out into the ocean, we find all this diversity and analyzing what’s new and what’s not new reflects back on the things we thought we knew well."
Research done at the Salk Institute was supported by the Razavi-Newman Foundation.
Source: Salk Institute
-
Key enzyme discovered to be master regulator in protein-protein reactions
Mar 24, 2010 |
4.5 / 5 (2) |
0
-
New findings provide more complete picture of kidney cancer
Dec 29, 2011 |
5 / 5 (1) |
0
-
New components of antiviral pathway discovered
Dec 22, 2011 |
5 / 5 (1) |
0
-
Scientists find missing link in regulation of glucose
Dec 22, 2011 |
5 / 5 (2) |
0
-
Drugs used to overcome cancer may also combat antibiotic resistance: researchers
Dec 22, 2011 |
5 / 5 (1) |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (31) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
More news stories
The power of estrogen -- male snakes attract other males
A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...
10 hours ago |
4.8 / 5 (5) |
0
|
Grass to gas: Researchers' genome map speeds biofuel development
Researchers at the University of Georgia have taken a major step in the ongoing effort to find sources of cleaner, renewable energy by mapping the genomes of two originator cells of Miscanthus x giganteus, a large perenn ...
7 hours ago |
3.7 / 5 (3) |
0
|
Experts reveal how plants don't get sunburn
(PhysOrg.com) -- Experts at the University of Glasgow have discovered how plants survive the harmful rays of the sun.
10 hours ago |
5 / 5 (2) |
0
|
Protein libraries in a snap
(PhysOrg.com) -- A Rice University undergraduate will depart with not only a degree but also a possible patent for his invention of an efficient way to create protein libraries, an important component of biomolecular ...
14 hours ago |
5 / 5 (3) |
0
|
Miami battling invasion of giant African snails
No one knows how they got there. But an invasion of African giant snails has southern Florida in a panic over potential crop damage, disease and general yuckiness surrounding the slimy gastropods.
14 hours ago |
4 / 5 (1) |
4
Google users warned of threat to smartphone wallets
Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.
Anonymous knocks CIA website offline (Update)
The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.
Complex wiring of the nervous system may rely on a just a handful of genes and proteins
Researchers at the Salk Institute have discovered a startling feature of early brain development that helps to explain how complex neuron wiring patterns are programmed using just a handful of critical genes. ...
New error-correcting codes guarantee the fastest possible rate of data transmission
Error-correcting codes are one of the triumphs of the digital age. Theyre a way of encoding information so that it can be transmitted across a communication channel such as an optical fiber o ...
Putting the squeeze on planets outside our solar system
(PhysOrg.com) -- Using high-powered lasers, scientists at Lawrence Livermore National Laboratory and collaborators discovered that molten magnesium silicate undergoes a phase change in the liquid state, abruptly ...
NASA sees wide-eyed cyclone Jasmine
Cyclone Jasmine's eye has opened wider on NASA satellite imagery, as it moves through the Southern Pacific Ocean.