New Stanford tool enables wider analyses of genome 'deep sequencing'

May 2, 2010

Life is almost unbearably complex. Humans and mice, frogs and flies toggle genes on and off in dizzying combinations and sequences during their relentless march from embryo to death. Now scientists seeking to understand the machinations of the proteins behind the genomic wizard's screen have a powerful new tool at their disposal, courtesy of researchers at the Stanford University School of Medicine.

Until now, researchers have relied on outdated methods of analysis to identify those involved in controlling when and how individual genes are expressed. Most often, those methods - capable of probing only specific, limited regions of the arising from a type of experiment called - led to the exclusive scrutiny of regions called promoters nestled near the start of the gene.

In contrast, the new Stanford-developed, web-based algorithm allows scientists to plumb the unprecedented depths of the data provided by new "deep-sequencing" techniques to reveal a pantheon of control regions for nearly any gene. The effect is like expanding a researcher's field of vision from a pencil-thin trained mainly on the regions near coding sequences to a sweeping spotlight illuminating the contributions of distant genomic regions.

"It used to be that people thought only the regions near the gene were important in controlling its function - in part because they had no way of assessing the impact of regions further away," said Gill Bejerano, PhD, assistant professor of developmental biology and of computer science at the medical school and Stanford's School of Engineering.

As a result, said Bejerano, researchers often cherry-picked nearby regions for further analysis based on their proximity or interest. "But when you're being that conservative with current sequencing capabilities, you're typically throwing away at least half of the data you so laboriously worked to obtain," he said.

Typically that data exists in the form of DNA binding sites for regulatory proteins called that dictate the activity of genes. And, with the advent of new, deep-sequencing techniques, it's being generated at rates that are both unimaginable and unmanageable.

Bejerano is the senior author of the research, which will be published online May 2 in Nature Biotechnology. The researchers coined the name "GREAT" for their algorithm, an acronym for "Genomic Regions Enrichment of Annotations Tool," and the website will be available for anyone to use after May 2 at http://great.stanford.edu

There are hundreds of known transcription factors. Each controls the expression of numerous genes by binding to specific regions in the genome. This makes it difficult for scientists to know exactly how any one transcription factor is acting, particularly if it works over long stretches of DNA. Usually they'll figure out where in the DNA the protein is binding and then look for interesting genes nearby. Or, conversely, they'll find an interesting gene and look for nearby transcription-factor binding sites. But recent research has shown that sections of DNA far away can also play an important role.

It works a bit like this: Think of your kitchen. Notice all the black things. Those are your transcription-factor binding sites. But what do they do? You might figure out that sliding the lever on the toaster makes the toast pop up. And plugging it into the wall makes it get hot. But you're likely to overlook that vitally important black breaker switch on the wall behind you, or to dismiss it as inconsequential among all the other black items in the room that don't, in fact, control the toaster. That is, unless you use this new analysis.

In contrast, users of the GREAT algorithm, developed by graduate students Cory McLean and Aaron Wenger and software engineer Dave Bristor, will simply enter a list of all the binding sites they've found throughout the genome for their transcription factor of interest. No prescreening is necessary, and the list can be hundreds or thousands of items long. Some will be biologically meaningful, and some will be experimental flukes. The software program will then provide an analysis revealing not only which genes that transcription factor is likely to moderate, both near and far, but also in which developmental or molecular pathways it is likely to function.

"The analysis gets pushed back into the hands of the person who did the experiment," said Bejerano. "Now you will start to see the kinds of results that we had expected with this much data." He and his collaborators found that test runs with well-known transcription factors verified the factors' association with the expression of particular genes, but also identified new, previously unsuspected alliances between binding sites and genes separated on the DNA by up to 1 million nucleotides.

"We've been asking the right questions, but using the wrong interpretation tools to answer them," said Bejerano. "We don't expect that this tool will help three labs. We expect that it will help 3,000 labs. GREAT can look at thousands of binding sites and tell you things that your transcription factor is doing that have never been reported before."

Provided by Stanford University Medical Center (news : web)


Rank 5 /5 (5 votes)
Relevant PhysicsForums posts
  • Mitosis
    created4 hours ago
  • Stem cell question.
    created6 hours ago
  • Protease cleavage
    created12 hours ago
  • Pertubance in a model
    created18 hours ago
  • Cancer drugs and Alzheimer's, Oh my!
    createdFeb 09, 2012
  • Squishing cells
    createdFeb 09, 2012
  • More from Physics Forums - Biology

More news stories

The power of estrogen -- male snakes attract other males

A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...

Biology / Plants & Animals

created 16 hours ago | popularity 4.8 / 5 (6) | comments 1 | with audio podcast

Grass to gas: Researchers' genome map speeds biofuel development

Researchers at the University of Georgia have taken a major step in the ongoing effort to find sources of cleaner, renewable energy by mapping the genomes of two originator cells of Miscanthus x giganteus, a large perenn ...

Biology / Biotechnology

created 13 hours ago | popularity 3.8 / 5 (5) | comments 0 | with audio podcast

Miami battling invasion of giant African snails

No one knows how they got there. But an invasion of African giant snails has southern Florida in a panic over potential crop damage, disease and general yuckiness surrounding the slimy gastropods.

Biology / Ecology

created 20 hours ago | popularity 4 / 5 (1) | comments 4

Experts reveal how plants don't get sunburn

(PhysOrg.com) -- Experts at the University of Glasgow have discovered how plants survive the harmful rays of the sun.

Biology / Cell & Microbiology

created 16 hours ago | popularity 4.8 / 5 (5) | comments 0 | with audio podcast

Protein libraries in a snap

(PhysOrg.com) -- A Rice University undergraduate will depart with not only a degree but also a possible patent for his invention of an efficient way to create protein libraries, an important component of biomolecular ...

Biology / Cell & Microbiology

created 20 hours ago | popularity 4.8 / 5 (4) | comments 1 | with audio podcast


Anonymous knocks CIA website offline (Update)

The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.

Google users warned of threat to smartphone wallets

Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.

New error-correcting codes guarantee the fastest possible rate of data transmission

Error-correcting codes are one of the triumphs of the digital age. They’re a way of encoding information so that it can be transmitted across a communication channel — such as an optical fiber o ...

Humans may have helped the decline of African rainforests 3000 years ago

(PhysOrg.com) -- Large areas of rainforests in Central Africa mysteriously disappeared over three thousand years ago, to be replaced by savannas. The prevailing theory has been that the cause was a change ...

New power source discovered

(PhysOrg.com) -- Researchers at the Massachusetts Institute of Technology (MIT) and RMIT University have made a breakthrough in energy storage and power generation.

Small modular reactor design could be a 'SUPERSTAR'

(PhysOrg.com) -- Though most of today's nuclear reactors are cooled by water, we've long known that there are alternatives; in fact, the world's first nuclear-powered electricity in 1951 came from a reactor ...