Petacache: Use that Memory

March 7, 2006
Petacache: Use that Memory

SLAC's Computerraum

For decades, high energy experimental physicists have struggled with a fundamental problem: they simply have too much data to analyze quickly and in its entirety.

BaBar researchers routinely wait nine months for computers to sift through large datasets, searching for interesting events and setting these aside for later analysis. This “data skimming” alone constantly uses about 50 percent of BaBar's computing power. And that’s before a researcher can even start analyzing her or his data. Preparing data from CERN's Large Hadron Collider (LHC) will only take longer.

Recognizing this widespread limitation, a team at SLAC is developing the PetaCache project, a new way of thinking about data access and storage. With new computer software and more efficient types of memory, PetaCache may significantly increase the speed of data analysis.

"PetaCache may help scientists change the way they think about exploring new ideas," said PetaCache project manager Randal Melen. "It will allow a physicist with a sudden new idea, an 'I wonder if…' moment, to quickly begin exploring that new idea."

Before the early 1990s, researchers analyzed much of their data from magnetic tape, having their computers spool through miles of it to find interesting events. As disk drives got larger and cheaper, and with the rise of computer clusters, much more of the data could be kept on disk. Yet these disks still required mechanical movement, limiting the speed at which researchers could begin accessing data. Computer technology has made great strides in speeding up the movement of data—called bandwidth—but the time to get the first byte of data—called latency—has been much slower to improve. "PetaCache, then, is really about improving the latency of testing new ideas," said Melen.

To do this, PetaCache uses several types of memory, not disks. Although memory is much faster at getting this first byte of data, in the past it has been too expensive to buy in the quantities necessary to record and analyze the massive amounts of data taken at particle accelerators. Today, DRAM (Dynamic Random Access Memory) and flash memory are more affordable, and flash memory is expected to continue to drop in price as it is used more and more in consumer electronics such as digital cameras, iPod-like devices, and cell phones. If successful, the PetaCache project will allow researchers to use both DRAM and flash memory on a large scale.

The prototype PetaCache system comprises two racks of 64 server computers, each with 16 gigabytes of DRAM for a total of one terabyte of memory. This large yet fragmented amount of memory is linked together with SCALLA (Structured Cluster Architecture for Low Latency Access), a computer program developed by SCCS Software Developer Andy Hanushevsky. SCALLA moves data from data servers to batch systems running physics analysis software with the lowest possible latencies. This load-balancing, self-organizing software distributes data across many data servers efficiently, making the individual machines appear as one huge chunk of memory to SCALLA-aware physics applications.

"The software makes good use of common hardware, so you don't have to make huge expenditures for great computing power," said Hanushevsky.

Right now, SLAC’s prototype system has one terabyte (1,000 gigabytes) of DRAM memory. With their next machine, the PetaCache team hopes to mainly use less expensive flash memory which, according to SCCS Director Richard Mount, "holds future promise of cost-effective memory-based data-analysis systems."

This second-generation prototype will aim at a few tens of terabytes of flash memory, which would make the system useful to BaBar and LSST researchers. In the next decade, the PetaCache team hopes to expand the system to a petabyte (1,000 terabytes). This is around the scale of what is needed to be useful at the LHC.

"Over the next few years, this type of memory technology will become much more common, from BaBar to the LHC to banks and airline reservation systems," said Research Director Emeritus David Leith. "They all benefit from being able to work from memory."

Source: Stanford Linear Accelerator Center, by Kelen Tuttle

4.5 /5 (18 votes)  

Rank 4.5 /5 (18 votes)
Tags

Relevant PhysicsForums posts

More news stories

Explained: Sigma

It's a question that arises with virtually every major new finding in science or medicine: What makes a result reliable enough to be taken seriously? The answer has to do with statistical significance -- but ...

Physics / General Physics

created Feb 09, 2012 | popularity 5 / 5 (18) | comments 59

Quantum physicist explains $100K offer for proof scaled-up quantum computing is impossible

(PhysOrg.com) -- MIT researcher Scott Aaronson has certainly riled the physics community with his offer this past Friday, of $100,000 to anyone who can prove that scaled-up quantum computing is impossible. ...

Physics / Quantum Physics

created Feb 08, 2012 | popularity 4.2 / 5 (13) | comments 34 | with audio podcast weblog

Diamond light, brighter than the sun

It’s the size of five football pitches and generates light 10 billion times brighter than the sun. As the Diamond Light Source celebrates its tenth anniversary this year, Penny Bailey visits one of the ...

Physics / General Physics

created Feb 07, 2012 | popularity 4.3 / 5 (7) | comments 15 | with audio podcast

Physicists 'record' magnetic breakthrough

An international team of scientists has demonstrated a revolutionary new way of magnetic recording which will allow information to be processed hundreds of times faster than by current hard drive technology.

Physics / General Physics

created Feb 07, 2012 | popularity 4.5 / 5 (39) | comments 14 | with audio podcast

Hints of the Higgs - papers are submitted

Back in December 2011, the ATLAS and CMS experiments at CERN presented some exciting results that provided tantalising hints of the Higgs boson.

Physics / General Physics

created Feb 08, 2012 | popularity 4.7 / 5 (6) | comments 10


Walney offshore wind farm is world's biggest (for now)

(PhysOrg.com) -- The Walney wind farm on the Irish Sea--characterized by high tides, waves and windy weather--officially opened this week. The farm is treated in the press as a very big deal as the Walney ...

GPS court ruling leaves US phone tracking unclear

A US Supreme Court decision requiring a warrant to place a GPS device on the car of a criminal suspect leaves unresolved the bigger issue of police tracking using mobile phones, legal experts say.

Europeans protest controversial Internet pact

Tens of thousands of people marched in protests in more than a dozen European cities Saturday against a controversial anti-online piracy pact that critics say could curtail Internet freedom.

Anonymous briefly knocks CIA website offline (Update 2)

The website of the Central Intelligence Agency was briefly inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.

Study finds that anti-diabetic medication can prevent the long-term effects of maternal obesity

In a study to be presented today at the Society for Maternal-Fetal Medicine's annual meeting, The Pregnancy Meeting, in Dallas, Texas, researchers will report findings that show that short therapy with the anti-diabetic medication ...

Europe stakes billion-dollar bet on new rocket

A pencil-slim rocket is scheduled to lift into space from South America on Monday, carrying a billion-dollar bet that Europe can grab a juicy slice of the market to place satellites in low orbit.