New tool enables powerful data analysis

January 8, 2009
New tool enables powerful data analysis

Enlarge

After using the algorithm to determine the filament structure of an aerogel -- a lightweight foam used in shielding electronic equipment in satellites -- the researchers were able to compute changes to its structural integrity by the simulated impact of a micrometeorite traveling at 10,000 miles per hour (red sphere on left). Image: Attila Gyulassy/UC Davis Copyright UC Regents

(PhysOrg.com) -- A powerful computing tool that allows scientists to extract features and patterns from enormously large and complex sets of raw data has been developed by scientists at University of California, Davis, and Lawrence Livermore National Laboratory. The tool - a set of problem-solving calculations known as an algorithm - is compact enough to run on computers with as little as two gigabytes of memory.

The team that developed this algorithm has already used it to probe a slew of phenomena represented by billions of data points, including analyzing and creating images of flame surfaces; searching for clusters and voids in a virtual universe experiment; and identifying and tracking pockets of fluid in a simulated mixing of two fluids.

"What we've developed is a workable system of handling any data in any dimension," said Attila Gyulassy, who led the five-year development effort while pursuing a PhD in computer science at UC Davis. "We expect this algorithm will become an integral part of a scientist's toolbox to answer questions about data."

A paper describing the new algorithm was published in the November-December issue of IEEE Transactions on Visualization and Computer Graphics.
Mixing of Fluids
Enlarge

This image of an early moment in the simulated mixing of two fluids was created by researchers using a powerful new algorithm they developed to extract features and patterns from massive data sets. In the image, blue and red spheres and the lines between them represent the branching of pockets of fluid. Image: Attila Gyulassi/UC Davis copyright UC Regents


Computers are widely used to perform simulations of real-world phenomena and to capture results of physical experiments and observations, storing this information as collections of numbers. But as the size of these data sets has burgeoned, hand-in-hand with computer capacity, analysis has grown increasingly difficult.

A mathematical tool to extract and visualize useful features from data sets has existed for nearly 40 years - in theory. Called the Morse-Smale complex, it partitions sets by similarity of features and encodes them into mathematical terms. But working with the Morse-Smale complex is not easy. "It's a powerful language. But a cost of that, is that using it meaningfully for practical applications is very difficult," Gyulassy said.

Gyulassy's algorithm divides data sets into parcels of cells, then analyzes each parcel separately using the Morse-Smale complex. Results of those computations are then merged together. As new parcels are created from merged parcels, they are analyzed and merged yet again. At each step, data that do not need to be stored in memory are discarded, drastically reducing the computing power required to run the calculations.

One of Gyulassy's tests of the algorithm was to use it to analyze and track the formation and movement of pockets of fluid in the simulated mixing of two fluids: one dense, one light. The complexity of this data set is so vast - it consists of more than one billion data points on a three-dimensional grid - it challenges even supercomputers, Gyulassy said. Yet the new algorithm with its streamlining features was able to perform the analysis on a laptop computer with just two gigabytes of memory. Although Gyulassy had to wait nearly 24 hours for the little machine to complete its calculations, at the end of this process he could pull up images in mere seconds to illustrate phenomena he was interested in, such as the branching of fluid pockets in the mixture.

Two main factors are driving the need for analysis of large data sets, said co-author Bernd Hamann: a surge in the use of powerful computers that can produce huge amounts of data, and an upswing in affordability and availability of sensing devices that researchers deploy in the field and lab to collect a profusion of data.

"Our data files are becoming larger and larger, while the scientist has less and less time to understand them," said Hamann, a professor of computer science and associate vice chancellor for research at UC Davis. "But what are the data good for if we don't have the means of applying mathematically sound and computationally efficient computer analysis tools to look for what is captured in them?"

Gyulassy is currently developing software that will allow others to put the algorithm to use. He expects the learning curve to be steep for this open-source product, "but if you just learn the minimal amount about what a Morse-Smale complex is," he said, "it will be pretty intuitive."

Source: University of California - Davis

4.4 /5 (18 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

googleplex
Jan 08, 2009

Rank: not rated yet
This sounds almost too good to be true. Anyone got a link to the paper?
earls
Jan 08, 2009

Rank: not rated yet
jrdewoodjr
Jan 08, 2009

Rank: not rated yet
i'm going to use this as a primary story element> very slick resonant flow curls!
kepler
Jan 11, 2009

Rank: not rated yet
http://ieeexplore...=4658183&isnumber=4658121
$29 for non-members
Rank 4.4 /5 (18 votes)
Relevant PhysicsForums posts

More news stories

New integrated building model may improve fish farming operations

Today's "locavore" movement with its emphasis on eating more locally-produced food is a natural fit for fruits and vegetables in nearly every region, but few entrepreneurs have dared to apply the concept to ...

Technology / Engineering

created 37 minutes ago | popularity not rated yet | comments 0

FBI file: Steve Jobs was considered for govt post

(AP) -- FBI background interviews of some people who knew Apple co-founder Steve Jobs reveal a man driven by power and alienating some of the people who worked with him.

Technology / Business

created 25 minutes ago | popularity 1 / 5 (1) | comments 0

LinkedIn's 4Q earnings strong, revenue doubles

(AP) -- LinkedIn reported a strong fourth quarter as the online professional-networking service added 14 million members. Its net income and revenue beat Wall Street's expectations.

Technology / Business

created 22 minutes ago | popularity not rated yet | comments 0

First Google hire leaving for online academy

The first person hired by Google's founders is leaving the Internet giant to devote himself to an innovative online education website called Khan Academy.

Technology / Internet

created 18 minutes ago | popularity not rated yet | comments 0

NY attorney general ends lawsuit against Intel

(AP) -- Intel Corp. is paying $6.5 million as part of a deal to terminate an antitrust lawsuit filed against the chip maker by the New York attorney general's office.

Technology / Business

created 16 minutes ago | popularity not rated yet | comments 0


'Dark plasmons' transmit energy

Microscopic channels of gold nanoparticles have the ability to transmit electromagnetic energy that starts as light and propagates via "dark plasmons," according to researchers at Rice University.

Cell biologists describes mechanism by which some people may be more susceptible to colon cancer

An international research team led by cell biologists at the University of California, Riverside has uncovered a new insight into colon cancer, the third leading cause of cancer-related deaths in the United ...

New method makes culture of complex tissue possible in any lab

Scientists at the University of California, San Diego have developed a new method for making scaffolds for culturing tissue in three-dimensional arrangements that mimic those in the body. This advance, published online in ...

Anyone can learn to be more inventive, cognitive researcher says

There will always be a wild and unpredictable quality to creativity and invention, says Anthony McCaffrey, a cognitive psychology researcher at the University of Massachusetts Amherst, because an "Aha moment" is rare and ...

NASA's TRMM satellite sees Cyclone Jasmine in 3-D

Data from NASA's TRMM satellite was used to create a 3-Dimensional look at Cyclone Jasmine, currently moving through the South Pacific Ocean.

FDA outlines path for lower-priced biotech drugs

(AP) -- The Food and Drug Administration is preparing to review the first lower-cost versions of biotech drugs, expensive medications which have never before faced generic competition.