E-science effort will try to tame data torrents
April 5, 2007
Research by design: Schematic diagram of geophysical data workflows. Credit: USC Information Sciences Institute
A growing number of scientific fields suffer from a stifling embarassment of riches: data pile up much faster than they can be analyzed. A team of researchers at the University of Southern California's Information Sciences Institute is now building a prototype of a system that will address the problem by automating scientific workflows.
ISI computer scientist Yolanda Gil leads the newly funded $13.8 million Windward project, aiming at (in its full title) "Scaleable Knowledge Discovery through Grid Workflows."
Gil says that in fields like climatology, high energy physics and seismic modeling “our ability to gather data is surpassing our ability to analyze it. Our data warehouses are becoming data graveyards.”
In a sense, Windward will bring to the analysis of scientific problems an approach similar to that of industrial engineering, where engineers create optimal workflows to bring together raw material and machinery in the most efficient fashion to create product.
The product in modern science is not a physical item like an automobile or computer, but rather a model, or an understanding. But efficient workflows to create it are equally critical — and, because the raw material is information, not matter, much more automatable.
Gil and ISI collaborator Ewa Deelman co-chaired an NSF workshop on the subject in May 2006.
"Significant scientific advances today are achieved through complex distributed scientific computations," their overview for this workshop noted. "These computations, often represented as workflows of executable jobs and their associated dataflow may be composed of thousands of steps that integrate diverse models and data sources.”
The workshop held out the possibility of computer science being able to channel this waterfall of data into orchestrated workflows, leading to recommendations for "basic work in computer science to create a science of workflows," and suggested that scientists proactively build workflow architecture into their research plans: "workflow representations that capture scientific analysis at all levels should become the norm when complex distributed scientific computations are carried out."
Windward is an effort by Gil, who is principal Investigator and Project Leader of the ISI Interactive Knowledge Capture research group, Deelman, and two fellow ISI project leaders, Paul Cohen and Carl Kesselman.
They believe they can accomplish this ambitious task by integrating two longtime ISI specialties, artificial intelligence and grid computing.
AI tries to give computers power to respond accurately and appropriately to changing and novel circumstances, bringing multiple concerns to bear on the problem of making the right choice from a number of alternatives.
Cohen will build on his work at the ISI Center for Research in Unexpected Events (CRUE), which has focused on AI systems for complex data analysis. Cohen has been working specifically in the area of AI analysis of scientific data for years, publishing papers on "Intelligent Assistance for Computational Scientists: Integrated Modeling, Experimentation, and Analysis" ten years ago with work on planning systems going even farther back.
He has also studied the history of science in certain field to try to see patterns in the process of discovery, work that underlies the approach.
In order for AI systems to automate processes and provide assistance to scientists in defining workflows of complex computations, they need to have the world carefully structured and described.
Gil has long been active in developing the semantic web, which creates a digital universe that AI can explore and understand, and which will be a building block of the Windward system.
Previous AI systems have been much, much smaller than the regional, national and even intercontinental data structures needed to do workflow science.
This is where grid computing, and Deelman and Kesselman come in. Since 1996, Kesselman has been perfecting the Globus software that allows multiple users in multiple locations secure and easy and transparent access not just to raw data, but also to resources (computers) to process the data.
Linking to grid computing software, Deelman and her collaborators have developed a workflow management system, called Pegasus, that maps large numbers of computations to distributed resources while optimizing the overall performance of the application.
Deelman will continue evolve Pegasus, which has already been successfully used in applications in the fields of astronomy, earthquake science, gravitational-wave physics, and others.
The AI and grid computing groups at ISI have been collaborating in the area of scientific workflows for several years now, with notable results in earthquake science in joint work with the Southern California Earthquake Center.
In the Windward project, they will develop new workflow techniques to represent complex algorithms and their subtle differences so that they can be automatically selected and configured to satisfy the stated application requirements.
They will also investigate mechanisms to support autonomous and robust execution of concurrent workflows over continuously changing data.
In addition, they will develop learning techniques to improve the performance of the workflow system by exploiting an episodic memory of prior workflow executions.
Source: University of Southern California
-
New application makes supercomputing simple
Dec 12, 2011 |
5 / 5 (1) |
1
-
Terapixel Project: Lots of Data, Expertise
Jul 19, 2010 |
4.9 / 5 (14) |
4
-
New software helps researchers find meaning in massive scientific data sets
May 24, 2010 |
4.1 / 5 (7) |
0
-
Putting services at the heart of tomorrow's software
Jun 26, 2006 |
2.8 / 5 (4) |
0
-
UK project discovers new knowledge about earthquakes
Nov 23, 2005 |
3.8 / 5 (5) |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (28) |
26
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
4 / 5 (22) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
-
Sending SMS to a website
Jan 20, 2012
-
Need help with my technical fest!
Jan 19, 2012
- More from Physics Forums - Computing & Technology
More news stories
Nicira promises virtual networks will transform networking
(PhysOrg.com) -- For the past four years, founders of the start-up company Nicira have been developing cutting-edge software that they predict will transform the networking technology underlying the Internet. ...
Navy to begin tests on electromagnetic railgun prototype launcher
The Office of Naval Research (ONR)'s Electromagnetic (EM) Railgun program will take an important step forward in the coming weeks when the first industry railgun prototype launcher is tested at a facility ...
12 hours ago |
4.5 / 5 (10) |
38
|
After Megaupload closure, BTJunkie shuts down
BTJunkie, a popular file-sharing indexing site, said Monday it was voluntarily shutting down, less than three weeks after the US closure of Megaupload in a crackdown on piracy of music, films and other materials.
12 hours ago |
5 / 5 (3) |
6
Bigger US role against companies' cyberthreats?
(AP) -- A developing Senate plan that would bolster the government's ability to regulate the computer security of companies that run critical industries is drawing strong opposition from businesses that say ...
15 hours ago |
5 / 5 (2) |
7
Solvay hails world's largest fuel cell of type in Flanders, one can power 1,400 homes
Chemicals giant Solvay hailed Monday the successful entry into service in Flanders of what it said was the largest fuel cell of its type in the world.
Technology / Energy & Green Tech
16 hours ago |
4.6 / 5 (5) |
5
Our Amorphophallus is smaller: New plant species from Madagascar smells like roadkill
The famed "corpse flower" plant known for its giant size, rotten-meat odor and phallic shape has a new, smaller relative: A University of Utah botanist discovered a new species of Amorphophallus that i ...
Invasive alien predator causes rapid declines of European ladybirds
A new study provides compelling evidence that the arrival of the invasive non-native harlequin ladybird to mainland Europe and subsequent spread has led to a rapid decline in historically-widespread species ...
New findings highlight the benefit of exercise ECGs just as they are being scrapped
In the UK, the exercise electrocardiogram (ECG) is the most common initial test for the evaluation of stable chest pain and has been used widely for almost half a century. However, recent NICE guidelines recommend that it ...
Counties with thriving small businesses have healthier residents, researchers find
Counties and parishes with a greater concentration of small, locally-owned businesses have healthier populations with lower rates of mortality, obesity and diabetes than do those that rely on large companies ...
Long-term study shows epilepsy surgery improves seizure control and quality of life
While epilepsy surgery is a safe and effective intervention for seizure control, medical therapy remains the more prominent treatment option for those with epilepsy. However, a new 26-year study reveals that following epilepsy ...
New DVT guidelines: No evidence to support 'economy class syndrome'
New evidence-based guidelines from the American College of Chest Physicians (ACCP) address the many risk factors for developing a deep vein thrombosis (DVT), or blood clot, as the result of long-distance travel. These risk ...