Brown mathematicians prove new way to build a better estimate
February 29, 2008How do you sift through hundreds of billions of bits of information and make accurate inferences from such gargantuan sets of data? Brown University mathematician Charles “Chip” Lawrence and graduate student Luis Carvalho have arrived at a fresh answer with broad applications in science, technology and business.
In new work published in the Proceedings of the National Academy of Sciences, Lawrence and Carvalho describe a new class of statistical estimators and prove four theorems concerning their properties. Their work shows that these “centroid” estimators allow for better statistical predictions – and, as a result, better ways to extract information from the immense data sets used in computational biology, information technology, banking and finance, medicine and engineering.
“What’s exciting about this work – what makes it every scientist’s dream – is that it’s so fundamental,” Lawrence said. “These new estimators have applications in biology and beyond and they advance a statistical method that’s been around for decades.”
For more than 80 years, one of the most common methods of statistical prediction has been maximum likelihood estimation (MLE). This method is used to find the single most probable solution, or estimate, from a set of data.
But new technologies that capture enormous amounts of data – human genome sequencing, Internet transaction tracking, instruments that beam high-resolution images from outer space – have opened opportunities to predict discrete “high dimensional” or “high-D” unknowns. The huge number of combinations of these “high-D” unknowns produces enormous statistical uncertainty. Data has outgrown data analysis.
This discrepancy creates a paradox. Instead of producing more precise predictions about gene activity, shopping habits or the presence of faraway stars, these large data sets are producing more unreliable predictions, given current procedures. That’s because maximum likelihood estimators use data to identify the single most probable solution. But because any one data point swims in an increasingly immense sea, it’s not likely to be representative.
Lawrence, a professor of applied mathematics and a faculty member in the Center for Computational Molecular Biology at Brown, first came upon this paradox and a potential way around it while working on predicting the structure of RNA molecules. If you want to predict the structure of these molecules – how the molecule will look when it folds onto itself – you’d have billions and billions of possible shapes to choose from.
“Using maximum likelihood estimation, the most likely outcome would be very, very, very unlikely,” Lawrence said, “so we knew we needed a better estimation method.”
Lawrence and Carvahlo used statistical decision theory to understand the limitations of the old procedure when faced with new “high-D” problems. They also used statistical decision-making theory to find an estimation procedure that applies to a broad range of statistical problems. These “centroid” estimators identify not the single most probable solution, but the solution that is most representative of all the data in a set.
Lawrence and Carvahlo went on to prove four theorems that illustrate the favorable properties of these estimators and show that they can be easily computed in many important applications.
“This new procedure should benefit any field that needs to reliably make predictions of large-scale, high-D unknowns,” Lawrence said.
Source: Brown University
-
Genetic study offers clues to how intelligence changes through life
Jan 19, 2012 |
4.3 / 5 (3) |
0
-
Calculating what's in the universe from the biggest color 3-D map
Jan 11, 2012 |
4.9 / 5 (7) |
10
-
Separating signal and noise in climate warming
Nov 17, 2011 |
4.5 / 5 (8) |
7
-
Saint Lawrence seaway eels slipping into oblivion
Oct 21, 2011 |
not rated yet |
0
-
Economic cost of weather may total $485 billion in US
Jun 22, 2011 |
3.5 / 5 (2) |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (31) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Finding intersections
11 hours ago
-
Interpreting a function based on it's equation.
13 hours ago
-
I found this. What is it?
16 hours ago
-
Derivative wrt a constant?
21 hours ago
-
Using Excel to figure out how much money I could make if I traded my dividends?
22 hours ago
-
Linear Equations (General and Standard forms: From Wikipedia)
Feb 11, 2012
- More from Physics Forums - General Math
More news stories
A frank discussion of the power law and linking correlation to causation
(PhysOrg.com) -- Michael Stumpf a mathematics professor at Imperial College in London, and Mason Porter a lecturer at Oxford have teamed together to write and publish a perspective piece in Science regarding the in ...
Employers feel no love for unscrupulous practice of 'service sweethearting'
A new study led by two Florida State University marketing professors finds that some frontline service employees who are rewarded for hikes in customer loyalty and satisfaction also may engage in "service ...
Other Sciences / Economics & Business
Feb 10, 2012 |
3.3 / 5 (3) |
11
US workers are 'giving away the store,' costing firms billions
Nearly 70 percent of the nation's service employees give away free goods and services from hamburgers to cable TV costing companies billions of dollars a year, according to a groundbreaking study.
Other Sciences / Economics & Business
Feb 09, 2012 |
3.5 / 5 (4) |
10
New insights into how to correct false knowledge
The abundance of false information available on the Internet, in movies and on TV has created a big challenge for educators.
Other Sciences / Social Sciences
Feb 07, 2012 |
4.9 / 5 (7) |
9
|
Neanderthal demise due to many influences, including cultural changes: study
As an ice age crept upon them thousands of years ago, Neanderthals and modern human ancestors expanded their territory ranges across Asia and Europe to adapt to the changing environment.
Other Sciences / Archaeology & Fossils
Feb 07, 2012 |
4.4 / 5 (5) |
8
|
Google might launch Drive for cloud storage soon
(PhysOrg.com) -- Google's next big move, according to the Wall Street Journal, is a cloud storage service called Drive. Hardly first to the plate, Google is simply catching up to introducing its cloud reposi ...
Walney offshore wind farm is world's biggest (for now)
(PhysOrg.com) -- The Walney wind farm on the Irish Sea--characterized by high tides, waves and windy weather--officially opened this week. The farm is treated in the press as a very big deal as the Walney ...
Latin America mining boom clashes with conservation
Latin America is experiencing a mining boom as prices rise fuelled by a hike in global demand, but the region is also being hit by a wave of violent protests, strikes and rallies by environmentalists.
Love a click away in Indonesia's Twitter Republic
He was a geeky kid from Yogyakarta, she a glamorous city girl in Jakarta. In a country with one of the world's most vibrant social networking scenes they fell in love on Twitter.
Europeans protest controversial Internet pact
Tens of thousands of people marched in protests in more than a dozen European cities Saturday against a controversial anti-online piracy pact that critics say could curtail Internet freedom.
Navy to begin tests on electromagnetic railgun prototype launcher
The Office of Naval Research (ONR)'s Electromagnetic (EM) Railgun program will take an important step forward in the coming weeks when the first industry railgun prototype launcher is tested at a facility ...
Nov 12, 2008
Rank: not rated yet
==============================
Thomson
homes for sale by owner