25 years of conventional evaluation of data analysis proves worthless in practice
September 3, 2008So-called 'intelligent' computer-based methods for classifying patient samples, for example, have been evaluated with the help of two methods that have completely dominated research for 25 years. Now Swedish researchers at Uppsala University are revealing that this methodology is worthless when it comes to practical problems. The article is published in the journal Pattern Recognition Letters.
Today there is rapidly growing interest in 'intelligent' computer-based methods that use various classes of measurement signals, from different patient samples, for instance, to create a model for classifying new observations. This type of method is the basis for many technical applications, such as recognition of human speech, images, and fingerprints, and is now also beginning to attract new fields such as health care.
"Especially in applications in which faulty classification decisions can lead to catastrophic consequences, such as choosing the wrong form of therapy for treating cancer, it is extremely important to be able to make a reliable estimate of the performance of the classification model," explains Mats Gustafsson, Professor of signal processing and medical bioinformatics at Uppsala University, who co-directed the new study together with Associate Professor Anders Isaksson.
To evaluate the performance of a classification model, one normally tests it on a number of trial examples that have never been involved in the design of the model. Unfortunately there are seldom tens of thousands of test examples available for this type of evaluation. In biomedicine, for instance, it is often expensive and difficult to collect the patient samples needed, especially if one wishes to analyze a rare disease. To solve this problem, many different methods have been proposed. Since the 1980s two methods have completely dominated research, namely, cross validation and resampling/bootstrapping.
"This has entailed that the performance assessment of virtually all new methods and applications reported in the scientific literature in the last 25 years has been carried out using one of these two methods," says Mats Gustafsson.
In the new study, the Uppsala researchers use both theory and convincing computer simulations to show that this methodology is worthless in practice when the total number of examples is small in relation to the natural variation that exists among different observations. What is considered a small number depends in turn on what problem is being studied-in other words, it is impossible to determine whether the number of examples is sufficient.
"Our main conclusion is that this methodology cannot be depended on at all, and that it therefore needs to be immediately replaces by Bayesian methods, for example, which can deliver reliable measures of the uncertainty that exists. Only then will multivariate analyses be in any position to be adopted in such critical applications as health care," says Mats Gustafsson.
Source: Uppsala University
-
Blunt nanostructures could make high-efficiency solar cells easier to fabricate
Jan 24, 2012 |
4.9 / 5 (7) |
9
-
Designing wildlife corridors in the digital age
Jan 20, 2012 |
5 / 5 (1) |
0
-
Sony's next-generation 55-inch 'Crystal LED Display' prototype presented at CES
Jan 11, 2012 |
5 / 5 (3) |
2
-
Entomologists devise new test for insecticide resistance
Jan 10, 2012 |
not rated yet |
0
-
Remembrance of things future: Long-term memory sets the stage for visual perception
Dec 28, 2011 |
3.6 / 5 (13) |
4
-
Fast photon control brings quantum photonic technologies closer
9 hours ago |
5 / 5 (4) |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (33) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (5) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Digital scratch pad?
1 hour ago
-
Quantum computer faster than regular computer?
12 hours ago
-
Flushing RAM in Mathematica
17 hours ago
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
- More from Physics Forums - Computing & Technology
More news stories
Music service gives Myspace second wind
Faded online social network Myspace said Monday it was getting a second wind due to the popularity of a freshly launched online music player.
2 hours ago |
5 / 5 (1) |
0
US, EU clear Google's $12.5B Motorola Mobility bid (Update)
Google's $12.5 billion acquisition of cellphone maker Motorola Mobility have won approvals from U.S. and European antitrust regulators, moving Google a major step closer to completing the biggest deal in its ...
3 hours ago |
5 / 5 (1) |
0
Apple shares close over $500
Apple shares surged past $500 for the first time on Wall Street on Monday, powered by reports a new iPad may be unveiled next month.
1 hour ago |
not rated yet |
0
Computer programs that think like humans
Intelligence what does it really mean? In the 1800s, it meant that you were good at memorising things, and today intelligence is measured through IQ tests where the average score for humans is 100. ...
Technology / Computer Sciences
2 hours ago |
not rated yet |
1
EU executive defends contested online piracy pact
The European Commission on Monday defended a global online-piracy pact opposed by some EU states and still to be ratified by the European Parliament.
3 hours ago |
not rated yet |
0
First-of-its-kind stem cell study re-grows healthy heart muscle in heart attack patients
Results from a Cedars-Sinai Heart Institute clinical trial show that treating heart attack patients with an infusion of their own heart-derived cells helps damaged hearts re-grow healthy muscle.
Scientists discover reason for Mt. Hood's non-explosive nature
(PhysOrg.com) -- For a half-million years, Mount Hood has towered over the landscape, but unlike some of its cousins in Oregons Cascade Mountains and many other volcanoes around the Pacific Rim ...
Discovery paves way for salmonella vaccine
(Medical Xpress) -- An international research team led by a University of California, Davis, immunologist has taken an important step toward an effective vaccine against salmonella, a group of increasingly antibiotic-resistant ...
Time of year important in projections of climate change effects on ecosystems
(PhysOrg.com) -- Does it matter whether long periods of hot weather, such as last year's heat wave that gripped the U.S. Midwest, happen in June or July, August or September?
Smoking bans lead to less, not more, smoking at home: study
Smoking bans in public/workplaces don't drive smokers to light up more at home, suggests a study of four European countries with smoke free legislation, published online in Tobacco Control.
Ovarian cancer arises in fallopian tube of knockout mice
(Medical Xpress) -- The most deadly form of "ovarian" cancer arises in the fallopian tubes not the ovaries of knockout mice that lack two genes associated with the disease, said researchers led by Baylor College ...