25 years of conventional evaluation of data analysis proves worthless in practice

September 3, 2008

So-called 'intelligent' computer-based methods for classifying patient samples, for example, have been evaluated with the help of two methods that have completely dominated research for 25 years. Now Swedish researchers at Uppsala University are revealing that this methodology is worthless when it comes to practical problems. The article is published in the journal Pattern Recognition Letters.

Today there is rapidly growing interest in 'intelligent' computer-based methods that use various classes of measurement signals, from different patient samples, for instance, to create a model for classifying new observations. This type of method is the basis for many technical applications, such as recognition of human speech, images, and fingerprints, and is now also beginning to attract new fields such as health care.

"Especially in applications in which faulty classification decisions can lead to catastrophic consequences, such as choosing the wrong form of therapy for treating cancer, it is extremely important to be able to make a reliable estimate of the performance of the classification model," explains Mats Gustafsson, Professor of signal processing and medical bioinformatics at Uppsala University, who co-directed the new study together with Associate Professor Anders Isaksson.

To evaluate the performance of a classification model, one normally tests it on a number of trial examples that have never been involved in the design of the model. Unfortunately there are seldom tens of thousands of test examples available for this type of evaluation. In biomedicine, for instance, it is often expensive and difficult to collect the patient samples needed, especially if one wishes to analyze a rare disease. To solve this problem, many different methods have been proposed. Since the 1980s two methods have completely dominated research, namely, cross validation and resampling/bootstrapping.

"This has entailed that the performance assessment of virtually all new methods and applications reported in the scientific literature in the last 25 years has been carried out using one of these two methods," says Mats Gustafsson.

In the new study, the Uppsala researchers use both theory and convincing computer simulations to show that this methodology is worthless in practice when the total number of examples is small in relation to the natural variation that exists among different observations. What is considered a small number depends in turn on what problem is being studied-­in other words, it is impossible to determine whether the number of examples is sufficient.

"Our main conclusion is that this methodology cannot be depended on at all, and that it therefore needs to be immediately replaces by Bayesian methods, for example, which can deliver reliable measures of the uncertainty that exists. Only then will multivariate analyses be in any position to be adopted in such critical applications as health care," says Mats Gustafsson.

Source: Uppsala University


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.3 /5 (6 votes)


September 3, 2008 all stories

Comments: 0

4.3 /5 (6 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • New search technique for images and videos has broad applications
    created 2 hours ago | popularity not rated yet | comments 0
  • Making Climate Forecasts More Useful to Farmers
    created Nov 09, 2009 | popularity not rated yet | comments 0
  • Software cos. eye key patent case in Supreme Court
    created Nov 08, 2009 | popularity not rated yet | comments 0
  • New methods are changing old materials
    created Oct 28, 2009 | popularity not rated yet | comments 0
  • How white is a paper?
    created Oct 22, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Controling/Reading a CDROM drive.
    created 6 hours ago
  • casio calculator that's similar to TI-89
    created Nov 08, 2009
  • Advice on what cell phone to get
    created Nov 08, 2009
  • Changing the language options on your phone.
    created Nov 03, 2009
  • More from Physics Forums - Computing & Technology

Other News

LinkedIn was launched in 2003 as an online community for people to advance career and job prospects

Twitter links to LinkedIn

Technology / Internet

created 50 minutes ago | popularity not rated yet | comments 0

Twitter on Tuesday linked to LinkedIn, letting people share updates and tweets between the hot microblogging service and the career-oriented online social networking website.


Members of the media are given a demonstration of the Kindle DX

Amazon delivers Kindle books to PCs

Technology / Software

created 30 minutes ago | popularity 3 / 5 (1) | comments 0

Amazon.com on Tuesday released free software that lets people read the online retail titan's electronic Kindle books on personal computers.


Adobe Systems announced on Tuesday it was cutting some 680 jobs worldwide

Adobe cutting 680 jobs

Technology / Business

created 20 minutes ago | popularity not rated yet | comments 0

Adobe Systems, known for its Photoshop editing program and Acrobat document software, announced on Tuesday it was cutting some 680 jobs worldwide, about nine percent of its workforce.


Inventing language

Inventing language

Technology / Engineering

created 40 minutes ago | popularity not rated yet | comments 0

(PhysOrg.com) -- Last Thursday, the day after the New York Yankees won their first World Series of the 21st century, MIT Institute Professor Barbara Liskov, the 2008 recipient of the Turing Award — frequently ...


New 'finFETS' promising for smaller transistors, more powerful chips

New 'finFETs' promising for smaller transistors, more powerful chips

Technology / Semiconductors

created 4 hours ago | popularity 5 / 5 (5) | comments 1

(PhysOrg.com) -- Purdue University researchers are making progress in developing a new type of transistor that uses a finlike structure instead of the conventional flat design, possibly enabling engineers ...