Computer vision may not be as good as thought

January 25, 2008 Computer vision may not be as good as thought

The human brain easily recognizes that these cars are all the same object, but the variations in the car's size, orientation and position are a challenge for computer-vision algorithms. Image / Nicolas Pinto

For years, scientists have been trying to teach computers how to see like humans, and recent research has seemed to show computers making progress in recognizing visual objects. A new MIT study, however, cautions that this apparent success may be misleading because the tests being used are inadvertently stacked in favor of computers.

Computer vision is important for applications ranging from “intelligent” cars to visual prosthetics for the blind. Recent computational models show apparently impressive progress, boasting 60-percent success rates in classifying natural photographic image sets. These include the widely used Caltech101 database, intended to test computer vision algorithms against the variety of images seen in the real world.

However, James DiCarlo, a neuroscientist in the McGovern Institute for Brain Research at MIT, graduate student Nicolas Pinto and David Cox of the Rowland Harvard Institute argue that these image sets have design flaws that enable computers to succeed where they would fail with more authentically varied images. For example, photographers tend to center objects in a frame and to prefer certain views and contexts. The visual system, by contrast, encounters objects in a much broader range of conditions.

“The ease with which we recognize visual objects belies the computational difficulty of this feat,” explains DiCarlo, senior author of the study in the online Jan. 25 PLoS Computational Biology. “The core challenge is image variation. Any given object can cast innumerable images onto the retina depending on its position, distance, orientation, lighting and background.”

The team exposed the flaws in current tests of computer object recognition by using a simple “toy” computer model inspired by the earliest steps in the brain's visual pathway. Artificial neurons with properties resembling those in the brain's primary visual cortex analyze each point in the image and capture low-level information about the position and orientation of line boundaries. The model lacks the more sophisticated analysis that happens in later stages of visual processing to extract information about higher-level features of the visual scene such as shapes, surfaces or spaces between objects.

The researchers intended this model as a straw man, expecting it to fail as a way to establish a baseline. When they tested it on the Caltech101 images, however, the model did surprisingly well, with performance similar or better than five state-of-the-art object-recognition systems.

How could that be? “We suspected that the supposedly natural images in current computer vision tests do not really engage the central problem of variability, and that our intuitions about what makes objects hard or easy to recognize are incorrect,” Pinto explains.

To test this idea, the authors designed a more carefully controlled test. Using just two categories-planes and cars-they introduced variations in position, size and orientation that better reflect the range of variation in the real world.

“With only two types of objects to distinguish, this test should have been easier for the 'toy' computer model, but it proved harder,” Cox says. The team's conclusion: “Our model did well on the Caltech101 image set not because it is a good model but because the 'natural' images fail to adequately capture real-world variability.”

As a result, the researchers argue for revamping the current standards and images used by the computer-vision community to compare models and measure progress. Before computers can approach the performance of the human brain, they say, scientists must better understand why the task of object recognition is so difficult and the brain's abilities are so impressive.

Source: Massachusetts Institute of Technology


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.3 /5 (21 votes)


January 25, 2008 all stories

Comments: 0

4.3 /5 (21 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • New search technique for images and videos has broad applications
    created Nov 10, 2009 | popularity not rated yet | comments 0
  • Sony Develops High Frame Rate Single Lens 3D Camera Technology
    created Oct 01, 2009 | popularity not rated yet | comments 0
  • Reconstruct Mars automatically in minutes
    created Sep 18, 2009 | popularity not rated yet | comments 0
  • The robot children
    created Sep 15, 2009 | popularity not rated yet | comments 0
  • Researcher uncovers secrets of Kells 'angels'
    created Sep 02, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Control System
    created Nov 24, 2009
  • Base Isolation Systems in Skyscrapers?
    created Nov 23, 2009
  • Need to interview a Computer Hardware Engineer for school project
    created Nov 23, 2009
  • transient heat transfer
    created Nov 23, 2009
  • More from Physics Forums - General Engineering

Other News

Should I buy a PC or Mac?

Technology / Software

created 5 minutes ago | popularity not rated yet | comments 0

Q. Our 6-year-old PC computer is dying a slow death and we are considering moving to a new iMac but have a few concerns. First, of all, we have several Word documents on our disk drive now that we want to keep and add to ...


ORNL 'deep retrofits' can cut home energy bills in half

ORNL 'deep retrofits' can cut home energy bills in half

Technology / Energy

created 3 hours ago | popularity 5 / 5 (1) | comments 0

(PhysOrg.com) -- Oak Ridge National Laboratory has announced plans to conduct a series of deep energy retrofit research projects with the potential to improve the energy efficiency in selected homes by as ...


Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines"

Magazine publishers creating 'iTunes for magazines': reports

Technology / Internet

created 2 hours ago | popularity not rated yet | comments 0

US magazine publishers Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines," according to published reports.


Design chosen for British 1,000 mph car

Design chosen for British 1,000 mph car (w/ Video)

Technology / Engineering

created 11 hours ago | popularity 4 / 5 (4) | comments 5

(PhysOrg.com) -- A British team hoping to be the first to get a car to 1,000 mph (1,610 km/h) has made its final design selection. The six-tonne car, known as the Bloodhound, will be powered by a Eurofighter ...


The logo of NBC studios in Burbank, California

Comcast bid for NBC Universal could be sealed next week: source

Technology / Business

created 1hour ago | popularity not rated yet | comments 0

Comcast's bid to buy a controlling stake in NBC Universal from General Electric could be sealed next week if GE reaches an agreement with Vivendi, a source close to the matter said Wednesday.