Computer vision may not be as good as thought
January 25, 2008
The human brain easily recognizes that these cars are all the same object, but the variations in the car's size, orientation and position are a challenge for computer-vision algorithms. Image / Nicolas Pinto
For years, scientists have been trying to teach computers how to see like humans, and recent research has seemed to show computers making progress in recognizing visual objects. A new MIT study, however, cautions that this apparent success may be misleading because the tests being used are inadvertently stacked in favor of computers.
Computer vision is important for applications ranging from “intelligent” cars to visual prosthetics for the blind. Recent computational models show apparently impressive progress, boasting 60-percent success rates in classifying natural photographic image sets. These include the widely used Caltech101 database, intended to test computer vision algorithms against the variety of images seen in the real world.
However, James DiCarlo, a neuroscientist in the McGovern Institute for Brain Research at MIT, graduate student Nicolas Pinto and David Cox of the Rowland Harvard Institute argue that these image sets have design flaws that enable computers to succeed where they would fail with more authentically varied images. For example, photographers tend to center objects in a frame and to prefer certain views and contexts. The visual system, by contrast, encounters objects in a much broader range of conditions.
“The ease with which we recognize visual objects belies the computational difficulty of this feat,” explains DiCarlo, senior author of the study in the online Jan. 25 PLoS Computational Biology. “The core challenge is image variation. Any given object can cast innumerable images onto the retina depending on its position, distance, orientation, lighting and background.”
The team exposed the flaws in current tests of computer object recognition by using a simple “toy” computer model inspired by the earliest steps in the brain's visual pathway. Artificial neurons with properties resembling those in the brain's primary visual cortex analyze each point in the image and capture low-level information about the position and orientation of line boundaries. The model lacks the more sophisticated analysis that happens in later stages of visual processing to extract information about higher-level features of the visual scene such as shapes, surfaces or spaces between objects.
The researchers intended this model as a straw man, expecting it to fail as a way to establish a baseline. When they tested it on the Caltech101 images, however, the model did surprisingly well, with performance similar or better than five state-of-the-art object-recognition systems.
How could that be? “We suspected that the supposedly natural images in current computer vision tests do not really engage the central problem of variability, and that our intuitions about what makes objects hard or easy to recognize are incorrect,” Pinto explains.
To test this idea, the authors designed a more carefully controlled test. Using just two categories-planes and cars-they introduced variations in position, size and orientation that better reflect the range of variation in the real world.
“With only two types of objects to distinguish, this test should have been easier for the 'toy' computer model, but it proved harder,” Cox says. The team's conclusion: “Our model did well on the Caltech101 image set not because it is a good model but because the 'natural' images fail to adequately capture real-world variability.”
As a result, the researchers argue for revamping the current standards and images used by the computer-vision community to compare models and measure progress. Before computers can approach the performance of the human brain, they say, scientists must better understand why the task of object recognition is so difficult and the brain's abilities are so impressive.
Source: Massachusetts Institute of Technology
-
Scripps Research alumnus wins International Science and Engineering Visualization Challenge
Feb 02, 2012 |
5 / 5 (1) |
0
-
New insights into how the brain reconstructs the third dimension
Dec 07, 2011 |
5 / 5 (6) |
0
-
How our brains keep us focused
Dec 07, 2011 |
4 / 5 (2) |
0
-
Hasson brings real life into the lab to examine cognitive processing
Dec 06, 2011 |
1 / 5 (1) |
0
-
Harvard group takes complexity out of video face replacement (w/ video)
Dec 05, 2011 |
4.8 / 5 (5) |
3
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (33) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (4) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (2) |
0
-
How to tilt a object
7 hours ago
-
How to calculate total compressibility in liquid porous solid system
13 hours ago
-
Need help reading 3-D
Feb 11, 2012
-
A way to send and receive wireless data
Feb 11, 2012
-
Calling function with no input argument
Feb 10, 2012
-
Force free body diagram problem on gym equipment
Feb 10, 2012
- More from Physics Forums - General Engineering
More news stories
Japan's Fukushima reactor may be reheating: operator
Temperature readings at one of the crippled Fukushima nuclear reactors have risen above Japan's stringent new safety standard but there was no immediate danger, its operator said Sunday.
Technology / Energy & Green Tech
24 minutes ago |
1 / 5 (1) |
0
Google might launch Drive for cloud storage soon
(PhysOrg.com) -- Google's next big move, according to the Wall Street Journal, is a cloud storage service called Drive. Hardly first to the plate, Google is simply catching up to introducing its cloud reposi ...
Iran blocks email, restricts net access: reports
Iran has further restricted access to the Internet and blocked popular email services for the past few days, in a move a top lawmaker said could "cost the regime dearly," media reports said on Sunday.
14 hours ago |
5 / 5 (2) |
5
Walney offshore wind farm is world's biggest (for now)
(PhysOrg.com) -- The Walney wind farm on the Irish Sea--characterized by high tides, waves and windy weather--officially opened this week. The farm is treated in the press as a very big deal as the Walney ...
Navy to begin tests on electromagnetic railgun prototype launcher
The Office of Naval Research (ONR)'s Electromagnetic (EM) Railgun program will take an important step forward in the coming weeks when the first industry railgun prototype launcher is tested at a facility ...
Feb 06, 2012 |
4.6 / 5 (21) |
95
|
Botox developer rues missing out on billions
Botox developer Alan Scott says he rues the day he handed over rights to the best-selling wrinkle-smoothing drug to a US company for just $4.5 million, saying he might have become a billionaire.
Australian women reject 'I love u' texts
Australian women may have embraced the digital era, but they prefer a face-to-face declaration of affection to an "I love u" text and find men addicted to their mobile phones a major turnoff.
Scientists discover molecular secrets of 2,000-year-old Chinese herbal remedy
For roughly two thousand years, Chinese herbalists have treated Malaria using a root extract, commonly known as Chang Shan, from a type of hydrangea that grows in Tibet and Nepal. More recent studies suggest that halofuginone, ...
New method to examine batteries -- MRI from the inside
There is an ever-increasing need for advanced batteries for portable electronics, such as phones, cameras, and music players, but also to power electric vehicles and to facilitate the distribution and storage of energy derived ...
A mitosis mystery solved: How chromosomes align perfectly in a dividing cell
Although the process of mitotic cell division has been studied intensely for more than 50 years, Whitehead Institute researchers have only now solved the mystery of how cells correctly align their chromosomes during symmetric ...
Lab study raises questions over nano-particle impact
Tests involving chickens have raised questions about the impact on health from engineered nano-particles, the ultra-fine grains commonly used in drugs and processed foods, scientists said on Sunday.