Teaching computers to recognise

June 1, 2009

(PhysOrg.com) -- Recognising objects and groups of objects is something we humans take for granted. For computers, this is far from straightforward. A European project has come up with novel solutions to this conundrum.

Imagine your friends have blindfolded you and taken you to a “secret location”. When they take off your blindfold, you immediately see a group of people around you and realise that they have thrown you a surprise birthday party. How did you know? Because everyone shouted “surprise”, and there were balloons, a birthday cake and booze.

The question may seem like a silly one, but the processes involved are far from straightforward. In fact, you had to collate an awful lot of visual, as well as other sensory data, cross-reference it with your memories, and make mental deductions.

“Vision is our most important sense and about half of the is involved with vision in one way or another,” explains Luc Van Gool of Belgium’s Leuven University (KUL) who also leads the Computer Vision Laboratory at the Swiss Federal Institute of Technology (ETH). “Enabling us to recognise the objects and places around us is a task it performs brilliantly.”

In fact, what we regard as the simple process of “recognition” would leave many computers stumped. Even something as apparently simple as recognising a birthday cake would normally require computers to be fed with information on what a cake generally looks like, the various shapes and sizes it comes in, the different forms and numbers of candles and other decorations you are likely to find adorning it, etc.

“The same object will look different depending on the viewpoint, the illumination, or the occlusions caused by other objects in front,” notes Van Gool.

Points of view

In brief, computers might be able to calculate pie to hundreds of decimal points and model complex , but they may find it impossible, without complex and painstaking programming, to recognise a human whose grown their hair or realise that Chihuahuas and Dobermans belong to the same species.

Van Gool is involved in a project, Cognitive-Level Annotation Using Latent Statistical Structure (CLASS), which is developing technologies to recognise visually specific objects, such as your car, or classes of object, such as a random car on the street.

“The recognition of an object as belonging to a particular group is a harder problem for a computer than the recognition of a specific object. The reason is that object classes show large variability among their members,” Van Gool points out.

The 3.5-year, EU-funded project managed to achieve technological improvements compared with previous efforts. It developed a system in which the description of the objects is based on the appearance of many separate, small patches. Such localised features give the necessary robustness to deal with the massive variations mentioned earlier. In addition, CLASS created special mechanisms - known as efficient approximate neighbourhood searches - for the comparison of an image or an object with huge numbers of reference images.

A picture speaks a thousand words

The specific object recognition technology developed by CLASS has already found a commercial application. Through a company known as kooaba, CLASS technology enables mobile phone subscribers who install the relevant software to take a photo with their handset of, say, a monument, a film poster, or an album cover and get relevant online information about it.

“It’s like the object itself becomes the link to further information,” observes Van Gool. He expects the application of this technology to expand rapidly. For instance, cities and museums may offer interactive guided tours or guide books through kooaba.

More information: http://class.inrialpes.fr/

Provided by ICT Results


   
Rate this story - not rated yet


June 1, 2009 all stories

Comments: 0

not rated yet

  • hide
  • Related Stories

  • Computer vision may not be as good as thought
    created Jan 25, 2008 | popularity not rated yet | comments 0
  • Robotic minds think alike?
    created Mar 27, 2008 | popularity not rated yet | comments 0
  • Scientist designs language development toy for autistic children
    created Feb 27, 2007 | popularity not rated yet | comments 0
  • Out of sight, out of mind? Not really
    created Aug 23, 2005 | popularity not rated yet | comments 0
  • Spacing, not size, matters in visual recognition, researchers find
    created Sep 25, 2008 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Computer 5V or 0V output to Sensaphone Express II
    created Feb 04, 2010
  • Ti-89 ROM Image
    created Jan 29, 2010
  • TV ads
    created Jan 29, 2010
  • Apple introduces latest iNonsense
    created Jan 27, 2010
  • cheap scientific calculator that does matrix operations
    created Jan 27, 2010
  • Power consumption: Residential vs. Commercial
    created Jan 22, 2010
  • More from Physics Forums - Computing & Technology

Other News

Imec and Holst Centre achieve breakthrough in battery-less radios

Imec achieves breakthrough in battery-less radios

Technology / Semiconductors

created 42 minutes ago | popularity 5 / 5 (1) | comments 0

At today's International Solid State Circuit Conference, Imec and Holst Centre report a 2.4GHz/915MHz wake-up receiver which consumes only 51µW power. This record low power achievement opens the door to battery-less ...


'Revolutionary' water treatment units on their way to Afghanistan

Technology / Engineering

created 2 minutes ago | popularity not rated yet | comments 0

The United States Army has taken delivery of the first two units of a "revolutionary" waste-water treatment system that will clean putrid water within 24 hours and leave no toxic by-products, according to scientists at Sam ...


The power of 'random'

The power of 'random': 'Seemingly loopy' technique could dramatically improve communications networks

Technology / Computer Sciences

created 5 hours ago | popularity 5 / 5 (4) | comments 3 | with audio podcast

A radical new approach to the design of communications networks, called "network coding," promises to make Internet file sharing faster, streaming video more reliable, and cell-phone reception better -- among ...


GMail logo

Google adding status updates to Gmail

Technology / Internet

created 1hour ago | popularity not rated yet | comments 0

Google plans to make it make it easier for users of Gmail to view online status updates from friends in a swipe at Twitter and Facebook, The Wall Street Journal reported on Tuesday.


Android

Google developing a translator for smartphones

Technology / Software

created 6 hours ago | popularity 4.7 / 5 (6) | comments 1 | with audio podcast report

(PhysOrg.com) -- Google is developing a translator for its Android smartphones that aims to almost instantly translate from one spoken language to another during phone calls.