Researchers Give Computers Common Sense

October 17, 2007 Researchers Give Computers Common Sense

Enlarge

The computer scientists injected context into an automated image labeling system through a post-processing context check. The approach strives to maximize the contextual agreement among the labeled objects within each picture.

Using a little-known Google Labs widget, computer scientists from UC San Diego and UCLA have brought common sense to an automated image labeling system. The common sense comes as the ability to use context to help identify objects in photographs.

For example, if a conventional automated object identifier has labeled a person, a tennis racket, a tennis court and a lemon in a photo, the new post-processing context check will re-label the lemon as a tennis ball.

“We think our paper is the first to bring external semantic context to the problem of object recognition,” said computer science professor Serge Belongie from UC San Diego.

The researchers show that the Google Labs tool called Google Sets can be used to provide external contextual information to automated object identifiers. The paper will be presented on Thursday 18 October 2007 at ICCV 2007 – the 11th IEEE International Conference on Computer Vision in Rio de Janeiro, Belongie.

Google Sets generates lists of related items or objects from just a few examples. If you type in John, Paul and George, it will return the words Ringo, Beatles and John Lennon. If you type “neon” and “argon” it will give you the rest of the noble gasses.

“In some ways, Google Sets is a proxy for common sense. In our paper, we showed that you can use this common sense to provide contextual information that improves the accuracy of automated image labeling systems,” said Belongie.

The image labeling system is a three step process. First, an automated system splits the image up into different regions through the process of image segmentation. In the photo above, image segmentation separates the person, the court, the racket and the yellow sphere.

Next, an automated system provides a ranked list of probable labels for each of these image regions.

Finally, the system adds a dose of context by processing all the different possible combinations of labels within the image and maximizing the contextual agreement among the labeled objects within each picture.

It is during this step that Google Sets can be used as a source of context that helps the system turn a lemon into a tennis ball. In this case, these “semantic context constraints” helped the system disambiguate between visually similar objects.

In another example, the researchers show that an object originally labeled as a cow is (correctly) re-labeled as a boat when the other objects in the image – sky, tree, building and water – are considered during the post-processing context step. In this case, the semantic context constraints helped to correct an entirely wrong image label. The context information came from co-occurence object information from the training data rather than from Google Sets.

The computer scientists also highlight other advances they bring to automated object identification. First, instead of doing just one image segmentation, the researchers generated a collection of image segmentations and put together a shortlist of stable image segmentations. This increases the accuracy of the segmentation process and provides an implicit shape description for each of the image regions.

Second, the researchers ran their object categorization model on each of the segmentations, rather than on individual pixels. This dramatically reduced the computational demands on the object categorization model.

In addition to Google Sets, the researchers gleaned semantic context information from the co-occurrence of object labels in the training sets.

In the two sets of images that the researchers tested, the categorization results improved considerably with inclusion of context. For one image dataset, the average categorization accuracy increased more than 10 percent using the semantic context provided by Google Sets. In a second dataset, the average categorization accuracy improved by about 2 percent using the semantic context provided by Google Sets. The improvements were higher when the researchers gleaned context information from data on co-occurrence of object labels in the training data set for the object identifier.

Right now, the researchers are exploring ways to extend context beyond the presence of objects in the same image. For example, they want to make explicit use of absolute and relative geometric relationships between objects in an image – such as “above” or “inside” relationships. This would mean that if a person were sitting on top of an animal, the system would consider the animal to be more likely a horse than a dog.

Source: University of California, San Diego


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.5 /5 (19 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • Quantum_Conundrum - Oct 17, 2007
    • Rank: 3.3 / 5 (3)
    This wont be succesful approach. Real life images do not always follow cookie-cutter outlines for context. In real life situations, this software will create about as many errors as it prevents due to prejudices in the developers definition of context.
  • earls - Oct 17, 2007
    • Rank: 3.3 / 5 (3)
    Pattern Recognition = the future of AI. That's the only thing that seperates us from computers, the ability to rapidly analyze and identify patterns.
  • alexxx - Oct 18, 2007
    • Rank: 2.7 / 5 (3)
    Google Image Labeler: http://images.goo...labeler/
  • fleem - Oct 18, 2007
    • Rank: 4 / 5 (3)
    Yes yes yes this is all well and good. But it STILL does not answer the question of why that guy is playing tennis with a lemon.

October 17, 2007 all stories

Comments: 4

4.5 /5 (19 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • New technique that scrambles light may lead to sharper images, wider views
    created Apr 21, 2009 | popularity not rated yet | comments 0
  • 'Smart' surveillance system may tag suspicious or lost people
    created Dec 17, 2008 | popularity not rated yet | comments 0
  • In game of tennis, seeing isn't always believing
    created Oct 27, 2008 | popularity not rated yet | comments 0
  • IBM Research Develops Technology to Aid Human Memory
    created Jul 29, 2008 | popularity not rated yet | comments 0
  • Robotic minds think alike?
    created Mar 27, 2008 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • casio calculator that's similar to TI-89
    created 14 hours ago
  • Mathematica Question: Finding local maximums
    created 17 hours ago
  • Advice on what cell phone to get
    created 18 hours ago
  • Read multiple binary files to ascii
    created Nov 07, 2009
  • Engineering Translation software
    created Nov 06, 2009
  • Changing the language options on your phone.
    created Nov 03, 2009
  • More from Physics Forums - Computing & Technology

Other News

What computer science can teach economics

What computer science can teach economics

Technology / Computer Sciences

created 2 hours ago | popularity not rated yet | comments 0

(PhysOrg.com) -- Computer scientists have spent decades developing techniques for answering a single question: How long does a given calculation take to perform? Constantinos Daskalakis, an assistant professor ...


Framed for child porn -- by a PC virus

Framed for child porn -- by a PC virus

Technology / Internet

created 22 hours ago | popularity 5 / 5 (6) | comments 3

(AP) -- Of all the sinister things that Internet viruses do, this might be the worst: They can make you an unsuspecting collector of child pornography.


Eco-friendly building techniques don't have to significantly raise construction costs

Technology / Energy

created 3 hours ago | popularity 4.5 / 5 (2) | comments 1

Home builder Lance Schmidt hears it all the time: Green building costs more. But he and his colleagues are out to prove otherwise.


A system of space solar power system (SSPS)

Japan eyes solar station in space as new energy source

Technology / Energy

created Nov 08, 2009 | popularity 4.7 / 5 (16) | comments 23

It may sound like a sci-fi vision, but Japan's space agency is dead serious: by 2030 it wants to collect solar power in space and zap it down to Earth, using laser beams or microwaves.


Dartmouth professor finds that iconic Oswald photo was not faked

Professor finds that iconic Oswald photo was not faked (w/ Video)

Technology / Computer Sciences

created Nov 05, 2009 | popularity 3.8 / 5 (9) | comments 39

(PhysOrg.com) -- Dartmouth Computer Scientist Hany Farid has new evidence regarding a photograph of accused John F. Kennedy assassin Lee Harvey Oswald. Farid, a pioneer in the field of digital forensics, digitally ...