New search technique for images and videos has broad applications

November 10, 2009 By Daniel Strain New search technique for images and videos has broad applications

Enlarge

Using a single image as a template, computer software can find similar images in a large database of photos, as shown in these examples. Images courtesy of P. Milanfar.

(PhysOrg.com) -- Engineers at the University of California, Santa Cruz, have developed a powerful new approach to a fundamental problem in computer vision: how to program a computer to recognize or categorize what it "sees" in an image or video. Their software could change the way people search the Web for photos and videos, and it may have applications in many other areas as well, such as video surveillance and security systems.

Peyman Milanfar, a professor of electrical engineering in the Baskin School of Engineering at UCSC, and graduate student Hae Jong Seo were able to overcome a major drawback of existing methods for computer recognition of objects in images--the need for an extensive "training" phase using a large number of examples. With a single photograph or video clip as a template, their software can sift through thousands of images or videos to pull out the ones that look like the template.

"When you search Images, you type in a term and it gives you returns from pages that have that text in them. We want to be able to upload an image and use it as a model for finding similar images," Milanfar said.

Milanfar and Seo developed an algorithm that enables automated recognition of both objects in images and actions in videos. The software analyzes an image or short movie and characterizes the most important constituents of the object or action represented. It can then search for those constituents in image and video databases. The researchers presented their new methods at the IEEE International Conference on in September and in a recent paper published by the IEEE Transcripts on Pattern Analysis and Machine Intelligence.

"When it comes to recognizing things in the visual world, humans have some uncanny abilities which, at least until now, well exceed the limits of what could be done by computer," Milanfar said. "In particular, we have the capacity to recognize an object after having seen it only once."

Existing technology can search for and distinguish individual objects in a database of images only after running through a time-consuming training phase. "If you're looking for of bicycles, for instance, current algorithms have to be shown pictures of hundreds, if not thousands, of bicycles in order to be able to recognize a bicycle," Milanfar said.

With his new software, a single photo of a bicycle at night can be used as a template to locate pictures of bicycles in full sunlight, in the foreground or the background. It works under a wide range of image qualities and lighting discrepancies. The template image or the target image can be sharp or out-of-focus, clean or noisy. To Milanfar's software, a bicycle is a bicycle.

Similarly, a person riding a bicycle is a person riding a bicycle. Video of Lance Armstrong in the Tour de France can be used to find clips of men and women riding along an ordinary street.

But the potential applications for Milanfar's work go well beyond browsing for cyclists on YouTube. By using videos of aggressive behavior as templates, the technology could help surveillance systems learn to recognize potentially dangerous situations. If a man reached for a weapon on camera and that action matched a template of such behavior, surveillance software could alert a busy security guard.

A picture is a composite of thousands of pixels. Milanfar's software examines these pixels and their relation to one another. In other words, how similar is a central pixel to adjacent pixels in orientation, coloring, and shading? To find actions within videos, like a man riding a bicycle, Milanfar's software completes the same procedures but incorporates the manner in which those pixel relationships move over time.

The software analyzes the map of pixel relationships and determines the salient geometric features of the object or action. These components remain perceptually constant within an object regardless of image quality.

"The geometry of the bicycle is recognizable by the shape of the wheels and the way they are connected to the body, for example," Milanfar said. "We compute features from an image that are very stable. They are there even if we make the object bigger or smaller, change the background, or add noise."

Search engines can use this algorithm to detect similar patterns of pixel relationships in a whole database of photos. The calculates the statistical likelihood that a candidate image contains the queried object. If the template is a bicycle, the outcome consists of a series of photographs containing bicycles of all shapes and sizes, ranked in order of similarity.

"This has been an area of research that has entertained people for many years, but the big successes have been few and far between," Milanfar said. "Our work is showing state-of-the-art performance with an accuracy as good as or better than any algorithm out there."

Provided by University of California, Santa Cruz


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 5 /5 (9 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • GaryB - Nov 11, 2009
    • Rank: 5 / 5 (1)
    This sounds like (perhaps a generalization of?) the "self similar" feature in Shechtman and Irani's "Matching Local self-similarities across images and videos" in CVPR 2007.

    I don't see a discussion of how fast this can run? This may be a great feature detector, but is it intrinsically slow?
  • neoabraxas - Nov 12, 2009
    • Rank: not rated yet
    Arrgh! Another article without a link to the referenced paper! No physorg article should EVER get posted without a link to the paper being referenced. This is so annoying people.

    Otherwise, a very interesting line of research. I'm really curious how they did it.
  • saa - Nov 12, 2009
    • Rank: not rated yet
    I'm going to go right out and say what we're all thinking: this'll be a great tool for searching for the pornography we like.
  • Rennes6 - Nov 13, 2009
    • Rank: not rated yet
    Arrgh! Another article without a link to the referenced paper! No physorg article should EVER get posted without a link to the paper being referenced. This is so annoying people.


    The publication is here:
    http://users.soe....sion.pdf
  • victorm - Nov 13, 2009
    • Rank: not rated yet
    More information is here:
    http://users.soe....ion.html

November 10, 2009 all stories

Comments: 5

5 /5 (9 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Research leads to improved human, object detection technology
    created Nov 03, 2009 | popularity not rated yet | comments 0
  • Human eye inspires advance in computer vision (w/Video)
    created Jun 18, 2009 | popularity not rated yet | comments 0
  • Seeing things: Researchers teach computers to recognize objects
    created Oct 13, 2009 | popularity not rated yet | comments 0
  • Extreme makeover: computer science edition
    created Nov 12, 2008 | popularity not rated yet | comments 0
  • Researchers develop new image-recognition software
    created May 21, 2008 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Please help me with dsl problem!!!
    created Dec 04, 2009
  • iPhone or Blackberry?
    created Dec 04, 2009
  • Is there any TI-89 support forum?
    created Dec 03, 2009
  • How to solve complex number equations with a calculator?
    created Dec 03, 2009
  • New TI-89 problem
    created Dec 02, 2009
  • Buying a Wii - what do I want?
    created Nov 29, 2009
  • More from Physics Forums - Computing & Technology

Other News

Google QR codes to appear in a store window near you

Google QR codes to appear in a store window near you (w/ Video)

Technology / Internet

created 6 minutes ago | popularity not rated yet | comments 0

(PhysOrg.com) -- Google recently sent out 100,000 stickers to selected US businesses for use on their storefront windows. The stickers have the Google Maps logo and a QR code that can be scanned by smart phone ...


Television control for the remote

Technology / Telecom

created 1hour ago | popularity not rated yet | comments 0

(PhysOrg.com) -- A cheap way to deliver interactive communications to remote communities has been successfully tested in Brazil and Italy.


Facebook (and systems biologists) take note: Network analysis reveals true connections

Facebook (and Systems Biologists) Take Note: Network Analysis Reveals True Connections

Technology / Computer Sciences

created 14 hours ago | popularity 3.5 / 5 (6) | comments 3

(PhysOrg.com) -- Facebook figures out that you know Holly, although you haven't seen her in 10 years, because you have four mutual friends -- a good predictor of direct friendship. But sometimes Facebook gets ...


Google Chrome

Google Chrome extensions to be officially released

Technology / Internet

created 23 hours ago | popularity 4.1 / 5 (8) | comments 3

(PhysOrg.com) -- Google is expected to release its Extensions Gallery for general users of the new Chrome browser this week, possibly at the Add-On Conference on browser extensions to be held on December 11, ...


Rethinking artificial intelligence

Rethinking artificial intelligence: Researchers hope to produce 'co-processors' for the human mind

Technology / Computer Sciences

created 21 hours ago | popularity 4.6 / 5 (14) | comments 6

The field of artificial-intelligence research (AI), founded more than 50 years ago, seems to many researchers to have spent much of that time wandering in the wilderness, swapping hugely ambitious goals for ...