Researchers Teach Computers to Search for Photos Based on Their Contents

October 8th, 2008 Researchers Teach Computers to Search for Photos Based on Their Contents

Enlarge

ALIPR assigned the following keywords to this photo of Biscayne Bay in Miami, Florida: landscape, lake, mountain, ocean, building, grass, water, ice, glacier, historical, house, rock, man-made, train, and tree. Credit: Penn State

A pair of Penn State researchers has developed a statistical approach, called Automatic Linguistic Indexing of Pictures in Real-Time (ALIPR), that one day could make it easier to search the Internet for photographs. The public can participate in improving ALIPR's accuracy by visiting a designated Web site (http://www.alipr.com), uploading photographs, and evaluating whether the keywords that ALIPR uses to describe the photographs are appropriate.

ALIPR works by teaching computers to recognize the contents of photographs, such as buildings, people, or landscapes, rather than by searching for keywords in the surrounding text, as is done with most current image-retrieval systems. The team recently received a patent for an earlier version of the approach, called ALIP, and is in the process of obtaining another patent for the more sophisticated ALIPR. They hope that eventually ALIPR can be used in industry for automatic tagging or as part of Internet search engines.

"Our basic approach is to take a large number of photos -- we started with 60,000 photos -- and to manually tag them with a variety of keywords that describe their contents. For example, we might select 100 photos of national parks and tag them with the following keywords: national park, landscape, and tree," said Jia Li, an associate professor of statistics at Penn State. "We then would build a statistical model to teach the computer to recognize patterns in color and texture among these 100 photos and to assign our keywords to new photos that seem to contain national parks, landscapes, and/or trees. Eventually, we hope to reverse the process so that a person can use the keywords to search the Web for relevant images."

Li said that most current image-retrieval systems search for keywords in the text associated with the photo or in the name that was given to the photo. This technique, however, often misses appropriate photos and retrieves inappropriate photos. Li's new technique allows her to train computers to recognize the semantics of images based on pixel information alone.

Li, who developed ALIPR with her colleague James Wang, a Penn State associate professor of information sciences and technology, said that their approach appropriately assigns to photos at least one keyword among seven possible keywords about 90 percent of the time. But, she added, the accuracy rate really depends on the evaluator. "It depends on how specific the evaluator expects the approach to be," she said. "For example, ALIPR often distinguishes people from animals, but rarely distinguishes children from adults."

Although the team's goal is to improve ALIPR's accuracy, Li said she does not believe the approach ever will be 100-percent accurate. "There are so many images out there and so many variations on the images' contents that I don't think it will be possible for ALIPR to be 100-percent accurate," she said. "ALIPR works by recognizing patterns in color and texture. For example, if a cat in a photo is wearing a red coat, the red coat may lead ALIPR to tag the photo with words that are irrelevant to the cat. There is just too much variability out there." Li currently is pursuing some new ideas that may help her to achieve better recognition of image semantics.

Provided by Penn State University


print this article email this article download pdf blog this article bookmark this article     Digg this Stumble it share on Facebook share on Reddit add to delicious save to Yahoo! bookmarks
3.5/5 after 4 votes


October 8th, 2008 all stories
Technology / Computer Sciences

Comments: 0
Rank: 3.5/5 after 4 votes

  • Stumble this up

  • Digg this

  • Share it:
  • share on Facebook
  • share on MySpace
  • share on Slashdot
  • rss-newsfeed
  • share on Google
  • share on Reddit
  • add to delicious
  • save to Yahoo! bookmarks
  • share on Windows Live
  • Add to Mixx!
Rating: 3.5/5 after 4 votes

  • Related Stories

  • Online system rates images by aesthetic quality
    created May 05, 2009 | popularity not rated yet | comments 0
  • Researchers teach computers how to name images by 'thinking'
    created Nov 01, 2006 | popularity not rated yet | comments 0
  • New software advances photo search and management in online systems
    created Oct 15, 2007 | popularity not rated yet | comments 0


  • Physicists Demonstrate Quantum Memory with Matter Qubits
    Physicists Demonstrate Quantum Memory with Matter Qubits
    Physics / General Physics
    created Jul 03, 2009 | popularity 4.4 / 5 (17) | comments 1
  • 'Holey' Nanosheets for Wastewater Dye Removal
    Nanotechnology / Nanomaterials
    created Jul 01, 2009 | popularity 5 / 5 (5) | comments 1
  • Jellyfish Robot Swims Like its Biological Counterpart
    Jellyfish Robot Swims Like its Biological Counterpart
    Electronics / Robotics
    created Jun 26, 2009 | popularity 4.4 / 5 (8) | comments 1
  • Could Maxwell's Demon Exist in Nanoscale Systems?
    Could Maxwell's Demon Exist in Nanoscale Systems?
    Physics / General Physics
    created Jun 24, 2009 | popularity 4.4 / 5 (18) | comments 29
  • Living Safely with Robots, Beyond Asimov's Laws
    Living Safely with Robots, Beyond Asimov's Laws
    Electronics / Robotics
    created Jun 22, 2009 | popularity 4.6 / 5 (52) | comments 40
  • Other News

    DoCoMo invests $45.5M in US mobile video firm

    Technology / Business

    created 1hour ago | popularity not rated yet | comments 0

    (AP) -- NTT DoCoMo, Japan's largest mobile phone operator, said Monday it spent $45.5 million to take a 35 percent share in a U.S. company that makes multimedia technology for its mobile phones.


    HTC Touch

    Taiwan's HTC earnings edge down in Q2

    Technology / Business

    created 2 hours ago | popularity not rated yet | comments 0

    HTC Corp, Taiwan's leading smartphone maker, said Monday its net profit in the second quarter was down almost two percent from a year earlier.


    Samsung announces earnings estimate (AP)

    Samsung announces earnings estimate

    Technology / Business

    created 2 hours ago | popularity not rated yet | comments 0

    (AP) -- Samsung Electronics Co., the world's biggest manufacturer of memory chips, announced quarterly earnings estimates for the first time Monday, saying it hopes to reduce market confusion and speculation ...


    Andreessen making leap from entrepreneur to VC

    Technology / Business

    created 4 hours ago | popularity not rated yet | comments 0

    (AP) -- Having built and sold two technology startups for a combined $11.7 billion, Marc Andreessen is ready to take a stab at, well, finding the next Marc Andreessen.


    Japan demands 119 million dlrs in tax from Amazon: report

    Technology / Business

    created 23 hours ago | popularity 3.6 / 5 (5) | comments 1

    Japanese authorities told a sales affiliate of US retail giant Amazon.com to pay about 119 million dollars in tax for unreported income over a three-year period, a newspaper said Sunday.