Listen, watch, read -- computers search for meaning

October 30, 2009

(PhysOrg.com) -- European researchers have created the first integrated semantic search platform that integrates text, video and audio. The system can 'watch' films, 'listen' to audio and 'read' text to find relevant responses to semantic search terms. At last, computers are able to look for meaning in our multimedia searches.

There is a phenomenal amount of content out there on the internet, but therein lies a problem. Sure, text content can be skimmed or glanced, but audiovisual content has to be viewed in linear time. It is very complex to search inside a film or audio recording for relevant information.

But European researchers in the MESH project have developed an integrated platform which they say, for the first time, can combine semantic search - or search by the meaning of the words - and a host of associated tools to deliver more relevant information, from a wide variety of sources that can be accessed from an individual user.

The platform can search annotated files from any type of media - photographs, videos, sound recordings, text, document scans - using a host of techniques including optical character recognition, automated speech recognition and automatic annotation of movies and photographs that track salient concepts.

Technology shift

This represents an emerging paradigm shift in .

Here is why. Right now, text in computing is defined by a series of numbers, most commonly the Unicode standard. Each number signifies a particular letter, and computers can scan these codes very quickly. So when you enter a search term, the machine has no idea what those letters signify. It simply looks for the pattern - it has no inkling of the concept behind the pattern.

But in semantic search, every bit of information is defined by potentially dozens of meaningful concepts. When a copywriter invoices for his or her work, for example, the date could be defined in terms of calendar, invoice, billing period, and so on. All these definitions for one piece of information are called ‘metadata’, or information about information.

Collections of agreed metadata terms for a particular field or task, like medicine or accounting, are called ontologies.

So the computer not only searches for the term, it searches for related metadata that defines types of information in specific ways. In reality, the computer still does not ‘understand’ a concept in its semantic search - it continues to look for patterns of letters. But because the concepts behind the search terms are included, it can return results based on concepts as well as text patterns.

Imminent domains

These technologies are becoming common in particular knowledge domains, and more are emerging every day, but most relate to the concepts behind text-based documents. The MESH platform sought to use for every type of media.

On the way, it created some cutting-edge technology. “Our automatic annotation for video, for example, is state of the art,” explains Pedro Concejero, coordinator of the MESH project.

“The annotation system is capable of identifying the general scene setting, such as whether a video is a studio shot or a shot recorded on location. With adequate training, it can also detect (within some error margins) the general topic of the video, such as a scene about an earthquake or a flood. It can also find a number of salient objects within the scene, such as persons or fire, but cannot yet identify consistently objects with great variations in shape or aspect.”

One of the major challenges of the project was a product of its own success: It annotated too much information!

“This is good - it is what we wanted the system to do - but the quantity of data was vast, too much to handle, so we had to find ways to cut down on the amount of metadata,” Concejero tells ICT Results.

Manual override

So the project developed a manual annotation tool that can, with a little training, be used by non-technical people. “It is a very powerful, very advanced professional program. There are other manual annotation tools available commercially, but we have developed a strong and user-friendly program that could probably compete very successfully with what is currently available.”

For the project, the platform was developed to search video news sources relating to civil unrest and street violence, and natural disasters like earthquakes, forest fires and floods.

“We had to focus the demonstrator because there is a lot of work involved in developing ontologies for specific news topics. You would need to develop a very detailed ontology for politics, or crime and so on. We have designed the system so that it can accept ontologies from elsewhere, but for the demonstrator we reserved our work to these two domains,” says Concejero.

The beginning of the end?

The technology will not be challenging the industry leading search engines any time soon. This project does not necessarily mark the end of the type of keyword-based search that we use every day.

But it could well be the beginning of the end, and in the meantime the work of the MESH project will find a happy home in a number of stand-alone commercial applications and work will, in one way or another, continue to develop new applications.

More information: MESH project

This is part one of a two-part special feature on the MESH project.

Provided by ICT Results


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4 /5 (1 vote)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • RayCherry - Oct 30, 2009
    • Rank: not rated yet
    Language/Text dependent concept identification still provides barriers that will limit the use to specific culture (English speaking).

    When the concepts themsleves achieve independence from Language/Text, then the system will be able to catalogue and cross reference multi-cultural concepts, providing a global 'map' of strong and weak linked concepts that underly the languages used to express/communicate them.

    Eliminating the Language barrier within the machines, will help the users to do the same.

October 30, 2009 all stories

Comments: 1

4 /5 (1 vote)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Online tools help students search for meaning
    created Nov 11, 2008 | popularity not rated yet | comments 0
  • It's semantic -- easier solution to annotate and search images
    created Aug 27, 2009 | popularity not rated yet | comments 0
  • You've got mail -- somewhere
    created Dec 20, 2007 | popularity not rated yet | comments 0
  • A computer can pick out speech even amid cacophony
    created Nov 26, 2008 | popularity not rated yet | comments 0
  • Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference
    created Feb 04, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Help with a camera choice
    created Nov 18, 2009
  • casio calculator that's similar to TI-89
    created Nov 08, 2009
  • Advice on what cell phone to get
    created Nov 08, 2009
  • Changing the language options on your phone.
    created Nov 03, 2009
  • HP strange RPN operation???
    created Nov 02, 2009
  • Databases in physics
    created Oct 31, 2009
  • More from Physics Forums - Computing & Technology

Other News

Suit over search-engine keywords tries new angle

Technology / Internet

created 13 hours ago | popularity 2.5 / 5 (2) | comments 0

(AP) -- A lawsuit in Wisconsin is bringing a fresh challenge to the practice of paying for keywords on Google and other search engines to boost one company's link over a rival's.


Screen of a computer featuring a search of the word "edition" on the home page of Google's website

Google books hearing set for February 18

Technology / Internet

created 15 hours ago | popularity not rated yet | comments 0

A US judge set February 18 for a hearing on the revised legal settlement between Google and US authors and publishers that would allow the Internet giant to scan and sell millions of books online.


Trust Linux!

Trust Linux!

Technology / Software

created 19 hours ago | popularity 4.3 / 5 (3) | comments 0

(PhysOrg.com) -- A team of researchers has implemented support for 'trusted computing' in a commercially available version of the open source operating system Linux, breaking new ground in the global drive ...


Newspapers are displayed at a newsstand

US newspaper ad revenue down nearly 28 percent

Technology / Business

created 13 hours ago | popularity not rated yet | comments 0

US newspaper advertising revenue fell by nearly 28 percent in the third quarter, continuing a slide which has led to layoffs, bankruptcies and the closure of several dailies.


Cisco has released a Web security app for iPhone

Cisco releases Web security app for iPhone

Technology / Software

created 15 hours ago | popularity 4.5 / 5 (2) | comments 0

Cisco on Friday announced the release of a free iPhone application for anyone who wants to stay on top of the latest trojans, worms, or other threats marauding on the Internet.