Can networked human computation solve computer language comprehension?

January 26, 2009

Researchers at the University of Essex hope to answer this question by getting more volunteers to take part in their online game, Phrase Detectives.

Jon Chamberlain, from Essex's School of Computer Science and Electronic Engineering, explains: ‘Human language is not an unconnected series of words, phrases and sentences but a series of people, objects and ideas that refer to each other in different ways. The complexity of language makes it sound "natural" to a reader but it can be difficult to define the rules that allow us to understand it.

‘Consider the statement: "Mary is a teacher who is 25 years old. She lives in England." A human reader can easily ascertain facts about Mary's occupation, age and residence by, for example, knowing that the word "she" refers to the person "Mary". However, comprehending this type of language referencing is a challenge facing programmers when designing computer systems that try to understand text, such as search, translation and summarisation systems.’

This is where the work of those playing Phrase Detectives becomes important. The game, part of a larger project called AnaWiki, is an attempt to address the bottleneck in creating annotated linguistic resources. By initially investigating anaphoric references (as in the example above) the project aims to develop a resource larger than anything currently available.

Players (or detectives) register at: http://www.phrasedetectives.org and read through texts, making annotations to highlight relationships between words and phrases. They may be asked to 'name the culprit', so will be given a word or phrase and must look for it appearing earlier in the text. For example: 'Sherlink Holmes went to the shop. He got some tobacco for his pipe.' The word ‘he’ refers to 'Sherlink Holmes'.

Jon added: ‘Players of the game are helping to create a resource that is rich in linguistic information and improves future technology. This project aims to collect a significant amount of data and investigate the possibility of using mass collaboration to train computer systems.

‘The best way to understand a language is to have lots of examples where the meaning has been clarified. Unfortunately creating this type is resource is both time consuming and expensive but the new approach offered by Phrase Detective should address this resource shortage. The same methodology could also be used to create resources for machine translation, semantics and other linguistic phenomenon.’

So far, players have made over 40,000 annotations in four weeks. However, the researchers hope more will join as detectives and that people will add new text to the site for analysis.

Phrase Detectives can be defined as part of a genre of “games with a purpose” (GWAP) that collect data on images, texts and music. The crucial element of these games is that players receive points for agreeing with each other. They are motivated to collaborate with their partners in order to score maximum points. This ensures that players are attempting to provide good quality information, as this will result in the most agreement.

The Essex researchers believe Phrase Detectives is the first attempt to collect linguistic judgements using a fun, collaborative online game. They aim to make the tasks and the texts interesting so it feels more like a computer game than a linguistic task. The data collected can then be used to improve computer systems that try to understand text. For example, it could help search engines find information more relevant to your searches.

So, can networked human computation really solve complex language comprehension tasks on computers? Initial results from the beta version of the game look promising and more detailed analysis will completed in early 2009.

Source: University of Essex


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.3 /5 (3 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • gmurphy - Jan 26, 2009
    • Rank: not rated yet
    wow, this is really cool, a solid dataset for training text comprehension algorithms.

January 26, 2009 all stories

Comments: 1

4.3 /5 (3 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Studying ancient man to learn to prevent disease
    created Sep 15, 2009 | popularity not rated yet | comments 0
  • Review: `Madden NFL 10' is franchise's best yet
    created Aug 13, 2009 | popularity not rated yet | comments 0
  • Future tech on show at 36th SIGGRAPH
    created Aug 03, 2009 | popularity not rated yet | comments 0
  • Going, going green
    created Jun 22, 2009 | popularity not rated yet | comments 0
  • Microsoft Gets Patent for Patently Offensive Audio Content
    created Oct 28, 2008 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • casio calculator that's similar to TI-89
    created Nov 08, 2009
  • Advice on what cell phone to get
    created Nov 08, 2009
  • Changing the language options on your phone.
    created Nov 03, 2009
  • HP strange RPN operation???
    created Nov 02, 2009
  • More from Physics Forums - Computing & Technology

Other News

Google digital book ambitions hinge on settlement (AP)

Google makes concessions on digital book deal (Update)

Technology / Internet

created 18 hours ago | popularity 5 / 5 (1) | comments 3

(AP) -- Google Inc. will loosen its control over millions of copyright-protected books that will be added to its digital library if a federal judge approves a revised legal settlement addressing the earlier ...


Aircraft that can see for themselves

Aircraft that can see for themselves (w/ Video)

Technology / Engineering

created 17 hours ago | popularity 4.6 / 5 (9) | comments 0

(PhysOrg.com) -- Australian researchers have made two important advances in the development of unmanned aircraft capable of seeing for themselves as they fly fast and low over dangerous terrain.


Road trains may be coming soon to Europe

Road trains may be coming soon to Europe (w/ Video)

Technology / Engineering

created Nov 13, 2009 | popularity 4.8 / 5 (11) | comments 17

(PhysOrg.com) -- Road trains linking vehicles together in a traveling convoy are planned for Europe. With only the lead vehicle being actively driven, the road trains would allow commuters to sleep, read a ...


A system of space solar power system (SSPS)

Japan eyes solar station in space as new energy source

Technology / Energy

created Nov 08, 2009 | popularity 4.8 / 5 (21) | comments 28

It may sound like a sci-fi vision, but Japan's space agency is dead serious: by 2030 it wants to collect solar power in space and zap it down to Earth, using laser beams or microwaves.


The collection and storage and retention of the household data makes it vulnerable to security breaches

New 'smart' electrical meters raise privacy issues

Technology / Energy

created Nov 06, 2009 | popularity 4.3 / 5 (11) | comments 12

The new "smart meters" utilities are installing in homes around the world to reduce energy use raise fresh privacy issues because of the wealth of information about consumer habits they reveal, experts said ...