Bringing down the language barrier... automatically

May 2nd, 2008

Progress being made by European researchers on automatic speech-to-speech translation technology could help the EU tackle one of the biggest remaining boundaries to internal trade, mobility and the free exchange of information – language.

With 23 official languages, European institutions spend more than a billion euros a year translating documents and interpreting speeches. Companies trading across the EU’s internal borders spend millions more just to understand their business partners.

The situation, unparalleled anywhere else in the world, makes Europe a natural market for automatic translation technology, and, logically, a leader in the development of systems that can help speakers of different languages communicate.

“There is an evident need for this sort of technology in Europe and elsewhere in the world… it saves time and costs over human translation,” explains Marcello Federico, a researcher at FBK-irst in Trento, Italy.

But no one has been able to develop an automatic translation system that comes anywhere close to the capabilities of a human translator or interpreter. Internet translations are a case in point, littered with punctuation errors, misplaced words and grammatical mistakes that can make them almost unintelligible.

Other systems can only translate certain predefined words and phrases, so-called ‘constrained speech’ that suffices for a tourist booking a hotel or checking flight times but is next to useless if you want to understand a news bulletin.

Federico led a team that sought to achieve something far more ambitious. Working in the EU-funded TC-STAR project they tackled what is perhaps the biggest human language technology challenge of all: taking speech in one language and outputting spoken words in another.

First in speech-to-speech translation

“For humans, translation is difficult. We have to master both the source language and the target language, and machine translation is significantly more difficult than that,” Federico notes. “To our knowledge, TC-STAR has been the first project in the world addressing unrestricted speech-to-speech translation.”

For such a system to be able to translate any speech regardless of topic and context, three technologies are used, all of which are still far from perfect. Automatic Speech Recognition (ASR) is used to transcribe spoken words to text. Spoken Language Translation (SLT) translates the source language to the target language. Text to Speech (TTS) synthesises the spoken output.

The TC-STAR research partners developed components to handle each of those tasks, creating a platform that has brought the state of the art of translation technology a step closer to matching the performance of human translators.

One of their key innovations was to combine the output of several ASR and SLT systems in order to make the transcription and translation phases considerably more accurate than comparable systems.

Based on the BLEU (Bilingual Evaluation Understudy) method, a way of comparing machine and human translations, evaluations of the quality of translations improved by between 40% and 60% over the course of the project, while up to 70% of words were translated correctly, even if they were not placed in the right position in a sentence.

From speeches to Chinese news bulletins

The 11 partners – including big telecom and entertainment companies, such as Nokia, Siemens, IBM and Sony – worked with recordings of speeches from the European Parliament, which they translated between English and Spanish. They also worked with radio news broadcasts, which they translated from Chinese to English.

Though the system still cannot match the accuracy of a human translator or interpreter, Federico is convinced that, with further research a commercially viable automatic speech-to-speech translator will be feasible within a few years, at least for some simpler language pairs.

In the meantime, components developed in the TC-STAR project have been made available under an open source license. The project has also led to at least one spin-off company and a follow-up initiative.

Called PerVoice, the spin-off is offering remote-automated transcription services for companies and public bodies.

“It saves them time and money to have minutes of meetings or town council sessions transcribed automatically,” Federico notes.

The follow-up project, JUMAS, focuses on developing a similar transcription system to record court trial proceedings.

Source: ICT Results


print this article email this article download pdf blog this article bookmark this article     Digg this Stumble it share on Facebook share on Reddit add to delicious save to Yahoo! bookmarks
4.6/5 after 14 votes

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • trippingsock - May 03, 2008
    • Rank: not rated yet
    It's still in science fiction realm, but will happen eventually. Can you imagine the impact of such device it would have on mankind. 'Babel tower' uniting all humans again. Yes I'm European and can see that language barrier is sometimes even greater than the borders between countries.

May 2nd, 2008 all stories
Technology / Software

Comments: 1
Rank: 4.6/5 after 14 votes

  • Stumble this up

  • Digg this

  • Share it:
  • share on Facebook
  • share on MySpace
  • share on Slashdot
  • rss-newsfeed
  • share on Google
  • share on Reddit
  • add to delicious
  • save to Yahoo! bookmarks
  • share on Windows Live
  • Add to Mixx!
Rating: 4.6/5 after 14 votes

  • Related Stories

  • Translate this: 'cognition-strength interfaces'
    created 1hour ago | popularity not rated yet | comments 0
  • Twitter brings new language to old institution, the Senate
    created Jun 17, 2009 | popularity not rated yet | comments 0
  • May You Stay Forever Young
    created May 27, 2009 | popularity not rated yet | comments 0
  • New tool improves productivity, quality when translating software
    created Feb 24, 2009 | popularity not rated yet | comments 0
  • Can networked human computation solve computer language comprehension?
    created Jan 26, 2009 | popularity not rated yet | comments 0

Tags


  • Physicists Demonstrate Quantum Memory with Matter Qubits
    Physicists Demonstrate Quantum Memory with Matter Qubits
    Physics / General Physics
    created Jul 03, 2009 | popularity 4.4 / 5 (17) | comments 1
  • 'Holey' Nanosheets for Wastewater Dye Removal
    Nanotechnology / Nanomaterials
    created Jul 01, 2009 | popularity 5 / 5 (5) | comments 1
  • Jellyfish Robot Swims Like its Biological Counterpart
    Jellyfish Robot Swims Like its Biological Counterpart
    Electronics / Robotics
    created Jun 26, 2009 | popularity 4.4 / 5 (8) | comments 1
  • Could Maxwell's Demon Exist in Nanoscale Systems?
    Could Maxwell's Demon Exist in Nanoscale Systems?
    Physics / General Physics
    created Jun 24, 2009 | popularity 4.4 / 5 (18) | comments 29
  • Living Safely with Robots, Beyond Asimov's Laws
    Living Safely with Robots, Beyond Asimov's Laws
    Electronics / Robotics
    created Jun 22, 2009 | popularity 4.6 / 5 (53) | comments 40
  • Other News

    Translate this: 'cognition-strength interfaces'

    Translate this: 'cognition-strength interfaces'

    Technology / Engineering

    created 1hour ago | popularity 5 / 5 (1) | comments 0

    (PhysOrg.com) -- A highly ambitious European project used basic cognitive function, eye-tracking and keystroke logging as the starting point for the study of human-computer interaction for translation. It ...


    DoCoMo invests $45.5M in US mobile video firm

    Technology / Business

    created 2 hours ago | popularity not rated yet | comments 0

    (AP) -- NTT DoCoMo, Japan's largest mobile phone operator, said Monday it spent $45.5 million to take a 35 percent share in a U.S. company that makes multimedia technology for its mobile phones.


    HTC Touch

    Taiwan's HTC earnings edge down in Q2

    Technology / Business

    created 4 hours ago | popularity not rated yet | comments 0

    HTC Corp, Taiwan's leading smartphone maker, said Monday its net profit in the second quarter was down almost two percent from a year earlier.


    Samsung announces earnings estimate (AP)

    Samsung announces earnings estimate

    Technology / Business

    created 4 hours ago | popularity not rated yet | comments 0

    (AP) -- Samsung Electronics Co., the world's biggest manufacturer of memory chips, announced quarterly earnings estimates for the first time Monday, saying it hopes to reduce market confusion and speculation ...


    Andreessen making leap from entrepreneur to VC

    Technology / Business

    created 6 hours ago | popularity not rated yet | comments 0

    (AP) -- Having built and sold two technology startups for a combined $11.7 billion, Marc Andreessen is ready to take a stab at, well, finding the next Marc Andreessen.