W3C looks to improve speech recognition technology for web transactions

December 10, 2005 A man surfs the internet in Valencia

W3C, the standards-setting body for the Internet (World Wide Web Consortium), has completed a draft for the important VoiceXML 3.0 - technology enabling voice identification verification. While normally associated with voice commands, it has the potential to greatly speed and improve the accuracy and positive proof of online transactions.

Some larger net businesses are even using it to confirm orders and verify identity. Many, however, have become increasingly worried about the reliability and security of these transactions with fraud and identity theft on the rise. Error rates have been around 1 to 2% - unacceptable for ironclad business transactions.

W3C does not actually make software but produces standards. They now have a working draft, said James Larson, co-chair of the W3C VBWG - Voice Browser Working Group.

The standard also addressed the issue of extending its Speech Synthesis Markup Language (SSML) functionality to certain languages including Mandarin Chinese, Japanese and Korean.

SSML is important because it allows software makers to control speech from pitch to volume to pronunciation. This insures the software will hear the right tones and pitches so critical in languages were a tiny change in pronunciation can affect the whole meaning of a word.

SSML is also used to tag areas of speech with different regional pronunciations. It is based on JSpeech Grammar Format (JSGF).

A technical description and how to use SSML version 1 on a web page can be found here: http://www.xml.com/pub/a/2004/10/20/ssml.html

Microsoft Agent website is another source for would be speech interface developers.
http://www.microsoft.com/MSAGENT/downloads/user.asp

Opera browsers can be programmed for speech recognition with some XHTML (Extended Hypertext Markup Language) extensions. http://my.opera.com/community/dev/voice/

Working with web-based speech applications can be frustrating. While the speech recognition software works well, poor quality microphones and PC speakers combined with slower Internet connections can put a damper on effectiveness. These issues will be difficult to address due to being largely beyond the control of the developer. New speech compression algorithms and simple responses like yes or no make the job much easier.

Trained systems – ones that are accustomed to the user’s voice – have been much more successful, but users typically do not have the patience to complete the training and the time factor makes it impractical.

Expect to find the first complex VoiceXML 3.0 technology mostly in telephone-connected and cell-phone activated systems – ones that have more controllable voice quality.

Hopefully, with the new W3C standards, companies can dedicate more to useful speech recognition and less to reinventing the wheel. Standards usually lead to software tool kits for programmers and these often end up in popular packages like Microsoft’s Frontpage and Adobe’s Macromedia Dreamweaver.

Amateur and professional web designers alike may soon find a compelling reason to upgrade to voice enabled web design suites.

Maybe one day you can toss that pesky keyboard and mouse and talk to your machine instead – a promise made since the late 1980s and not yet satisfactorily realized.

by Philip Dunn, Copyright 2005 PhysOrg.com


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.7 /5 (3 votes)


December 10, 2005 all stories

Comments: 0

4.7 /5 (3 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Switching cell phones takes emotional toll
    created Nov 18, 2009 | popularity not rated yet | comments 0
  • Babies' language learning starts from the womb
    created Nov 05, 2009 | popularity not rated yet | comments 0
  • FCC girds itself to navigate swamp where Web, phone, TV meet
    created Oct 22, 2009 | popularity not rated yet | comments 0
  • Augmented reality to help astronauts make sense of space
    created Sep 22, 2009 | popularity not rated yet | comments 0
  • Monkeys get a groove on, but only to monkey music (w/ Audio)
    created Sep 01, 2009 | popularity not rated yet | comments 0


Other News

Design chosen for British 1,000 mph car

Design chosen for British 1,000 mph car (w/ Video)

Technology / Engineering

created 20 hours ago | popularity 3.9 / 5 (7) | comments 5

(PhysOrg.com) -- A British team hoping to be the first to get a car to 1,000 mph (1,610 km/h) has made its final design selection. The six-tonne car, known as the Bloodhound, will be powered by a Eurofighter ...


Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines"

Magazine publishers creating 'iTunes for magazines': reports

Technology / Internet

created 10 hours ago | popularity not rated yet | comments 0

US magazine publishers Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines," according to published reports.


Should I buy a PC or Mac?

Technology / Software

created 8 hours ago | popularity 4 / 5 (2) | comments 4

Q. Our 6-year-old PC computer is dying a slow death and we are considering moving to a new iMac but have a few concerns. First, of all, we have several Word documents on our disk drive now that we want to keep and add to ...


ORNL 'deep retrofits' can cut home energy bills in half

ORNL 'deep retrofits' can cut home energy bills in half

Technology / Energy

created 11 hours ago | popularity 3 / 5 (2) | comments 0

(PhysOrg.com) -- Oak Ridge National Laboratory has announced plans to conduct a series of deep energy retrofit research projects with the potential to improve the energy efficiency in selected homes by as ...


EU assembly adopts Internet, phone user rights

Technology / Telecom

created 16 hours ago | popularity 5 / 5 (1) | comments 1

(AP) -- The European Parliament has endorsed new telecom rules that would give phone and Internet users more rights and allow them to appeal to national courts if they are cut off for illegal file-sharing.