Speech-recognition technology is rapidly improving

July 23, 2009 By Troy Wolverton

Maybe I watched too much "Star Trek" when I was younger, but I love the idea of being able to command things in my house or in my car by talking to them.

I'd love to be like the Enterprise's Captain Picard and get a cup of tea by walking up to a wall panel and saying "Earl Grey, hot."

I'm not holding out for a voice-activated replicator. I'd settle for simpler stuff, such as being able to set the thermostat, turn on my oven, switch off the lights or record a ballgame by simply speaking my wishes out loud.

Unfortunately, my real-life experience with technology -- and I'm guessing yours, too -- has been nothing like "Star Trek." Instead, it's been largely frustrating and sometimes infuriating, what with voice-dialing cell phones that can't understand who you're asking them to call and phone trees that respond to simple voice commands but won't direct you to a live person who can handle more complex questions.

But if speech recognition advocates are to be believed, the science fiction world of widespread and well-functioning voice-controlled devices and appliances is finally just around the corner.

"I think we're seeing a real renaissance here," said Bill Meisel, a longtime follower of speech-recognition technology and editor of Speech Strategy News, an industry newsletter.

Meisel foresees a not-so-far-off world in which our mobile phones serve as a universal remote that allows us to control and program everything from TiVos to alarm clocks with our voice.

Todd Mozer, CEO of Sensory, a Sunnyvale, Calif.-based speech-recognition company, envisions a soon-to-come world filled with speech-controlled Internet devices, or SCIDs.

One example of such a device might be a clock radio that could not only tell you the current time when asked, but could also, using its Internet connection, tell you the weather in Boston.

Regardless of how the future unfolds, advocates such as Meisel and Mozer say it's nearer than most people might think. Voice recognition is not some pipe dream, but a maturing technology that has improved in recent years and already works well in certain circumstances.

Widespread adoption of speech recognition has been delayed by problems with accuracy, advocates acknowledge. But thanks to Moore's law, computers and microprocessors are getting faster at processing spoken words. They're also able to sort through and compare what's spoken with increasingly large databases of recognized words and ways of saying them.

Developers are also improving accuracy by narrowing the problem.

Some do that by limiting the number of words a device needs to understand. A Bluetooth headset, say, may recognize only a handful of commands. But it will understand those commands so well that it can respond to them, no matter a person's accent and despite having to listen for them over a talk-radio program.

Another way of narrowing the problem is to categorize the information sought, or focus on industry-specific vocabularies. Some speech-to-text programs, for example, are specifically designed for particular medical professions, such as radiology.

But it's more than just improved accuracy that has advocates excited about prospects for speech-recognition. The technology has become a big business, and major corporations, such as Apple, Microsoft and Google, are pushing it. Speech recognition technologies have become commonplace not only in cell phones and phone trees, but also in particular industries, such as transcribing medical records or even for use by workers in larger warehouses.

And two other trends could help move speech technologies into consumers' homes. One is the growing demand for more "natural" ways of interacting with technology products, such as by using gestures on a touch-screen or via motion-sensing controllers. Some analysts think speech works well in tandem with other natural interfaces.

The other trend is the growing number of home networks and Internet-connected devices. Having a network connection allows devices to download updates that would improve their speech recognition capabilities over time. It also can let devices tap into more powerful speech recognizers either on the local network or out on the Internet.

Indeed, getting speech-controlled devices into the mainstream is becoming less an issue of technology than consumer acceptance, advocates say. The more consumers are exposed to speech recognition services and have positive experiences, the more likely they'll seek speech-based interfaces in other areas, argues Meisel.

"People's attitudes have been colored by unpleasant experiences," said Meisel. "(Those) attitudes will change."

Here's hoping he's right. Dreams of "Star Trek" aside, I'd be happy to simply have my phone recognize when I want to dial my sister.

___

(c) 2009, San Jose Mercury News (San Jose, Calif.).
Visit MercuryNews.com, the World Wide Web site of the Mercury News, at http://www.mercurynews.com
Distributed by McClatchy-Tribune Information Services.


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 3.3 /5 (4 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • RayCherry - Aug 05, 2009
    • Rank: not rated yet
    Non English Devices? Microwaves that understand more langauges than most humans? Looking forward to that! ;-)

July 23, 2009 all stories

Comments: 1

3.3 /5 (4 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Researchers produce 'neural fingerprint' of speech recognition
    created Nov 10, 2008 | popularity not rated yet | comments 0
  • IBM Research Unleashes Powerful Speech Software
    created Aug 18, 2008 | popularity not rated yet | comments 0
  • NEC Develops Speech-to-Speech Translation Software for Mobile Phones
    created Oct 24, 2005 | popularity not rated yet | comments 0
  • Carnegie Mellon engineering researchers to create speech recognition in silicon
    created Sep 13, 2004 | popularity not rated yet | comments 0
  • A computer can pick out speech even amid cacophony
    created Nov 26, 2008 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Control System
    created Nov 24, 2009
  • Base Isolation Systems in Skyscrapers?
    created Nov 23, 2009
  • Need to interview a Computer Hardware Engineer for school project
    created Nov 23, 2009
  • transient heat transfer
    created Nov 23, 2009
  • Trying to adapt a fuel gage circuit
    created Nov 22, 2009
  • Pushing the piston.
    created Nov 22, 2009
  • More from Physics Forums - General Engineering

Other News

ORNL 'deep retrofits' can cut home energy bills in half

ORNL 'deep retrofits' can cut home energy bills in half

Technology / Energy

created 3 hours ago | popularity 5 / 5 (1) | comments 0

(PhysOrg.com) -- Oak Ridge National Laboratory has announced plans to conduct a series of deep energy retrofit research projects with the potential to improve the energy efficiency in selected homes by as ...


Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines"

Magazine publishers creating 'iTunes for magazines': reports

Technology / Internet

created 1hour ago | popularity not rated yet | comments 0

US magazine publishers Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines," according to published reports.


The logo of NBC studios in Burbank, California

Comcast bid for NBC Universal could be sealed next week: source

Technology / Business

created 1hour ago | popularity not rated yet | comments 0

Comcast's bid to buy a controlling stake in NBC Universal from General Electric could be sealed next week if GE reaches an agreement with Vivendi, a source close to the matter said Wednesday.


Design chosen for British 1,000 mph car

Design chosen for British 1,000 mph car (w/ Video)

Technology / Engineering

created 11 hours ago | popularity 4 / 5 (4) | comments 5

(PhysOrg.com) -- A British team hoping to be the first to get a car to 1,000 mph (1,610 km/h) has made its final design selection. The six-tonne car, known as the Bloodhound, will be powered by a Eurofighter ...


EU assembly adopts Internet, phone user rights

Technology / Telecom

created 7 hours ago | popularity not rated yet | comments 1

(AP) -- The European Parliament has endorsed new telecom rules that would give phone and Internet users more rights and allow them to appeal to national courts if they are cut off for illegal file-sharing.