Generating 'oohs' and 'aahs': Vocal Joystick uses voice to surf the Internet

October 9, 2007
Vocal Joystick uses voice to surf the Internet

Doctoral student Brandi House uses Vocal Joystick to control the movement of a robotic arm. The screen on the lower right shows how the software analyzes her vocalizations to create instructions for the arm's movement. Credit: University of Washington

The Internet offers wide appeal to people with disabilities. But many of those same people find it frustrating or impossible to use a handheld mouse. Software developed at the University of Washington provides an alternative using one of the oldest and most versatile modes of communication: the human voice.

"There are many people who have perfect use of their voice who don't have use of their hands and arms," said Jeffrey Bilmes, a UW associate professor of electrical engineering. "I think there are several reasons why Vocal Joystick might be a better approach, or at least a viable alternative, to brain-computer interfaces." The tool's latest developments will be presented this month in Tempe, Ariz. at the Assets Conference on Computers and Accessibility.

Vocal Joystick detects sounds 100 times a second and instantaneously turns that sound into movement on the screen. Different vowel sounds dictate the direction: "ah," "ee," "aw" and "oo" and other sounds move the cursor one of eight directions. Users can transition smoothly from one vowel to another, and louder sounds make the cursor move faster. The sounds "k" and "ch" simulate clicking and releasing the mouse buttons.

Versions of Vocal Joystick exist for browsing the Web, drawing on a screen, controlling a cursor and playing a video game. A version also exists for operating a robotic arm, and Bilmes believes the technology could be used to control an electronic wheelchair.

Existing substitutes for the handheld mouse include eye trackers, sip-and-puff devices, head-tracking systems and other tools. Each technology has drawbacks. Eye-tracking devices are expensive and require that the eye simultaneously take in information and control the cursor, which can cause confusion. Sip-and-puff joysticks held in the mouth must be spit out if the user wants to speak, and can be tiring. Head-tracking devices require neck movement and expensive hardware.

Vocal Joystick requires only a microphone, a computer with a standard sound card and a user who can produce vocal sounds.

"A lot of people ask: 'Why don't you just use speech recognition"'" Bilmes said. "It would be very slow to move a cursor using discrete commands like 'move right' or 'go faster.' The voice, however, is able to do continuous commands quickly and easily." Early tests suggest that an experienced user of Vocal Joystick would have as much control as someone using a handheld device.

In the laboratory, doctoral student Jonathan Malkin, who helped develop the tool, uses Vocal Joystick to play a game called Fish Tale. It takes two minutes to train the program for Malkin's voice. He then moves the fish character easily around the screen, raising his voice slightly to speed up and avoid being eaten by a predator fish.

The newest development, which will be presented at the October meeting in Tempe, uses Vocal Joystick to control a robotic arm. The pitch of the tone moves the arm up and down; other commands are unchanged. This is the first time that vocal commands have been used to control a three-dimensional object, Bilmes said.

One initial concern, he said, was whether people would feel self-conscious using the tool.

"But once you try it you immediately forget what you're saying," Bilmes said. "I usually go to the New York Times' Web site to test the system and then I get distracted and start reading the news. I forget that I'm using it."

To test the device, the group has been working with about eight spinal-cord injury patients at the UW Medical Center since March.

"It's a really exciting idea. I think it has tremendous potential," said Kurt Johnson, a professor of rehabilitation medicine who is helping with the tests.

Bilmes said he hopes people will become more adept at using the system over time. Future research will incorporate more advanced controls that use more aspects of the human voice, such as repeated vocalizations, vibrato, degree of nasality and trills.

"While people use their voices to communicate with just words and phrases," Bilmes said, "the human voice is an incredibly flexible instrument, and can do so much more."

Source: University of Washington

4.3 /5 (4 votes)  

Rank 4.3 /5 (4 votes)
Tags

Relevant PhysicsForums posts
  • Empirical data regarding shower heads and water
    created8 hours ago
  • feed hold button on CNC lathe
    createdFeb 09, 2012
  • RFAC in Fortran
    createdFeb 09, 2012
  • dynamics 2/32
    createdFeb 08, 2012
  • dynamics
    createdFeb 08, 2012
  • Vibration Absorbtion Problem
    createdFeb 08, 2012
  • More from Physics Forums - General Engineering

More news stories

Sony's Hirai refuses to abandon dire TV business

Struggling Japanese entertainment giant Sony will not abandon its cash-bleeding television business, its incoming CEO says, but he acknowledges tough decisions lie ahead including over redundancies.

Technology / Business

created 6 minutes ago | popularity not rated yet | comments 0

New error-correcting codes guarantee the fastest possible rate of data transmission

Error-correcting codes are one of the triumphs of the digital age. They’re a way of encoding information so that it can be transmitted across a communication channel — such as an optical fiber o ...

Technology / Computer Sciences

created 2 hours ago | popularity 5 / 5 (2) | comments 2 | with audio podcast

Small modular reactor design could be a 'SUPERSTAR'

(PhysOrg.com) -- Though most of today's nuclear reactors are cooled by water, we've long known that there are alternatives; in fact, the world's first nuclear-powered electricity in 1951 came from a reactor ...

Technology / Energy & Green Tech

created 2 hours ago | popularity 5 / 5 (4) | comments 7 | with audio podcast

Advanced power-grid model finds low-cost, low-carbon future in West

(PhysOrg.com) -- The least expensive way for the Western U.S. to reduce greenhouse gas emissions enough to help prevent the worst consequences of global warming is to replace coal with renewable and other ...

Technology / Energy & Green Tech

created 2 hours ago | popularity 5 / 5 (1) | comments 3 | with audio podcast

Engineering images bring life to submerged city

(PhysOrg.com) -- Photo-realistic 3D mapping and digital reconstruction of an ancient underwater city in Greece have earned a team from the University of Sydney's Faculty of Engineering and Information Technologies ...

Technology / Engineering

created 1 hour ago | popularity not rated yet | comments 1


The power of estrogen -- male snakes attract other males

A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...

Could Venus be shifting gear?

(PhysOrg.com) -- ESA’s Venus Express spacecraft has discovered that our cloud-covered neighbour spins a little slower than previously measured. Peering through the dense atmosphere in the infrared, the ...

Experts reveal how plants don't get sunburn

(PhysOrg.com) -- Experts at the University of Glasgow have discovered how plants survive the harmful rays of the sun.

Team isolates nerve cells involved in storing long term memory and gene proteins associated with them

(Medical Xpress) -- A research team in Taiwan has succeeded in isolating two nerve cells in fruit fly brains that are believed to be the major players in allowing for the formation of long term memories. Furthermore, ...

Fool's gold may prove an unlikely alternative to overexploited catalytic materials

Catalytic materials, which lower the energy barriers for chemical reactions, are used in everything from the commercial production of chemicals to catalytic converters in car engines. However, with current catalytic materials ...

SLAC, Stanford team focuses on high-energy electrons to treat cancer

Accelerator physicists at SLAC and cancer specialists from Stanford are working on a new technology that could dramatically reduce the time needed for cancer radiation treatments. The team ran an initial experiment ...