Machine Translates Thoughts into Speech in Real Time
December 21, 2009 By Lisa Zyga
Model of the brain-machine interface for real-time synthetic speech production. The stroke-induced lesion (red X) disables speech output, but speech motor planning in the cerebral cortex remains intact. Signals collected from an electrode in the speech motor cortex are amplified and sent wirelessly across the scalp as FM radio signals. The Neuralynx System amplifies, converts, and sorts the signals. The neural decoder then translates the signals into speech commands for the speech synthesizer. Credit: Guenther, et al.
(PhysOrg.com) -- By implanting an electrode into the brain of a person with locked-in syndrome, scientists have demonstrated how to wirelessly transmit neural signals to a speech synthesizer. The "thought-to-speech" process takes about 50 milliseconds - the same amount of time for a non-paralyzed, neurologically intact person to speak their thoughts. The study marks the first successful demonstration of a permanently installed, wireless implant for real-time control of an external device.
The study is led by Frank Guenther of the Department of Cognitive and Neural Systems and the Sargent College of Health and Rehabilitation Sciences at Boston University, as well as the Division of Health Science and Technology at Harvard University-Massachusetts Institute of Technology. The research team includes collaborators from Neural Signals, Inc., in Duluth, Georgia; StatsANC LLC in Buenos Aires, Argentina; the Georgia Tech Research Institute in Marietta, Georgia; the Gwinnett Medical Center in Lawrenceville, Georgia; and Emory University Hospital in Atlanta, Georgia. The team published their results in a recent issue of PLoS ONE.
“The results of our study show that a brain-machine interface (BMI) user can control sound output directly, rather than having to use a (relatively slow) typing process,” Guenther told PhysOrg.com.
In their study, the researchers tested the technology on a 26-year-old male who had a brain stem stroke at age 16. The brain stem stroke caused a lesion between the volunteer’s motor neurons that carry out actions and the rest of the brain; while his consciousness and cognitive abilities are intact, he is paralyzed except for slow vertical movement of the eyes. The rare condition is called locked-in syndrome.
Five years ago, when the volunteer was 21 years old, the scientists implanted an electrode near the boundary between the speech-related premotor and primary motor cortex (specifically, the left ventral premotor cortex). Neurites began growing into the electrode and, in three or four months, the neurites produced signaling patterns on the electrode wires that have been maintained indefinitely.
Three years after implantation, the researchers began testing the brain-machine interface for real-time synthetic speech production. The system is “telemetric” - it requires no wires or connectors passing through the skin, eliminating the risk of infection. Instead, the electrode amplifies and converts neural signals into frequency modulated (FM) radio signals. These signals are wirelessly transmitted across the scalp to two coils, which are attached to the volunteer’s head using a water-soluble paste. The coils act as receiving antenna for the RF signals. The implanted electrode is powered by an induction power supply via a power coil, which is also attached to the head.
The signals are then routed to an electrophysiological recording system that digitizes and sorts them. The sorted spikes, which contain the relevant data, are sent to a neural decoder that runs on a desktop computer. The neural decoder’s output becomes the input to a speech synthesizer, also running on the computer. Finally, the speech synthesizer generates synthetic speech (in the current study, only three vowel sounds were tested). The entire process takes an average of 50 milliseconds.
As the scientists explained, there are no previous electrophysiological studies of neuronal firing in speech motor areas. In order to develop an accurate neural coding scheme, they had to rely on an established neurocomputational model of speech motor control. According to this model, neurons in the left ventral premotor cortex represent intended speech sounds in terms of “formant frequency trajectories.”
In an intact brain, these frequency trajectories are sent to the primary motor cortex where they are transformed into motor commands to the speech articulators. However, in the current study, the researchers had to interpret these frequency trajectories in order to translate them into speech. To do this, the scientists developed a two-dimensional formant frequency space, in which different vowel sounds can be plotted based on two formant frequencies (whose values are represented on the x and y axes).
“The study supported our hypothesis (based on the DIVA model, our neural network model of speech) that the premotor cortex represents intended speech as an ‘auditory trajectory,’ that is, as a set of key frequencies (formant frequencies) that vary with time in the acoustic signal we hear as speech,” Guenther said. “In other words, we could predict the intended sound directly from neural activity in the premotor cortex, rather than try to predict the positions of all the speech articulators individually and then try to reconstruct the intended sound (a much more difficult problem given the small number of neurons from which we recorded). This result provides our first insight into how neurons in the brain represent speech, something that has not been investigated before since there is no animal model for speech.”
To confirm that the neurons in the implanted area were able to carry speech information in the form of formant frequency trajectories, the researchers asked the volunteer to attempt to speak in synchrony with a vowel sequence that was presented auditorily. In later experiments, the volunteer received real-time auditory feedback from the speech synthesizer. During 25 sessions over a five-month period, the volunteer significantly improved the thought-to-speech accuracy. His average hit rate increased from 45% to 70% across sessions, reaching a high of 89% in the last session.
Although the current study focused only on producing a small set of vowels, the researchers think that consonant sounds could be achieved with improvements to the system. While this study used a single three-wire electrode, the use of additional electrodes at multiple recording sites, as well as improved decoding techniques, could lead to rapid, accurate control of a speech synthesizer that could generate a wide range of sounds.
“Our immediate plans involve the implementation of a new synthesizer that can produce consonants as well as vowels but remains simple enough for a BMI user to control,” Guenther said. “We are also working on hardware that will greatly increase the number of neurons that are recorded. We expect to tap into at least 10 times as many neurons in the next implant recipient, which should lead to a dramatic improvement in performance.”
Overall, the work marks a milestone in the development of a permanent neural prosthesis that requires no major external hardware beyond a wireless receiver and laptop computer. Previous brain-machine interfaces for communication applications are very slow, producing only about one word per minute. The new system has the potential to enable real-time conversation, and help minimize the social isolation that accompanies profound paralysis.
More information: Guenther FH, Brumberg JS, Wright EJ, Nieto-Castanon A, Tourville JA, et al. (2009) A Wireless Brain-Machine Interface for Real-Time Speech Synthesis. PLoS ONE 4(12): e8218. doi:10.1371/journal.pone.0008218
Copyright 2009 PhysOrg.com.
All rights reserved. This material may not be published, broadcast, rewritten or redistributed in whole or part without the express written permission of PhysOrg.com.
-
Zeroing in on the brain's speech 'receiver'
Jun 20, 2007 |
not rated yet |
0
-
Where the brain makes sense of speech
Dec 19, 2007 |
not rated yet |
0
-
Researchers shed light on the brain mechanism responsible for processing of speech
Aug 12, 2009 |
not rated yet |
0
-
Read my lips: Using multiple senses in speech perception (Video)
Feb 11, 2009 |
not rated yet |
0
-
Our faces, not just our ears 'hear' speech: study
Jan 20, 2009 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (30) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Can I forget a language?
1 hour ago
-
Is Everyday Technology Killing Us?
Feb 08, 2012
-
Exercise and weight loss
Feb 08, 2012
-
Why do we have head aches? Our brains can't feel anything.
Feb 07, 2012
-
"The end of diseases" by David Agus, interview from Daily Show with Jon Stewart
Feb 04, 2012
-
Oncolytic adenovirus
Feb 04, 2012
- More from Physics Forums - Medical Sciences
More news stories
New understanding of DNA repair could eventually lead to cancer therapy
A research group in the Faculty of Medicine & Dentistry at the University of Alberta is hoping its latest discovery could one day be used to develop new therapies that target certain types of cancers.
55 minutes ago |
5 / 5 (3) |
0
|
Researchers develop new method for creating tissue engineering scaffolds
Researchers at Northwestern University have developed a new method for creating scaffolds for tissue engineering applications, providing an alternative that is more flexible and less time-intensive than current technology.
8 minutes ago |
not rated yet |
0
|
Drug halts organ damage in inflammatory genetic disorder
A new study shows that Kineret (anakinra), a medication approved for the treatment of rheumatoid arthritis, is effective in stopping the progression of organ damage in people with neonatal-onset multisystem inflammatory disease ...
Medicine & Health / Medications
19 minutes ago |
not rated yet |
0
Molecular profiling reveals differences between primary and recurrent ovarian cancers
There is a need to analyze tumor specimens at the time of ovarian cancer recurrence, according to a new study published in Molecular Cancer Therapeutics. Researchers used a diagnostic technology called molecular profiling to examine ...
8 minutes ago |
not rated yet |
0
|
Both maternal and paternal age linked to autism
Older maternal and paternal age are jointly associated with having a child with autism, according to a recently published study led by researchers at The University of Texas Health Science Center at Houston (UTHealth).
Medicine & Health / Psychology & Psychiatry
1 hour ago |
not rated yet |
0
|
Hovering not hard if you're top-heavy, researchers find
Top-heavy structures are more likely to maintain their balance while hovering in the air than are those that bear a lower center of gravity, researchers at New York University's Courant Institute of Mathematical Sciences ...
Grass to gas: Researchers' genome map speeds biofuel development
Researchers at the University of Georgia have taken a major step in the ongoing effort to find sources of cleaner, renewable energy by mapping the genomes of two originator cells of Miscanthus x giganteus, a large perenn ...
Zuckerberg's focus drives Facebook's ascent
When Mark Zuckerberg showed up to rent Judy Fusco's Los Altos, Calif., house in the fall of 2004, soon after he'd arrived in Silicon Valley, the landlord was immediately struck by his confidence.
Night, weekend delivery OK for babies with birth defects
Weekday delivery is no better than night or weekend delivery for infants with birth defects, according to a new study presented today at The Pregnancy Meeting, the Society for Maternal-Fetal Medicine's annual conference. ...
Sonic Cradle lands spot in TED exhibition
A Simon Fraser University graduate student project that melds music, meditation and modern technology has landed a rare spot as an exhibit at TEDActive 2012 in Palm Springs, California this month.
Cochlear implants may be safe, effective for organ transplant patients
Cochlear implants may be a safe, effective option for some organ transplant patients who've lost their hearing as an unfortunate consequence of their transplant-related drug regime, researchers report.
Dec 21, 2009
Rank: 5 / 5 (1)
Dec 21, 2009
Rank: 5 / 5 (1)
Dec 21, 2009
Rank: 1 / 5 (1)
Dec 21, 2009
Rank: 5 / 5 (1)
the mind is still a black box, theories are how the auditory and speech system may be obscured by plastic adaptation of the neuronal system of the subjects brain to the device that is implanted in it, however, as long as the theory helps the progress and development of the technology, it's utility warrants its explanatory legitimacy, particularly with regard to the mechansim of the action in the artificial speech device (as opposed to natural speech) .
Dec 21, 2009
Rank: 5 / 5 (1)
I was imagining a small normal wire in comparison to a nerve cell... Kind of like a skyscraper next to a human. Picks up the group yelling but not the individual.
Dec 22, 2009
Rank: 5 / 5 (2)
Dec 22, 2009
Rank: 5 / 5 (1)
Dec 22, 2009
Rank: not rated yet
I honestly hope it was the former.
*After* the primary motor cortex would have made learning to speak way easier for the patient. Weeks, instead of half a year! Learning to ski, skate, drive, karate, whistle, they're all simply reprogramming the motor cortex. That's what we do. We're actually better at it than any other species.
Dec 22, 2009
Rank: 5 / 5 (1)
Dec 23, 2009
Rank: 5 / 5 (1)
Dec 23, 2009
Rank: 5 / 5 (2)
No doubt if telepathy exists in advanced species it would be a natural development of this tech, implanted at first and eventually genetically produced.
Dec 24, 2009
Rank: 5 / 5 (2)
So will tele-robots that work at the command of a person wearing a thinking cap. It seems that mind reading devices are much closer to reality than anyone thought. The singularity is approaching and its picking up speed.
Dec 25, 2009
Rank: 4 / 5 (1)
However, I can already imagine some of the potential negative consequences if this was to morph into a new form of lie detection, and/or thought police tool as we push into a world of more control.
Dec 25, 2009
Rank: 5 / 5 (1)
Dec 25, 2009
Rank: not rated yet
Dec 26, 2009
Rank: not rated yet
Dec 27, 2009
Rank: 4 / 5 (1)
Dec 27, 2009
Rank: not rated yet
See? Like I said- we don't usually need to break laws but we feel it's our right to do so. We WANT to, just as any animal wants to find a way out of the cage. Instinct. I heard 2 lawyers talking yesterday about a simple case with an inevitable conclusion which nevertheless dragged on for years, causing needless hardship and expense, only because the defendent had some influence. Look up the story on the Guess Jeans magnate. Over principle he has ruined lives not to prove any point but because he is an unsettled paranoid. With millions to burn. Enough is enough. Only automated justice is truly blind.
Dec 27, 2009
Rank: not rated yet
Dec 27, 2009
Rank: not rated yet
Dec 28, 2009
Rank: not rated yet
Dec 29, 2009
Rank: not rated yet
Anyone remember the Gilligan Island episode where Gilligan eats something and can read the thoughts of others? This story reminds me of that. What happens when we say exactly what we think? "Hey, Ginger, nice t*ts!"
Dec 30, 2009
Rank: not rated yet
Dec 30, 2009
Rank: not rated yet
Jan 02, 2010
Rank: not rated yet
I personally think that given that people have the right to end all treatment - effectively choosing a slow a painful death - we should also give them the right to choose a quick and painless death. After all, murderers die by painless lethal injection, so why do we force innocents to suffer more?
I hasten to add that this should be a personal choice - but I know which one I'd make.
Jan 06, 2010
Rank: not rated yet
Than you.
Jan 06, 2010
Rank: not rated yet
Actually, thats not technically true. The impulses to the motor cortex are working fine (this is why putting the electrodes in between the pre-motor and the motor cortexes found signal) but the connections from the motor cortex to the actual MUSCLES was destroyed. This is why he was paralyzed (spinal cord damage).
Interestingly, the signals through your brain to cause movement (in a normal, healthy human) are almost identical to the signals sent while merely THINKING about moving, minus the primary motor cortex (which initiates the movements).