A computer can pick out speech even amid cacophony
November 26, 2008
Schematic diagram of SHoUT
(PhysOrg.com) -- Using a recent development in speech recognition, it is possible to search through television news programmes provided the recognition system has been trained beforehand. PhD candidate Marijn Huijbregts from the University of Twente (Netherlands) has, however, taken things even further: he has developed Spoken Document Retrieval for audio and video files that the speech recognition system has not yet been trained to deal with.
This version of speech recognition works well even if there is a great deal of unexpected background noise. Huijbregts received his doctorate from the Faculty of Electrical Engineering, Mathematics and Computer Science on 21 November.
Information can be retrieved from text very quickly using, for example, an index in a book or a search machine such as Google. However, it is much more difficult to search in audio and video files, as they do not have an easily searchable index. You can use speech recognition to simplify this process as most of the information in audio and video files comes from speech. By recording via speech recognition, you can transform speech into text. To do this, you need a Spoken Document Retrieval (SDR) system; this makes it possible to search directly in audio and video materials, just as if you were searching in ordinary text documents. In other words, a sort of Google for audio and video.
Evening news on television
The Human Media Interaction group at the University of Twente had previously developed an SDR system for an evening television news programme. Search terms could be used to look for specific topics, the system being specially trained using newspaper texts and 20 hours of news programmes. The SDR for the evening news programme worked well because, in that situation, it was more or less known what was going to be said and there was little background noise. If you tried applying this system, without any training, to other video files, it did not perform well. Huijbregts then wondered whether he could develop a SDR system for which almost no training data would be needed, but which could nevertheless deal with unknown audio and video files satisfactorily.
SHoUT
With unknown audio and video files, it is not clear beforehand what is going to happen: who is speaking, what is being said and what sort of background noises are present. Huijbregts therefore developed an SDR system that was robust enough to deal with these unknown situations. It is called SHoUT (this acronym corresponds to the Dutch version of ‘Speech Recognition Research at University of Twente’). SDR can be described as robust if it can deal with all audio and video files under all sorts of circumstances, such as background noise or if people are not speaking clearly.
SHoUT is divided up into three stages. Firstly, the system distinguishes between speech and other sounds. For example, background music is filtered out from speech. Secondly, the system identifies different speakers and gives them labels. Then finally the automatic speech recognition takes place: the system transforms speech into text. You can now search the text file for relevant topics using key words, just as Google searches through text files on Internet.
The first version of SHoUT is already available, but Huijbregts is developing it even further. SHoUT and other demonstrations of SDR systems can be found on the website of Huijbregts (http://wwwhome.cs. … l/~huijbreg/).
Provided by University of Twente, Netherlands
-
Playback: 130-year-old sounds revealed
Dec 15, 2011 |
5 / 5 (5) |
7
-
Apple seeks patents for display and noise-out systems
Dec 11, 2011 |
3 / 5 (15) |
19
-
New software tool provides unprecedented searches of sound, from musical riffs to gunshots
Nov 09, 2011 |
5 / 5 (2) |
0
-
Superior sound for telephones, mobile and related devices
May 26, 2011 |
5 / 5 (2) |
0
-
Gadgets: Ultra portable video camera trades fun for video/audio quality
Jun 04, 2010 |
2 / 5 (3) |
0
-
Fast photon control brings quantum photonic technologies closer
1 hour ago |
5 / 5 (2) |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (33) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (5) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Quantum computer faster than regular computer?
4 hours ago
-
Flushing RAM in Mathematica
9 hours ago
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
- More from Physics Forums - Computing & Technology
More news stories
Microsoft India retail site down after 'cyber attack'
Microsoft said Monday it was investigating an attack by hackers on its Indian retail website, reportedly carried out by a Chinese group called the "Evil Shadow Team."
5 hours ago |
not rated yet |
0
Chinese city seizes Apple iPads in name dispute
(AP) -- Authorities have seized Apple iPads from retailers in a city in northern China due to a dispute with a domestic company that says it owns the iPad name, an official said Monday. The Chinese company said it is asking ...
5 hours ago |
not rated yet |
0
AT&T customers surprised by 'unlimited data' limit
(AP) -- Mike Trang likes to use his iPhone 4 as a GPS device, helping him get around in his job. Now and then, his younger cousins get ahold of it, and play some YouTube videos and games.
7 hours ago |
5 / 5 (2) |
3
Independent group inspects Apple supplier
(AP) -- An independent group, the Fair Labor Association, has started auditing Apple Inc.'s Chinese supplier Foxconn after a request by Apple.
1 hour ago |
not rated yet |
0
Hacker claims porn site users compromised
A hacker claims to have compromised the personal information of more than 350,000 users after breaking into a disused website operated by pornography provider Brazzers.
6 hours ago |
5 / 5 (3) |
0
Slowing ocean current caused Earth to spin faster
(PhysOrg.com) -- Most people probably didn’t notice it, but back in 2009, the Earth spun around on its axis a tiny bit faster than usual, making for some slightly shorter days. It only happened for a ...
China's pollution related to E-cars may be more harmful than gasoline cars, researchers find
Electric cars have been heralded as environmentally friendly, but findings from University of Tennessee, Knoxville, researchers show that electric cars in China have an overall impact on pollution that could be more harmful ...
What we mean when we ask for the milk
New research into the different ways that English and Polish people use language in everyday family situations can help members of each community to understand each other better and avoid cultural misunderstandings.
Nerve sparing helps most prostate cancer patients to have same orgasms as before surgery
The vast majority of men who have a prostate cancer operation can retain their ability to orgasm if the surgery is carried out without removing the nerves that surround the prostate gland like a hammock, according to a study ...
Larger belly linked to memory problems in people with HIV
A larger waistline may be linked to an increased risk of decreased mental functioning in people infected with the AIDS virus HIV, according to research published in the February 14, 2012, print issue of Neurology, the me ...
Big fish reveal shelter secrets on reefcam
When it comes to choosing a place to hang out, big reef fish like coral trout, snappers and sweetlips have strong architectural preferences.
Nov 30, 2008
Rank: not rated yet