Microsoft Gets Patent for Patently Offensive Audio Content
October 28th, 2008 by Mary Anne Simpson
US Patent and Trademark Expo
(PhysOrg.com) -- Microsoft recently obtained a patent designed to create an Automatic Censorship of Audio Data For Broadcast . The invention is intended to act as a filter for live broadcasts where it is impracticable to delete or make inaudible certain undesired words or phrases. Additionally, other audio streams like music or games can utilize the automatic censor.
An input audio data stream comprising speech is processed by an automatic censoring filter in either real time mode or batch mode, producing censored speech that has been altered so the undesired words or phrases are either unintelligible or inaudible. The substituted speech is not bleeped out, but modified in the speakers own voice.
According to the patent abstract, the reason for the invention is that it is common for radio and and television broadcasts of live events to be delayed a few seconds before the audio data is transmitted to enable real-time (censors) content reviewers to evaluate whether the content contains objectionable language. Broadcast television and radio are heeding the demand by the public to essentially "clean up" the airways for the general public and children in particular. The degree of censorship depends on the targeted audience and the nature of the event. Specifically, certain words which are commonly referred to as profanity, obscenities and sexually explicit words are targeted by censors to be unacceptable for public broadcasting.
In certain situations where there are multiple individuals speaking as in a game-play scenario it is almost impossible for a human censor to catch all extraneous remarks or words. This is where the Automatic Censorship of Audio Data for Broadcast fills in the gaps. Instead of bleeping out objectionable language, the patent will essentially supply a substitute phrase or word or make the word inaudible to the listener. While the patent can employ the "bleeping" of objectionable words, it will attempt to overwrite the objectionable content with a more acceptable term or word. The acceptable word or phrase is created by using previously uttered phonemes of the speaker that can be combined to produce the more acceptable version. Thus, the censored content appears and sounds like the speaker´s own words.
The Automatic Censorship of Audio Data for Broadcast patent is not without its critics. On-line forums and bloggers are not uniformly impressed with the patent. According to one comment on Slashdot, "Come on, if you´re old enough to play the game, you´re old enough to either deal with it or tell them to stuff a sock in it." Other comments include the impossible task of coding various types of a new breed of curse words. To wit, another commentator congratulates Microsoft for expanding the lexicon of profanity to include a more creative style of profanity and obscenity in order to fool the program.
One thing is for sure, the Automatic Censorship of Audio Data for Broadcast evokes a discussion on the subject of censorship. Comments are welcome on this post without %$ included, if at all possible.


They probably got the idea from that Mac commercial where PC presses the buzzer when Mac tries to say 'Vista'. Or should I say V*&^$?
Now that's scary. Talk about putting words into somebody else's mouth.
What happens when someone finds the inserted words objectionable?
A modified version of this could make for interesting debates!
...or Used Car ads "Come down and see the, uh, mile of cars we have on our lot."
That's a good one. XD
It is always more efficient to squelch the source of noise rather than cleaning the many messes it makes everywhere it goes. What Microsoft is doing is the antithesis of this and they are pretty ambitious to try it. Let's _all_ hope it works! Anyone who doesn't want it to work is a sociopath who needs help.
Sadly, it's doable now on a basic level. Making it foolproof and seamless will be much harder. On video you would get a "Godzilla effect", the words won't match the mouth movements. I can imagine people would have fun with saying completely obscene things without using curse words.
As an aside I'd like to recommend the novel "Little Brother" by Mitch Kapor. A very smart and engrossing story that deals with issues of technology and freedom.
Ack! That's Cory Doctorow, not Mitch Kapor! Mitch's name was on the dust cover because he was recommending it. I should have gotten up to check. My bad.
I can already see it now. I'm watching a porn movie online and the girl goes "Oh Yes! Sexually please me!"
Unless a program like this can be turned off by the user at will I doubt if it's even in accordance with freedom of information.
Both the automated and manual systems are also vulnerable to the "slow obscenity" hole. If a live broadcast is on a 5 second delay, you just have to say your obscenity very slowly so that it takes 6 seconds to say, and now it won't be caught. Of course this isn't practical for common use, but I'm sure comedians could have fun with it.
I remember that movie :)
The auto-mocking of those who are paid big bucks to offend the virtuous would be a long overdue civility-improving feedback loop.
I didn't write this. That's my story and I'm sticking to it. Someone else wrote it when I was away from my PC.
Am I offended? YES.
I saw a speech by Osama Bin Laden that may have been created using this software.
My points here is. Can anything you see someone say and hear someone say actually be believed?
How do we know they said or did what we saw them say or do?