Researchers Find Way To Make Internet Video More Appealing

December 21st, 2004

Jay Leno’s comedy routines are helping to advance technologies for distance learning on the Internet.
Ohio State University engineers are using video recordings of Leno and other TV personalities to test software that transmits more information in an Internet video using less bandwidth.

One of the obstacles to distance learning on the Internet is the difficulty with viewing lectures, explained James Davis, professor of computer science and engineering at Ohio State. A high-resolution video of a speaker takes too long to download, but a low-resolution video makes fine details such as the speaker’s face and hands appear fuzzy.

“When we communicate, we say a lot with our face and hands,” Davis said. “Our voice, gestures, and facial expressions are all intertwined. If I’m watching a lecture and I’m trying to learn something, I need to be able to see the speaker’s face and hands.”

He and his students have created software that zeroes in on a speaker’s face and gesturing hands and sharpens the image in just those spots, while slightly lowering the resolution of the rest of the image. In that way, the final video communicates more information without increasing bandwidth.

In a recent issue of the journal Computer Vision and Image Understanding, Davis and former undergraduate student Robin Tan reported that the software worked successfully in initial tests. The engineers were able to enhance a speaker’s face and gesturing hands without increasing bandwidth, and -- not surprisingly -- users greatly preferred viewing the enhanced video stream.

Davis and Tan inserted their algorithms into a publicly available MPEG encoder -- the software that compresses video for efficient digital transmission on the Internet -- and an MPEG decoder that converts the digital signal back to video so a user can view it on their computer.

For the test, five volunteers watched three short video sequences: one taken from an educational lecture on public television; one from the opening monologue of “Saturday Night Live,” featuring actor Leslie Nielsen; and one taken from a Jay Leno monologue on “The Tonight Show.”

Using a typical computer monitor, they watched the enhanced video and the unenhanced video side-by-side.

The unenhanced version was typical of what most people see when they try to watch video over the Internet -- pixelated and blurry. The enhanced version featured clearer visuals of the speaker’s face and gesturing hands, with a slightly lower-quality background than in the unenhanced video. For both versions, the number of kilobytes of data per frame was kept constant, so they both required the same amount of bandwidth for transmission.

All five volunteers immediately picked the enhanced video as their favorite.

Davis was motivated to pursue this work several years ago, when he was a research assistant in the Media Laboratory at the Massachusetts Institute of Technology. There, he interacted with researchers that studied video of Leno and others in order to better understand how people use speech and gestures in communication.

He took note of patterns in gestures, such as Leno’s technique of opening his hands to the audience when he delivers a punch line. A “pounding” gesture is also common, when speakers are trying to make a point, he said.

The hard part was enabling the software to recognize these hand gestures. He and Tan wrote those algorithms at Ohio State, and combined them with other algorithms to track a speaker’s face and hands as they moved.

In the future, Davis plans to incorporate algorithms that can track the movements of multiple speakers at once. He also wants to selectively enhance background objects, such as whiteboards, when they are used in a lecture.

Source: Ohio State University


print this article email this article download pdf blog this article bookmark this article     Digg this Stumble it share on Facebook share on Reddit add to delicious save to Yahoo! bookmarks
not rated yet


December 21st, 2004 all stories
Other Sciences /

Comments: 0
Rank: not rated yet

  • Stumble this up

  • Digg this

  • Share it:
  • share on Facebook
  • share on MySpace
  • share on Slashdot
  • rss-newsfeed
  • share on Google
  • share on Reddit
  • add to delicious
  • save to Yahoo! bookmarks
  • share on Windows Live
  • Add to Mixx!
Rating: not rated yet

  • Related Stories

  • Gesture-driven computers
    created Feb 29, 2008 | popularity not rated yet | comments 0
  • RIM Introduces First CDMA-enabled BlackBerry Pearl
    created Oct 01, 2007 | popularity not rated yet | comments 0
  • RIM Introduced BlackBerry 8800 Smartphone
    created Feb 13, 2007 | popularity not rated yet | comments 0
  • Communication study: Actions used with words speak even louder
    created May 03, 2006 | popularity not rated yet | comments 0
  • Samsung Introduces 'Bluetooth Voice Recognition Phone'
    created Jun 16, 2005 | popularity not rated yet | comments 0

Tags


  • Physicists Demonstrate Quantum Memory with Matter Qubits
    Physicists Demonstrate Quantum Memory with Matter Qubits
    Physics / General Physics
    created Jul 03, 2009 | popularity 4.4 / 5 (17) | comments 1
  • 'Holey' Nanosheets for Wastewater Dye Removal
    Nanotechnology / Nanomaterials
    created Jul 01, 2009 | popularity 5 / 5 (5) | comments 1
  • Jellyfish Robot Swims Like its Biological Counterpart
    Jellyfish Robot Swims Like its Biological Counterpart
    Electronics / Robotics
    created Jun 26, 2009 | popularity 4.4 / 5 (8) | comments 1
  • Could Maxwell's Demon Exist in Nanoscale Systems?
    Could Maxwell's Demon Exist in Nanoscale Systems?
    Physics / General Physics
    created Jun 24, 2009 | popularity 4.4 / 5 (18) | comments 29
  • Living Safely with Robots, Beyond Asimov's Laws
    Living Safely with Robots, Beyond Asimov's Laws
    Electronics / Robotics
    created Jun 22, 2009 | popularity 4.6 / 5 (52) | comments 40
  • Other News

    Creation Museum president Ken A. Ham

    Paleontologists brought to tears, laughter by Creation Museum

    Other Sciences / Other

    created Jun 30, 2009 | popularity 4.3 / 5 (41) | comments 120

    For a group of paleontologists, a tour of the Creation Museum seemed like a great tongue-in-cheek way to cap off a serious conference.


    Mummified dinosaur skin yields up new secrets

    Mummified dinosaur skin yields up new secrets

    Other Sciences / Archaeology & Fossils

    created Jul 01, 2009 | popularity 4.7 / 5 (15) | comments 10

    (PhysOrg.com) -- Scientists from The University of Manchester have identified preserved organic molecules in the skin of a dinosaur that died around 66-million years ago.


    Liberal? Conservative? Stanford study says mental nudge can make voters flip-flop

    Liberal? Conservative? Stanford study says mental nudge can make voters flip-flop

    Other Sciences / Social Sciences

    created Jul 02, 2009 | popularity 3.8 / 5 (5) | comments 4

    (PhysOrg.com) -- No doubt you’ve worked hard for your success. But chances are you’ve also had some help and lucky breaks along the way.


    Probing Question: How do Ponzi Schemes work?

    Other Sciences / Economics

    created Jul 02, 2009 | popularity 5 / 5 (1) | comments 2

    Imagine the shock, the horror, and the sheer panic that would come with learning that the financial plan you’d sunk your life savings into was a sham, the financial experts you trusted were crooks, and all your money was ...


    Tourists enjoy a "Pineapple Tour" in Costa Rica

    Costa Rica tops happiness, 'green living' poll

    Other Sciences / Social Sciences

    created Jul 04, 2009 | popularity 2 / 5 (2) | comments 0

    Costa Rica is the happiest place on earth, and one of the most environmentally friendly, according to a new survey by a British non-governmental group.