MIT develops lecture search engine to aid students

November 8, 2007
MIT develops lecture search engine to aid students

The Lecture Server, as shown in this screenshot of MIT physics professor Walter Lewin, displays video and highlighted search terms.

Imagine you are taking an introductory biology course. You're studying for an exam and realize it would be helpful to revisit the professor's explanation of RNA interference. Fortunately for you, a digital recording of the lecture is online, but the 10-minute explanation you want is buried in a 90-minute lecture you don't have time to watch.

A new lecture search engine developed at MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) could help with this dilemma. Created by a team of researchers and students led by MIT associate professor Regina Barzilay and principal research scientist James Glass, the web-based technology allows users to search hundreds of MIT lectures for key topics.

"Our goal is to develop a speech and language technology that will help educators provide structure to these video recordings, so it's easier for students to access the material," said Glass, who is head of CSAIL's Spoken Language Systems Group.

More than 200 MIT lectures are currently available on the site (web.sls.csail.mit.edu/lectures/). So far, most of the users are international students who access the lectures through MIT's OpenCourseWare (OCW) initiative, which makes curriculum materials for most MIT courses available to anyone with Internet access. Although the lecture-browsing system is still in the early development stages, a recent announcement in OCW's newsletter has drawn increased traffic to the site.

Barzilay and Glass expect the system will be most useful for OCW users and for MIT students who want to review lecture material. MIT World, a web site that provides video of significant MIT events such as lectures by speakers from MIT and around the world, is also participating in the project.

Many MIT professors record their lectures and post them online, but it's difficult to search them for specific topics. Because there is no way to easily scan audio, as you can with printed text, "you end up watching the whole thing, and it's hard to keep focused," said Barzilay, the Douglas T. Ross Career Development Associate Professor of Software Development in the Department of Electrical Engineering and Computer Science.

On the prototype web site, users can search lectures for any term they want and then play the relevant sections.

The lecture transcripts are created by speech recognition software. One major challenge is that the lectures usually contain many technical terms that might not be in the computer program's vocabulary, so the researchers use textbooks, lecture notes and abstracts to identify key terms and feed them into the computer.

"These lectures can have a very specialized vocabulary," said Glass. "For example, in an algebra class, the professor might talk about Eigenvalues."

When properly adapted to a speaker and topic, the lecture-based speech recognizer gets about four out of five words correct, however most of the errors occur in words that are not critical to the lecture topic, i.e., not the key vocabulary terms that people would use to search.

Once the transcript is complete, a language processing program divides the text into sections by topic. Chunks of text, about 100 words each, are compared with each other using a mathematical formula that calculates the number of overlapping words between the text blocks. Each word is weighted so that repetition of key terms has more weight than less important words, and chunks with the most similar words are grouped into sections.

In the future, Barzilay and Glass hope to add a lecture summarization feature to the language processing system. They also want to get users more involved in the project, by incorporating a Wikipedia-like function that would let users correct errors in lecture transcripts and allow them to add lecture notes.

The researchers presented their project at the Interspeech 2007 conference in Antwerp, Belgium, in August. The project was originally funded by Microsoft through the iCampus program and is now funded by the National Science Foundation.

Source: MIT

4.7 /5 (3 votes)  

Rank 4.7 /5 (3 votes)
Tags

Relevant PhysicsForums posts
  • Flow From a Tank through a Pipe
    created6 hours ago
  • How to tilt a object
    created22 hours ago
  • How to calculate total compressibility in liquid porous solid system
    createdFeb 12, 2012
  • Need help reading 3-D
    createdFeb 11, 2012
  • A way to send and receive wireless data
    createdFeb 11, 2012
  • Calling function with no input argument
    createdFeb 10, 2012
  • More from Physics Forums - General Engineering

More news stories

Music service gives Myspace second wind

Faded online social network Myspace said Monday it was getting a second wind due to the popularity of a freshly launched online music player.

Technology / Internet

created 2 hours ago | popularity 5 / 5 (1) | comments 0

US, EU clear Google's $12.5B Motorola Mobility bid (Update)

Google's $12.5 billion acquisition of cellphone maker Motorola Mobility have won approvals from U.S. and European antitrust regulators, moving Google a major step closer to completing the biggest deal in its ...

Technology / Business

created 3 hours ago | popularity 5 / 5 (1) | comments 0

Apple shares close over $500

Apple shares surged past $500 for the first time on Wall Street on Monday, powered by reports a new iPad may be unveiled next month.

Technology / Business

created 1 hour ago | popularity not rated yet | comments 0

Computer programs that think like humans

Intelligence – what does it really mean? In the 1800s, it meant that you were good at memorising things, and today intelligence is measured through IQ tests where the average score for humans is 100. ...

Technology / Computer Sciences

created 2 hours ago | popularity not rated yet | comments 1

EU executive defends contested online piracy pact

The European Commission on Monday defended a global online-piracy pact opposed by some EU states and still to be ratified by the European Parliament.

Technology / Internet

created 3 hours ago | popularity not rated yet | comments 0


First-of-its-kind stem cell study re-grows healthy heart muscle in heart attack patients

Results from a Cedars-Sinai Heart Institute clinical trial show that treating heart attack patients with an infusion of their own heart-derived cells helps damaged hearts re-grow healthy muscle.

Scientists discover reason for Mt. Hood's non-explosive nature

(PhysOrg.com) -- For a half-million years, Mount Hood has towered over the landscape, but unlike some of its cousins in Oregon’s Cascade Mountains and many other volcanoes around the Pacific “Rim ...

Discovery paves way for salmonella vaccine

(Medical Xpress) -- An international research team led by a University of California, Davis, immunologist has taken an important step toward an effective vaccine against salmonella, a group of increasingly antibiotic-resistant ...

Time of year important in projections of climate change effects on ecosystems

(PhysOrg.com) -- Does it matter whether long periods of hot weather, such as last year's heat wave that gripped the U.S. Midwest, happen in June or July, August or September?

Smoking bans lead to less, not more, smoking at home: study

Smoking bans in public/workplaces don't drive smokers to light up more at home, suggests a study of four European countries with smoke free legislation, published online in Tobacco Control.

Ovarian cancer arises in fallopian tube of knockout mice

(Medical Xpress) -- The most deadly form of "ovarian" cancer arises in the fallopian tubes – not the ovaries – of knockout mice that lack two genes associated with the disease, said researchers led by Baylor College ...