January 6, 2007

Researchers Use Wikipedia To Make Computers Smarter

Using Wikipedia, Technion researchers have developed a way to give computers knowledge of the world to help them “think smarter,” making common sense and broad-based connections between topics just as the human mind does. The new method will help computers filter e-mail spam, perform Web searches and even conduct intelligence gathering at more sophisticated levels than current programs.

Researchers at the Technion-Israel Institute of Technology have found a way to give computers encyclopedic knowledge of the world to help them “think smarter,” making common sense and broad-based connections between topics just as the human mind does.

The new method will help computers filter e-mail spam, perform Web searches and even conduct electronic intelligence gathering at a much more sophisticated level than current programs, according to researchers Evgeniy Gabrilovich and Shaul Markovitch of the Technion Faculty of Computer Science. The findings will be presented next week in Hyderabad, India during the Twentieth International Joint Conference for Artificial Intelligence.

The program devised by the Technion researchers helps computers map single words and larger fragments of text to a database of concepts built from the online encyclopedia Wikipedia, which has over one million articles in its English-language version. The Wikipedia-based concepts act as “background knowledge” to help computers figure out the meaning of the text entered into a Web search, for instance.

Giving computers this deeper knowledge has been a long-standing problem in artificial intelligence, according to Markovitch. “Humans use a significant amount of background knowledge” to understand text, “but we didn’t know how to have computers access such knowledge,” he said.

Most Web search and e-mail filter programs appear smart by calculating how often certain words appear in two texts, Markovitch explained. “But what is common to all these applications is that the programs that actually do this kind of thing don’t understand text. They treat text as a collection of words, but they don’t understand the meaning of words.”

This shallow understanding is what makes an e-mail spam filter block all messages containing the word “vitamin,” but fail to block messages containing the word “B12.” “If the program never saw “B12” before, it’s just a word without any meaning. But you would know it’s a vitamin,” Markovitch said.

“With our methodology, however, the computer will use its Wikipedia-based knowledge base to infer that "B12" is strongly associated with the concept of vitamins, and will correctly identify the message as spam," he added.

Or, computers could look at a chunk of text about Saddam Hussein and weapons of mass destruction and know that it is conceptually related to topics such as the Iraq war and U.S. Senate debates on intelligence—even if those terms do not appear anywhere in the original text.

The method also helps computers figure out ambiguous terms—deciding, for instance, whether the word “mouse” refers to the computer device or the fuzzy animal. This can be especially important in translated documents, Markovitch said.

In the near future, the Technion researchers hope to improve their method by adding information from the Web page links inside Wikipedia articles. They are already pursuing a patent on their work, which they say will be of interest to the intelligence community and Web search engine companies, among others.

Source: American Technion Society

Citation: Researchers Use Wikipedia To Make Computers Smarter (2007, January 6) retrieved 25 April 2024 from https://phys.org/news/2007-01-wikipedia-smarter.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Finding a better way to use cameras to reduce crime

0 shares

Feedback to editors

The secret to saving old books could be gluten-free glues

18 minutes ago

Synthesis of two new carbides provides perspective on how complex carbon structures could exist on other planets

33 minutes ago

Scientists regenerate neural pathways in mice with cells from rats

41 minutes ago

Study reveals protein's key role in helping cilium transmit signals to the rest of the cell

45 minutes ago

Diamond dust as a potential alternative to contrast agent gadolinium in magnetic resonance imaging

1 hour ago

Maternal grandmothers' support buffers children against the impacts of adversity, finds study

1 hour ago

The rise of microbial cheaters in iron-limited environments: Study reveals their evolutionary history

1 hour ago

Synthetic droplets cause a stir in the primordial soup: Chemotaxis research answers questions about biological movement

1 hour ago

First experimental proof for brain-like computer with water and salt

1 hour ago

Airborne single-photon lidar system achieves high-resolution 3D imaging

1 hour ago

Load comments (0)

Researchers Use Wikipedia To Make Computers Smarter

The secret to saving old books could be gluten-free glues

Synthesis of two new carbides provides perspective on how complex carbon structures could exist on other planets

Scientists regenerate neural pathways in mice with cells from rats

Study reveals protein's key role in helping cilium transmit signals to the rest of the cell

Diamond dust as a potential alternative to contrast agent gadolinium in magnetic resonance imaging

Maternal grandmothers' support buffers children against the impacts of adversity, finds study

The rise of microbial cheaters in iron-limited environments: Study reveals their evolutionary history

Synthetic droplets cause a stir in the primordial soup: Chemotaxis research answers questions about biological movement

First experimental proof for brain-like computer with water and salt

Airborne single-photon lidar system achieves high-resolution 3D imaging

Relevant PhysicsForums posts

Passing variables in FORTRAN

My Website For Creating Interactive Visuals Linked To Equations

Number of Multiplications in the FFT Algorithm

Error logging in: onLoginSuccess is not a function

Latest Notable AI accomplishments

Building a homemade Long Short Term Memory with FSMs

Finding a better way to use cameras to reduce crime

Replacing 'you' with 'we' can make a message less threatening, and less likely to be censored

Research finds police bodycams more important than race, gender in public's assessment of use-of-force cases

New technology to assemble three-dimensional structures using gold nanoparticles confined in nanocapsules

Scientists use large scientific facilities to test the synthesis and characterization of polymeric nitrogen

Climate change causing 60% of plants and insects to fall out of sync

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Medical Xpress

Tech Xplore

Science X

Researchers Use Wikipedia To Make Computers Smarter

The secret to saving old books could be gluten-free glues

Synthesis of two new carbides provides perspective on how complex carbon structures could exist on other planets

Scientists regenerate neural pathways in mice with cells from rats

Study reveals protein's key role in helping cilium transmit signals to the rest of the cell

Diamond dust as a potential alternative to contrast agent gadolinium in magnetic resonance imaging

Maternal grandmothers' support buffers children against the impacts of adversity, finds study

The rise of microbial cheaters in iron-limited environments: Study reveals their evolutionary history

Synthetic droplets cause a stir in the primordial soup: Chemotaxis research answers questions about biological movement

First experimental proof for brain-like computer with water and salt

Airborne single-photon lidar system achieves high-resolution 3D imaging

Relevant PhysicsForums posts

Related Stories

Finding a better way to use cameras to reduce crime

Replacing 'you' with 'we' can make a message less threatening, and less likely to be censored

Research finds police bodycams more important than race, gender in public's assessment of use-of-force cases

New technology to assemble three-dimensional structures using gold nanoparticles confined in nanocapsules

Scientists use large scientific facilities to test the synthesis and characterization of polymeric nitrogen

Climate change causing 60% of plants and insects to fall out of sync

Recommended for you

Hyphens in paper titles harm citation counts and journal impact factors

A big step toward the practical application of 3-D holography with high-performance computers

Combining multiple CCTV images could help catch suspects

Applying deep learning to motion capture with DeepLabCut

Training artificial intelligence with artificial X-rays

New model for large-scale 3-D facial recognition

Newsletter sign up

Donate and enjoy an ad-free experience