Researchers Use Wikipedia To Make Computers Smarter
January 6, 2007Using Wikipedia, Technion researchers have developed a way to give computers knowledge of the world to help them “think smarter,” making common sense and broad-based connections between topics just as the human mind does. The new method will help computers filter e-mail spam, perform Web searches and even conduct intelligence gathering at more sophisticated levels than current programs.
Researchers at the Technion-Israel Institute of Technology have found a way to give computers encyclopedic knowledge of the world to help them “think smarter,” making common sense and broad-based connections between topics just as the human mind does.
The new method will help computers filter e-mail spam, perform Web searches and even conduct electronic intelligence gathering at a much more sophisticated level than current programs, according to researchers Evgeniy Gabrilovich and Shaul Markovitch of the Technion Faculty of Computer Science. The findings will be presented next week in Hyderabad, India during the Twentieth International Joint Conference for Artificial Intelligence.
The program devised by the Technion researchers helps computers map single words and larger fragments of text to a database of concepts built from the online encyclopedia Wikipedia, which has over one million articles in its English-language version. The Wikipedia-based concepts act as “background knowledge” to help computers figure out the meaning of the text entered into a Web search, for instance.
Giving computers this deeper knowledge has been a long-standing problem in artificial intelligence, according to Markovitch. “Humans use a significant amount of background knowledge” to understand text, “but we didn’t know how to have computers access such knowledge,” he said.
Most Web search and e-mail filter programs appear smart by calculating how often certain words appear in two texts, Markovitch explained. “But what is common to all these applications is that the programs that actually do this kind of thing don’t understand text. They treat text as a collection of words, but they don’t understand the meaning of words.”
This shallow understanding is what makes an e-mail spam filter block all messages containing the word “vitamin,” but fail to block messages containing the word “B12.” “If the program never saw “B12” before, it’s just a word without any meaning. But you would know it’s a vitamin,” Markovitch said.
“With our methodology, however, the computer will use its Wikipedia-based knowledge base to infer that "B12" is strongly associated with the concept of vitamins, and will correctly identify the message as spam," he added.
Or, computers could look at a chunk of text about Saddam Hussein and weapons of mass destruction and know that it is conceptually related to topics such as the Iraq war and U.S. Senate debates on intelligence—even if those terms do not appear anywhere in the original text.
The method also helps computers figure out ambiguous terms—deciding, for instance, whether the word “mouse” refers to the computer device or the fuzzy animal. This can be especially important in translated documents, Markovitch said.
In the near future, the Technion researchers hope to improve their method by adding information from the Web page links inside Wikipedia articles. They are already pursuing a patent on their work, which they say will be of interest to the intelligence community and Web search engine companies, among others.
Source: American Technion Society
-
High-tech devices leave users vulnerable to spies
Jan 05, 2012 |
5 / 5 (1) |
0
-
Spammers propel India to junk-mail top spot
Jan 01, 2012 |
5 / 5 (2) |
0
-
'Anonymous' hackers target US security think tank
Dec 25, 2011 |
5 / 5 (11) |
96
-
Multimodal interaction: Humanizing the human-computer interface
Dec 14, 2011 |
not rated yet |
0
-
Mining the language of science
Nov 18, 2011 |
5 / 5 (9) |
10
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (29) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
-
Sending SMS to a website
Jan 20, 2012
-
Need help with my technical fest!
Jan 19, 2012
- More from Physics Forums - Computing & Technology
More news stories
Windows 8 preview set for February 29
Microsoft on Wednesday revealed plans to unveil a test version of its latest Windows computer operating software later this month.
7 hours ago |
3.8 / 5 (4) |
5
Solar start-ups set new efficiency records
(PhysOrg.com) -- Although Alta Devices and Semprius make different types of solar panels, both start-ups have been breaking records in the past few days. Santa Clara, Calif.-based Alta Devices announced that ...
Groupon fails to turn profit as revenue grows
Daily deals site Groupon on Wednesday issued its first earnings report as a publicly traded company, saying it failed to turn a profit despite revenue nearly tripling from a year earlier.
5 hours ago |
not rated yet |
0
Lawsuit seeks to block Google's privacy changes
(AP) -- A consumer watchdog group is suing the Federal Trade Commission in an attempt to prevent Google from making sweeping changes to its privacy policies next month.
5 hours ago |
not rated yet |
0
Romanian accused of hacking NASA-JPL computers
(AP) -- The Los Angeles U.S. attorney's office says a federal grand jury has indicted a Romanian citizen on charges he hacked into 25 climate-research computers at NASA's Jet Propulsion Laboratory in Pasadena.
5 hours ago |
not rated yet |
0
Astronomy team discovers nearby dwarf galaxy
(PhysOrg.com) -- A team led by UCLA research astronomer Michael Rich has used a unique telescope to discover a previously unknown companion to the nearby galaxy NGC 4449, which is some 12.5 million light years ...
Amasia: As next supercontinent forms, Arctic Ocean, Caribbean will vanish first
(PhysOrg.com) -- Geologists at Yale University have proposed a new theory to describe the formation of supercontinents, the epic process by which Earths major continental blocks combine into a single ...
Why are there so few fish in the Earth's oceans?
(PhysOrg.com) -- A Stony Brook University researcher has found that, contrary to popular belief, there are not plenty of fish in the sea.
Transparent iron? For the first time, an experiment shows that atomic nuclei can become transparent
At the high-brilliance synchrotron light source PETRA III, a team of DESY scientists headed by Dr. Ralf Röhlsberger has succeeded in making atomic nuclei transparent with the help of X-ray light. At the ...
Physicists build highly efficient 'no-waste' laser
A team of University of California, San Diego researchers has built the smallest room-temperature nanolaser to date, as well as an even more startling device: a highly efficient, "thresholdless" laser that ...
Scientists strengthen memory by stimulating key site in brain
Ever gone to the movies and forgotten where you parked the car? New UCLA research may one day help you improve your memory.