Researchers mine millions of metaphors through computer-based techniques
March 3, 2009 By Lisa M. KriegerMetaphors cannot be taught, asserted the great philosopher Aristotle. "It is the one thing that cannot be learnt from others." But a computer scientist and literary historian say he's wrong.
In a project started at Stanford University, the researchers are teaching computers how to analyze texts from Plato to Pynchon, mining millions of these abstract phrases. (Metaphorically speaking.)
They're building a vast searchable database, making it possible to browse historic patterns of word usage - for instance, "rose" and "love" - from ancient Homeric epics to postmodern cyberpunk novels, and everything in between.
"As a tool, it provides a really powerful way of thinking about a lot of literature at once," said English literature professor Brad Pasanek, who collaborated on the project with longtime friend and computer scientist D. Sculley.
The work makes tangible what the German linguist Harald Weinrich called the "metaphoric field." "Pasanek's database is the first 'metaphoric field' that we can actually see and use," said Franco Moretti, a Stanford comparative literature professor. "It provides empirical proof for a daring, but never wholly solid concept."
This approach to studying literature was inconceivable back around 330 BCE, when Aristotle wrote that "the greatest thing by far is to be a master of the metaphor," language that compares seemingly unrelated subjects - a "winged thought," for instance.
But two new trends have created a field of computer-based literary analysis, part of the emerging discipline called "digital humanities," an intersection of computing and the study of languages, history, philosophy and religion.
Digitized libraries have put an ocean of books - including obscure ones - at readers' fingertips. Using new data mining techniques and "machine learning," researchers can search the millions of words contained in those books to study subtle shifts in how words were used. Analyzing such patterns offers insights to how language - and culture - evolved.
The idea was conceived when Pasanek was idly flipping through his worn copy of "Pride and Prejudice," its key phrases highlighted in bright colors.
Through the tangled tale of Elizabeth, Darcy and Wickham, "marking words that occurred again and again, I realized that you could flip through a novel and see these motifs appear in an explosion of color, then disappear."
The computer replaces the colored marker, he said. "It's possible to trace when and where something appears, what it means, and how it changes," he said.
Pasanek's near-obsessive collection of interesting metaphors began while he was at Stanford working on his Ph.D. First he kept a list on the back pages of the works of Shakespeare, Milton and the King James Bible. As his list grew, he moved to index cards.
"Metaphors are a fundamental figure of speech," he said. "They show how we think, and how what we think changes over time."
Recognizing he needed help, Stanford computer scientist Matt Jockers helped him create a digital database, which was initially posted in 2005. The list quickly grew to 1,000, then 3,000 entries. But the list's expansion created a special search challenge.
"The nature of metaphor is such that it does not lend itself to easy detection by the usual sorts of pattern matching algorithms," Jockers said. Finding a simile is a fairly straightforward task: one writes a program that looks for text strings of the type "like" and "as."
"Structurally speaking, the phrase 'my love is a red rose' is very much the same as 'my dog is a blue heeler,'" Jockers said. "The former is metaphor, the latter is not."
Pasanek provided the computer with examples of metaphors and "trained" the machine to recognize them. They programmed "proximity searches" between words likely to be metaphoric. For example, a search for "mind" within 100 characters of "mint" finds the following couplet in William Cowper's poetry: "The mind and conduct mutually imprint /And stamp their image in each other's mint."
A similar technique, said Sculley, is used in spam-recognition software.
In one project, they tracked the evolving references to the young mind. In the fourth century B.C., it was referred to as a "tabula rasa," Latin for "blank slate." By the 17th century, John Locke called it a "white Paper, void of all Characters." In 18th century texts, it was compared to a "roasting jack," conjuring up an image of meat spinning on a rotisserie, cooked by flames. As tools changed -- slates, paper, rotisseries -- so did the references.
There are other metaphor databases, though Pasanek says his is the largest and geared toward the history of thought. However, the database (http://mind.textdriven.com) is still in its beta version, said Pasanek, who now teaches literature at the University of Virginia in Charlottesville. Under renovation, it suffers from what he calls "bug plagues." But with time, it will improve, and broaden to vast horizons.
"A metaphor has a career, and it tells a complete story," he said, "about how we think about ourselves and the world."
___
(c) 2009, San Jose Mercury News (San Jose, Calif.).
Visit MercuryNews.com, the World Wide Web site of the Mercury News, at http://www.mercurynews.com
Distributed by McClatchy-Tribune Information Services.
-
Researchers develop computer model that can predict cholera outbreaks 11 months in advance
Jan 24, 2012 |
5 / 5 (1) |
0
-
Study of Facebook patterns suggests interests in music, movies unlikely to spread among friends
Jan 19, 2012 |
3.8 / 5 (5) |
2
-
Evolution is written all over your face
Jan 11, 2012 |
4.6 / 5 (16) |
10
-
CMU will tap advanced computer methods to help doctors make sense of their patients' DNA
Jan 10, 2012 |
not rated yet |
0
-
Diamonds and dust for better cement
Dec 12, 2011 |
4.5 / 5 (2) |
1
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (30) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
-
Sending SMS to a website
Jan 20, 2012
-
Need help with my technical fest!
Jan 19, 2012
- More from Physics Forums - Computing & Technology
More news stories
Soraa LED light may dim 50-watt halogen rivals
(PhysOrg.com) -- Soraa, a Fremont, California company founded in 2008, this week launched its first product, a light that uses LEDS (light emitting diodes). The "Soraa LED MR16 lamp" is the "perfect" replacement ...
Samsung can continue selling Galaxy tabs in Germany: court
South Korea's Samsung Electronics can continue to sell its Galaxy Tab 10.1N tablet computer in Germany, a German court ruled Thursday, rejecting a bid by arch-rival Apple to have them banned.
19 hours ago |
3.7 / 5 (3) |
3
Digital photos could put kids at risk
A study published in the International Journal of Electronic Security and Digital Forensics this month suggests that parents and carers could be putting children at risk if they upload digital photos that are automatically "geota ...
15 hours ago |
5 / 5 (1) |
3
Google launches Chrome browser for Android smartphones
With more and more people connecting to the Internet through a phone or a tablet instead of a PC, Google Inc. is bringing its fast-growing browser, Chrome, to the newest Android-powered mobile devices.
18 hours ago |
5 / 5 (4) |
0
Model analyzes shape-memory alloys for use in earthquake-resistant structures
Recent earthquake damage has exposed the vulnerability of existing structures to strong ground movement. At the Georgia Institute of Technology, researchers are analyzing shape-memory alloys for their potential ...
16 hours ago |
5 / 5 (1) |
0
|
'Dark plasmons' transmit energy
Microscopic channels of gold nanoparticles have the ability to transmit electromagnetic energy that starts as light and propagates via "dark plasmons," according to researchers at Rice University.
FDA-approved drug rapidly clears amyloid from the brain, reverses Alzheimer's symptoms in mice
Neuroscientists at Case Western Reserve University School of Medicine have made a dramatic breakthrough in their efforts to find a cure for Alzheimer's disease. The researchers' findings, published in the journal Science, show t ...
Hydrogen from acidic water: Researchers develop potential low cost alternative to platinum for splitting water
A technique for creating a new molecule that structurally and chemically replicates the active part of the widely used industrial catalyst molybdenite has been developed by researchers with the Lawrence Berkeley ...
Ultraviolet protection molecule in plants yields its secrets
Lying around in the sun all day is hazardous not just for humans but also for plants, which have no means of escape. Ultraviolet (UV) radiation from the sun can damage proteins and DNA inside cells, leading ...
Anyone can learn to be more inventive, cognitive researcher says
There will always be a wild and unpredictable quality to creativity and invention, says Anthony McCaffrey, a cognitive psychology researcher at the University of Massachusetts Amherst, because an "Aha moment" is rare and ...
Flexible paper robots
(PhysOrg.com) -- These inexpensive robots can stretch, bend and twist under control, and lift objects up to 120 times their own weight. Being soft, they can apply gentle and even pressure, and adapt to varied ...