Cognitive scientists develop new take on old problem: why human language has so many words with multiple meanings

January 19, 2012 by Emily Finn
The advantage of ambiguity

Enlarge

Graphic: Christine Daniloff

Why did language evolve? While the answer might seem obvious -- as a way for individuals to exchange information -- linguists and other students of communication have debated this question for years. Many prominent linguists, including MIT’s Noam Chomsky, have argued that language is, in fact, poorly designed for communication. Such a use, they say, is merely a byproduct of a system that probably evolved for other reasons -- perhaps for structuring our own private thoughts.

As evidence, these linguists point to the existence of ambiguity: In a system optimized for conveying information between a speaker and a listener, they argue, each word would have just one meaning, eliminating any chance of confusion or misunderstanding. Now, a group of MIT cognitive scientists has turned this idea on its head. In a new theory, they claim that ambiguity actually makes language more efficient, by allowing for the reuse of short, efficient sounds that listeners can easily disambiguate with the help of context.

“Various people have said that ambiguity is a problem for communication,” says Ted Gibson, an MIT professor of cognitive science and senior author of a paper describing the research to appear in the journal Cognition. “But once we understand that context disambiguates, then ambiguity is not a problem — it’s something you can take advantage of, because you can reuse easy [words] in different contexts over and over again.”

Lead author of the paper is Steven Piantadosi PhD ’11; Harry Tily, a postdoc in the Department of Brain and Cognitive Sciences, is another co-author.

What do you ‘mean’?

For a somewhat ironic example of ambiguity, consider the word “mean.” It can mean, of course, to indicate or signify, but it can also refer to an intention or purpose (“I meant to go to the store”); something offensive or nasty; or the mathematical average of a set of numbers. Adding an ‘s’ introduces even more potential definitions: an instrument or method (“a means to an end”), or financial resources (“to live within one’s means”).

But virtually no speaker of English gets confused when he or she hears the word “mean.” That’s because the different senses of the word occur in such different contexts as to allow listeners to infer its meaning nearly automatically.

Given the disambiguating power of context, the researchers hypothesized that languages might harness ambiguity to reuse words — most likely, the easiest words for language processing systems. Building on observation and previous studies, they posited that words with fewer syllables, high frequency and the simplest pronunciations should have the most meanings.

To test this prediction, Piantadosi, Tily and Gibson carried out corpus studies of English, Dutch and German. (In linguistics, a corpus is a large body of samples of language as it is used naturally, which can be used to search for word frequencies or patterns.) By comparing certain properties of words to their numbers of meanings, the researchers confirmed their suspicion that shorter, more frequent words, as well as those that conform to the language’s typical sound patterns, are most likely to be ambiguous — trends that were statistically significant in all three languages.

To understand why ambiguity makes a language more efficient rather than less so, think about the competing desires of the speaker and the listener. The speaker is interested in conveying as much as possible with the fewest possible words, while the listener is aiming to get a complete and specific understanding of what the speaker is trying to say. But as the researchers write, it is “cognitively cheaper” to have the listener infer certain things from the context than to have the speaker spend time on longer and more complicated utterances. The result is a system that skews toward ambiguity, reusing the “easiest” words. Once context is considered, it’s clear that “ambiguity is actually something you would want in the communication system,” Piantadosi says.

Tom Wasow, a professor of linguistics and philosophy at Stanford University, calls the paper “important and insightful.”

“You would expect that since languages are constantly changing, they would evolve to get rid of ambiguity,” Wasow says. “But if you look at natural languages, they are massively ambiguous: Words have multiple meanings, there are multiple ways to parse strings of . … This paper presents a really rigorous argument as to why that kind of ambiguity is actually functional for communicative purposes, rather than dysfunctional.”

Implications for computer science

The researchers say the statistical nature of their paper reflects a trend in the field of linguistics, which is coming to rely more heavily on information theory and quantitative methods.

“The influence of computer science in linguistics right now is very high,” Gibson says, adding that natural language processing (NLP) is a major goal of those operating at the intersection of the two fields.

Piantadosi points out that ambiguity in natural language poses immense challenges for NLP developers. “Ambiguity is only good for us [as humans] because we have these really sophisticated cognitive mechanisms for disambiguating,” he says. “It’s really difficult to work out the details of what those are, or even some sort of approximation that you could get a computer to use.”

But, as Gibson says, computer scientists have long been aware of this problem. The new study provides a better theoretical and evolutionary explanation of why ambiguity exists, but the same message holds: “Basically, if you have any human in your input or output, you are stuck with needing context to disambiguate,” he says.

Provided by Massachusetts Institute of Technology (news : web)

This story is republished courtesy of MIT News (http://web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.

4.2 /5 (18 votes)  

Filter


Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

Squirrel
Jan 19, 2012

Rank: 4 / 5 (2)
The bug with this theory is that they looked at only languages which low homophony (English, Dutch and German) and not languages such as Chinese in which homophony is very dense (and word pronunciation has six different meanings!) It is not clear their findings apply to such languages. Second, if efficient use of pronunciations so important why is it that so many phonetically acceptable "nonword" pronunciations are not used?

The paper can be found in the list of Steven Piantadosi's publications
http://web.mit.ed...ado/www/
hb_
Jan 19, 2012

Rank: 5 / 5 (1)
@Squirrel

I do not agree. In Chinese, the word order is much more important than in English or German. You can claim that the context - or word order - has an even larger effect on changing the meaning of a word.

So, even if there is a smaller set of syllables in Chinese than in English, the importance of context to differentiate meanings from words having the same sounds is even greater. I.e. the argument that sound ambiguity is allowed to re-use efficient sound combinations is even more valid.

Also, if you would repeat the study with Chinese, and look not at single words, but at groups of two or three words, I am sure you would find that there is more ambiguity in pairs than in triplets. Shorter combinations rely more on context to get the meaning accross.
hb_
Jan 19, 2012

Rank: 5 / 5 (1)
@Squirrel

Another argument. It would surprise me greatly if inefficient tone combinations are used as frequently as efficient ones. A tone number 3 (falling-and-then-rising) followed by another number 3, is - to my knowledge - not present at all.

I think the second tone number 3 modifies the first tone number 3 - I have forgotten exactly how - since the pronunciation would otherwise be very slow. You would have to perform an additional rising/falling tone compared to simplifying the first number 3 to a single rising tone (correct me if I remember incorrectly)
Tachyon8491
Jan 19, 2012

Rank: not rated yet
Ambiguity in syntagmatic constructions vitally enriches intended meaning and, paradoxically, contextualises in very relevant ways. Words are illuminated "peaks of meaning" in a linguistic landscape where they stretch down into a shadowed substrate that interconnects them all... The encoding of quantized intentionality in acoustic patterns is as old as human consciousness - it contains primary dynamics which have much in common with music, the latter also a language which accurately reflects an underlying reality.
Tausch
Jan 20, 2012

Rank: not rated yet
Gentlemen:
Simplify.
Sound.
In acoustics ALL fundamentals has infinite multiples.
It's that simple.
Any multiple of a fundamental is NOT a fundamental.

Those are properties of physics, specifically, mechanic waves of acoustics.

That is where the 'properties' of linguistics and all human languages originate.

Now ask critical questions:

Is there anything 'ambiguous' about a fundamental?

Is there anything 'ambiguous' about multiples of fundamentals?

Are the properties of sound the underlying origins for what
we now label linguistically ambiguity?

What other life forms utilize sound?

....character limit to this thread...unwise all numberless questions lead you to one inescapable conclusion - the origin of all human languages is sound.

If you dispute the physics of sound you must replace the physics of sound with physics that are as unambiguous for sound as the origin for all human languages. Good luck.

Tausch
Jan 20, 2012

Rank: not rated yet
unwise = otherwise - see above:
Typo.
sashakr
Jan 20, 2012

Rank: not rated yet
How long are we going to hear the old funny story that the function of language is communication, and communication is exchange of information? Can one call himself a cognitive scientist if he seriously believes that this is the case? Or should we refer to such people as "voodoo scientists"? The function of language is co-ordination of consensual co-ordinations of behavior, it has nothing to do with so-called information transfer (after all, information is not a thing). I respect Chomsky as a very influential thinker, but errare humanum est, and Chomsky is only a human being.
Tausch
Jan 20, 2012

Rank: not rated yet
The origin of all (human) behaviors are physical in nature.
Naïve realism finest moment.

Why stop there?

As soon as naïve realism falls short, scientific realism is there to pick up the slack.

The function of language is co-ordination of consensual co-ordinations of behavior, it has nothing to do with so-called information transfer (after all, information is not a thing). - sas


Huh?
You have to have a physical origin for(human)behavior.
Called a planet. And the probability that the planet has a potential to harbor, at any time, what we label life.

It was a mistake to seek a innate, human, biological mechanism for language. Chomsky searches still for this.
Rank 4.2 /5 (18 votes)
Related Stories
Relevant PhysicsForums posts
  • Uniform Price Auction question. Why price is #Seller 1st highest of #Buyers
    createdFeb 16, 2012
  • Can I forget a language?
    createdFeb 10, 2012
  • The Biggest Lie Ever
    createdFeb 09, 2012
  • What are the limits of learning?
    createdFeb 06, 2012
  • Isn't that grammatically wrong?
    createdFeb 06, 2012
  • What does it mean when traders are indifferent?
    createdFeb 04, 2012
  • More from Physics Forums - Social Sciences

More news stories

Global influence of U.S. Constitution on the decline, study reveals

The U.S. Constitution's global influence is on the decline, finds a new study by David S. Law, JD, PhD, professor of law at Washington University in St. Louis.

Other Sciences / Economics & Business

created 17 hours ago | popularity 4.5 / 5 (2) | comments 8

Immigration chief seeks to reassure Silicon Valley

(AP) -- The Obama administration's top immigration official said Wednesday he wants to keep more foreign-born high-tech entrepreneurs in the U.S. But to make that happen, he said he needs those entrepreneurs to turn their ...

Other Sciences / Other

created 9 hours ago | popularity 5 / 5 (2) | comments 0

What is the value of a green card? Researcher calculates increase in income

Just what does it mean to get a green card? To some applicants, about $1,000 each month.

Other Sciences / Economics & Business

created 16 hours ago | popularity not rated yet | comments 2

Increasingly, children's books are where the wild things aren't: study

Was your favorite childhood book crawling with wild animals and set in places like jungles or deep forests? Or did it take place inside a house or in a city, with few if any untamed creatures in sight?

Other Sciences / Social Sciences

created 11 hours ago | popularity 4.8 / 5 (4) | comments 0

Ancient rock art found in Brazil

Researchers have discovered an extremely old anthropomorphic figure engraved in rock in Brazil, according to a report published Feb. 22 in the open access journal PLoS ONE.

Other Sciences / Archaeology & Fossils

created 9 hours ago | popularity 4.5 / 5 (4) | comments 0


Researchers build first physical 'metatronic' circuit

(PhysOrg.com) -- The technological world of the 21st century owes a tremendous amount to advances in electrical engineering, specifically, the ability to finely control the flow of electrical charges using ...

Spitzer finds solid buckyballs in space

(PhysOrg.com) -- Astronomers using data from NASA's Spitzer Space Telescope have, for the first time, discovered buckyballs in a solid form in space. Prior to this discovery, the microscopic carbon spheres ...

Faster than light neutrinos? More like faulty wiring

You can shelf your designs for a warp drive engine (for now) and put the DeLorean back in the garage; it turns out neutrinos may not have broken any cosmic speed limits after all.

Physicists surprised by disappearing and reappearing superconductivity in iron selenium chalcogenides

Superconductivity is a rare physical state in which matter is able to conduct electricity -- maintain a flow of electrons -- without any resistance. This phenomenon can only be found in certain materials at low temperatures, ...

CT colonography shown to be comparable to standard colonoscopy

Computerized tomographic (CT) colonography (CTC), also known as virtual colonoscopy, is comparable to standard colonoscopy in its ability to accurately detect cancer and precancerous polyps in people ages 65 and older, according ...

Stanford research team cracks animated NuCaptcha

(PhysOrg.com) -- The research team from Stanford University, led by Elie Bursztein, that previously had cracked regular CAPTCHAs and then audio CAPTCHAs, now has also successfully cracked the animated version called NuCapt ...