Informatics researchers throttle notion of search engine dominance

August 7, 2006

Search engines are not biased toward popular Web sites, and may even be egalitarian in the way they direct traffic, say Indiana University School of Informatics researchers. Their study, "Topical interests and the mitigation of search engine bias," in the Aug. 7-11 issue of the Proceedings of the National Academy of Sciences, challenges the view of a Web-dominating "Googlearchy" in which search engines like Google push all Web traffic to established, mainstream Web sites.

"Empirical data do not support the idea of a vicious cycle amplifying the rich-get-richer dynamic of the Web," said Filippo Menczer, associate professor of informatics and computer science. "Our study demonstrates that popular sites receive on average far less traffic than predicted by the Googlearchy theory and that the playing field is more even."

Menczer was joined in the study by IU post-doctoral fellow Santo Fortunato; Alessandro Flammini, assistant professor of informatics; and Alessandro Vespignani, professor of informatics.

The IU team pooled their expertise in Web mining, networks and complex systems to collect empirical data from various search engines. In one scenario, users browsed the Web using only random links. In another, users visited only pages returned by the search engines. The researchers also studied the way in which search engines have influenced the Web's evolution.

"A simple ranking mechanism provides an elegant model to understand the genesis of a broad class of complex systems, including social and technological networks such as the Internet and the World Wide Web," Fortunato said. "These networks possess a peculiar 'long-tail'TM structure in which a few nodes attract a great majority of connections."

The long tail structure of the Web is commonly explained through rich-get-richer models that require knowledge of the prestige of each node in the network. However, those who create and link Web pages may not know the prestige values of target pages.

In another study, "Scale-Free Network Growth by Ranking," (May 27 Physical Review Letters), the Menczer, Fortunato, and Flammini showed that for a search engine to give rise to a long tail network, it must simply sort nodes according to any prestige measure, even if the exact values are unknown. If new nodes are linked to old ones according to their ranking order, a long tail emerges.

"By sorting results, search engines give us a simple mechanism to interpret how the Web grows and how traffic is distributed among Web sites," said Menczer.

The ranking model can help understand the dynamics of other complex networks besides the Web. For example, in a social system, one may be able to tell which of two people is richer without knowing their bank account balance. Such a criterion might explain the frequency and robustness of the complex structure observed in many real networks.

Source: Indiana University


   
Rate this story - 2.1 /5 (15 votes)


August 7, 2006 all stories

Comments: 0

2.1 /5 (15 votes)

  • hide
  • Related Stories

  • Helping Haitians find family
    created Feb 08, 2010 | popularity not rated yet | comments 0
  • FlashFind - Lightning-Fast Search on Mobile Devices
    created Feb 05, 2010 | popularity not rated yet | comments 0
  • Health stories by experts more credible than blogs
    created Feb 05, 2010 | popularity not rated yet | comments 0
  • Many wired Chinese unfazed at possible Google exit
    created Feb 01, 2010 | popularity not rated yet | comments 0
  • Dying news media looks to Apple tablet for hope
    created Jan 27, 2010 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • how to welding thin SS foil (0.002")?
    created Feb 08, 2010
  • Civil Engineering is hazardous to your career prospects
    created Feb 06, 2010
  • hot water circulator, kitchen faucet, ? mixing
    created Feb 06, 2010
  • Static or dynamic pressures in duct
    created Feb 06, 2010
  • More from Physics Forums - General Engineering

Other News

The power of 'random'

The power of 'random': 'Seemingly loopy' technique could dramatically improve communications networks

Technology / Computer Sciences

created 16 hours ago | popularity 4.8 / 5 (8) | comments 5 | with audio podcast

A radical new approach to the design of communications networks, called "network coding," promises to make Internet file sharing faster, streaming video more reliable, and cell-phone reception better -- among ...


'Revolutionary' water treatment units on their way to Afghanistan

Technology / Engineering

created 10 hours ago | popularity 4.2 / 5 (5) | comments 4 | with audio podcast

The United States Army has taken delivery of the first two units of a "revolutionary" waste-water treatment system that will clean putrid water within 24 hours and leave no toxic by-products, according to scientists at Sam ...


Imec and Holst Centre achieve breakthrough in battery-less radios

Imec achieves breakthrough in battery-less radios

Technology / Semiconductors

created 11 hours ago | popularity 4.9 / 5 (9) | comments 0 | with audio podcast

At today's International Solid State Circuit Conference, Imec and Holst Centre report a 2.4GHz/915MHz wake-up receiver which consumes only 51µW power. This record low power achievement opens the door to battery-less ...


Android

Google developing a translator for smartphones

Technology / Software

created 17 hours ago | popularity 4.7 / 5 (7) | comments 3 | with audio podcast report

(PhysOrg.com) -- Google is developing a translator for its Android smartphones that aims to almost instantly translate from one spoken language to another during phone calls.


Handling emergencies online

Technology / Internet

created 6 hours ago | popularity not rated yet | comments 0

Online social networking sites could solve many problems plaguing information dissemination and communications when disaster strikes, according to a report from US researchers in a recent issue of the International Journal of ...