Scientists Construct Model of the World Wide Web

April 8th, 2008 By Lisa Zyga Scientists Construct Model of the World Wide Web

Enlarge

Traffic statistics for the Web page http://reverent.org/true_art_or_fake_art.html, where the long periods of low activity and short bursts of high activity are similar to other sites that the researchers analyzed. Credit: Simkin and Roychowdhury.

Although the Internet contains well over 100 million Web sites, two electrical engineers think they know what the traffic patterns of the entire Web look like.

Mikhail Simkin and Vwani Roychowdhury, electrical engineers from both the University of California Los Angeles and NetSeer.com, have constructed a model of the Web using the traffic statistics of just three Web pages: http://reverent.org/true_art_or_fake_art.html , http://reverent.org/sounds_like_faulkner.html , http://ecclesiastes911.net/disumbrated_art.html . (Traffic patterns from a dozen other Web pages the researchers studied were very similar.) Using several years of data from these three pages, the researchers show how the Internet overall reaches a self-organized critical (SOC) state with long-lasting traffic.

“One of the main implications of our findings is that traffic and [the corresponding] fame is a prolonged phenomenon instead of a one-time fling, and recurs in a spasmodic fashion,” Roychowdhury told PhysOrg.com.

Most of the time, traffic to any single Web page is relatively low and steady, where visitors come from search engines, Web directories, online encyclopedias, and other constant sources. But these long periods of low traffic are interrupted by bursts of heavy traffic that follow a power law, usually the effect of numerous blog entries linking to those pages.

The researchers use a branching model to describe the probability and extent of these bursts. Basically, there’s a certain probability that a viewer will post a blog entry with a link to that Web page, and then a certain number of viewers who will visit the Web page via the blogger’s link. The product of these two variables determines whether or not a Web page will reach the critical value of 1, which determines if the branch keeps growing or dead-ends.

“A system is in a critical state if a single movement in an individual constituent element leads, on the average, to the movement of precisely one other element in the system,” Roychowdhury explained.

If a system is in a super- or sub-critical state, movement of one element leads, on average, to the movement of either more or less than one other element, respectively. That means that a signal generated in a super-critical system should increase forever, while a signal in a sub-critical system eventually dies out.

“But in a critical system, something very interesting happens,” Roychowdhury said. “Almost all signal cascades will die out, but some of them can last for a long time and can cover a large area. Clearly, sub- and super-critical systems are not that interesting unless we want a system that is either not that responsive or a system that explodes at the slightest provocation. Critical systems, on the other hand, allow for a responsive system to exist without it being blown apart. Many physical systems naturally gravitate towards a critical state, and this phenomenon is termed SOC.”

As the researchers explain, competition for viewers and links is a driving force of the Web, and this competition pushes the entire Web into an SOC state. Based on their data, the researchers determined the values for the two variables above for the “true art or fake art” site that closely produce its traffic patterns: they found that its link probability of 0.01 and referral number of 95 visitors per link results in a slightly sub-critical value of 0.95 for that particular Web page.

But since some Web pages are more interesting than others, some pages will achieve the critical value of 1 or even surpass it.

“To explain how the Web evolves into the SOC state, we need to use the concept of Darwinian fitness, which is a scientific measure of digital fangs and claws that help the Web page to fight for links with its competitors,” Simkin said. “The success in this competition depends not only on the Web page's own fitness, but also on the average fitness of other pages currently discussed in the blogosphere, with which our Web page competes.”

If this average is low, Simkin explained, then the fittest papers are super-critical. This means that, with time, they increase their share of the blogosphere. But in turn, this leads to the increase of the average fitness. The process continues until the fittest pages become exactly critical.

“One finding that is important for Webmasters is that our work disproves the so-called fifteen minutes of fame paradigm, according to which things can get popular soon after release and quickly become forgotten,” Simkin said. “One, of course, knows that this paradigm is manifestly wrong for immortal classics. However, our work shows it to be wrong not only for great creations, but for anything which is of any intrinsic (not created by advertisement) merit.”

The researchers found that the traffic to a Web page with fixed content can persist for at least several years.

“So one should not hurry to delete old Web pages,” Simkin said. “When there is enough – say a year – of access statistics, our model can be used to infer a page's fitness and predict the average volume and fluctuations of future traffic.”

Roychowdhury is a cofounder and Simkin a consultant for a start-up company called NetSeer.com that focuses on next-generation Internet advertising. The company utilizes similar physics-based modeling of the Web, though not the direct results of the present study.

The researchers add that the Web is just one of many complex systems that exhibit self-organized criticality, with other examples including evolutionary patterns, earthquakes, and citations in research papers. They suggest that their model could also be used to explain the spreading of cultural elements, like movies, books, and fashion styles.

More information: Simkin, M. V. and Roychowdhury, V. P. “A theory of web traffic.” Europhysics Letters, 82 (2008) 28006.

Copyright 2008 PhysOrg.com.
All rights reserved. This material may not be published, broadcast, rewritten or redistributed in whole or part without the express written permission of PhysOrg.com.


print this article email this article download pdf blog this article bookmark this article     Digg this Stumble it share on Facebook share on Reddit add to delicious save to Yahoo! bookmarks
3.4/5 after 55 votes

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • neuraxon77 - Apr 13, 2008
    • Rank: 5 / 5 (1)
    @nilbud I think your missing the point. Blogs are just one example where this power law could be applied. More broadly I see this applying to any form of linking, and that includes those novel concepts with no links in sight whatsoever. As Steve Gillmor states; "Links are dead". In other words; novel concepts embedded in language semantics that lead to people searching for more on a subject...

April 8th, 2008 all stories
Physics / General Physics

Comments: 1
Rank: 3.4/5 after 55 votes

  • Stumble this up

  • Digg this

  • Share it:
  • share on Facebook
  • share on MySpace
  • share on Slashdot
  • rss-newsfeed
  • share on Google
  • share on Reddit
  • add to delicious
  • save to Yahoo! bookmarks
  • share on Windows Live
  • Add to Mixx!
Rating: 3.4/5 after 55 votes


Tags


  • Physicists Demonstrate Quantum Memory with Matter Qubits
    Physicists Demonstrate Quantum Memory with Matter Qubits
    Physics / General Physics
    created Jul 03, 2009 | popularity 4.4 / 5 (17) | comments 1
  • 'Holey' Nanosheets for Wastewater Dye Removal
    Nanotechnology / Nanomaterials
    created Jul 01, 2009 | popularity 5 / 5 (5) | comments 1
  • Jellyfish Robot Swims Like its Biological Counterpart
    Jellyfish Robot Swims Like its Biological Counterpart
    Electronics / Robotics
    created Jun 26, 2009 | popularity 4.4 / 5 (8) | comments 1
  • Could Maxwell's Demon Exist in Nanoscale Systems?
    Could Maxwell's Demon Exist in Nanoscale Systems?
    Physics / General Physics
    created Jun 24, 2009 | popularity 4.4 / 5 (18) | comments 29
  • Living Safely with Robots, Beyond Asimov's Laws
    Living Safely with Robots, Beyond Asimov's Laws
    Electronics / Robotics
    created Jun 22, 2009 | popularity 4.6 / 5 (52) | comments 40
  • Other News

    Scientists create first electronic quantum processor

    Scientists create first electronic quantum processor

    Physics / General Physics

    created Jun 28, 2009 | popularity 4.8 / 5 (53) | comments 39

    A team led by Yale University researchers has created the first rudimentary solid-state quantum processor, taking another step toward the ultimate dream of building a quantum computer.


    Science journals

    How to Spot an Influential Paper Based on its Citations

    Physics / General Physics

    created Jul 04, 2009 | popularity 4 / 5 (9) | comments 5

    (PhysOrg.com) -- At first it may seem that the number of citations received by a published scientific paper is directly related to that paper's quality of content. The higher the quality, the more people read ...


    Fermilab's CDF observes Omega-sub-b baryon

    Fermilab's CDF observes Omega-sub-b baryon

    Physics / General Physics

    created Jun 29, 2009 | popularity 4.7 / 5 (16) | comments 7

    (PhysOrg.com) -- At a recent physics seminar at the Department of Energy’s Fermi National Accelerator Laboratory, Fermilab physicist Pat Lukens of the CDF experiment announced the observation of a new particle, ...


    New insights, and a new angle, on high-temperature superconductivity

    New insights, and a new angle, on high-temperature superconductivity

    Physics / Superconductivity

    created Jun 29, 2009 | popularity 4.8 / 5 (13) | comments 6

    (PhysOrg.com) -- A Princeton-led research team has revealed surprising information about how electron behavior influences the conduction of electricity in a class of high-temperature superconductors. An increased ...


    The art of invisibility and the perfect cat's eye

    The art of invisibility and the perfect cat's eye

    Physics / Optics & Photonics

    created Jun 30, 2009 | popularity 4 / 5 (8) | comments 6

    (PhysOrg.com) -- In recent years scientists have explored the impossible by developing invisibility or 'cloaking' devices, but can the same technology also help make things more visible?