MySpace looks to USC to help servers keep up with users

June 27, 2008

How do you keep a potential half-billion social networkers current? A Viterbi School of Engineering expert tackles the growth problems of one of the world's largest computing systems.

Imagine a moment - a thousandth of a second - in the life of a MySpace computer. In that brief fraction of an eye blink, thousands of fingers on mice, spread across thousands of square miles, have clicked-in urgent requests for as many chunks of data: a photo needed in Minneapolis, video needed in Des Moines, a forum comment wanted by an observer in Detroit.

As the vast social networking system grows, each millisecond becomes more and more crowded with requests like these. Now, a University of Southern California specialist is working to make sure that the answers keep coming back quickly, even with tens of millions of new users.

"If MySpace were less successful," notes Shahram Ghandeharizadeh, a professor in the USC Viterbi School of Engineering's department of computer science, "there would be no problem. But at the current volume of transactions, getting to the data quickly becomes an issue. "

The key to speed and capacity is what is called DRAM. "Ideally," says Ghandeharizadeh, who is director of the USC Database Laboratory, "you want all the data requested to be in the quick-access cache memory of the servers, the DRAM, rather than having to retrieve it from the servers' disc memory, which is much slower."

But the total volume of data created by users is far more than the DRAM cache will hold. And as the user population grows at an accelerating pace, more and more requests arrive to query a larger and larger body of information. Even the innovative Berkeley DataBase system (BDB) that MySpace uses to keep interactions quick is coming under increasing strain.

BDB keeps MySpace information flowing by quadruple redundancy: each section of users is served not by one but by four overlapping servers that share DRAM space, making the system faster, more reliable, and more scalable.

"Now it works," says Felipe Cariņo, who heads MySpace Research, the company's in-house R&D facility, "but if you double it, it may not." And he says the population may in fact double as people in China and other areas learn to meet and greet each other on their own sites.

Ghandeharizadeh is working with Cariņo, a 1995 USC Marshall School Executive M.B.A., to find a way around the impending squeeze. Cariņo dubs the effort the "Gemini Project," after the famous twins: "Two heads, Viterbi and MySpace, coming together."

The collaborators have been exploring a new method for maintaining and replacing the data kept in DRAM.

Up to now, the method used has been simple: Data that has remained in the DRAM longest without being accessed is overwritten by new data.

Another method is potentially more effective: "Heuristic" replacement in which data is given simple but useful characterizations, which a program then uses to guide replacement. The program isn't static, but learns from system behavior and adjusts its criteria to improve performance.

The heuristic algorithm that MySpace Intrapreneurial Research Group is adapting to the MySpace database comes out of a recent PhD research thesis done by USC graduate student Shahin Shayandeh, who is a member of the team, along with three MySpace computer scientists.

The key element is taking file size into account as well as date. Large objects, like video files, kick out many small objects from the memory when they're loaded. While the general solution is based on how frequently objects are accessed, another rule is to not let very large objects into DRAM.

"When you have a gigantic video file that takes up the same space as hundreds or even thousands of text files, uploading it is a shock to the system," says Ghandeharizadeh.

Based on their research, Cariņo and Ghandeharizadeh are hopeful that the new algorithm will adapt well to MySpace demands and deliver the desired improvements in performance.

"Simulation studies show the heuristic method is a marvel. But seeing whether it delivers in a real ultra-large system, such as the one at MySpace, remains to be seen," says Ghandeharizadeh.

Ghandeharizadeh cherishes the opportunity to take the software to limits not others have reached. He specializes in ultra-large systems, and only three or four exist in the entire world, of which MySpace's is one. It includes, says Cariņo "10 data centers, thousands of servers, in 30 plus countries supporting local cultures and languages".

Cariņo notes that MySpace Research mission is clear and tightly focused "so that the final result must be a system or prototype that creates a new product or technology". MySpace Research will file patents and publish papers on the pioneering innovations, but the final deliverable is a new working system. Which Ghandeharizadeh hopes to deliver.

Source: University of Southern California


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - not rated yet


June 27, 2008 all stories

Comments: 0

not rated yet
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • MySpace in talks to buy imeem
    created Nov 17, 2009 | popularity not rated yet | comments 0
  • All eyes on Murdoch as newspapers ponder digital future
    created Nov 15, 2009 | popularity not rated yet | comments 0
  • EA posts 2Q loss, cutting 17 pct. of work force
    created Nov 09, 2009 | popularity not rated yet | comments 0
  • Electronic Arts acquires Playfish for $275 million
    created Nov 09, 2009 | popularity not rated yet | comments 0
  • Social networking meets ambient intelligence (w/ Video)
    created Nov 04, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Laser plasma emission
    created Nov 26, 2009
  • Achromat lens - magnifying LCD
    created Nov 25, 2009
  • Control System
    created Nov 24, 2009
  • Base Isolation Systems in Skyscrapers?
    created Nov 23, 2009
  • Need to interview a Computer Hardware Engineer for school project
    created Nov 23, 2009
  • transient heat transfer
    created Nov 23, 2009
  • More from Physics Forums - General Engineering

Other News

Government delays new ban on Internet gambling

Technology / Internet

created 9 hours ago | popularity 3 / 5 (2) | comments 0

(AP) -- The Treasury Department and the Federal Reserve are giving U.S. financial institutions an additional six months to comply with regulations designed to ban Internet gambling.


Fujitsu Develops Technology for Low-Temperature Full-Service Direct Formation of Graphene Transistors on Large-Scale Substrates

Fujitsu Develops Technology for Low-Temperature Full-Service Direct Formation of Graphene Transistors on Large-Scale Sub

Technology / Semiconductors

created 9 hours ago | popularity 5 / 5 (5) | comments 1

Fujitsu Laboratories today announced, as a world first, the development of a novel technology for forming graphene transistors directly on the entire surface of large-scale insulating substrates at low temperatures ...


Teachers begin using cell phones for class lessons

Technology / Hi Tech

created 5 hours ago | popularity 3 / 5 (1) | comments 0

(AP) -- Ariana Leonard's high school students shuffled in their seats, eagerly awaiting a cue from their Spanish teacher that the assignment would begin. "Take out your cell phones," she said in Spanish.


Signal fading on radio traffic reports

Technology / Other

created 7 hours ago | popularity not rated yet | comments 1

(AP) -- For more than 20 years, Mike Nolan was known to radio listeners as the "eye in the sky." He flew over Southern California freeways in his single-engine plane, reporting on the nation's worst traffic.


Semantic research sets world standards

Semantic research sets world standards

Technology / Computer Sciences

created 15 hours ago | popularity 3.3 / 5 (3) | comments 1

(PhysOrg.com) -- European researchers have created new tools for semantic technology development which are helping to set the next generation of official standards. The tools also unblock some key bottlenecks ...