New program color-codes text in Wikipedia entries to indicate trustworthiness

August 3, 2007

The online reference site Wikipedia enjoys immense popularity despite nagging doubts about the reliability of entries written by its all-volunteer team. A new program developed at the University of California, Santa Cruz, aims to help with the problem by color-coding an entry's individual phrases based on contributors' past performance.

The program analyzes Wikipedia's entire editing history--nearly two million pages and some 40 million edits for the English-language site alone--to estimate the trustworthiness of each page. It then shades the text in deepening hues of orange to signal dubious content. A 1,000-page demonstration version is already available on a web page operated by the program's creator, Luca de Alfaro, associate professor of computer engineering at UCSC.

Other sites already employ user ratings as a measure of reliability, but they typically depend on users' feedback about each other. This method makes the ratings vulnerable to grudges and subjectivity. The new program takes a radically different approach, using the longevity of the content itself to learn what information is useful and which contributors are the most reliable.

"The idea is very simple," de Alfaro said. "If your contribution lasts, you gain reputation. If your contribution is reverted [to the previous version], your reputation falls." De Alfaro will speak about his new program this Saturday, August 4, at the Wikimania conference in Taipei, Taiwan.

The program works from a user's history of edits to calculate his or her reputation score. The trustworthiness of newly inserted text is computed as a function of the reputation of its author. As subsequent contributors vet the text, their own reputations contribute to the text's trustworthiness score. So an entry created by an unknown author can quickly gain (or lose) trust after a few known users have reviewed the pages.

A benefit of calculating author reputation in this way is that de Alfaro can test how well his reliability scores work. He does so by comparing users' reliability scores with how long their subsequent edits last on the site. So far, the program flags as suspect more than 80 percent of edits that turn out to be poor. It's not overly accusatory, either: 60 to 70 percent of the edits it flags do end up being quickly corrected by the Wikipedia community.

The exhaustive analysis of Wikipedia's seven-year edit history takes de Alfaro's desktop PC about a week to complete. At present he is working from copies of the site that Wikipedia periodically distributes. Once the initial backlog of edits is calculated, however, de Alfaro said that updating reliability scores in real time should be fairly simple.

While the program prominently displays text trustworthiness, de Alfaro favors keeping hidden the reputation ratings of individual users. Displaying reputations could lead to competitiveness that would detract from Wikipedia's collaborative culture, he said, and could demoralize knowledgeable contributors whose scores remain low simply because they post infrequently and on few topics.

"We didn't want to modify the experience of a user going in to Wikipedia," de Alfaro said. "It is very relaxing right now and we didn't want to modify what has worked so well and is so welcoming to the new user."

Source: UC Santa Cruz


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4.2 /5 (12 votes)


August 3, 2007 all stories

Comments: 0

4.2 /5 (12 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • New IBM Lotus Connections Software Brings Consumer Social Networking Features to the Office
    created Sep 22, 2009 | popularity not rated yet | comments 0
  • HumBio instructor gets props for YouTube raps
    created Mar 16, 2009 | popularity not rated yet | comments 0
  • Adobe Ships Photoshop Lightroom 1.0
    created Feb 20, 2007 | popularity not rated yet | comments 0
  • Vidcasting market set to grow
    created Oct 12, 2005 | popularity not rated yet | comments 0
  • Expanding drug treatment: Is US ready to step up?
    created 11 hours ago | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • casio calculator that's similar to TI-89
    created 8 hours ago
  • Mathematica Question: Finding local maximums
    created 11 hours ago
  • Advice on what cell phone to get
    created 12 hours ago
  • Read multiple binary files to ascii
    created Nov 07, 2009
  • More from Physics Forums - Computing & Technology

Other News

A system of space solar power system (SSPS)

Japan eyes solar station in space as new energy source

Technology / Energy

created 23 hours ago | popularity 4.7 / 5 (14) | comments 20

It may sound like a sci-fi vision, but Japan's space agency is dead serious: by 2030 it wants to collect solar power in space and zap it down to Earth, using laser beams or microwaves.


Framed for child porn -- by a PC virus

Framed for child porn -- by a PC virus

Technology / Internet

created 16 hours ago | popularity 5 / 5 (5) | comments 2

(AP) -- Of all the sinister things that Internet viruses do, this might be the worst: They can make you an unsuspecting collector of child pornography.


Campaigners are stepping up efforts to curb online tracking

Advertisers face resistance to on-line tracking

Technology / Internet

created 23 hours ago | popularity 5 / 5 (4) | comments 0

Campaigners are stepping up efforts to curb online tracking of Internet use by firms that deliver adverts tailored to the specific interests of consumers, as polls reveal widespread unease with the practice.


Dartmouth professor finds that iconic Oswald photo was not faked

Professor finds that iconic Oswald photo was not faked (w/ Video)

Technology / Computer Sciences

created Nov 05, 2009 | popularity 3.8 / 5 (9) | comments 38

(PhysOrg.com) -- Dartmouth Computer Scientist Hany Farid has new evidence regarding a photograph of accused John F. Kennedy assassin Lee Harvey Oswald. Farid, a pioneer in the field of digital forensics, digitally ...


airpod

Car That Runs on Compressed Air Questioned by Critics (w/ Video)

Technology / Energy

created Nov 03, 2009 | popularity 3.8 / 5 (20) | comments 34

(PhysOrg.com) -- As electric cars begin breaking into the short-distance vehicle market, one French company thinks that it has an alternative to the electric vehicle: a car that runs on compressed air. Motor ...