Personal discrimination on the Web

May 21, 2009

How do you tell if a website you are browsing is a showing you a personal web page expressing the opinions of an individual or the marketing speak of a commercial site in disguise? Information engineers in India and Japan believe they have found an automatic way to discriminate between personal web pages and commercial pages designed to fool consumers.

Writing in a forthcoming issue of the International Journal of Business Intelligence and Data Mining, Takahiro Hayashi of Niigata University, and colleagues, explain that their approach extracts subjective expressions from . The system then scores them by degree of subjectivity and provides the reader with an indication of whether the website content expresses personal opinions or marketing speak about a product or service.

The team has evaluated the performance of their system using 1200 web pages collected from four categories: product, tourist spot, restaurant, and movie. They found that their method is much more effective in finding personal pages than a general search engine, in all categories. Part of the reason for this is that search engines, such as , tend not to rank personal pages highly.

Personal homepages, personal blogs, web forum sites and smaller customer opinion sites are regarded as personal pages and generally don't appear high in the search engine results pages (SERPs). Finding genuine personal opinions surveys is much harder than finding commercially biased sites, the researchers explain.

Their system relies on the fact that marketing copywriters and advertisers tend not to report negative comments about a product or service. In contrast, the personal opinions of users of the product or service will be littered with both positive and negative comments depending on their standpoint.

In Japanese, subjective expressions in written language might be described as: expressions with a negative meaning, sentence-final particles, interjections, and specific symbols such as face marks (Kanji), which are equivalent to smilies in the West. There are of course, equivalent expressions in other languages, say the researchers.

These various types of expressions can be extracted from a webpage and fed into the researchers' algorithm, which determines a weighted and categorized ratio of negative to positive expressions. This provides the basic indicator of whether or not a page is commercial or personal automatically.

The obvious extension of this approach is to apply such an algorithm to the results of a search for a product or service carried out by a general and so filter out the commercial from the personal and allow consumers to assess the wider opinions of the web community on that product.

More information: Discrimination of personal web pages by extracting subjective expressions" in Int. J. Business Intelligence and Data Mining, 2009, 4, 62-77

Source: Inderscience Publishers (news : web)


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4 /5 (1 vote)


May 21, 2009 all stories

Comments: 0

4 /5 (1 vote)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Google launches new application service
    created Aug 22, 2005 | popularity not rated yet | comments 0
  • Yahoo! launches Web answering site
    created Dec 08, 2005 | popularity not rated yet | comments 0
  • Google, IBM team up on PC desktop search
    created Nov 01, 2005 | popularity not rated yet | comments 0
  • Search engine marketing for non-profits
    created Dec 08, 2008 | popularity not rated yet | comments 0
  • Search website offers a visual alternative
    created Jan 07, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Achromat lens - magnifying LCD
    created 6 hours ago
  • Control System
    created Nov 24, 2009
  • Base Isolation Systems in Skyscrapers?
    created Nov 23, 2009
  • Need to interview a Computer Hardware Engineer for school project
    created Nov 23, 2009
  • More from Physics Forums - General Engineering

Other News

Design chosen for British 1,000 mph car

Design chosen for British 1,000 mph car (w/ Video)

Technology / Engineering

created 17 hours ago | popularity 3.7 / 5 (6) | comments 5

(PhysOrg.com) -- A British team hoping to be the first to get a car to 1,000 mph (1,610 km/h) has made its final design selection. The six-tonne car, known as the Bloodhound, will be powered by a Eurofighter ...


Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines"

Magazine publishers creating 'iTunes for magazines': reports

Technology / Internet

created 7 hours ago | popularity not rated yet | comments 0

US magazine publishers Time Inc., Conde Nast and Hearst are preparing to launch an online newsstand described as an "iTunes for magazines," according to published reports.


Should I buy a PC or Mac?

Technology / Software

created 5 hours ago | popularity 4 / 5 (2) | comments 4

Q. Our 6-year-old PC computer is dying a slow death and we are considering moving to a new iMac but have a few concerns. First, of all, we have several Word documents on our disk drive now that we want to keep and add to ...


ORNL 'deep retrofits' can cut home energy bills in half

ORNL 'deep retrofits' can cut home energy bills in half

Technology / Energy

created 9 hours ago | popularity 3 / 5 (2) | comments 0

(PhysOrg.com) -- Oak Ridge National Laboratory has announced plans to conduct a series of deep energy retrofit research projects with the potential to improve the energy efficiency in selected homes by as ...


Web sites aim to survive with hyperlocal focus

Technology / Internet

created 4 hours ago | popularity not rated yet | comments 0

Finding a financially viable way to provide local news is a challenge large metropolitan newspapers are confronting. But a Coral Gables, Fla., Web site is among a few locally with faith it can succeed.