Researchers classify Web searches

April 10, 2008

Although millions of people use Web search engines, researchers show that – by using relatively simple methods – most queries submitted can be classified into one of three categories.

Jim Jansen, assistant professor in Penn State's College of Information Sciences and Technology, worked with IST undergraduate Danielle Booth and Amanda Spink, Queensland University of Technology, to find that Web search engine users are doing primarily informational, navigational or transactional searching.

Informational searching involves looking for a specific fact or topic, navigational searching seeks to locate a specific Web site and transactional searching looks for information related to buying a particular product or service.

The research was the first published work of its kind done using actual searching data, with the aim of real-time classification. Researchers analyzed more than 1.5 million queries from hundreds of thousands of search engines users. Findings showed that about 80 percent of queries are informational and about 10 percent each are for navigational and transactional purposes.

Jansen and his colleagues arrived at those results by selecting random samples of records and analyzing query length, the order of the query in the session and the search results. These fields helped the team develop an algorithm that classified the searches with a 74-percent accuracy rate.

"Other results have classified comparatively much smaller sets of queries, usually manually," Jansen said. "This research aimed to classify queries automatically.

"Our findings have broad implications for search engines and e-commerce if they can classify the user intent of queries in real time. This is why we wanted a computational undemanding algorithm," Jansen continued. "It proves the 80/20 rule that 80 percent of the cases can be achieved with these clear-cut methods."

The paper "Determining the informational, navigational and transactional intent of Web queries" will appear in the May 2008 issue of Information Processing & Management. The article is currently available online.

The Penn State researcher said he plans to continue this research using a more complex algorithm that will hopefully yield a 90-percent accuracy rate using similar searching criteria.

Source: Penn State


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 3.2 /5 (9 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • Valentiinro - Apr 10, 2008
    • Rank: not rated yet
    I am wondering where porn fits into this scheme. There is quite a large amount of that on the internet. Informational I suppose?
  • gopher65 - Apr 11, 2008
    • Rank: 5 / 5 (1)
    Definitely Informational.

April 10, 2008 all stories

Comments: 2

3.2 /5 (9 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • P2P comes to the aid of audiovisual search (w/ Video)
    created Nov 18, 2009 | popularity not rated yet | comments 0
  • Yahoo! vows to fight Microsoft on new front
    created Aug 25, 2009 | popularity not rated yet | comments 0
  • Sex, videos, friends, games hot with kids online: Norton
    created Aug 12, 2009 | popularity not rated yet | comments 0
  • Microsoft sees size as search answer in Yahoo deal
    created Aug 03, 2009 | popularity not rated yet | comments 0
  • Google News mistakes Jackson queries for attack
    created Jun 27, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • How to convert and edit video/audio for iPhone, iPhone 3G, iPhone 3GS
    created 1hour ago
  • kindle e-reader and scientific papers
    created Nov 24, 2009
  • Help with a camera choice
    created Nov 18, 2009
  • casio calculator that's similar to TI-89
    created Nov 08, 2009
  • More from Physics Forums - Computing & Technology

Other News

Design chosen for British 1,000 mph car

Design chosen for British 1,000 mph car (w/ Video)

Technology / Engineering

created 14 hours ago | popularity 4.2 / 5 (5) | comments 5

(PhysOrg.com) -- A British team hoping to be the first to get a car to 1,000 mph (1,610 km/h) has made its final design selection. The six-tonne car, known as the Bloodhound, will be powered by a Eurofighter ...


ORNL 'deep retrofits' can cut home energy bills in half

ORNL 'deep retrofits' can cut home energy bills in half

Technology / Energy

created 5 hours ago | popularity 3 / 5 (2) | comments 0

(PhysOrg.com) -- Oak Ridge National Laboratory has announced plans to conduct a series of deep energy retrofit research projects with the potential to improve the energy efficiency in selected homes by as ...


Web sites aim to survive with hyperlocal focus

Technology / Internet

created 1hour ago | popularity not rated yet | comments 0

Finding a financially viable way to provide local news is a challenge large metropolitan newspapers are confronting. But a Coral Gables, Fla., Web site is among a few locally with faith it can succeed.


Should I buy a PC or Mac?

Technology / Software

created 2 hours ago | popularity 5 / 5 (1) | comments 2

Q. Our 6-year-old PC computer is dying a slow death and we are considering moving to a new iMac but have a few concerns. First, of all, we have several Word documents on our disk drive now that we want to keep and add to ...


EU assembly adopts Internet, phone user rights

Technology / Telecom

created 10 hours ago | popularity 5 / 5 (1) | comments 1

(AP) -- The European Parliament has endorsed new telecom rules that would give phone and Internet users more rights and allow them to appeal to national courts if they are cut off for illegal file-sharing.