NIST tool helps Internet master top-level domains

May 16, 2008

At the request of a worldwide Internet organization, a computer scientist at the National Institute of Standards and Technology developed an algorithm that may guide applicants in proposing new “top-level domains”—the last part of an Internet address, such as .com, that people type in navigating the Web.

As new top-level domains are added to the familiar .com, .info and .net, the algorithm* checks whether the newly proposed name is confusingly similar to existing ones by looking for visual likenesses in its appearance. Having visually distinct top-level domain names may help avoid confusion in navigating the ever-expanding Internet and combat fraud, by reducing the potential to create malicious look-alikes: .C0M with a zero instead of .COM, for instance.

Later this year, the Internet Corporation for Assigned Names and Numbers (ICANN) plans to launch the process for proposing a new round of “generic” top-level domains (gTLDs), strings such as .net, .gov and .org meant to indicate organizations or interests. In preparing for newly proposed gTLDs, ICANN reached out to various algorithm developers, including NIST’s Paul E. Black, as among those engaged to “provide an open, objective, and predictable mechanism for assessing the degree of visual confusion” in gTLDs.

Black’s algorithm compares a proposed gTLD with other TLDs and generates a score based on their visual similarities. For example, the domain .C0M scores an 88 percent visual similarity with the familiar .COM. The resulting scores may help indicate whether the newly proposed domain name looks too much like existing ones.

To make its assessments, the algorithm rates the degree of similarity between pairs of alpha-numeric characters. Some pairs, such as the numeral “1” and its dead-ringer, the lowercase letter “l,” are assigned the highest scores for visual similarity while other pairs, such as “h” and “n”, are given lower scores. The algorithm takes other considerations into account, for example how certain pairs of letters, like “c” and “l,” can join to look like a third letter (“d”), as in the case of “close” and “dose.”

Employing these scores and considerations, the algorithm computes the “cost” of transforming one string of characters into another, such as “opel” into “apple.” Lower cost means higher visual similarity. The algorithm then adjusts for the relative lengths of the two strings (different lengths increase their distinctiveness) and converts the final cost into a percent similarity.

ICANN is considering future enhancements to the algorithm, such as having it check for visual confusion between existing domains and future planned Internet top-level domain names in scripts such as Cyrillic.

Source: National Institute of Standards and Technology


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 3.8 /5 (9 votes)


May 16, 2008 all stories

Comments: 0

3.8 /5 (9 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Visual assistance for cosmic blind spots
    created Nov 23, 2009 | popularity not rated yet | comments 0
  • New search technique for images and videos has broad applications
    created Nov 10, 2009 | popularity not rated yet | comments 0
  • Solving big problems with new quantum algorithm
    created Nov 09, 2009 | popularity not rated yet | comments 0
  • P vs. NP -- The most notorious problem in theoretical computer science remains open
    created Oct 29, 2009 | popularity not rated yet | comments 0
  • WOWD, the real-time search engine
    created Oct 26, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • Sixth sense technology
    created Nov 26, 2009
  • kindle e-reader and scientific papers
    created Nov 24, 2009
  • Help with a camera choice
    created Nov 18, 2009
  • casio calculator that's similar to TI-89
    created Nov 08, 2009
  • More from Physics Forums - Computing & Technology

Other News

Building real security with virtual worlds

Technology / Computer Sciences

created 14 hours ago | popularity 4 / 5 (4) | comments 0

(PhysOrg.com) -- Advances in computerized modeling and prediction of group behavior, together with improvements in video game graphics, are making possible virtual worlds in which defense analysts can explore and predict ...


McKinnon, accused of hacking into US military and NASA computers, faces extradition to the United States

UFO-obsessed Briton loses bid to block US extradition

Technology / Other

created 10 hours ago | popularity 3.8 / 5 (5) | comments 1

A Briton accused of hacking into US military and NASA computers faces extradition to the United States after the British government Thursday rejected last-ditch requests to block the move.


Sony optimistic on 3-D TVs, in-house display (AP)

Sony optimistic on 3-D TVs, in-house display

Technology / Hi Tech

created 20 hours ago | popularity not rated yet | comments 0

(AP) -- A third to a half of the Sony Corp. TV sets sold annually will be packed with 3-D features by the year ending March 2013, a senior executive said Thursday.


Roku adds more 'channels' of video and other digital content

Technology / Telecom

created 14 hours ago | popularity not rated yet | comments 0

Owners of Roku's digital video player will soon have a bunch more channels to choose from.


Holiday Web shopping looks brighter than last year

Technology / Internet

created 16 hours ago | popularity not rated yet | comments 0

(AP) -- Online retailers hope the convenience of the Web, plus discounts and deals, spur still-nervous shoppers to spend more online this holiday season - even as traditional retailers brace for mediocre sales.