Infovell's 'research engine' finds deep Web pages that Google, Yahoo miss
September 8, 2008 by Lisa Zyga
With Infovell, users search with key phrases up to 25,000 words long, rather than keywords. Image credit: Infovell.
According to a study by the University of California at Berkeley, traditional search engines such as Google and Yahoo index only about 0.2% of the Internet. The remaining 99.8%, known as the "deep Web," is a vast body of public and subscription-based information that traditional search engines can't access.
To dig into this "invisible" information, scientists have developed a new search engine called Infovell geared at helping researchers find often obscure data in the deep Web. As scientists working on the Human Genome Project, Infovellīs founders designed the new searching technology based on methods in genomics research. Instead of using keywords, Infovell accepts much longer search terms, and in any language.
"There are no īkeywordsī in genetics," explains Infovellīs Web site. "New unique and powerful techniques have been developed to extract knowledge from genes. Now, through Infovell, these techniques have, for the first time, been applied to language and other symbol systems, shattering long-held barriers in search and leapfrogging the capabilities of current search providers to deliver the Worldīs Research Engine."
While keywords may work fine for the general public looking for popular and accessible content, they donīt often meet the needs of researchers looking for specific data. As information in the deep web continues to grow, Infovell explains that a one-size-fits-all approach to searching will make academic searching even more challenging.
One reason is the nature of deep Web sites themselves. While many popular Web sites are specifically designed to be search-engine friendly, a lot of deep Web content is unstructured, making it difficult for keyword-based search engines to index. Further, the deep Web does not receive much traffic, meaning these pages donīt have many incoming links and therefore arenīt ranked highly by systems such as Googleīs PageRank. And for private sites, barriers such as registration and subscription requirements also make it difficult for search engines to access them.
Searching with keywords also presents a trade-off between being too general and getting millions of irrelevant results, or being too specific and not getting any results at all. After getting results, users then have to sift through many pages looking for what they need.
But with Infovell, users search with "KeyPhrases," from paragraphs to whole documents or even sets of documents up to 25,000 words. Because itīs born out of the world of genomics, Infovell is also language-independent. Users can search in English, Chinese, Arabic, or even mathematical symbols, chemical formulas, or musical notes. "The key requirement is that the information is in digital format, and it can be stored in a linear, sequential and segregated manner," according to Infovellīs site.
Infovellīs technology allows users to locate the most current and comprehensive documents and published articles from billions of pages, with topics including life sciences, medicine, patents, industry news, and other reference content.
Currently, some researchers use advanced search options provided by individual sites to try to get around keyword search engines. However, these search engines require users to learn special syntax, and only work for the site theyīre at. The advantage of Infovell is that it doesnīt require special training (and it doesnīt use Boolean operators, taxonomies or clustering); rather, it is easy to use and can search everything at once.
Although Infovell is not the first attempt at a search engine for crawling the deep Web, its developers hope that researchers will benefit from Infovellīs advantages more in the future, especially as the deep Web continues to grow.
Infovell is being demonstrated at DEMOfall08, a conference for emerging technologies taking place in San Diego on September 7-9. Users can sign up for a 30-day risk-free trial at Infovellīs Web site, and Infovell is initially available on a subscription basis. Later this year, Infovell will release a free beta version on a limited basis without some of the advanced features in the premium version.
More information: www.infovell.com
Via: www.networkworld.com
-
UK researchers rank best online advice for postnatal depression
Feb 07, 2012 |
not rated yet |
0
-
Yahoo! shakes up board to give firm new life
18 hours ago |
not rated yet |
0
-
Online dating research shows cupid's arrow is turning digital
Feb 06, 2012 |
not rated yet |
0
-
Italian professor launches challenge to Google
Feb 06, 2012 |
4.2 / 5 (5) |
0
-
EU probes new Google privacy policy
Feb 03, 2012 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (30) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
feed hold button on CNC lathe
18 hours ago
-
Mechanics of Solids ( Final exam question) please help!
20 hours ago
-
RFAC in Fortran
23 hours ago
-
dynamics 2/32
Feb 08, 2012
-
dynamics
Feb 08, 2012
-
Vibration Absorbtion Problem
Feb 08, 2012
- More from Physics Forums - General Engineering
More news stories
Soraa LED light may dim 50-watt halogen rivals
(PhysOrg.com) -- Soraa, a Fremont, California company founded in 2008, this week launched its first product, a light that uses LEDS (light emitting diodes). The "Soraa LED MR16 lamp" is the "perfect" replacement ...
First Google hire leaving for online academy
The first person hired by Google's founders is leaving the Internet giant to devote himself to an innovative online education website called Khan Academy.
6 hours ago |
5 / 5 (1) |
0
FBI file: Steve Jobs was considered for govt post
(AP) -- FBI background interviews of some people who knew Apple co-founder Steve Jobs reveal a man driven by power and alienating some of the people who worked with him.
6 hours ago |
3.4 / 5 (5) |
0
New integrated building model may improve fish farming operations
Today's "locavore" movement with its emphasis on eating more locally-produced food is a natural fit for fruits and vegetables in nearly every region, but few entrepreneurs have dared to apply the concept to ...
7 hours ago |
not rated yet |
0
NY attorney general ends lawsuit against Intel
(AP) -- Intel Corp. is paying $6.5 million as part of a deal to terminate an antitrust lawsuit filed against the chip maker by the New York attorney general's office.
6 hours ago |
not rated yet |
0
'Dark plasmons' transmit energy
Microscopic channels of gold nanoparticles have the ability to transmit electromagnetic energy that starts as light and propagates via "dark plasmons," according to researchers at Rice University.
FDA-approved drug rapidly clears amyloid from the brain, reverses Alzheimer's symptoms in mice
Neuroscientists at Case Western Reserve University School of Medicine have made a dramatic breakthrough in their efforts to find a cure for Alzheimer's disease. The researchers' findings, published in the journal Science, show t ...
Hydrogen from acidic water: Researchers develop potential low cost alternative to platinum for splitting water
A technique for creating a new molecule that structurally and chemically replicates the active part of the widely used industrial catalyst molybdenite has been developed by researchers with the Lawrence Berkeley ...
Ultraviolet protection molecule in plants yields its secrets
Lying around in the sun all day is hazardous not just for humans but also for plants, which have no means of escape. Ultraviolet (UV) radiation from the sun can damage proteins and DNA inside cells, leading ...
Anyone can learn to be more inventive, cognitive researcher says
There will always be a wild and unpredictable quality to creativity and invention, says Anthony McCaffrey, a cognitive psychology researcher at the University of Massachusetts Amherst, because an "Aha moment" is rare and ...
New method makes culture of complex tissue possible in any lab
Scientists at the University of California, San Diego have developed a new method for making scaffolds for culturing tissue in three-dimensional arrangements that mimic those in the body. This advance, published online in ...
Sep 08, 2008
Rank: 4 / 5 (3)
It's high time for a better search engine. If true to their word, Infovell people just got themselves a big $$$ generator.
Sep 08, 2008
Rank: 4 / 5 (2)
Sep 09, 2008
Rank: not rated yet
I'm not positive, but I believe the only way to keep the information published and available is to generate a "hard" HTML copy of the page.
Sep 09, 2008
Rank: 4 / 5 (1)
Sep 12, 2008
Rank: not rated yet
Sep 12, 2008
Rank: not rated yet