New metasearch engine leaves Google, Yahoo crawling
March 25, 2009
Weiyi Meng, a professor of computer science at Binghamton University, State University of New York, is hopeful that one day in the not-too-distant future, you'll be able to type a query into an online search engine and have it deliver not Web pages that may contain an answer, but just the answer itself. Credit: Jonathan Cohen
One day in the not-too-distant future, you'll be able to type a query into an online search engine and have it deliver not Web pages that may contain an answer, but just the answer itself, says Weiyi Meng, a professor of computer science at Binghamton University, State University of New York.
For instance, imagine typing in "Who starred in the film Casablanca?" The search engine would respond with "Humphrey Bogart and Ingrid Bergman."
Not impressed?
Try asking a more nuanced question, such as "What do Americans think of universal health care?" A search engine will create a report indicating trends in opinion based on what has been posted to the Web.
Search engines may eventually be used to conduct polling and even help sort fact from fiction, said Meng, who is helping to make such possibilities a reality, both through his research and as president of a company called Webscalers.
The way Meng sees it, big search engines such as Google and Yahoo are fundamentally flawed. The Web has two parts: the surface Web and the deep Web. The surface Web is made up of perhaps 60 billion pages. The deep Web, at some 900 billion pages, is about 15 times larger.
Google, which relies on a "crawler" to examine pages and catalog them for future searches, can search about 20 billion pages. Web crawlers follow links to reach pages and often miss content that isn't linked to any other page or is in some way "hidden."
Meng, along with researchers at the University of Illinois at Chicago and the University of Louisiana at Lafayette, has helped pioneer large-scale metasearch-engine technology that harnesses the power of small search engines to come up with results that are more accurate and more complete.
"Most of the pages on the deep Web aren't directly 'crawlable.' We want to connect to small search engines and reach the deep Web," he said. "That's the idea. Many people have the misconception that Google can search everything, and if it's not there it doesn't exist. But we should be able to retrieve many times more than what Google can search."
Not only can a metasearch engine probe deeper, it can also offer the latest information.
"In principle," Meng said, "small guys are much better able to maintain the freshness of their data. Google has a program to 'crawl' all over the world. Depending on when the crawler has last visited your server, there's a delay of days or weeks before a new page will show up in that search. We can get fresher results."
The concept is not new. In fact, the first metasearch engine was built in 1994.
"The big difference between our technology and the ones pursued by other people is that most of the other technologies do the metasearching on top of a small number of general-purpose search engines, such as Yahoo, Google or MSN," Meng explained. "We have a completely different perspective. We want to build large-scale metasearch engines on top of many small search engines."
The Web has millions of search engines at businesses, universities, newspapers and other organizations. Since 1997, and with continued funding from the National Science Foundation, Meng and his collaborators have found ways to run queries across multiple search engines and sort through the results.
Webscalers is based in the Start-Up Suite at Binghamton University's Innovative Technologies Complex, which is home to several young companies that have their roots in faculty inventions.
"If the Web keeps on growing, a company like Google may run out of resources to crawl all of those pages," said Vijay V. Raghavan, vice president of Webscalers and a faculty member at the University of Louisiana at Lafayette. "We won't have that problem. We will scale much better."
Webscalers' technology could be useful for large organizations with many divisions. For example, Webscalers has developed a prototype that would allow a search of all 64 campuses in the State University of New York system as well as SUNY's central administration.
"People can use it to find collaborators," Meng said. "It could also help prospective students find programs they're interested in."
The technology could be adapted to large companies or even the government, Meng said.
Challenges for large-scale metasearch engines include determining which search engines are the best for a given query, automating the interaction with search engines as well as organizing the search results.
Meng hopes to build a grand metasearch engine one day that would integrate all of the 1 million small search engines into a single system. "There are still a lot of significant challenges in creating a system of such magnitude," he said, "but I am optimistic that such a metasearch engine can be built."
Try out the concept online
Webscalers has already launched several metasearch products:
The first is a news metasearch engine called AllinOneNews. Available at http://www.allinonenews.com , it connects to 1,800 news sources in 200 countries. That's the largest metasearch engine in the world.
Webscalers also offers MySearchView, a system that allows any user to create his or her own metasearch engine just by checking off a few options at http://www.mysearchview.com .
Source: Binghamton University
-
Search engine branding to be examined by researcher
Jun 11, 2008 |
not rated yet |
0
-
Branding matters -- even when searching
Jun 28, 2007 |
not rated yet |
0
-
Search engine mashup
Jul 06, 2007 |
not rated yet |
0
-
Clicks on sponsored links lower than previously reported but show growth potential
Aug 22, 2007 |
not rated yet |
0
-
Search engines return similar results for e-commerce comparison shopping
Feb 02, 2006 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (30) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
-
Sending SMS to a website
Jan 20, 2012
-
Need help with my technical fest!
Jan 19, 2012
- More from Physics Forums - Computing & Technology
More news stories
Zuckerberg's focus drives Facebook's ascent
When Mark Zuckerberg showed up to rent Judy Fusco's Los Altos, Calif., house in the fall of 2004, soon after he'd arrived in Silicon Valley, the landlord was immediately struck by his confidence.
1 hour ago |
1 / 5 (1) |
0
Review: Netflix and Hulu's new scripted originals
Within just over a week, Netflix and Hulu are both debuting their first stabs at original scripted programming.
1 minute ago |
not rated yet |
0
Tailor-made search tools for the Web
For companies, customer feedback is a matter of strategic importance. Smart apps for the semantic analysis of user opinions from the Web help businesses keep an eye on feedback. Users benefit as well: with ...
16 minutes ago |
not rated yet |
0
New error-correcting codes guarantee the fastest possible rate of data transmission
Error-correcting codes are one of the triumphs of the digital age. Theyre a way of encoding information so that it can be transmitted across a communication channel such as an optical fiber o ...
Technology / Computer Sciences
5 hours ago |
5 / 5 (3) |
2
|
Netflix light on flicks as viewers soak up TV shows
Like most fresh faces that arrive in Hollywood, Netflix wanted to be a movie star. But now it's learning what many in Tinseltown have known for decades: Movies are sexy, but the real money is in television.
2 hours ago |
not rated yet |
1
New understanding of DNA repair could eventually lead to cancer therapy
A research group in the Faculty of Medicine & Dentistry at the University of Alberta is hoping its latest discovery could one day be used to develop new therapies that target certain types of cancers.
Hovering not hard if you're top-heavy, researchers find
Top-heavy structures are more likely to maintain their balance while hovering in the air than are those that bear a lower center of gravity, researchers at New York University's Courant Institute of Mathematical Sciences ...
Grass to gas: Researchers' genome map speeds biofuel development
Researchers at the University of Georgia have taken a major step in the ongoing effort to find sources of cleaner, renewable energy by mapping the genomes of two originator cells of Miscanthus x giganteus, a large perenn ...
Both maternal and paternal age linked to autism
Older maternal and paternal age are jointly associated with having a child with autism, according to a recently published study led by researchers at The University of Texas Health Science Center at Houston (UTHealth).
Night, weekend delivery OK for babies with birth defects
Weekday delivery is no better than night or weekend delivery for infants with birth defects, according to a new study presented today at The Pregnancy Meeting, the Society for Maternal-Fetal Medicine's annual conference. ...
Sonic Cradle lands spot in TED exhibition
A Simon Fraser University graduate student project that melds music, meditation and modern technology has landed a rare spot as an exhibit at TEDActive 2012 in Palm Springs, California this month.
Mar 25, 2009
Rank: 4 / 5 (1)
This "metasearch" however, has little do to with a typical "Google Search." It seems to be more related to the "post-processing" of the results than actually finding them.
Airkin also made me aware in another article that it "seems to be limited to its own database (think Wiki) not like Google's large ones of the internet."
This is evidenced by "Meng hopes to build a grand metasearch engine one day that would integrate all of the 1 million small search engines into a single system."
It seems like a step back in my mind... Or at least, too infantile to be of any use (yet).
Another issue that should be consider is "Should you trust the result."
The converse of "one simple answer" is being painted as a negative: Google search returns many (too many?) results that have to be poured over to distill an answer... Though this is not really the case, as (generally) you'll get the answer you're looking for in the top 10 results.
However, with one absolute answer, "just because the computer said," how do you know it's the correct answer? Many results gives you the ability to compare and contrast and decide for yourself what's true.
I suppose (and understand) this is what Wolfram and Meng are attempting to accomplish... An authoritative response that falls within the human margin of error... But it just seems to me "humans are computers, and computers aren't human." Is there simply an natural disconnect between the two different "mediums" or will a singularity be reached in the future?
I wonder what the metasearch would have to say about that question. ;) "42."
Mar 25, 2009
Rank: not rated yet
Mar 26, 2009
Rank: not rated yet
Let it go already, we get it.
Mar 29, 2009
Rank: not rated yet
Mar 29, 2009
Rank: not rated yet
Really? Gosh, I've seen pages I post show up literally minutes later.
For this kind of stuff, I'll believe it when I see it, and I think seeing it is a long, long way off.
Apr 07, 2009
Rank: not rated yet
Yes, I know this isn't really a flaw. I just wanted to point out that all the search engines have their good and their bad sides and can develop simultaneously.