Grid browser finds the meaning of life
May 20, 2009
(PhysOrg.com) -- A web browser that can understand technical terms in life sciences and automatically find additional resources and services has been developed by European researchers. It could lead to a new generation of intelligent search engines.
The life sciences community has built numerous databases - such as for gene sequencing and information about diseases - that are available to researchers as ‘grid’ services.
“Grid computing is essentially about building virtual organisations that are independent of the physical location where they reside,” says Michael Schroeder of Technische Universität Dresden.
The problem is how to link those services to other scientific information found on the web. Schroeder is coordinator of the EU-funded Sealife project which has created a ‘semantic grid browser’ to make grid services for the life sciences much more accessible.
“We have the web on the one hand and then we have grid computing, with its many services, on the other,” he says. A semantic grid browser seamlessly integrates them.
“It tries to understand what it finds on web pages, interprets this content and then links it, on the fly, to services that might be useful to the user.”
A matter of semantics
The key to the Sealife browser is a ‘semantic hyperlink’ that shows up on the page to direct users to relevant services. The link is not put there by the website but by the browser itself.
How does it do that?
First, the browser needs to understand the content of the page and identify terms which could be linked to grid services. An example tested in the Sealife project is the naming of genes. Each human gene has an average of 5.5 names, Schroeder points out, but if it can be identified correctly, a link can be made to a wealth of information about that gene.
The browser must also be able to handle ambiguity. “If I see ‘Jaguar’ on a web page, what is it? Is it an animal? Is it a car? Is it the Mac operating system?” Sealife uses specialised algorithms to work out the context from other words on the page and correctly interpret the meaning.
It is still not an exact science, though. The Sealife team entered their algorithm in an international competition with 50 others to identify names of genes. They won, with an 81% success rate, though Schroeder says they have now got that up to 87%.
Background knowledge
The second challenge is the background knowledge that allows the browser to make sense of the identified terms. Such knowledge is formally known as an ‘ontology’, a systematic hierarchy of concepts and their relation to one another. Biology, with its extensive taxonomies, is an ideal field for semantic grid browsing.
“All these efforts of building hierarchical classification systems have been at the core of biology for centuries,” says Schroeder. “Biologists are used to it and there are many efforts to make information exchangeable.”
But outside the life sciences such systematic classification is not so well developed, and the Sealife project has created editors to build ontologies from published literature in any specific field of interest.
“We developed algorithms that grind through this data, identify the key concepts and then the ontology editor offers these concepts to you,” Schroeder explains. “If you agree, it then searches the web to find things that look like definitions. This whole process of building this background knowledge cannot be fully automated but you can ease the pain of doing this quite significantly.”
Different varieties of the Sealife browser build on work by partners in Edinburgh, Manchester, London and Sophia-Antipolis, as well as in Dresden. They have been tested in three scenarios: evidence-based medicine, mining of scientific and patent literature, and in molecular biology. In each case, the focus has been on infectious diseases.
Browser that understands everything?
So successful has the project been that TU Dresden has spun-off a new company, Transinsight, to exploit work done in Sealife. The company has sold semantic browsers to such major customers as BASF and Unilever and runs the GoPubMed search engine, which is linked to the respected PubMed archive of biomedical literature.
But there is no reason why a semantic browser should be confined to specialised academic areas. Could we have a browser that understands everything? Schroeder thinks that is not as far-fetched as it may seem. “The vision is to include every domain,” he says. “For example, if we were able to extract and formalise the knowledge in Wikipedia we would have this general background knowledge that covers all areas.”
Many researchers look forward to a next-generation search engine that can understand what the user is looking for and return much more relevant results than today’s engines can. “This will involve integrating information,” says Schroeder, “because very often answers to questions are not provided in one document as a single statement that I can pick up by keywords.
“In the future, we will need background knowledge and this is at the core of Sealife. If we build semantic into search, and make it scaleable, then you will have the next-generation search engine.”
More information: http://www.biotec. … .de/sealife/
Provided by ICT Results
-
Understanding grid semantics for virtual collaboration
Dec 15, 2005 |
not rated yet |
0
-
Tropical cyclone or ISU Cyclone? Semantic science search engine knows that there is a difference
Feb 04, 2009 |
not rated yet |
0
-
Apple Announces Safari 4 Browser
Feb 24, 2009 |
not rated yet |
0
-
Foundations for the World Wide Grid
Nov 20, 2008 |
not rated yet |
0
-
Ross: Firefox Goes Where Few Browsers Have Gone Before
Mar 15, 2007 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (31) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
-
Sending SMS to a website
Jan 20, 2012
-
Need help with my technical fest!
Jan 19, 2012
- More from Physics Forums - Computing & Technology
More news stories
Anonymous knocks CIA website offline (Update)
The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.
14 hours ago |
4.7 / 5 (13) |
21
New error-correcting codes guarantee the fastest possible rate of data transmission
Error-correcting codes are one of the triumphs of the digital age. Theyre a way of encoding information so that it can be transmitted across a communication channel such as an optical fiber o ...
Technology / Computer Sciences
22 hours ago |
4.9 / 5 (8) |
6
|
Small modular reactor design could be a 'SUPERSTAR'
(PhysOrg.com) -- Though most of today's nuclear reactors are cooled by water, we've long known that there are alternatives; in fact, the world's first nuclear-powered electricity in 1951 came from a reactor ...
Technology / Energy & Green Tech
22 hours ago |
4.4 / 5 (14) |
27
|
New power source discovered
(PhysOrg.com) -- Researchers at the Massachusetts Institute of Technology (MIT) and RMIT University have made a breakthrough in energy storage and power generation.
Technology / Energy & Green Tech
21 hours ago |
4.7 / 5 (32) |
8
|
Advanced power-grid model finds low-cost, low-carbon future in West
(PhysOrg.com) -- The least expensive way for the Western U.S. to reduce greenhouse gas emissions enough to help prevent the worst consequences of global warming is to replace coal with renewable and other ...
Technology / Energy & Green Tech
22 hours ago |
4.2 / 5 (5) |
11
|
The power of estrogen -- male snakes attract other males
A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...
Fool's gold may prove an unlikely alternative to overexploited catalytic materials
Catalytic materials, which lower the energy barriers for chemical reactions, are used in everything from the commercial production of chemicals to catalytic converters in car engines. However, with current catalytic materials ...
Could Venus be shifting gear?
(PhysOrg.com) -- ESAs Venus Express spacecraft has discovered that our cloud-covered neighbour spins a little slower than previously measured. Peering through the dense atmosphere in the infrared, the ...
Google users warned of threat to smartphone wallets
Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.
A frank discussion of the power law and linking correlation to causation
(PhysOrg.com) -- Michael Stumpf a mathematics professor at Imperial College in London, and Mason Porter a lecturer at Oxford have teamed together to write and publish a perspective piece in Science regarding the in ...
Employers feel no love for unscrupulous practice of 'service sweethearting'
A new study led by two Florida State University marketing professors finds that some frontline service employees who are rewarded for hikes in customer loyalty and satisfaction also may engage in "service ...