Ad hoc encyclopaedia for the information age

April 14, 2008

Linking communities and information into a virtual digital library is the 21st century version of the Dictionaire Raisonneé. Better, they can be organised around specific topics, creating vast repositories and networks of experts around a single problem. Best of all, it can be done on demand.

In 1750, Denis Diderot convinced his publisher to support a vast enterprise, the publication of the Encyclopédie gathering all knowledge into one location.

Dozens of writers worked on thousands of articles for more than 15 years to produce the first summary of all human knowledge and, despite the labour and pains of its birth, its entire contents would barely fill one volume of a contemporary encyclopaedia.

Times have changed. And they keep on changing. The pace of discovery in the modern world is such that it is difficult for specialists to stay abreast of their own field let alone be aware of the knowledge in all other fields that may impact on their specialty.

The internet, though useful, makes us aware of our ignorance. It does not reliably fill the gap with relevant and timely information. As an information society, it is becoming increasingly difficult to see the trees for the wood.

“There’s a trend in digital libraries now towards combining heterogeneous data from a wide variety of sources. This includes textual, multimedia objects and, increasingly, sensor and experimental data, or raw data that needs to be processed,” explains Donatella Castelli, scientific coordinator of the Diligent project.

Raw data allows virtual digital library (VDL) users to formulate questions that may not have been considered before. But this quantity of data poses huge processing challenges requiring digital libraries to have enormous resources, resources that are not readily available for many institutions.

The virtual digital library

But not, perhaps, for too much longer. Diligent sought to create a test bed to prove the viability of VDL infrastructure on grid-enabled technology. It would behave a little like a wiki, a Hawaiian word that means quick. Like Wikipedia – the world’s most famous wiki – a VDL on grids could allow the creation of vast online data repositories from distributed computing sources.

But unlike wikis, Diligent created a system that combines digital libraries with grid computing to provide storage, content retrieval and access services and, most impressively, shared data processing capabilities.

Grids link many computers together to provide a framework for shared processing and storage capabilities. So a grid can take a big, processing-intense problem, like weather prediction, and split the problem between a handful, dozens or even thousands of computers. Each only handles a tiny bit or the problem, but combined they provide a huge amount of raw power.

The power of grids is well established, and all that raw data crunching gives physicists and molecular biologists goose bumps. It is the power behind the SETI@home project, which uses volunteers’ computers to analyse cosmic signals in the search for extraterrestrial life.

But grids have never been used for virtual digital libraries, a library that exists only by the combination of data across cyberspace. It is an exciting new use of the technology. But it is not a trivial problem.

“It was very, very difficult,” reveals Castelli. “There was a lot of new technology to learn [and] many of the tools we needed were only being defined as we worked on the project.”

A better mousetrap

It is like inventing a better mousetrap, but the tools to do the job are only being developed as you hop impatiently from foot-to-foot, waiting for them. Then the tools get changed and you need to go back and reinvent your mousetrap.

But the hard work paid off. Diligent created an infrastructure – a system called g-Cube – and two VDLs to validate how it all works; one among the ‘Earth Observation’ community, the other in the Cultural Heritage community. It was a resounding success, and now these research communities have VDLs on grids serving their own needs. These are very impressive results and strain the definition of test bed as Diligent literally pushed the available technology to the limit and still came up with a working infrastructure.

They even developed advanced interface tools to set up a VDL. “We have a wizard to set up VDLs and it is very easy to use,” notes Castelli.

Nonetheless, work remains to be done. “The system needs to be optimised to improve its quality of service. We need to develop a production infrastructure and deal with issues like real infrastructure policies. We’ve started a new project called D4Science, and we’ll be working with the Earth Observation and the Fishery and Aquaculture Resource Management Research communities”, says Castelli.

Diderot’s pride

Diligent has many fine achievements and prompted the interests of a wide range of groups that could usefully share resources. But the real power of the project is the enormous opportunities for fruitful collaboration that their tools will enable in the future.

Scientists, engineers, policy-makers, NGOs and other experts or stakeholders will be able to come together on an ad hoc basis to brainstorm and share relevant data around specific problems, such as disaster relief, fuel efficiency, or even apparently routine tasks like organising a conference.

Diderot, the patron of vast collaborations around a great, hugely ambitious goal, would be proud.

Source: ICT Results


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - 4 /5 (7 votes)

Rank Filter

Move the slider to adjust rank threshold, so that you can hide some of the comments.


Display comments: newest first

  • drknowledge - Apr 21, 2008
    • Rank: not rated yet
    I wonder if they'd read or even seen the Didirot Encyclopedia. It doesn't "barely fit" in one modern encyclopedia volume, as the article claims. I own a modern copy. It's huge, with easily the information in 5 of 29 volumes of the Encyclopedia Britiannica, not counting the volume of illustrations. One can't help wonder what other "facts" are entirely misrepresented in this "scientific advertisement". I doubt Diderot would have been proud of their approach so far!

April 14, 2008 all stories

Comments: 1

4 /5 (7 votes)
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • 10 unusual gadgets and gifts for geeks
    created Nov 12, 2009 | popularity not rated yet | comments 0
  • Professor finds that iconic Oswald photo was not faked (w/ Video)
    created Nov 05, 2009 | popularity not rated yet | comments 0
  • Wimba providing classroom alternative in light of flu outbreaks
    created Nov 04, 2009 | popularity not rated yet | comments 0
  • SMOS, Proba-2: Two new ESA satellites successfully lofted into orbit (w/ Video)
    created Nov 02, 2009 | popularity not rated yet | comments 0
  • PCs shed pounds and CD drives, gain touch screens
    created Nov 01, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

  • A solution for playing JVC camcorder(mod files) problems
    created 5 hours ago
  • casio calculator that's similar to TI-89
    created Nov 08, 2009
  • Advice on what cell phone to get
    created Nov 08, 2009
  • Changing the language options on your phone.
    created Nov 03, 2009
  • HP strange RPN operation???
    created Nov 02, 2009
  • Databases in physics
    created Oct 31, 2009
  • More from Physics Forums - Computing & Technology

Other News

All eyes on Murdoch as newspapers ponder digital future

Technology / Internet

created 18 hours ago | popularity not rated yet | comments 1

Is Rupert Murdoch bluffing? Making a bold high-stakes gamble that will save the troubled newspaper industry? Or pursuing a pipe dream that can only end in failure?


Road trains may be coming soon to Europe

Road trains may be coming soon to Europe (w/ Video)

Technology / Engineering

created Nov 13, 2009 | popularity 4.8 / 5 (13) | comments 22

(PhysOrg.com) -- Road trains linking vehicles together in a traveling convoy are planned for Europe. With only the lead vehicle being actively driven, the road trains would allow commuters to sleep, read a ...


A system of space solar power system (SSPS)

Japan eyes solar station in space as new energy source

Technology / Energy

created Nov 08, 2009 | popularity 4.8 / 5 (23) | comments 31

It may sound like a sci-fi vision, but Japan's space agency is dead serious: by 2030 it wants to collect solar power in space and zap it down to Earth, using laser beams or microwaves.


Cars sit in traffic on a highway

Netherlands to levy 'green' road tax by the kilometre

Technology / Hi Tech

created Nov 13, 2009 | popularity 3 / 5 (4) | comments 8

The Dutch government said Friday it wants to introduce a "green" road tax by the kilometre from 2012 aimed at cutting carbon dioxide emissions by 10 percent and halving congestion.


Hydrogen milestone moves energy independence one step forward

Hydrogen milestone moves energy independence one step forward

Technology / Energy

created Nov 10, 2009 | popularity 3.9 / 5 (12) | comments 7

(PhysOrg.com) -- Big things often come in small packages. That's certainly the case with the potential created by recent successes in hydrogen research at Idaho National Laboratory.