From Terabytes to Petabytes: Computer Scientists Develop New Hybrid Database System
August 26, 2009(PhysOrg.com) -- As the amounts of data being stored by databases around the world enters the realm of the petabyte (the amount of data stored in a mile-high stack of CD-ROM disks), efficient data management is becoming more and more important. Now computer scientists at Yale University have developed a new database system by combining the best features of multiple approaches to create an open source hybrid system called HadoopDB.
Traditional approaches to managing data at this scale typically fall into one of two categories. The first includes parallel database management systems (DBMS), which are good at working with structured data that contain, for instance, tables with trillions of rows of data. The second includes the kind of approach taken by MapReduce, the software framework used by Google to search data contained on the Web, which gives the user more control over how the data is retrieved.
“In essence, HadoopDB is a hybrid of MapReduce and parallel DBMS technologies,” said Daniel Abadi, assistant professor of computer science at Yale and one of the system designers. “It’s designed to take the best features of both worlds. We get the performance of parallel database systems with the scalability and ease of use of MapReduce.”
HadoopDB was announced on Abadi’s blog last month. Yale graduate students and co-creators Azza Abouzeid and Kamil Bajda-Pawlikowski will present more in-depth details of the new system at the VLDB conference in Lyon, France on August 27. They will also present results of a detailed performance analysis they conducted with Abadi, Avi Silberschatz, chair of computer science at Yale, and Alexander Rasin of Brown University. The team will demonstrate the system performance on a range of representative queries at the conference, both on structured and unstructured data, and will outline HadoopDB’s characteristics along the run-time performance, loading time, fault tolerance and scalability dimensions.
With the huge amounts of data being collected and used in today’s databases - from consumer information used by retail chains to improve buying experiences and reduce customer churn to financial information being collected by banks to reduce risk and avoid another catastrophic financial collapse- being able to store and analyze such vast amounts of data will only continue to grow in importance, Abadi said.
HadoopDB reduces the time it takes to perform some typical tasks from days to hours, making more complicated analysis possible - the kind that could be used to find patterns in the stock market, earthquakes, consumer behavior and even outbreaks, Abadi said. “People have all this data, but they’re not using it in the most efficient or useful way.”
-
San Diego Supercomputer Center begins cloud computing research using the Google-IBM CluE cluster
Feb 18, 2009 |
not rated yet |
0
-
CA Offers New Database Performance Analysis Tool
Apr 17, 2007 |
not rated yet |
0
-
Computer scientists devise a 'P4P' system for efficient Internet usage
May 27, 2008 |
not rated yet |
0
-
New operations research paper tackles problems facing confidential databases
Nov 13, 2007 |
not rated yet |
0
-
Yale Professor wins Godel Prize for showing how computer algorithms solve problems
Aug 13, 2008 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (33) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (5) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (2) |
0
-
Quantum computer faster than regular computer?
2 hours ago
-
Flushing RAM in Mathematica
7 hours ago
-
Synergistic relations between computer science and technology.
Feb 06, 2012
-
how do iphone gloves work?
Feb 05, 2012
-
iPhone battery over time
Jan 30, 2012
-
Best alternate Tablet to an iPad for writing math or physics equations?
Jan 26, 2012
- More from Physics Forums - Computing & Technology
More news stories
Building a 'blind-friendly' Internet
Rakesh Babu demonstrates how a blind person uses the Internet.
35 minutes ago |
not rated yet |
0
Ethanol mandate not the best option
Many people are willing to pay a premium for ethanol, but not enough to justify the government mandate for the corn-based fuel, a Michigan State University economist argues.
Technology / Energy & Green Tech
21 minutes ago |
not rated yet |
0
Darpa to develop mobile millimeter-wave backhaul networks
Providing high-bandwidth communications for troops in remote forward operating locations is not only critical but also challenging because a reliable infrastructure optimized for remote geographic areas does ...
13 minutes ago |
not rated yet |
0
Thomas Edison inspires the oscar awards you don't see
Thomas Edison's invention of the first motion picture camera in 1891 inspired scientific and technological advances that he never could have imagined.
Technology / Hi Tech & Innovation
1 hour ago |
5 / 5 (1) |
0
Microsoft India retail site down after 'cyber attack'
Microsoft said Monday it was investigating an attack by hackers on its Indian retail website, reportedly carried out by a Chinese group called the "Evil Shadow Team."
2 hours ago |
not rated yet |
0
Manipulating genes with hidden TALENs
(PhysOrg.com) -- A better understanding of gene function in model plant and animal systems could be used to develop useful traits in livestock and crop plants, and might someday lead to developments in stem ...
Alien matter in the solar system: A galactic mismatch
This just in: The Solar System is different from the space just outside it.
Transforming galaxies
(PhysOrg.com) -- Many of the Universe's galaxies are like our own, displaying beautiful spiral arms wrapping around a bright nucleus. Examples in this stunning image, taken with the Wide Field Camera 3 on ...
'Smart' microcapsules in a single step
(PhysOrg.com) -- A new, single-step method of fabricating microcapsules, which have potential commercial applications in industries including medicine, agriculture and diagnostics, has been developed by researchers ...
Tenofovir, leading HIV medication, linked with risk of kidney damage
(Medical Xpress) -- Tenofovir, one of the most effective and commonly prescribed antiretroviral medications for HIV/AIDS, is associated with a significant risk of kidney damage and chronic kidney disease that increases over ...
A continent ablaze in auroral and manmade light
The North American continent is literally set ablaze in a confluence of Auroral and Manmade light captured in spectacular new videos snapped by the astronauts serving aboard the International Space Station ...