Sequential data access with Oracle and Hadoop: a performance comparison

Zbigniew Baranowski; Luca Canali; Eric Grancher

doi:10.1088/1742-6596/513/4/042001

Journal of Physics: Conference Series

The following article is Open access

Sequential data access with Oracle and Hadoop: a performance comparison

Zbigniew Baranowski¹, Luca Canali¹ and Eric Grancher¹

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 513, Issue 4 Citation Zbigniew Baranowski et al 2014 J. Phys.: Conf. Ser. 513 042001 DOI 10.1088/1742-6596/513/4/042001

Download Article PDF

Article metrics

1044 Total downloads

Author affiliations

¹ European Organization for Nuclear Research, IT Department, Database Group Geneva, Switzerland

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

The Hadoop framework has proven to be an effective and popular approach for dealing with "Big Data" and, thanks to its scaling ability and optimised storage access, Hadoop Distributed File System-based projects such as MapReduce or HBase are seen as candidates to replace traditional relational database management systems whenever scalable speed of data processing is a priority. But do these projects deliver in practice? Does migrating to Hadoop's "shared nothing" architecture really improve data access throughput? And, if so, at what cost? Authors answer these questions–addressing cost/performance as well as raw performance– based on a performance comparison between an Oracle-based relational database and Hadoop's distributed solutions like MapReduce or HBase for sequential data access. A key feature of our approach is the use of an unbiased data model as certain data models can significantly favour one of the technologies tested.

Export citation and abstract BibTeX RIS

Next article in issue

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.

Sequential data access with Oracle and Hadoop: a performance comparison

Article metrics

Share this article

Author affiliations

Abstract