skip to main content
10.1145/1774088.1774229acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Maintaining data reliability without availability in P2P storage systems

Published:22 March 2010Publication History

ABSTRACT

Peer-to-peer (P2P) storage is a promising technology to provide users with cheap and online persistence. However, due the instability of these infrastructures, P2P storage systems must introduce redundancy in order to guarantee a reliable storage service. Besides, they need data repair algorithms to maintain this redundancy in front of permanent node departures. To ensure that such repairs can always be run, existing P2P storage systems aim to maintain 100% data availability. Unfortunately, this solution seems to overkill in preventing data loses, introducing network and data overheads.

In this paper we propose a new data repair algorithm able to guarantee a high reliable storage service without 100% data availability. The main idea is to ensure that objects are kept stored instead of maintaining them available. We analytically prove that our approach reduces considerably the total amount of redundancy. Moreover, through simulation, we show how our approach significantly reduces the required number of repairs, decreasing both, the network and the storage overheads.

References

  1. B. G. Chun, F. Dabek, A. Haeberlen, E. Sit, H. Weatherspoon, M. F. Kaashoek, and J. Kubiatowicz. Efficient replica maintenance for distributed storage systems. machine availability estimation. In Symposium on Networked Systems Design and Implementation (NSDI), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. L. Cox and B. Noble. Pastiche: Making backup cheap and easy. In Proceedings of 5th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. F. Dabek, M. F. Kaashoek, D. Karger, R. Morris, and I. Stoica. Wide-area cooperative storage with cfs. In Proceedings of the 18th ACM Symposium on Operating Systems Principles (SOSP), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Anwitaman Datta and Karl Aberer. Internet-scale storage systems under churn. a study of the steady-state using markov models. In Proceedings of the 6th International Conference on Peer-to-Peer Computing (P2P), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Druschela and A. Rowstron. Past: A large-scale, persistent peer-to-peer storage utility. In Proceedings of the 8th Workshop on Hot Topics in Operating Systems (HOTOS), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Duminuco, E. W. Biersack, and T. En-Najjary. Proactive replication in distributed storage systems using machine availability estimation. In Proceedings of the 3rd CoNEXT conference (CONEXT), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Ranjita Bhagwan Kiran, Kiran Tati, Yu-chung Cheng, Stefan Savage, and Geoffrey M. Voelker. Total recall: System support for automated availability management. In Symposium on Networked Systems Design and Implementation (NSDI), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Luby. Lt codes. In Proceedings of the 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. G. Luby, M. Mitzenmacher, M. A. Shokrollahi, D. A. Spielman, and V. Stemann. Practical loss-resilient codes. In Proceedings of the 29th Annual Aymposium on Theory of Computing (STOC), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Maymounkov. Online codes. Technical Report TR2002-833, New York University, 2002.Google ScholarGoogle Scholar
  11. Sean Rhea, Brighten Godfrey, Brad Karp, John Kubiatowicz, Sylvia Ratnasamy, Scott Shenker, Ion Stoica, and Harlan Yu. Opendht: a public dht service and its uses. SIGCOMM Comput. Commun. Rev., 35(4):73--84, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. A. Shokrollahi. Raptor codes. IEEE/ACM Transactions on Networking (TON), 14, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. Sit, A. Haeberlen, F. Dabek, B. Chun, H. Weatherspoon, R. Morris, M. F. Kaashoek, and J. Kubiatowicz. Proactive replication for data durability. In Proceedings of the 5th International Workshop on Peer-to-Peer Systems (IPTPS), 2006.Google ScholarGoogle Scholar
  14. G. Utard and A. Vernois. Data durability in peer to peer storage systems. In Proceedings of the 4th IEEE International Symposium on Cluster Computing and the Grid (CCGRID), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Z. Zhang and Q. Lian. Reperasure: replication protocol using erasure-code in peer-to-peer storage network. In Proceedings of the 21st Symposium on Reliable Distributed Systems (SRDS), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Maintaining data reliability without availability in P2P storage systems

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SAC '10: Proceedings of the 2010 ACM Symposium on Applied Computing
          March 2010
          2712 pages
          ISBN:9781605586397
          DOI:10.1145/1774088

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 March 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          SAC '10 Paper Acceptance Rate364of1,353submissions,27%Overall Acceptance Rate1,650of6,669submissions,25%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader