ABSTRACT
In cloud storage and big data processing systems, RAID especially disk arrays tolerating triple disk failures (3DFTs) is a popular choice to provide high reliability with low monetary cost. For 3DFTs, a key obstacle is the low partial stripe write performance, which is caused by large amount of parity modifications based on complex erasure coding layouts.
In order to solve this problem, in this paper, we propose an optimized partial stripe write (OPS) method, which reorganizes the distribution of write data blocks to share partial parities among data blocks, thereby improving overall I/O performance. The OPS method can effectively reduce the number of modified parities. To illustrate the effectiveness of our OPS method, we used Disksim to evaluate several different partial stripe write methods through simulation. The results show that OPS can reduce the average response time by up to 37.21% and decreases the number of write operations by up to 26.22% compared to the traditional partial strip writing method..
- J. Bonwick. Raid-z. http://blogs.sun.com/bonwick/entry/raidz, 2010.Google Scholar
- P. Chen, E. Lee, G. Gibson, R. Katz, and D. Patterson. RAID: High-performance, reliable secondary storage. ACM Computing Surveys, 26(2):145--185, June 1994. Google ScholarDigital Library
- D. Patterson, G. Gibson, and R. Katz. A case for Redundant Arrays of Inexpensive Disks (RAID). In Proc. of the 1988 ACM SIGMOD International Conference on Management of Data, Chicago, IL, June 1988. Google ScholarDigital Library
- E. Pinheiro, W. Weber, and L. Barroso. Failure trends in a large disk drive population. In Proc. of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, February 2007. Google ScholarDigital Library
- B. Schroeder and G. Gibson. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proc. of the 5th USENIX Conference on File and Storage Technologies, San Jose, CA, February 2007. Google ScholarDigital Library
- I. Reed and G. Solomon. Polynomial codes over certain finite fields. Journal of the Society for Industrial and Applied Mathematics, pages 300--304, 1960.Google ScholarCross Ref
- Stephen B Wicker and Vijay K Bhargava. Reed-Solomon codes and their applications. John Wiley & Sons, 1999. Google ScholarCross Ref
- James S Plank. A tutorial on reed-solomon coding for fault-tolerance in raid-like systems. Software: Practice and Experience, 27(9):995--1012, 1997. Google ScholarDigital Library
- C. Huang and L. Xu. STAR: An efficient coding scheme for correcting triple storage node failures. In Proc. of the 4th USENIX Conference on File and Storage Technologies, San Francisco, CA, December 2005. Google ScholarDigital Library
- Y. Wang, G. Li, and X. Zhong. Triple-Star: A coding scheme with optimal encoding complexity for tolerating triple disk faliures in raid. International Journal of Innovative Computing, Information and Control, 8(3):1731--1472, 2012.Google Scholar
- Yongzhe Zhang, Chentao Wu, Jie Li, and Minyi Guo. Tip-code: A three independent parity code to tolerate triple disk failures with optimal update complextiy. In 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), pages 136--147. IEEE, 2015. Google ScholarDigital Library
- Yanbing Jiang, Chentao Wu, Jie Li, and Minyi Guo. Eh-code: An extended mds code to improve single write performance of disk arrays for correcting triple disk failures. In International Conference on Algorithms and Architectures for Parallel Processing, pages 34--49. Springer, 2015.Google ScholarCross Ref
- Dan Tang, Xiaojing Wang, Sheng Cao, and Zheng Chen. A new class of highly fault tolerant erasure code for the disk array. In Power Electronics and Intelligent Transportation System,, 2008. PEITS'08. Workshop on, pages 578--581. IEEE, 2008. Google ScholarDigital Library
- Dimitris S Papailiopoulos and Alexandros G Dimakis. Locally repairable codes. IEEE Transactions on Information Theory, 60(10):5843--5855, 2014.Google ScholarCross Ref
- Pradeep Subedi and Xubin He. A comprehensive analysis of xor-based erasure codes tolerating 3 or more concurrent failures. In Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International, pages 1528--1537. IEEE, 2013. Google ScholarDigital Library
- Zhang Yanpo. Lrc(local reconstruction codes) erasure code based on reed-solomon with vandermonde matrix. https://github.com/baishancloud/lrc-erasure-code, 2015.Google Scholar
- Congjin Du, Chentao Wu, Jie Li, Minyi Guo, and Xubin He. Bps: A balanced partial stripe write scheme to improve the write performance of raid-6. In Cluster Computing (CLUSTER), 2015 IEEE International Conference on, pages 204--213. IEEE, 2015. Google ScholarDigital Library
- Guillermo A Alvarez, Walter A Burkhard, and Flaviu Cristian. Tolerating multiple failures in raid architectures with optimal storage and uniform declustering. In ACM SIGARCH Computer Architecture News, volume 25, pages 62--72. ACM, 1997. Google ScholarDigital Library
- E. David. Method for improving partial stripe write performance in disk array subsystems. US Patent No. 5333305, July 1994.Google Scholar
- Ao Ma, Rachel Traylor, Fred Douglis, Mark Chamness, Guanlin Lu, Darren Sawyer, Surendar Chandra, and Windsor Hsu. Raidshield: characterizing, monitoring, and proactively protecting against disk failures. ACM Transactions on Storage (TOS), 11(4):17, 2015. Google ScholarDigital Library
- A. Thomasian and M. Blaum. Higher reliability redundant disk arrays: Organization, operation, and coding. ACM Transactions on Storage, 5(4):Article 7, November 2009. Google ScholarDigital Library
- M. Blaum, J. Brady, J. Bruck, and J. Menon. EVENODD: An efficient scheme for tolerating double disk failures in RAID architectures. IEEE Transactions on Computers, 44(2):192--202, February 1995. Google ScholarDigital Library
- P. Corbett, B. English, A. Goel, T. Grcanac, S. Kleiman, J. Leong, and S. Sankar. Row-Diagonal Parity for double disk failure correction. In Proc. of the 3rd Usenix Conference on File and Storage Technologies, San Francisco, CA, March 2004. Google ScholarDigital Library
- Chao Jin, Hong Jiang, Dan Feng, and Lei Tian. P-code: A new raid-6 code with optimal properties. In Proceedings of the 23rd international conference on Supercomputing, pages 360--369. ACM, 2009. Google ScholarDigital Library
- C. Wu, S. Wan, X. He, and C. Xie. H-Code: A hybrid mds array code to optimize partial stripe write in raid-6. In Proc. of the IEEE IPDPS' 11, Anchorage, AK, May 2011. Google ScholarDigital Library
- L. Xu and J. Bruck. X-Code: MDS array codes with optimal encoding. IEEE Transactions on Information Theory, 45(1):272--276, January 1999. Google ScholarDigital Library
- Xiaoning Ding, Song Jiang, Feng Chen, Kei Davis, and Xiaodong Zhang. Diskseen: Exploiting disk layout and access history to enhance i/o prefetch. In USENIX Annual Technical Conference, volume 7, pages 261--274, 2007. Google ScholarDigital Library
- Sitaram Iyer and Peter Druschel. Anticipatory scheduling: A disk scheduling framework to overcome deceptive idleness in synchronous i/o. In ACM SIGOPS Operating Systems Review, volume 35, pages 117--130. ACM, 2001. Google ScholarDigital Library
- Yuehai Xu and Song Jiang. A scheduling framework that makes any disk schedulers non-work-conserving solely based on request characteristics. In FAST, pages 119--132, 2011. Google ScholarDigital Library
- Alexander Thomasian. Survey and analysis of disk scheduling methods. ACM SIGARCH Computer Architecture News, 39(2):8--25, 2011. Google ScholarDigital Library
- Jeanna Neefe Matthews, Drew Roselli, Adam M Costello, Randolph Y Wang, and Thomas E Anderson. Improving the performance of log-structured file systems with adaptive methods, volume 31. ACM, 1997.Google Scholar
- Mendel Rosenblum and John K Ousterhout. The design and implementation of a log-structured file system. ACM Transactions on Computer Systems (TOCS), 10(1):26--52, 1992. Google ScholarDigital Library
- Randolph Y Wang, Thomas E Anderson, and David A Patterson. Virtual log based file systems for a programmable disk. Operating systems review, 33:29--44, 1998. Google ScholarDigital Library
- Peter Scheuermann, Gerhard Weikum, and Peter Zabback. Data partitioning and load balancing in parallel disk systems. The VLDB Journal---The International Journal on Very Large Data Bases, 7(1):48--66, 1998. Google ScholarDigital Library
- Michael Stonebraker, Robert Devine, Marcel Kornacker, Witold Litwin, Avi Pfeffer, Adam Sah, and Carl Staelin. An economic paradigm for query processing and data migration in mariposa. In Proceedings of 3rd International Conference on Parallel and Distributed Information Systems, pages 58--67. IEEE, 1994. Google ScholarDigital Library
- J. Bucy, J. Schindler, S. Schlosser, and G. Ganger. The disksim simulation environment version 4.0 reference manual. Technical Report CMU-PDL-08-101, Carnegie Mellon University, May 2008.Google Scholar
- EMC Corporation. Emc clariion raid 6 technology: A detailed review. http://www.emc.com/collateral/hardware/white-papers/h2891-clariion-raid-6.paf, July 2007.Google Scholar
- Chih-Shing Tau and Tzone-I Wang. Efficient parity placement schemes for tolerating triple disk failures in raid architectures. In Advanced Information Networking and Applications, 2003. AINA 2003. 17th International Conference on, pages 132--137. IEEE, 2003. Google ScholarDigital Library
- Fiu-homes. http://iotta.snia.org/traces/391, 2011.Google Scholar
Index Terms
- OPS: an optimized partial stripe write scheme to improve performance of XOR-based disk arrays tolerating triple disk failures
Recommendations
HPDA: A hybrid parity-based disk array for enhanced performance and reliability
Flash-based Solid State Drive (SSD) has been productively shipped and deployed in large scale storage systems. However, a single flash-based SSD cannot satisfy the capacity, performance and reliability requirements of the modern storage systems that ...
Reducing Storage Overhead with Small Write Bottleneck Avoiding in Cloud RAID System
GRID '12: Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid ComputingCloud storage systems commonly use replication of stored data sets to ensure high reliability and availability. However, the high storage overhead of replication becomes increasingly unacceptable with the explosive growth of data stored in cloud. Some ...
ACS: an alternate coding scheme to improve degrade read performance for SSD-based RAID5 systems
To guarantee high performance and reliability, storage systems require better devices and data redundancy schemes, e.g., SSD-based RAID5. However, failures in the large-scale storage systems are common. In order to serve requests on a failed node, the SSD-...
Comments