Abstract
The rapid growth of IoT triggered the exponential growth of generated data transferred to/from Cloud and emerging Edge servers. As a result, recent projections forecast that the DRAM subsystem will soon be responsible for more than 40% of the overall power consumption within most servers [1]. One of the reasons for the high energy consumed by the DRAM devices is the usage of pessimistic DRAM operating parameters, such as voltage, refresh rate and timing parameters, set by the vendors. Vendors use these parameters to handle possible failures induced by the charge leakage and cell-to-cell interference. Moreover, such failures prevent scaling further scaling the size of DRAM cells. This reality has led researchers to question if such pessimistic parameters can be relaxed and if the induced failures can be handled at the hardware or software level. In this chapter, we discuss the challenges related to the DRAM reliability and present a systematic study on exceeding the conservative operating DRAM margins to improve the energy efficiency of Edge servers. We demonstrate a machine learning-based technique that enables us to scale down the DRAM operating parameters and the hardware/software stack that handles all the induced failures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
B. Giridhar, M. Cieslak, D. Duggal, R. Dreslin-ski, H. M. Chen, R. Patti, B. Hold, C. Chakrabarti, T. Mudge, D. Blaauw, Exploring dram organizations for energy-efficient and resilient exascale memories, in Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis (ACM, 2013), p. 23
K. K. Chang, A. Kashyap, H. Hassan, S. Ghose, K. Hsieh, D. Lee, T. Li, G. Pekhimenko, S. Khan, and O. Mutlu, Understanding latency variation in modern dram chips: Experimental characterization, analysis, and optimization, SIGMETRICS Perform. Eval. Rev., vol. 44, no. 1, pp. 323–336 2016. [Online]. Available: http://doi.acm.org/10.1145/2964791.2901453
S. Khan, D. Lee, Y. Kim, A. R. Alameldeen, C. Wilkerson, and O. Mutlu, The efficacy of error mitigation techniques for dram retention failures: a comparative experimental study,” SIGMETRICS Perform. Eval. Rev., vol. 42, no. 1, pp. 519–532, 2014. [Online]. Available: http://doi.acm.org/10.1145/2637364.2592000
S. Khan, C. Wilkerson, Z. Wang, A. R. Alameldeen, D. Lee, O. Mutlu, Detecting and mitigating data-dependent dram failures by exploiting current memory content, in Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, ser. MICRO-50’17 (ACM, New York, 2017), pp. 27–40 [Online]. Available: http://doi.acm.org/10.1145/3123939.3123945
B. Keeth, R.J. Baker, B. Johnson, F. Lin, DRAM Circuit Design: Fundamental and High-Speed Topics, 2nd edn. (Wiley-IEEE Press, 2007)
X.-C. Wu, T. Sherwood, F. T. Chong, Y. Li, Protecting page tables from rowhammer attacks using monotonic pointers in dram true-cells, in Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, ser. ASPLOS ’19. New York, NY, USA: Association for Computing Machinery, 2019, pp. 645–657. [Online]. Available: https://doi.org/10.1145/3297858.3304039
L.Mukhanov, D.S.Nikolopoulos, G.Karakonstantis, Dstress: Automatic synthesis of dram reliability stress viruses using genetic algorithms, in 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020, pp. 298–312
L.-T. Wang, C.-W. Wu, X. Wen, VLSI Test Principles and Architectures: Design for Testability (Systems on Silicon) (Morgan Kaufmann Publishers Inc., San Francisco, 2006)
M. Jung, D. M. Mathew, C. C. Rheinlander, C. Weis, N. Wehn, A platform to analyze DDR3 dram’s power and retention time, IEEE Des. Test. 34(4), 52–59 (2017) [Online]. Available: https://doi.org/10.1109/MDAT.2017.2705144
K. K. Chang, A. G. Yauglikcci, S. Ghose, A. Agrawal, N. Chatterjee, A. Kashyap, D. Lee, M. O’Connor, H. Hassan, O. Mutlu, Understanding reduced-voltage operation in modern dram devices: experimental characterization, analysis, and mechanisms, in Proc. ACM Meas. Anal. Comput. Syst., vol. 1, no. 1, pp. 10:1–10:42, 2017. [Online]. Available: http://doi.acm.org/10.1145/3084447
H. David, C. Fallin, E. Gorbatov, U. R. Hanebutte, O. Mutlu, Memory power management via dynamic voltage/frequency scaling, in Proceedings of the 8th ACM International Conference on Autonomic Computing, ser. ICAC’11 (ACM, New York, 2011), pp. 31–40 [Online]. Available: http://doi.acm.org/10.1145/1998582.1998590
T. Hamamoto, S. Sugiura, S. Sawada, On the retention time distribution of dynamic random access memory (dram). IEEE Transactions on Electron Devices 45(6), 1300–1309 (1998)
J. Liu, B. Jaiyen, Y. Kim, C. Wilkerson, and O. Mutlu, “An experimental study of data retention behavior in modern dram devices: Implications for retention time profiling mechanisms,” in Proceedings of the 40th Annual International Symposium on Computer Architecture, ser. ISCA ’13. New York, NY, USA: ACM, 2013, pp. 60–71. [Online]. Available: http://doi.acm.org/10.1145/2485922.2485928
K. Kim and J. Lee, “A new investigation of data retention time in truly nanoscaled drams,” IEEE Electron Device Letters, vol. 30, no. 8, pp. 846–848, Aug 2009.
L.-T. Wang, C.-W. Wu, X. Wen, VLSI Test Principles and Architectures: Design for Testability (Systems on Silicon) (Morgan Kaufmann Publishers Inc., San Francisco, 2006)
P. Girard, N. Nicolici, X. Wen, Power-Aware Testing and Test Strategies for Low Power Devices, 1st edn. (Springer, Cham, 2009)
V. Sridharan, N. DeBardeleben, S. Blanchard, K.B. Ferreira, J. Stearley, J. Shalf, S. Gurumurthi, Memory errors in modern systems: the good, the bad, and the ugly. in Proceedings of the Twentieth International Conference on Architectural Support for Programming Languages and Operating Systems, ser. ASPLOS’15, 2015, pp. 297–310
P.J. Restle, J.W. Park, B.F. Lloyd, Dram variable retention time. in 1992 International Technical Digest on Electron Devices Meeting, December 1992, pp. 807–810
L. Mukhanov, K. Tovletoglou, H. Vandierendonck, D. S. Nikolopoulos, G. Karakonstantis, Workload-aware dram error prediction using machine learning. in 2019 IEEE International Symposium on Work- load Characterization (IISWC), 2019, pp. 106–118
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Mukhanov, L., Karakonstantis, G. (2022). Improving DRAM Energy-efficiency. In: Karakonstantis, G., Gillan, C.J. (eds) Computing at the EDGE. Springer, Cham. https://doi.org/10.1007/978-3-030-74536-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-74536-3_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-74535-6
Online ISBN: 978-3-030-74536-3
eBook Packages: Computer ScienceComputer Science (R0)