ABSTRACT
In this paper we present TEAPOT, a full system GPU simulator, whose goal is to allow the evaluation of the GPUs that reside in mobile phones and tablets. To this extent, it has a cycle accurate GPU model for evaluating performance, power models for the GPU, the memory subsystem and for OLED screens, and image quality metrics. Unlike prior GPU simulators, TEAPOT supports the OpenGL ES 1.1/2.0 API, so that it can simulate all commercial graphical applications available for Android systems.
To illustrate potential uses of this simulating infrastructure, we perform two case studies. We first turn our attention to evaluating the impact of the OS when simulating graphical applications. We show that the overall GPU power/performance is greatly affected by common OS tasks, such as image composition, and argue that application level simulation is not sufficient to understand the overall GPU behavior. We then utilize the capabilities of TEAPOT to perform studies that trade image quality for energy. We demonstrate that by allowing for small distortions in the overall image quality, a significant amount of energy can be saved.
- J.-M. Arnau, J.-M. Parcerisa, and P. Xekalakis. Boosting Mobile GPU Performance with a Decoupled Access/Execute Fragment Processor. In Proc. of ISCA, pages 84--93, 2012. Google ScholarDigital Library
- A. Bakhoda, G. Yuan, W. Fung, H. Wong, and T. Aamodt. Analyzing CUDA Workloads Using a Detailed GPU Simulator. In Proc. of ISPASS, pages 163--174, 2009.Google ScholarCross Ref
- A. Carroll and G. Heiser. An Analysis of Power Consumption in a Smartphone. In Proc. of USENIX, pages 21--21, 2010. Google ScholarDigital Library
- S. Collange, M. Daumas, D. Defour, and D. Parello. Barra: A Parallel Functional Simulator for GPGPU. In Proc. of MASCOTS, pages 351--360, 2010. Google ScholarDigital Library
- M. Dong, Y.-S. K. Choi, and L. Zhong. Power Modeling of Graphical User Interfaces on OLED Displays. In Proc. of DAC, pages 652--657, 2009. Google ScholarDigital Library
- M. W. Eldridge. Designing Graphics Architectures Around Scalability and Communication. PhD thesis, 2001. Google ScholarDigital Library
- B. Juurlink, I. Antochi, D. Crisu, S. Cotofana, and S. Vassiliadis. GRAAL: A Framework for Low-Power 3D Graphics Accelerators. IEEE Computer Graphics and Applications, 28(4):63--73, 2008. Google ScholarDigital Library
- S. Li, J. H. Ahn, R. D. Strong, J. B. Brockman, D. M. Tullsen, and N. P. Jouppi. McPAT: An Integrated Power, Area, and Timing Modeling Framework for Multicore and Manycore Architectures. In Proc. of MICRO, pages 469--480, 2009. Google ScholarDigital Library
- S. Molnar, M. Cox, D. Ellsworth, and H. Fuchs. A Sorting Classification of Parallel Rendering. IEEE Comput. Graph. Appl., 14(4):23--32, July 1994. Google ScholarDigital Library
- V. Moya, C. Gonzalez, J. Roca, A. Fernandez, and R. Espasa. ATTILA: A Cycle-level Execution-Driven Simulator for Modern GPU Architectures. In Proc. of ISPASS, pages 231--241, 2006.Google Scholar
- K. Pulli, T. Aarnio, K. Roimela, and J. Vaarala. Designing Graphics Programming Interfaces for Mobile Devices. IEEE Comput. Graph. Appl., 25(6):66--75, Nov. 2005. Google ScholarDigital Library
- J. Rasmusson, J. Hasselgren, and T. Akenine-Möller. Exact and Error-Bounded Approximate Color Buffer Compression and Decompression. In Proc. of EUROGRAPHICS Conf. on Graphics Hardware, pages 41--48, 2007. Google ScholarDigital Library
- J. W. Sheaffer, D. Luebke, and K. Skadron. A Flexible Simulation Framework for Graphics Architectures. In Proc. of the EUROGRAPHICS Conf. on Graphics Hardware, pages 85--94, 2004. Google ScholarDigital Library
- A. Shye, B. Scholbrock, and G. Memik. Into the Wild: Studying Real User Activity Patterns to Guide Power Optimizations for Mobile Architectures. In Proc. of MICRO, pages 168--178, 2009. Google ScholarDigital Library
- J. Ström and T. Akenine-Möller. iPACKMAN: High-Quality, Low-Complexity Texture Compression for Mobile Phones. In Proc. of the EUROGRAPHICS Conf. on Graphics Hardware, pages 63--70, 2005. Google ScholarDigital Library
- J. Tang, S. Liu, Z. Gu, C. Liu, and J.-L. Gaudiot. Prefetching in Embedded Mobile Systems Can Be Energy-Efficient. IEEE Comput. Archit. Lett., 10(1):8--11, Jan. 2011. Google ScholarDigital Library
- Z. Wang, A. Bovik, H. Sheikh, and E. Simoncelli. Image Quality Assessment: from Error Visibility to Structural Similarity. IEEE Transactions on Image Processing, 13(4):600--612, 2004. Google ScholarDigital Library
- Android SDK. http://developer.android.com/sdk/index.html.Google Scholar
- Bringing High-End Graphics to Handheld Devices. http://www.nvidia.com/content/PDF/tegra_white_papers/Bringing_High-End_%Graphics_to_Handheld_Devices.pdf.Google Scholar
- Color Space Conversions. http://www.poynton.com/PDFs/coloureq.pdf.Google Scholar
- Composition with Snapdragon. https://developer.qualcomm.com/sites/default/files/composition-with-sna%pdragon.pdf.Google Scholar
- Gallium3D. http://en.wikipedia.org/wiki/Gallium3D/.Google Scholar
- GPUWattch. http://www.gpgpu-sim.org/gpuwattch/.Google Scholar
- Mali-400 MP: A Scalable GPU for Mobile Devices. http://www.highperformancegraphics.org/previous/www_2010/media/Hot3D/HP%G2010_Hot3D_ARM.pdf.Google Scholar
- Mean Squared Error. http://en.wikipedia.org/wiki/Mean_squared_error.Google Scholar
- Microsoft Direct3D. http://en.wikipedia.org/wiki/Microsoft_Direct3D.Google Scholar
- NVIDIA CUDA Programming Guide. http://docs.nvidia.com/cuda/cuda-c-programming-guide/.Google Scholar
- OpenCL. http://www.khronos.org/opencl/.Google Scholar
- OpenGL ES. http://www.khronos.org/opengles/.Google Scholar
- OpenGL Shading Language. http://en.wikipedia.org/wiki/GLSL.Google Scholar
- Qualcomm Adreno 320. http://www.anandtech.com/show/6112/qualcomms-quadcore- snapdragon-s4-apq8064adreno-320-performance-preview.Google Scholar
- Peak signal-to-noise ratio. http://en.wikipedia.org/wiki/Peak_signal-to-noise_ratio.Google Scholar
- PowerVR Technology Overview. http://www.imgtec.com/factsheets/SDK/PowerVR%20Technology%20Overview.1.%0.2e.External.pdf.Google Scholar
- QEMU. http://wiki.qemu.org/Main_Page.Google Scholar
- Tile-Based Deferred Rendering. http://en.wikipedia.org/wiki/Tiled_rendering.Google Scholar
- Tungsten Graphics Shader Infrastructure. http://people.freedesktop.org/ csimpson/gallium-docs/tgsi.html.Google Scholar
- Using hardware acceleration in the Android Emulator. http://developer.android.com/tools/devices/emulator.html#acceleration.Google Scholar
Index Terms
- TEAPOT: a toolset for evaluating performance, power and image quality on mobile graphics systems
Recommendations
Image processing acceleration for intelligent unmanned aerial vehicle on mobile GPU
In this paper, we present an algorithm for providing visually-guided unmanned aerial vehicle (UAV) control using visual information that is processed on a mobile graphic processing unit (GPU). Most real-time machine vision applications for UAVs exploit ...
Wideband Channelization for Software-Defined Radio via Mobile Graphics Processors
ICPADS '13: Proceedings of the 2013 International Conference on Parallel and Distributed SystemsWideband channelization is a computationally intensive task within software-defined radio (SDR). To support this task, the underlying hardware should provide high performance and allow flexible implementations. Traditional solutions use field-...
Texture-Directed Mobile GPU Power Management for Closed-Source Games
HPCC '14: Proceedings of the 2014 IEEE Intl Conf on High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS)Power consumption and battery life constrain the developments of the mobile platforms. Among the mobile applications, games are the most demanding applications in terms of both computational cost and consumed energy. The power consumption spent on GPU ...
Comments