Direct GPU/FPGA communication Via PCI express

Bittner, Ray; Ruf, Erik; Forin, Alessandro

doi:10.1007/s10586-013-0280-9

Direct GPU/FPGA communication Via PCI express

Published: 08 June 2013

Volume 17, pages 339–348, (2014)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Ray Bittner¹,
Erik Ruf¹ &
Alessandro Forin¹

2208 Accesses
33 Citations
Explore all metrics

Abstract

We describe a mechanism for connecting GPU and FPGA devices directly via the PCI Express bus, enabling the transfer of data between these heterogeneous computing units without the intermediate use of system memory. We evaluate the performance benefits of this approach over a range of transfer sizes, and demonstrate its utility in a computer vision application. We find that bypassing system memory yields improvements as high as 2.2× in data transfer speed, and 1.9× in application performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

VerCoLib: Fast and Versatile Communication for FPGAs via PCI Express

Article 16 July 2019

Oğuzhan Sezenlik, Sebastian Schüller & Joachim K. Anlauf

VerCoLib: Fast and Versatile Communication for FPGAs via PCI Express

Heterogeneous Computing Utilizing FPGAs

Article 31 May 2018

Marc Reichenbach, Philipp Holzinger, … Dietmar Fey

References

Bittner, R.: Speedy bus mastering PCI express. In: 22nd International Conference on Field Programmable Logic and Applications (2012)
Google Scholar
Goldhammer, A., Ayer, J. Jr.: Understanding performance of PCI express systems. Xilinx WP350 (Sept. 2008)
Khronos Group: OpenCL: the open standard for parallel programming of heterogeneous systems. Available at: http://www.khronos.org/opencl/
Khronos Group: OpenCL API registry. Available at: http://www.khronos.org/registry/cl
Microsoft Corporation: “DirectCompute”. Available at: http://blogs.msdn.com/b/chuckw/archive/2010/07/14//directxompute.aspx
nVidia Corporation: nVidia CUDA API reference manual, version 4.1. Available at: http://ww.nvidia.com/CUDA
nVidia Corporation: nVidia CUDA C programming guide, version 4.1. Available at: http://ww.nvidia.com/CUDA
PCI express base specification, PCI SIG: Available at http://www.pcisig.com/specifications/pciexpress
Whitted, T., Kajiya, J., Ruf, E., Bittner, R.: Embedded function composition. In: Proceedings of the Conference on High Performance Graphics (2009)
Google Scholar
PLDA Corporation: http://www.plda.com/prodetail.php?pid=175
Xilinx Corporation: PCI express. Available at: http://www.xilinx.com/technology/protocols/pciexpress.htm
nVidia GPUDirect: http://developer.nvidia.com/gpudirect
Oberg, J., Eguro, K., Bittner, R., Forin, A.: Random decision tree body part recognition using FPGAs. In: International Conference on Field Programmable Logic and Applications, August (2012)
Google Scholar
da Silva, B., Braeken, A., D’Hollander, E., Touhafi, A., Cornelis, J.G., Lemiere, J.: Performance and toolchain of a combined GPU/FPGA desktop. In: 21st International Symposium on Field Programmable Gate Arrays (FPGA’13), Monterey, CA, February (2013)
Google Scholar
Rossetti, D., et al.: GPU peer-to-peer techniques applied to a cluster interconnect. In: Proceeding of the Third Workshop on Communication Architecture for Scalable Systems (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Microsoft Research, Redmond, USA
Ray Bittner, Erik Ruf & Alessandro Forin

Authors

Ray Bittner
View author publications
You can also search for this author in PubMed Google Scholar
Erik Ruf
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Forin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Erik Ruf.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bittner, R., Ruf, E. & Forin, A. Direct GPU/FPGA communication Via PCI express. Cluster Comput 17, 339–348 (2014). https://doi.org/10.1007/s10586-013-0280-9

Download citation

Received: 15 February 2013
Accepted: 13 May 2013
Published: 08 June 2013
Issue Date: June 2014
DOI: https://doi.org/10.1007/s10586-013-0280-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Direct GPU/FPGA communication Via PCI express

Abstract

Access this article

Similar content being viewed by others

VerCoLib: Fast and Versatile Communication for FPGAs via PCI Express

VerCoLib: Fast and Versatile Communication for FPGAs via PCI Express

Heterogeneous Computing Utilizing FPGAs

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Direct GPU/FPGA communication Via PCI express

Abstract

Access this article

Similar content being viewed by others

VerCoLib: Fast and Versatile Communication for FPGAs via PCI Express

VerCoLib: Fast and Versatile Communication for FPGAs via PCI Express

Heterogeneous Computing Utilizing FPGAs

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation