skip to main content
10.1145/2684746.2689110acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
poster

FPGA Acceleration of Irregular Iterative Computations using Criticality-Aware Dataflow Optimizations (Abstract Only)

Authors Info & Claims
Published:22 February 2015Publication History

ABSTRACT

FPGA acceleration of large irregular dataflow graphs is often limited by the long tail distribution of parallelism on fine-grained overlay dataflow architectures. In this paper, we show how to overcome these limitations by exploiting criticality information along compute paths; both statically during graph pre-processing and dynamically at runtime. We statically reassociate the high-fanin dataflow chains by providing faster routes for late arriving inputs. We also perform a fanout decomposition and selective node replication in order to distribute serialization costs across multiple PEs. Additionally, we modify the dataflow firing rule in hardware to prefer critical nodes when multiple nodes are ready for dataflow evaluation. Effectively these transformations reduce the length of the tail in the parallelism profile for these large-scale graphs. Across a range of dataflow benchmarks extracted from Sparse LU factorization, we demonstrate up to 2.5× (mean 1.21×) improvement when using the static pre-processing alone, a 2.4× (mean 1.17×) improvement when using only dynamic optimizations and an overall 2.9× (mean 1.39×) improvement when both static and dynamic optimizations are enabled. These improvements are on top of 3--10× speedups over CPU implementations without our transformation enabled.

References

  1. T. A. Davis and E. Palamadai Natarajan, Algorithm 907: KLU, A Direct Sparse Solver for circuit simulation problems, ACM Transactions on Mathematical Software, Volume 37 Number 3, Sept. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J.B. Dennis and D.P. Misunas, A preliminary architecture for a basic data-flow processor, SIGARCH Computer Architecture News, Volume 3 Number 4, Dec. 1974. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Nachiket Kapre, SPICE2 -- A Spatial Parallel Architecture for Accelerating the SPICE Circuit Simulator, PhD thesis, California Institute of Technology, Pasadena, 2010.Google ScholarGoogle Scholar
  4. Nachiket Kapre and Andre DeHon, Parallelizing sparse Matrix Solve for SPICE circuit simulation using FPGAs, Proceedings of the International Conference on Field-Programmable Technology, Dec. 2010.Google ScholarGoogle Scholar
  5. Siddhartha and Nachiket Kapre, Breaking Sequential Dependencies in FPGA-based Sparse LU Factorization, Proceedings of the IEEE Symposium on Field Programmable Custom Computing Machines, Mar. 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. FPGA Acceleration of Irregular Iterative Computations using Criticality-Aware Dataflow Optimizations (Abstract Only)

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        FPGA '15: Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
        February 2015
        292 pages
        ISBN:9781450333153
        DOI:10.1145/2684746

        Copyright © 2015 Owner/Author

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 February 2015

        Check for updates

        Qualifiers

        • poster

        Acceptance Rates

        FPGA '15 Paper Acceptance Rate20of102submissions,20%Overall Acceptance Rate125of627submissions,20%
      • Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0

        Other Metrics