skip to main content
10.1145/3490422.3510453acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
keynote

The Virtuous Cycles of Determinism: Programming Groq's Tensor Streaming Processor

Published:11 February 2022Publication History

ABSTRACT

FPGAs and other 2D and 3D spatial computing fabrics share several common characteristics e.g. a deterministic computing model with distributed memories, but also differ along important dimensions e.g. granularity and communication infrastructure. This talk will position Groq's Tensor Streaming Processing (TSP) chip relative to other spatial computing fabrics and highlight common aspects of programming models for such spatial computing engines as well as highlighting some of the unique characteristics of the TSP architecture and its programming model One common characteristic this talk will focus on is \em determinism and specifically the use of static scheduling to know at compile time exactly how many cycles a program will take to execute regardless of the specific values of the input data to the TSP chip. Groq's TSP architecture is based on a parallel collection of coarse grain processing units including matrix multiplication or convolution (MXM), vector operations (VXM), arbitrary vector transformations (SXM) and various numerical operations. Data originates from a memory reads (MEM) and is then chained through a tensor pipeline through one or more computing blocks and terminated by a memory write (MEM). The chip can operate on in-flight tensors as they are produced and consumed, aggressively exploiting data-flow locality. A variety of deterministic computing models will be surveyed as we seek abstractions that support rapid multiplexing between multiple workloads which can be co-resident on TSP chips, with scheduling and coordination facilitated by determinism.

References

  1. Marten Lohstroh et al. "Toward a Lingua Franca for Deterministic Concurrent Systems". In: ACM Trans. Embed. Comput. Syst. 20.4 (2021), 36:1--36:27. doi: 10.1145/3448128. url: https://doi.org/10.1145/3448128.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The Virtuous Cycles of Determinism: Programming Groq's Tensor Streaming Processor

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          FPGA '22: Proceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays
          February 2022
          211 pages
          ISBN:9781450391498
          DOI:10.1145/3490422

          Copyright © 2022 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 11 February 2022

          Check for updates

          Qualifiers

          • keynote

          Acceptance Rates

          Overall Acceptance Rate125of627submissions,20%
        • Article Metrics

          • Downloads (Last 12 months)40
          • Downloads (Last 6 weeks)1

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader