research-article

Thread Weaving: Static Resource Scheduling for Multithreaded High-Level Synthesis

Authors:
Hsuan Hsiao

University of Toronto

University of Toronto
View Profile

,
Jason Anderson

University of Toronto

University of Toronto
View Profile

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019June 2019Article No.: 2Pages 1–6https://doi.org/10.1145/3316781.3317924

Published:02 June 2019Publication History

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

Pages 1–6

ABSTRACT

In high-level synthesis (HLS), software multithreading constructs can be used to explicitly specify coarse-grained parallelism for multiple accelerators. While software threads typically operate independently and in isolation of each other on CPUs, HLS threads/accelerators are sub-components of one circuit. Since these components generally reside in the same clock domain, we can schedule their execution statically to avoid shared-resource contention among threads. We propose thread weaving, a technique that statically interleaves requests from different threads through scheduling constraints. With the guarantee of a contention-free schedule, we eliminate replication/arbitration of shared resources, reducing the area footprint of the circuit and improving its maximum operating frequency (Fmax).

References

A. Canis, S. D. Brown, and J. H. Anderson. 2014. Modulo SDC scheduling with recurrence minimization in high-level synthesis. In FPL.Google Scholar
A. Canis et al. 2011. LegUp: High-level Synthesis for FPGA-based Processor/Accelerator Systems. In ACM/SIGDA International Symposium on FPGA. ACM, New York, NY, USA, 33--36. Google ScholarDigital Library
J. Choi, S. Brown, and J. Anderson. 2015. Resource and memory management techniques for the high-level synthesis of software threads into parallel FPGA hardware. In IEEE FPT. 152--159.Google Scholar
J. Choi, S. D. Brown, and J. H. Anderson. 2013. From software threads to parallel hardware in high-level synthesis for FPGAs. In IEEE FPT. 270--277.Google Scholar
S. Hadjis et al. 2012. Impact of FPGA Architecture on Resource Sharing in High-level Synthesis. In ACM FPGA. 111--114. Google ScholarDigital Library
C. Lattner and V. Adve. 2004. LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation. In ACM/IEEE CGO. Google ScholarDigital Library
M. Lattuada and F. Ferrandi. 2015. Exploiting Outer Loops Vectorization in High Level Synthesis. In Proceedings of the Architecture of Computing Systems (Lecture Notes in Computer Science), Vol. 9017. Springer International Publishing, 31--42.Google Scholar
M. Lattuada and F. Ferrandi. 2017. Exploiting Vectorization in High Level Synthesis of Nested Irregular Loops. Journal of Systems Architecture 75 (2017), 1--14. Google ScholarDigital Library
C. Pilato and F. Ferrandi. 2013. Bambu: A modular framework for the high level synthesis of memory-intensive applications. In FPL. 1--4.Google Scholar
N. Ramanathan et al. 2017. Hardware Synthesis of Weakly Consistent C Concurrency. In ACMFPGA. 169--178. Google ScholarDigital Library
Xilinx. 2017. Vivado Design Suite User Guide High-Level Synthesis. https://www.xilinx.com/support/documentation/sw_manuals/xilinx2017_2/ug902-vivado-high-level-synthesis.pdfGoogle Scholar

Thread Weaving: Static Resource Scheduling for Multithreaded High-Level Synthesis
1. Hardware
  1. Electronic design automation
2. Software and its engineering
  1. Software notations and tools

Recommendations

Thread algebra for strategic interleaving
Abstract
We take a thread as the behavior of a sequential deterministic program under execution and multi-threading as the form of concurrency provided by contemporary programming languages such as Java and C#. We outline an algebraic theory about threads ...
Read More
Thread-local concurrency: a technique to handle data race detection at programming model abstraction
HPDC '18: Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing

With greater adoption of various high-level parallel programming models to harness on-node parallelism, accurate data race detection has become more crucial than ever. However, existing tools have great difficulty spotting data races through these high-...
Read More
Efficient Java thread serialization
PPPJ '03: Proceedings of the 2nd international conference on Principles and practice of programming in Java

The Java system supports the transmission of code via dynamic class loading, and the transmission or storage of data via object serialization. However, Java does not provide any mechanism for the transmission/storage of computation (i.e., thread ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019
June 2019
1378 pages
ISBN:9781450367257
DOI:10.1145/3316781

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 2 June 2019
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate1,770of5,499submissions,32%
Upcoming Conference
DAC '24

Sponsor:

sigda

61st ACM/IEEE Design Automation Conference

June 23 - 27, 2024

San Francisco , CA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 441
  Total Downloads
- Downloads (Last 12 months)36
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Thread Weaving: Static Resource Scheduling for Multithreaded High-Level Synthesis

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

ABSTRACT

References

Cited By

Recommendations

Thread algebra for strategic interleaving

Thread-local concurrency: a technique to handle data race detection at programming model abstraction

Efficient Java thread serialization

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Thread Weaving: Static Resource Scheduling for Multithreaded High-Level Synthesis

DAC '19: Proceedings of the 56th Annual Design Automation Conference 2019

ABSTRACT

References

Cited By

Recommendations

Thread algebra for strategic interleaving

Thread-local concurrency: a technique to handle data race detection at programming model abstraction

Efficient Java thread serialization

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media