Skip to main content
Log in

Low-Latency Online Multiplier with Reduced Activities and Minimized Interconnect for Inner Product Arrays

  • Published:
Journal of Signal Processing Systems Aims and scope Submit manuscript

Abstract

Multiplication is indispensable and is one of the core operations in many modern applications including signal processing and neural networks. Conventional right-to-left (RL) multiplier extensively contributes to the power consumption, area utilization and critical path delay in such applications. This paper proposes a low latency multiplier based on online or left-to-right (LR) arithmetic which can increase throughput and reduce latency by digit-level pipelining. Online arithmetic enables overlapping successive operations regardless of data dependency because of the most significant digit first mode of operation. To produce most significant digit first, it uses redundant number system and we can have a carry-free addition, therefore, the delay of the arithmetic operation is independent of operand bit width. The operations are performed digit by digit serially from left to right which allows gradual increase in the slice activities making it suitable for implementation on reconfigurable devices. Serial nature of the online algorithm and gradual increment/decrement of active slices minimize the interconnects and signal activities resulting in overall reduction of area and power consumption. We present online multipliers with; both inputs in serial, and one in serial and one in parallel. Pipelined and non-pipelined designs of the proposed multipliers have been synthesized with GSCL 45nm technology on Synopsys Design Compiler. Thorough comparative analysis has been performed using widely used performance metrics. The results show that the proposed online multipliers outperform the RL multipliers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14

Similar content being viewed by others

Data Availability

Not Applicable.

References

  1. Liu, W., Qian, L., Wang, C., Jiang, H., Han, J., & Lombardi, F. (2017). Design of approximate radix-4 booth multipliers for error-tolerant computing. IEEE Transactions on Computers, 66(8), 1435–1441.

    Article  MathSciNet  MATH  Google Scholar 

  2. Lin, J.-F., Chan, C.-Y., & Yu, S.-W. (2019). Novel low voltage and low power array multiplier design for iot applications. Electronics, 8(12), 1429.

    Article  Google Scholar 

  3. Ercegovac, M. D. (2017). On left-to-right arithmetic. In 2017 51st Asilomar Conference on Signals, Systems, and Computers (pp. 750–754). IEEE.

  4. Ercegovac, M. D. (2020). On reducing module activities in online arithmetic operations. In 2020 54th Asilomar Conference on Signals, Systems, and Computers (pp. 524–528). IEEE.

  5. Villalba, J., Lang, T., & Hormigo, J. (2011). Radix-2 multioperand and multiformat streaming online addition. IEEE Transactions on Computers, 61(6), 790–803.

    Article  MathSciNet  MATH  Google Scholar 

  6. Elshafei, A.-R. (2009). Hardware online multiplication-division: A design and performance study. PhD thesis, King Fahd University of Petroleum and Minerals

  7. Huang, Z., & Ercegovac, M. D. (2001). FPGA implementation of pipelined on-line scheme for 3-d vector normalization. In The 9th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM’01) (pp. 61–70). IEEE.

  8. Galli, R. (2001). Design and evaluation of on-line arithmetic modules and networks for signal processing applications on FPGAS. Master’s thesis, Oregon State University.

  9. Sinky, M. H., Tenca, A. F., Shantilal, A. C., & Lucchese, L. (2004). Design of a color image processing algorithm using online arithmetic modules. In Advanced Signal Processing Algorithms, Architectures, and Implementations XIV (Vol. 5559, pp. 79–90). International Society for Optics and Photonics.

  10. Zhao, Y., Wickerson, J., & Constantinides, G. A. (2016). An efficient implementation of online arithmetic. In 2016 International Conference on Field-Programmable Technology (FPT) (pp. 69–76). IEEE.

  11. Shi, K., Boland, D., & Constantinides, G. A. (2014). Efficient FPGA implementation of digit parallel online arithmetic operators. In 2014 International Conference on Field-Programmable Technology (FPT) (pp. 115–122). IEEE.

  12. Usman, M., Lee, J.-A., & Ercegovac, M. D. (2021). Multiplier with reduced activities and minimized interconnect for inner product arrays. In 2021 55th Asilomar Conference on Signals, Systems, and Computers (pp. 1–5). IEEE.

  13. Tangtrakul, A., Yeung, B., & Cook, T. A. (1996). Signed-digit online floating-point arithmetic for FPGAS. In High-Speed Computing, Digital Signal Processing, and Filtering Using Reconfigurable Logic (Vol. 2914, pp. 2–13). International Society for Optics and Photonics.

  14. Dormiani, P., Omoto, D., Adharapurapu, P., & Ercegovac, M. D. (2005). A design of online scheme for evaluation of multinomials. In Advanced Signal Processing Algorithms, Architectures, and Implementations XV (Vol. 5910, p. 59100). International Society for Optics and Photonics.

  15. Ercegovac, M. D., & Lang, T. (1987). On-the-fly conversion of redundant into conventional representations. IEEE Transactions on Computers, 7, 895–897.

    Article  Google Scholar 

  16. Ercegovac, M. D., & Lang, T. (2004). Digital Arithmetic. Morgan Kaufmann Publishers, San Francisco, CA, USA.

  17. Usman, M. (2022). Energy-efficient online arithmetic in domain-specific accelerators for deep learning applications. PhD thesis, Chosun University.

  18. Bewick, G. W. (1994). Fast multiplication: Algorithms and implementation. PhD thesis, Stanford University.

  19. Baugh, C. R., & Wooley, B. A. (1973). A two’s complement parallel array multiplication algorithm. IEEE Transactions on Computers, 100(12), 1045–1047.

    Article  MATH  Google Scholar 

  20. Horowitz, M., Indermaur, T., & Gonzalez, R. (1994). Low-power digital design. In Proceedings of 1994 IEEE Symposium on Low Power Electronics (pp. 8–11). IEEE.

Download references

Acknowledgements

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2020R1I1A3063857). The EDA tool was supported by the IC Design Education Center (IDEC), Korea.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jeong-A. Lee.

Ethics declarations

Conflict of Interest

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 248 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Usman, M., D. Ercegovac, M. & Lee, JA. Low-Latency Online Multiplier with Reduced Activities and Minimized Interconnect for Inner Product Arrays. J Sign Process Syst 95, 777–796 (2023). https://doi.org/10.1007/s11265-023-01856-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11265-023-01856-w

Keywords

Navigation