A massively parallel keypoint detection and description (MP-KDD) algorithm for high-speed vision chip

Shi, Cong; Yang, Jie; Liu, LiYuan; Wu, NanJian; Wang, ZhiHua

doi:10.1007/s11432-014-5174-9

A massively parallel keypoint detection and description (MP-KDD) algorithm for high-speed vision chip

面向高速视觉芯片的大规模并行特征点检测和描述算法

Research Paper
Published: 08 August 2014

Volume 57, pages 1–12, (2014)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Cong Shi^1,2,
Jie Yang¹,
LiYuan Liu¹,
NanJian Wu¹ &
…
ZhiHua Wang^2,3

2 Citations
Explore all metrics

Abstract

This paper proposes a massively parallel keypoint detection and description (MP-KDD) algorithm for the vision chip with parallel array processors. The MP-KDD algorithm largely reduces the computational overhead by removing all floating-point and multiplication operations while preserving the currently popular SIFT and SURF algorithm essence. The MP-KDD algorithm can be directly and effectively mapped onto the pixel-parallel and row-parallel array processors of the vision chip. The vision chip architecture is also enhanced to realize direct memory access (DMA) and random access to array processors so that the MP-KDD algorithm can be executed more effectively. An FPGA-based vision chip prototype is implemented to test and evaluate our MP-KDD algorithm. Its image processing speed reaches 600–760 fps with high accuracy for complex vision applications, such as scene recognition.

摘要

本文提出了一种面向视觉芯片并行图像处理阵列的高效图像特征点提取和描述算法。该算法基于SIFT特征点检测及SURF特征点描述, 但简化避免了浮点运算以及乘除法操作, 极大地节约了硬件开销和处理时间。该算法可以直接、高效地映射到视觉芯片的像素级并行和行并行阵列处理器。本文实现了基于FPGA的视觉原型, 在其上成功测试了所提出的算法, 达到了600–700帧/秒的较高速度。

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Design Methodology for the Next Generation Real-Time Vision Processors

Image Feature Extraction Acceleration

An FPGA accelerator for PatchMatch multi-view stereo using OpenCL

Article 15 February 2018

References

Ishikawa M, Ogawa K, Komoro T, et al. A CMOS vision chip with SIMD processing element array for 1 ms image processing. In: Proceedings of IEEE Solid-State Circuits Conference Digest of Technical Papers (ISSCC), San Francisco, 1999. 206–207
Google Scholar
Shi C, Yang J, Han Y, et al. A 1000 fps vision chip based on a dynamically reconfigurable hybrid architecture comprising a PE array and self-organizing map neural network. In: Proceedings of IEEE Solid-State Circuits Conference Digest of Technical Papers (ISSCC), San Francisco, 2014. 128–129
Google Scholar
Komoro T, Ishii I, Ishikawa M, et al. A digital vision chip specialized for high-speed target tracking. IEEE Trans Electron Dev, 2003, 50: 191–199
Article Google Scholar
Dudek P, Hicks P J. A general-purpose processor-per-pixel analog SIMD vision chip. IEEE Trans Circuits Syst I-Regul Pap, 2005, 52: 13–20
Article Google Scholar
Ishii I, Yamamoto K, Kubozono M. Higher order autocorrelation vision chip. IEEE Trans Electron Dev, 2006, 53: 1797–1804
Article Google Scholar
Miao W, Lin Q Y, Zhang W C, et al. A programmable SIMD vision chip for real-time vision applications. IEEE J Solid-State Circuits, 2008, 43: 1470–1479
Article Google Scholar
Dubois J, Ginhac D, Paindavoine M, et al. A 10 000 fps CMOS sensor with massively parallel image processing. IEEE J Solid-State Circuits, 2008, 43: 706–717
Article Google Scholar
Yamashita H, Sodini C. A CMOS imager with a programmable bit-serial column-parallel SIMD/MIMD processor. IEEE Trans Electron Dev, 2009, 56: 2534–2545
Article Google Scholar
Lin Q Y, Miao W, Zhang W C. A 1000 frame/s programmable vision chip with variable resolution and row-pixel-mixed parallel image processors. Sensors, 2009, 9: 5933–5951
Article Google Scholar
Cheng C C, Lin C H, Li C T, et al. iVisual: an intelligent visual sensor SoC with 2790 fps image sensor and 205 GOPS/W vision processor. IEEE J Solid-State Circuits, 2009, 44: 127–135
Article Google Scholar
Zhang W C, Fu Q Y, Wu N J. A programmable vision chip based on multiple levels of parallel processors. IEEE J Solid-State Circuits, 2011, 46: 2132–2147
Article Google Scholar
Lopich A, Dudek P. A SIMD cellular processor array vision chip with asynchronous processing capabilities. IEEE Trans Circuits Syst I-Regul Pap, 2011, 58: 2420–2431
Article MathSciNet Google Scholar
Cottini N, Gottardi M, Massari N, et al. A 33 μW 64×64 pixel vision sensor embedding robust dynamic background subtraction for event detection and scene interpretation. IEEE J Solid-State Circuits, 2013, 48: 850–863
Article Google Scholar
Gauglitz S, Höllerer T, Turk M. Evaluation of interest point detectors and feature descriptors for visual tracking. Int J Comput Vis, 2011, 94: 335–360
Article MATH Google Scholar
Lowe D G. Distinctive image features from scale-invariant keypoints. Int J Comput Vis, 2004, 60: 91–110
Article Google Scholar
Bay H, Ess A, Tuytelaars T, et al. Speeded-up robust features (SURF). Comput Vis Image Underst, 2008, 110: 346–359
Article Google Scholar
Bonato V, Marques E, Constantinides G A. A parallel hardware architecture for scale and rotation invariant feature detection. IEEE Trans Circuits Syst Video Technol, 2008, 18: 1703–1712
Article Google Scholar
Kim K, Lee S, Kim J Y, et al. A configurable heterogeneous multicore architecture with cellular neural network for real-time object recognition. IEEE Trans Circuits Syst Video Technol, 2009, 19: 1612–1622
Article Google Scholar
Zhang W L, Liu L B, Yin S Y, et al. An efficient VLSI architecture of speeded-up robust feature extraction for high resolution and high frame rate video. Sci China Inf Sci, 2013, 56: 072402
MathSciNet Google Scholar
Zhou Y F, Cao Z X, Qin Q, et al. A 1000 FPS high speed CMOS image sensor with low noise global shutter pixels. Sci China Inf Sci, 2014, 57: 042405
Google Scholar
Csurka C, Dance C R, Fan L, et al. Visual categorization with bags of keypoints. In: Proceedings of ECCV International Workshop on Statistical Learning in Computer Vision, Prague, 2004. 1–7
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory for Superlattices and Microstructures, Institute of Semiconductors, Chinese Academy of Sciences, Beijing, 100083, China
Cong Shi, Jie Yang, LiYuan Liu & NanJian Wu
Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
Cong Shi & ZhiHua Wang
Institute of Microelectronics, Tsinghua University, Beijing, 100084, China
ZhiHua Wang

Authors

Cong Shi
View author publications
You can also search for this author in PubMed Google Scholar
Jie Yang
View author publications
You can also search for this author in PubMed Google Scholar
LiYuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
NanJian Wu
View author publications
You can also search for this author in PubMed Google Scholar
ZhiHua Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to NanJian Wu.

Electronic supplementary material

Supplementary material, approximately 55.8 KB.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shi, C., Yang, J., Liu, L. et al. A massively parallel keypoint detection and description (MP-KDD) algorithm for high-speed vision chip. Sci. China Inf. Sci. 57, 1–12 (2014). https://doi.org/10.1007/s11432-014-5174-9

Download citation

Received: 26 March 2014
Accepted: 08 May 2014
Published: 08 August 2014
Issue Date: October 2014
DOI: https://doi.org/10.1007/s11432-014-5174-9

Keywords

关键词

102402

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A massively parallel keypoint detection and description (MP-KDD) algorithm for high-speed vision chip

Abstract

摘要

Access this article

Similar content being viewed by others

A Design Methodology for the Next Generation Real-Time Vision Processors

Image Feature Extraction Acceleration

An FPGA accelerator for PatchMatch multi-view stereo using OpenCL

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material, approximately 55.8 KB.

Rights and permissions

About this article

Cite this article

Keywords

关键词

Navigation

A massively parallel keypoint detection and description (MP-KDD) algorithm for high-speed vision chip

Abstract

摘要

Access this article

Similar content being viewed by others

A Design Methodology for the Next Generation Real-Time Vision Processors

Image Feature Extraction Acceleration

An FPGA accelerator for PatchMatch multi-view stereo using OpenCL

References

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material, approximately 55.8 KB.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

关键词

Search

Navigation