On the representation of rigid body transformations for accurate registration of free-form shapes

doi:10.1016/S0921-8890(02)00173-2

Robotics and Autonomous Systems

Volume 39, Issue 1, 30 April 2002, Pages 37-52

https://doi.org/10.1016/S0921-8890(02)00173-2 Get rights and content

Abstract

In this paper we consider representation issues of rigid body transformations based on geometric properties of reflected correspondence vectors. A sufficient and necessary representation of rigid body transformations is put forward followed by an accurate and robust algorithm for the registration of free-form surfaces. The algorithm makes full use of global rigid motion constraints derived from the representation of rigid body transformations and can effectively deal with occlusion, noise, and appearance and disappearance of points. A comparative study based on both synthetic data and real images show that the registration method is accurate and robust.

Introduction

Recent developments in electronics and optic devices have enabled depth information to be directly incorporated into image acquisition systems. The availability of range images is leading to a wide range of useful applications, from object recognition to autonomous navigation, to a full integration of computer vision and graphics. However, unavoidable limitations exist in range image acquisition and processing. First, due to physical constraints such as limited field of view of range cameras, the acquisition of object surfaces often requires a number of images to be taken from different viewpoints. Second, these different views need to be put together into a full 3D model. Thus, techniques for image registration and fusion are required so that quality models can be constructed. In this paper, we limit our attention to the registration of two overlapping range images of free-form surfaces which are represented as two sets of unorganised points described in two different co-ordinate frames. Such sets of points typically represent range image acquisition from a stationary camera with a moving object or, say, a sequence of images acquired from a moving robot platform at different time instants.

Many methods have been proposed to solve the registration problem based on techniques such as scatter matrix [11], iterative closest point (ICP) algorithm [3], [5], [27], extreme points [21], crest line [20], reverse calibration [4], interactive method [23], and geometric histogram [1], among many others. Among these methods, the ICP algorithm implements a natural and practical idea and has attracted much attention from the machine vision community since it was independently proposed in 1992 by several researchers. A version of the ICP algorithm was proposed in [3] to register model and scene data sets. This algorithm minimises the squared distance between the transformed points in one co-ordinate frame and the closest points in another. The advantage of this method is that it can be used for any type of object surfaces and it is generally accurate. The disadvantage is that the algorithm assumes that the description of the scene must be a subset of the model and requires a good initialisation of motion parameters.

In [5], another iterative algorithm was proposed with the same purpose. However, this algorithm instead minimises the squared distance from the transformed points in one co-ordinate frame to the tangent planes at the intersection points between the normal vectors at the transformed points and the surfaces in the second frame. It computes the variation of rigid body transformation at each iteration. The advantage of this algorithm is that it does not require point to point correspondences and is generally efficient [19]. The disadvantage is that it requires organised points and accurate estimation of the intersection points where the tangent planes lie. In [27], another version of ICP algorithm was proposed. In order to speed up the search for the closest points, the algorithm uses a K-D tree representation of image data. The advantage of this algorithm is that it can deal with occlusion, appearance and disappearance of points. The disadvantage is that it requires the experimenter to set a threshold for maximum distance for matched points.

A flowchart depicting the steps in the ICP algorithm is shown in Fig. 1. Since 1992, a number of methods have been proposed to improve every step of this algorithm. For instance, in order to improve step 1, the distances between bitangent points [7], bitangent curves [24], surface signatures [26], and spin images [9] were matched. Also, the correlation of spherical harmonic projections [6] were performed for correspondences from which a crude motion can be estimated. In order to improve step 2, colour information [10], normal vector [7], and a number n of invariants [18] associated with each point were incorporated so that distances were defined in 6 or (3+n)-dimensional space rather than just in 3D space.

Step 3 represents the crucial aspect of the algorithm and, in order to improve it, the K-D tree representation of image data [10] and the grid closest point transform [25] were used to speed up the search for closest points. In this step, once possible matching points are found, their correct evaluation (as true or false matches) will determine the overall robustness and accuracy of the registration. For this reason, most work so far on variants of the ICP method have focused on the evaluation of whether matching points represent plausible point correspondences or not. Unfortunately, such evaluation has proven difficult and various schemes have been proposed, highlighted as follows. In [10], a threshold is required for the maximum distance for matched points. In [17], [27], the orientation consistency and the normal vectors associated with the matched points were validated. In [15] and [12], [13], respectively, geometric properties of correspondence vectors and reflected correspondence vectors were used to cope with false matches. In [22], the boundary points are discarded since they are more likely to yield false matches.

Once more accurate sets of correspondences have been established (here more accurate means that points are determined by a method that yields better results than simply taking the point’s nearest neighbour as its correspondence), motion parameters can be estimated using algorithms such as based on the constraint least squares method, quaternion method [3], or dual quaternion method [27]. In practice, due to false matches and noise corrupting the data, the objective function usually has a large number of local minima. Thus, iterative algorithms such as the ICP are very likely to converge to a local minimum. In order to overcome this, a number of techniques can be used with good results, such as simulated annealing as proposed in [16].

A summary of the techniques highlighted above used to improve the standard ICP algorithm is depicted in Table 1. An overall analysis reveals that these techniques are mainly based on local invariants described in a single co-ordinate frame. In theory, local invariants can be extracted from image data but, unfortunately, these invariants are sensitive to noise, occlusion, appearance and disappearance of points. In contrast, invariants defined by points described in different co-ordinate systems are termed global invariants such as the ones used in this paper. Since the estimation of global invariants makes use of redundant data their estimation is more robust to noise, occlusion, appearance and disappearance of points. The method presented in this paper is based on geometric relationships described by two co-ordinate frames bridging the gap between correspondences before and after a rigid motion. This proposed method is consistent with the nature of the data which are normally available leading to accurate and robust registration results.

The standard ICP algorithm uses only a distance constraint to define the position of correspondences and this leads to a number of false matches at every iteration. Thus, the key factor for a successful application of the ICP method is the rejection of false matches. Since false matches are created by the motion, we argue that their elimination must consider the properties of the motion. Thus, in this paper, we first reconsider the representation of rigid body transformations aiming at developing new constraints to evaluate correspondences and eliminate false matches. While the main idea of this paper follows similar reasoning as described in [14] where the projected distance, angle, and projection information are used, in this paper the distance and the projected distance between a point and its reflected correspondence are used. Doing so is justified as such distances are geometrically intuitive and thus can further improve our understanding of rigid body motions which, in turn, lead to the specification of effective constraints for the elimination of false matches.

An earlier version of the extended algorithm presented here has been described in [13]. Also, while the algorithms presented in [12], [13], [15] employ global rigid motion constraints, this paper makes the further distinction between correspondences established by different mappings implied by the traditional ICP criterion. These mappings are one-to-one, many-to-one and one-to-many. To our knowledge, existing ICP-based registration algorithms treat equally all correspondences established by these different mappings. Even though it cannot be said for certain that the one-to-one mapping of established correspondences are plausible, it is certain, however, that the vast majority of the established correspondences by other mappings are not plausible. Thus, such distinction greatly facilitates accurate and robust estimation of the parameters of interest especially for data that are highly corrupted by outliers. This, in turn, reduces the complexity of the process of eliminating false matches.

Similar observation was made in [22] where either point of a correspondence lying on a triangular mesh boundary is considered to yield false matches and is, thus, discarded. However, this assumption does not imply for certain that it is a false match, so at least one correspondence established by many-to-one and one-to-many mappings implies a false match. This paper considers this situation in detail through a comparative study based on both synthetic data and real images. It is shown that the extended algorithm presented here is accurate and robust for the registration of free-form shapes with a small motion and that it is superior to the algorithms presented in [12], [13], [15].

The rest of this paper is structured as follows. Section 2 provides a representation of rigid body transformations derived from the geometric properties of reflected correspondence vectors. Such representation provides explicit constraints to rigid motion forming the basis for the proposed registration algorithm. Section 3 presents the extended registration algorithm, and Section 4 presents experimental results. Finally, Section 5 presents a detailed discussion on some relevant issues in range image registration and some conclusions are drawn.

Section snippets

The representation of rigid body transformations

Generally, the following equation can be used to represent a rigid body transformation: $p ′= Rp + t,$ where $R$ and $t$ are the rigid body rotation matrix and translation vector. The point pair $(p, p ′)$ is called a correspondence representing the same physical point in 3D space where $p$ is a point described in one co-ordinate frame before a transformation and $p ′$ is its correspondence described in another co-ordinate frame after a rigid transformation. The correspondence vector $CV$ of the point pair $(p, p ′)$ is

Analysis of the standard ICP method

From Eq. (1), it is known that when the motion parameters rotation matrix $R$ and the translation vector $t$ are known and the points $(p, p ′)$ are real correspondents, then the equality of that equation is exactly true. However, in practice, real data do not strictly satisfy this constraint and relaxing assumptions are thus required in order to achieve reasonably accurate and efficient registration. Relaxing the equality of Eq. (1) leads to a criterion used by the standard ICP algorithm to establish

Experimental results

In order to validate the algorithm and carry out a comparative study of performance, both the proposed extended RGICP algorithm and the GICP algorithm as described in [13] were implemented and applied to both synthetic data and real range images. Synthetic data are relevant because full knowledge is available of all points and their correspondences before and after a transformation to serve as parameters for error estimation. It is important to stress that, in this paper, we focus on the

Discussion

The experiments highlight a number of significant aspects on applying the proposed geometric constraints and algorithm to image registration. First and more important, the RGICP algorithm has been applied to the images without any knowledge about the distribution of points, occlusion, appearance and disappearance of points, and motion information. No feature extraction or pre-processing were required which makes the algorithm particularly useful for real applications where image acquisition

References (27)

A.P. Ashbrook, R.B. Fisher, C. Robertson, N. Werghi, Finding surface correspondences for object recognition and...
R. Bergevin et al.
Towards a general multi-view registration technique
IEEE Transactions on Pattern Analysis and Machine Intelligence
(1996)
P.J. Besl et al.
A method for registration of 3D shapes
IEEE Transactions on Pattern Analysis and Machine Intelligence
(1992)
G. Blais et al.
Registering multiview range data to create 3D computer objects
IEEE Transactions on Pattern Analysis and Machine Intelligence
(1995)
C.W. Chen et al.
Object modelling by registration of multiple range images
Image and Vision Computing
(1992)
S. Chen, R. Hsu, A.K. Jain, C.R. Mercer, Registration of 3D range views using spherical harmonics, Technical Report,...
J. Feldmar, N. Ayache, F. Betting, 3D–2D projective registration of free-form curves and surfaces, Technical Report,...
G.H. Golub, C.F. Van Loan, Matrix Computations, 3rd Edition, Johns Hopkins University Press, Baltimore, MD,...
D. Huber, O. Carmichael, M. Hebert, 3D map reconstruction from range data, in: Proceedings of IEEE International...
A. Johnson, S.B. Kang, Registration and integration of textured 3D data, Technical Report, CRL 96/4, DEC, Cambridge...

Z. Lin, H. Lee, T. Huang, Finding 3D point correspondences in motion estimation, in: Proceedings of Eighth...

Y. Liu, M.A. Rodrigues, Y. Wang, Developing rigid motion constraints for the registration of free-form shapes, in:...

Y. Liu, M.A. Rodrigues, Analysing the geometric properties of reflected correspondence vectors for the registration of...

Cited by (26)

Track gauge measurement based on model matching using UAV image
2023, Automation in Construction
This study investigates the use of unmanned aerial vehicles (UAVs) for track gauge measurement, as an alternative to traditional methods that rely on specialized equipment mounted on rail inspection vehicles or trains. Such equipment not only consumes valuable rail resources but also poses safety risks to inspectors. The proposed method involves utilizing image data collected by a UAV to generate a point cloud of the railway scene. A hybrid segmentation algorithm based on augmented Region-Growing and improved alpha-shape techniques is then employed to extract rails from chaotic scene data. To determine the track gauge, a pre-built rail Building Information Modeling (BIM) model is discretized into a point cloud template, which is then matched with the extracted rail features using a template-oriented model matching approach. Validation experiments on four groups of data demonstrate the superiority of this method in terms of safety, efficiency, speed, and accuracy.
Constraints for closest point finding
2008, Pattern Recognition Letters
Citation Excerpt :
It is impossible that some possible point matches are of high quality, while their neighbours are of significantly low quality. This unique characteristic sheds light on why the traditional CPC has been widely used in various applications and reminds that we should carefully treat these possible point matches when classifying them into either real or false ones (Liu and Rodrigues, 2002; Liu, 2004; Pulli, 1999; Rodrigues and Liu, 2002; Turk and Levoy, 1994; Zhang, 1994), since such a classification is unlikely always successful due to the gradual changing nature of the quality of these possible point matches. This means that a uniform treatment of these possible point matches is encouraged in the sense of estimating their probability as being real ones (Gold et al., 1998; Liu, 2005, 2006, 2007).
The traditional closest point criterion has been widely used for 3D free form shape matching, object recognition, internet search, computer graphics and medical imaging. However, the rationale of this criterion has not yet been well understood and exploited. In this paper, we apply vector operations and the triangle inequality to carefully analyse this criterion and reveal that this criterion can guarantee that the found point matches satisfy the orientation, rigidity and matching error constraints and thus, are of high relative quality. Such properties not only shed light on and deepen our understanding of this criterion about its generality and practicality and improve our awareness about whether the established point matches are consistent with each other, but also provide us with a possibility to develop novel algorithms for the reliability evaluation of existing point matches and an efficient establishment of more accurate point matches. The experimental results based on real images show that the possible point matches established through extracting and matching spin images often violate these constraints and these constraints can often be successfully applied to reject (probably part of) the unlikely point matches for more accurate free form shape matching results.
Automatic registration of overlapping 3D point clouds using closest points
2006, Image and Vision Computing
While the SoftAssign algorithm imposes a two-way constraint embedded into the deterministic annealing scheme and the EMICP algorithm imposes a one-way constraint, they represent the state of the art technique for the automatic registration of overlapping free form shapes. They both have a time complexity of O(n²). While the former has a space complexity also of O(n²), the latter has a space complexity of O(n). The heavy demand for computation and storage memory renders either the SoftAssign or EMICP algorithm to hardly operate on whole shapes with thousands of points. In this case, they often have to reduce the number of points to an order of 100s on the free form shapes to be registered. This paper proposes using closest points in conjunction with either the one-way or two-way constraint for the automatic registration of overlapping 3D point clouds and thus, combining the accuracy of both the SoftAssign and EMICP algorithms with the efficiency of the traditional ICP algorithm. A comparative study based on both synthetic data and real images has shown that the proposed algorithm does not significantly sacrifice accuracy and stability of either the SoftAssign or EMICP algorithm, but gains remarkable efficiency of the traditional ICP algorithm for the automatic registration of overlapping 3D point clouds. Since, the proposed algorithm is of general use and has an advantage of easy implementation, it is likely to become in the future a benchmark for the automatic registration of overlapping 3D point clouds.
Application of terrestrial laser scanning methodology in geometric tolerances analysis of tunnel structures
2006, Tunnelling and Underground Space Technology
Projecting registration error for accurate registration of overlapping range images
2006, Robotics and Autonomous Systems
In this paper, we propose a novel algorithm for the automatic registration of two overlapping range images. Since it is relatively difficult to compare the registration errors of different point matches, we project them onto a virtual image plane for more accurate comparison using the classical pin-hole perspective projection camera model. While the traditional ICP algorithm is more interested in the points in the second image close to the sphere centred at the transformed point, the novel algorithm is more interested in the points in the second image as collinear as possible to the transformed point. The novel algorithm then extracts useful information from both the registration error and projected error histograms for the elimination of false matches without any feature extraction, image segmentation or the requirement of motion estimation from outliers corrupted data and, thus, has an advantage of easy implementation. A comparative study based on real images captured under typical imaging conditions has shown that the novel algorithm produces good registration results.
Feature-based reverse modeling strategies
2006, CAD Computer Aided Design
We presented two integrated solution schemes, sectional feature based strategy and surface feature based strategy, for modeling industrial components from point cloud to surfaces without using triangulation. For the sectional feature based strategy, slicing, curve feature recognition and constrained fitting are introduced. This strategy emphasizes the advanced feature architecture patterns from 2D to 3D in reverse engineering. The surface feature based strategy relies on differential geometric attributes estimation and diverse feature extraction techniques. The methods and algorithms such as attributes estimation based on 4D Shepard surface, symmetry plane extraction, quadric surface recognition and optimization, extruded and rotational surface extraction, and blend feature extraction with probability and statistic theory are proposed. The reliable three-dimensional feature fabricated the valid substratum of B-rep model faultlessly. All the algorithms are implemented in RE-SOFT, a reverse engineering software developed by Zhejiang University. The proposed strategies can be used to capture the original design intention accurately and to complete the reverse modeling process conveniently. Typical industrial components are used to illustrate the validation of our feature-based strategies.

View all citing articles on Scopus

Marcos Aurélio Rodrigues received his B.E. in Mechanical Engineering from the Federal University of Santa Catarina, Brazil, in 1983. He was awarded an M.Sc. in Computer Science in 1989 and a Ph.D. in Computer Science in 1991, both from the University of Wales, Aberystwyth. He was a lecturer within the Department of Computer Science, the University of Hull, UK, from 1995 to 1999. In 2000 he was appointed as Reader in Computing at Sheffield Hallam University, England. He has published over 90 technical papers in international journals and conferences on the subjects of robotics, systems modelling, computer vision, and pattern recognition. His main current research interests include machine vision, machine learning, and pattern recognition.

Yonghuai Liu received his B.Sc. in Computer-Aided Mapping and Geographical Information Systems from Nanjing University, PR China, in July 1989. He was awarded an M.Sc. in Remote Sensing and Image Processing from the Chinese Academy of Sciences in July 1992, and a Ph.D. in Artificial Intelligence from Northwestern Polytechnical University, PR China, in April 1998. From July 1992 to August 1997, he worked as a lecturer at Northwestern Polytechnical University, PR China, teaching on the subjects of artificial intelligence, uncertain reasoning, and expert systems. During this period he published over 20 papers in journals and conferences and won several awards. He was also awarded a Ph.D. in Computer Science in 2001 from the University of Hull, England. He has published extensively on his current research interests which include 3D image reconstruction, 3D machine vision, and pattern recognition.

View full text

On the representation of rigid body transformations for accurate registration of free-form shapes

Abstract

Introduction

Section snippets

The representation of rigid body transformations

Analysis of the standard ICP method

Experimental results

Discussion

Towards a general multi-view registration technique

IEEE Transactions on Pattern Analysis and Machine Intelligence

A method for registration of 3D shapes

IEEE Transactions on Pattern Analysis and Machine Intelligence

Registering multiview range data to create 3D computer objects

IEEE Transactions on Pattern Analysis and Machine Intelligence

Object modelling by registration of multiple range images

Image and Vision Computing