A fast 3D reconstruction system with a low-cost camera accessory

Zhang, Yiwei; Gibson, Graham M.; Hay, Rebecca; Bowman, Richard W.; Padgett, Miles J.; Edgar, Matthew P.

doi:10.1038/srep10909

Download PDF

Article
Open access
Published: 09 June 2015

A fast 3D reconstruction system with a low-cost camera accessory

Yiwei Zhang¹,
Graham M. Gibson¹,
Rebecca Hay¹,
Richard W. Bowman²,
Miles J. Padgett¹ &
…
Matthew P. Edgar¹

Scientific Reports volume 5, Article number: 10909 (2015) Cite this article

40k Accesses
168 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Photometric stereo is a three dimensional (3D) imaging technique that uses multiple 2D images, obtained from a fixed camera perspective, with different illumination directions. Compared to other 3D imaging methods such as geometry modeling and 3D-scanning, it comes with a number of advantages, such as having a simple and efficient reconstruction routine. In this work, we describe a low-cost accessory to a commercial digital single-lens reflex (DSLR) camera system allowing fast reconstruction of 3D objects using photometric stereo. The accessory consists of four white LED lights fixed to the lens of a commercial DSLR camera and a USB programmable controller board to sequentially control the illumination. 3D images are derived for different objects with varying geometric complexity and results are presented, showing a typical height error of <3 mm for a 50 mm sized object.

A multifunctional display based on photo-responsive perovskite light-emitting diodes

Article Open access 10 April 2024

Chunxiong Bao, Zhongcheng Yuan, … Feng Gao

Interferometric imaging of amplitude and phase of spatial biphoton states

Article Open access 14 August 2023

Danilo Zia, Nazanin Dehghan, … Ebrahim Karimi

Flexible quasi-2D perovskite solar cells with high specific power and improved stability for energy-autonomous drones

Article 17 April 2024

Bekele Hailegnaw, Stepan Demchyshyn, … Martin Kaltenbrunner

Introduction

Three dimensional (3D) image reconstruction is a procedure of creating a mathematical representation of a 3D object. 3D imaging has applications in a wide range of disciplines, for instance, prototyping, object recognition, robot navigation, 3D movies and games¹.

There are various approaches for performing 3D imaging. On the smallest scale, interferometry is an extensively utilized technique for 3D surface reconstruction. The principle of interferometry is based on the phenomenon that two waves (e.g. light or radio waves) with the same or nearly the same frequency can be superimposed to form a resultant wave with a greater or lower amplitude². With information extracted from the combined waves, interferometry can be used to inspect optical surfaces and provide high precision mapping to a small fraction of a wavelength³.

Another reliable non-contact 3D imaging technique is structured illumination. Having a calibrated projector-camera pair, a light pattern is projected onto the scene and imaged by the camera (or cameras). If the surface in the scene is planar without any 3D surface variation, the pattern shown in the corresponding image will be the same (or similar) to that of the projected light pattern. However, if the surface in the scene is non-planar, the projected structured-light pattern in the corresponding image will be distorted due to surface geometry⁴. The structured illumination method uses the information from the distortion of the projected structured-light pattern to extract the 3D surface geometry. By using various structured illumination patterns (most simply sine wave), 3D surface profiles of objects can be measured with a height error within millimeters^5,6.

In addition to these aforementioned techniques, stereo vision (also known as stereoscopic vision or stereopsis) is another extensively used technique in 3D imaging, which reconstructs a 3D object by deducing the spatial shape and position of the object through parallax between the corresponding pixels from different images of the object as observed from multiple viewpoints⁷. The principle of traditional stereo vision techniques is triangulation, in which the unique contours of the object can be determined with the photos taken from two unparalleled cameras⁸. Traditional stereo vision approaches rely on the correspondence between photo elements from two cameras which sometimes can be difficult to determine.

It is also possible to extract 3D information from a single image, namely shape from shading⁹, by utilising assumptions of uniform surface reflectivity. More recently, there have been significant advances by using learning-based algorithms¹⁰, 3D morphable model algorithms¹¹ and coupled radius basis function network algorithms¹², that rely on large data training to recover 3D information from a single 2D image.

An alternative approach, which was first introduced by Woodham and known as photometric stereo¹³, allows depth and surface orientation to be estimated from multiple images of a static object taken from the same viewpoint, but under different illumination directions. This technique makes no assumption about the surface smoothness and can be calculated with reasonable computational cost.

There has been significant developments in the use of photometric stereo for 3D imaging in recent years. Okatani and Deguchi¹⁴ pointed out that, for general diffuse reflectance, there exists a set of the surface normals for which the relation between the surface normal and the orientation of the 3-vector formed by the image brightness triplet is guaranteed to be one-to-one. Basri, Jacobs and Kemelmacher¹⁵ demonstrated new methods of photometric stereo to recover surface normals assuming that all lights in a scene are distant from the object but otherwise unconstrained. Tan, et al.¹⁶ described a method to enhance the resolution of photometric stereo, which recovers the distribution of surface normals and the surface convexity of each pixel and then spatially arranging the normals among pixels based on the consistency and simplicity constraints on the surface structure. Kuparinen and Kyrki¹⁷ presented an optimal reconstruction of near planar textured surfaces using photometric stereo with the Wiener filtering from noisy and blurred observation, when the statistics of imaging errors are measurable. Shi, et al.¹⁸ proposed a self-calibrating method for photometric stereo, which uses color and intensity profiles from images taken under different and unknown lighting conditions to automatically recover both the camera’s radiometric response function and the unknown lighting directions and intensities. Hansen, et al.¹⁹ described an algorithm for selecting the optimal light sources used for photometric stereo reconstruction with both visible and near-infrared light sources, which does not require knowledge of the precise shadow boundary. Wu, et al.²⁰ demonstrated a new approach for photometric stereo, which uses advanced convex optimization techniques to handle shadows and specularities in the images for recovering surface normals from multiple lighting conditions. Sun, et al.²¹ combined a lighting calibration method, which uses a reference face model to estimate the lighting parameters from face images taken under unknown illumination, with the classical photometric stereo to reconstruct 3D faces rapidly. Chandraker, et al.²² presented a comprehensive theory, in which surface information can be determined from unknown, isotropic bidirectional reflectance distribution functions (BRDFs). In subsequent work²³, Chandraker demonstrated shape recovery from camera motions under BRDFs, based on shape from motion theories. Mecca, et al.²⁴ investigated the implementations of photometric stereo in the case of near field imaging applications and presented an efficient model based on quasi-linear partial differential equations for surface reconstruction.

Results

Hardware system and image processing

Our system consists of a commercial DSLR camera (Canon EOS 5D MarkII), four white LEDs (Luxeon Rebel) surrounding the camera lens, fixed at a distance of 330 mm via aluminium spokes, a controller board (Arduino Uno) to enable USB control of the illumination direction and a computer running our program (LabVIEW) to communicate with the controller board and obtain real-time 2D images captured by the camera (see Fig. 1).

For this investigation the background of each scene is set to be black, which helps objects to be segmented from it. The object position, the camera perspective and the light positions for all images are static and known. Once the object and camera are aligned, image acquisition is triggered through our software (see Fig. 2), initiating four LED’s to flash successively, synchronized with a short camera exposure (in total less than 1 second). This procedure provides four separate images of the object, each with shading determined by a different lighting vector. To minimize computer memory requirements, the images are subsequently resized and down-sampled to 360 × 360 pixels before 3D image reconstruction is performed.

The software reconstruction pipeline is shown in Fig. 3. The intensities from the four 2D input images are compared and the maximum intensity is obtained to provide the red, green and blue (RGB) intensity values for each pixel. This provides the reflection coefficient ρ^t map, which is later used for the 3D object texture. A threshold value is set to distinguish background and foreground, from which the object in the images can be separated. With the known coordinates, L_n, of four LED’s and the corresponding object intensity value on each pixel I_n, the object’s surface normal N is calculated by rearranging²⁵

where λ_i is the albedo (reflectivity) of each pixel on the object surface that can be estimated with a modular operation of the surface normal.

From the surface normal of each pixel, the gradients between adjacent pixels are used to obtain the surface geometry by integration, from a central starting point (set manually) to the outermost pixels of the object’s surface. The surface height at a pixel point can be approximated with the gradient of the surface and the height of its nearest-neighbor point. Since each pixel point corresponds to the measured gradient data and has more than one nearest-neighbor point, the gradient of the surface used is the mean value of the gradients at every two contiguous points and the surface height at each pixel point is denoted by the mean of the values calculated from all the nearest-neighbor points.

The program runs through the pixel points one at a time, each time the height at the pixel point in each iteration is set to the mean value of its estimates from all the nearest-neighbor points. The pixel points of an object can be divided into two types: internal pixel point and boundary pixel point. When a pixel is an internal point, the reconstructed height field’s Laplacian is set to the Laplacian calculated from the measured gradient data; when a pixel is a boundary point, the measured gradient data at the point is assumed to be accurate²⁶.

Quantitative and qualitative comparison

To test the accuracy and robustness of our 3D imaging system we imaged three different objects with varying geometric complexity: a hemisphere, an arc and a mannequin head. Both the hemisphere and the arc were constructed using a 3D printer and designed with a height of 50 mm, while the mannequin head was measured to have a height (from ear to nose tip) of 160 mm. The front of each object was located 900 mm from the camera lens. Surrounding the lens were located four white LED’s (positioned above, below, left and right) maintained at a distance of 330 mm from the center of the lens. For each of the objects placed in the scene, four images were acquired, corresponding to the different lighting conditions and cropped into a 360 × 360 pixel image (corresponding to 320 mm × 320 mm virtual size). These images were then used to reconstruct the 3D surface of the objects (see Fig. 4) using the aforementioned approach.

The 3D surface height map for each object was compared with a reference height map. For the arc and the hemisphere, the reference data was acquired from a stereolithography (STL) file used to create them, whereas the mannequin head reference data was obtained from a stereophotogrammetric camera system. For comparison, the measured data was scaled appropriately to match the reference data. We express the standard deviation of the differences between measured data and reference data using the root mean square error (RMSE) and the normalized root mean square error (NRMSE), representing the variation of the RMSE²⁷. The RMSE and NRMSE, correspondingly are defined as²⁸:

where n is the number of data pairs, d_i is the difference between measured values and reference values and (x_max − x_min) is the range of measured values. The value of NRMSE is often expressed as a percentage, where a lower value indicates less variance and hence higher accuracy. The results are shown in Table 1.

Table 1 Deviations between measured values and true values.

Full size table

We observed close agreement for the two objects with relatively low geometric complexity, the hemisphere and the arc, having RMS errors of 2.76 mm and 2.65 mm, respectively. While an RMS error of 15.60 mm was observed for the mannequin head. We note that the regions contributing most to the overall RMS error were locations of sharp edges or where the surface normal was in a direction perpendicular to the camera perspective. Furthermore, we tested the system with a real human subject, to validate its capabilities for potential applications, such as facial recognition, the result of which is shown in Fig. 5.

In addition to reconstructing 3D images with a cheap camera attachment and photometric stereo software, it is possible to simultaneously determine pairs of 2D images representing the slightly different perspectives of an object that would be formed by our eyes. By displaying these images on a 3D enabled TV, we are able to view the object in 3D from different angles despite the camera and object remaining stationary during image acquisition. We have demonstrated this system in application whilst obtaining 3D images of faces from visitors at various science exhibitions (such as the Glasgow Science Centre and the Royal Society), whereupon a short 2D movie is produced of the head rotating and subsequently sent to an email address. The results proved the capability of this system to reconstruct 3D models of faces under realistic workplace conditions.

Discussion

In this paper, we have presented a new computational system with a low-cost camera accessory, which consists of four white LED’s and a controller board, for enabling fast 3D image reconstruction based on photometric stereo principles. We have performed 3D reconstructions on a selection of objects with varying geometric complexity, finding good quantitative agreement with the known reference object with a wide viewing angle. We observed an increased error at regions of large gradients, where the surface normals were approaching a direction perpendicular to the camera perspective, indicating one limitation of this 3D imaging system. This inexpensive system has potential to be used at the airport security-check for collecting 3D information of passengers rapidly and it could also be applied to high schools for education purpose. Further improvement could be made to the system by optimising the reconstruction algorithm in order to provide better height estimates at regions with sharp edges.

Methods

Photometric Stereo

The appearance of an object in a photo results from the effects of illumination, object orientation, object shape and its reflectance. With a fixed object, the corresponding surface orientation can be determined by analyzing the object images under different illumination directions (see Fig. 6). Fundamentally, photometric stereo, which is simple and succinct for Lambertian surfaces, enables 3D reconstruction of an object by analyzing differences in the pixel intensities in images that have been acquired from at least three different illumination directions²⁹.

By definition, a surface, from which light is reflected in all directions and its brightness looks the same regardless of direction and position, is a Lambertian surface³⁰. For instance, a human face is an approximate Lambertian object³¹, for which this system may have applications in imaging, for example in facial recognition techniques. In general, the appearance of a diffuse object with a specular varying reflection may be modeled as³²:

where I_P is the pixel intensity at point p, k is a fixed value of a linear combination of k basis materials, ρ^t is a reflection coefficient that varies on the surface, f_i is any reflectance map as a function of the viewing direction v, n_P is the surface normal at that point and L_P is the incident illumination field.

Photometric stereo requires some control of the lighting environment, without position changes of neither object nor camera. Theoretically, three illumination directions are sufficient to obtain the surface normals, however to ensure that at least three intensity values are measured at any pixel in all acquired images, our 3D imaging system utilizes four illumination directions.

The data used to produce the content of this manuscript is available at: http://dx.doi.org/10.5525/gla.researchdata.168.

Additional Information

How to cite this article: Zhang, Y. et al. A fast 3D reconstruction system with a low-cost camera accessory. Sci. Rep. 5, 10909; doi: 10.1038/srep10909 (2015).

References

Quan, L. Image-Based Modeling (Springer, New York, 2010).
Book Google Scholar
Hill, M. et al. Verification of 2-D MEMS model using optical profiling techniques. Opt. Laser Eng. 36, 169–183 (2001).
Article Google Scholar
Hariharan, P. Basics of Interferometry (Academic Press, Boston, 2010).
Geng, J. Structured-light 3D surface imaging: a tutorial. Adv. Opt. Photon. 3, 128–160 (2011).
Article CAS Google Scholar
Scharstein, D. & Szeliski, R. High-accuracy stereo depth maps using structured light. In IEEE Comput. Vis. Pattern Recogn., Wisconsin, USA, I–195–I–202 (IEEE, 2003).
Gupta, M., Agrawal, A., Veeraraghavan, A. & Narasimhan, S. G. Structured light 3D scanning in the presence of global illumination. In IEEE Comput. Vis. Pattern Recogn., Providence, USA, 713–720 (IEEE, 2011).
Iocchi, L. & Konolige, K. A multiresolution stereo vision system for mobile robots. In New Trends in Robotics Research, Padova, Italy, 317–321 (AIIA, 1998).
Keppel, E. Approximating complex surfaces by triangulation of contour lines. IBM J. Res. and Dev. 19, 2–11 (1975).
Article MathSciNet Google Scholar
Horn, B. K. & Brooks, M. J. Shape from Shading (MIT Press, Cambridge, 1989).
Nandy, D. & Ben-Arie, J. Shape from recognition: a novel approach for 3-D face shape recovery. IEEE Trans. Image Process. 10, 206–217 (2001).
Article CAS ADS Google Scholar
Romdhani, S. & Vetter, T. Estimating 3D shape and texture using pixel intensity, edges, specular highlights, texture constraints and a prior. In IEEE Comput. Vis. Pattern Recogn., Providence, RI, 2, 986–993 (IEEE, 2005).
Google Scholar
Song, M., Tao, D., Huang, X., Chen, C. & Bu, J. Three-dimensional face reconstruction from a single image by a coupled RBF network. IEEE Trans. Image Process. 21, 2887–2897 (2012).
Article ADS MathSciNet Google Scholar
Woodham, R. J. Photometric method for determining surface orientation from multiple images. Opt. Eng. 19, 139–144 (1980).
Article ADS Google Scholar
Okatani, T. & Deguchi, K. On uniqueness of solutions of the three-light-source photometric stereo: Conditions on illumination configuration and surface reflectance. Comput. Vis. Image Und. 81, 211–226 (2001).
Article Google Scholar
Basri, R., Jacobs, D. & Kemelmacher, I. Photometric stereo with general, unknown lighting. Int. J. Comput. Vision 72, 239–257 (2007).
Article Google Scholar
Tan, P., Lin, S. & Quan, L. Subpixel photometric stereo. IEEE Trans. Pattern Anal. Mach. Intell. 30, 1460–1471 (2008).
Article Google Scholar
Kuparinen, T. & Kyrki, V. Optimal reconstruction of approximate planar surfaces using photometric stereo. IEEE Trans. Pattern Anal. Mach. Intell. 31, 2282–2289 (2009).
Article Google Scholar
Shi, B., Matsushita, Y., Wei, Y., Xu, C. & Tan, P. Self-calibrating photometric stereo. In IEEE Comput. Vis. Pattern Recogn., California, USA, 1118–1125 (IEEE, 2010).
Hansen, M. F., Atkinson, G. A., Smith, L. N. & Smith, M. L. 3D face reconstructions from photometric stereo using near infrared and visible light. Comput. Vis. Image Und. 114, 942–951 (2010).
Article Google Scholar
Wu, L. et al. Robust photometric stereo via low-rank matrix completion and recovery. In Proceedings of the 10th Asian Conference on Computer Vision, Queenstown, New Zealand, 703–717 (Springer-Verlag, 2011).
Sun, Y., Dong, J., Jian, M. & Qi, L. Fast 3D face reconstruction based on uncalibrated photometric stereo. Multimed. Tools Appl. 1–16; 10.1007/s11042-013-1791-3 (2013).
Chandraker, M., Bai, J. & Ramamoorthi, R. On differential photometric reconstruction for unknown, isotropic BRDFs. IEEE Trans. Pattern Anal. Mach. Intell. 35, 2941–2955 (2013).
Article Google Scholar
Chandraker, M. What camera motion reveals about shape with unknown BRDF. In IEEE Comput. Vis. Pattern Recogn., 2179–2186 (IEEE, 2014).
Mecca, R., Wetzler, A., Bruckstein, A. M. & Kimmel, R. Near field photometric stereo with point light sources. SIAM J. Imaging Sci. 7, 2732–2770 (2014).
Article MathSciNet Google Scholar
Hernández, C., Vogiatzis, G. & Cipolla, R. Multiview photometric stereo. IEEE Trans. Pattern Anal. Mach. Intell. 30, 548–554 (2008).
Article Google Scholar
Horn, B. Robot Vision (MIT Press, Cambridge, 1986).
Stone, R. J. Improved statistical procedure for the evaluation of solar radiation estimation models. Sol. Energy 51, 289–291 (1993).
Article ADS Google Scholar
Tong, K. & Granat, M. H. A practical gait analysis system using gyroscopes. Med. Eng. Phys. 21, 87–94 (1999).
Article CAS Google Scholar
Chantler, M. J. & Wu, J. Rotation invariant classification of 3D surface textures using photometric stereo and surface magnitude spectra. In British Machine Vision Conference, Bristol, UK, 1–10 (BMVA, 2000).
Acharya, T. & Ray, A. K. Image Processing (John Wiley & Sons, Inc., New Jersey, 2005).
Aggarwal, G. Recognizing Human Faces: Physical Modeling and Pattern Classification (ProQuest, 2008).
Hertzmann, A. & Seitz, S. M. Example-based photometric stereo: Shape reconstruction with general, varying BRDFs. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1254–1264 (2005).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the European Research Council (ERC) through TWISTS (grant number 192382), the Engineering and Physical Sciences Research Council (EPSRC) through COAM (grant number EP/I012451/1) and the University of Glasgow through the Kelvin Smith Scholarship.

Author information

Authors and Affiliations

SUPA, School of Physics and Astronomy, University of Glasgow, Glasgow, G12 8QQ, UK
Yiwei Zhang, Graham M. Gibson, Rebecca Hay, Miles J. Padgett & Matthew P. Edgar
Department of Physics, University of Cambridge, CB3 0HE, UK
Richard W. Bowman

Authors

Yiwei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Graham M. Gibson
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Hay
View author publications
You can also search for this author in PubMed Google Scholar
Richard W. Bowman
View author publications
You can also search for this author in PubMed Google Scholar
Miles J. Padgett
View author publications
You can also search for this author in PubMed Google Scholar
Matthew P. Edgar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.Z. wrote the main manuscript text and G.G. prepared figures 1–2. R.B., R.H., M.P.E. and M.J.P. were involved in the development of the system. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Movie S1

Supplementary Movie S2

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Zhang, Y., Gibson, G., Hay, R. et al. A fast 3D reconstruction system with a low-cost camera accessory. Sci Rep 5, 10909 (2015). https://doi.org/10.1038/srep10909

Download citation

Received: 31 December 2014
Accepted: 29 April 2015
Published: 09 June 2015
DOI: https://doi.org/10.1038/srep10909

This article is cited by

Real-time molecular imaging of near-surface tissue using Raman spectroscopy
- Wei Yang
- Florian Knorr
- Iwan W. Schie
Light: Science & Applications (2022)
Field surface roughness levelling of the lapping metal surface using specular white light
- Junaid Dar
- Dinuka Ravimal
- Sun-Kyu Lee
The International Journal of Advanced Manufacturing Technology (2022)
Optical MEMS devices for compact 3D surface imaging cameras
- Sung-Pyo Yang
- Yeong-Hyeon Seo
- Ki-Hun Jeong
Micro and Nano Systems Letters (2019)
Principles and prospects for single-pixel imaging
- Matthew P. Edgar
- Graham M. Gibson
- Miles J. Padgett
Nature Photonics (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.