skip to main content
10.1145/3460569.3460583acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmaiConference Proceedingsconference-collections
research-article

Height Estimation from Single Aerial Imagery with a Deep Boundary-Guided Network

Published:31 August 2021Publication History

ABSTRACT

Extracting 3D information from single aerial image plays an important role in computer vision and remote sensing. However, due to the structural complexity of ground objects and noise introduced during the generation stage of ground truth labels, it is challenging to automatically recover the regularized height map from only one orthogonal photography. In this paper, we propose a novel deep network for estimating accurate and regularized height map from a single aerial image. The network mainly contains two sub-networks, namely the height map derivation sub-network and the boundary guidance sub-network. They are sequentially connected together, so that the corresponding boundary map can be directly calculated after the height map is obtained. We also propose a loss function suitable for semantic boundary guidance, which is similar to SSIM loss function at the edges of the ground targets. Apart from pursuing accuracy of height regression, boundary regularity constraints derived from semantic labels are also employed to form a joint metric criterion. We perform a qualitative and quantitative evaluations on ISPRS remote sensing dataset, and the result indicate that our framework improve both accuracy and regularity of estimated depth map.

References

  1. Amirkolaee, H.A., Arefi, H.: Height estimation from single aerial images using a deep convolutional encoder-decoder network. ISPRS journal of photogrammetry and remote sensing 149, 50-66 (2019)Google ScholarGoogle Scholar
  2. Badrinarayanan, V., Handa, A., Cipolla, R.: Segnet: A deep convolutional encoderdecoder architecture for robust semantic pixel-wise labelling. arXiv preprint arXiv:1505.07293 (2015)Google ScholarGoogle Scholar
  3. Badrinarayanan, V., Kendall, A., Cipolla, R.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence 39(12), 2481-2495 (2017)Google ScholarGoogle Scholar
  4. Carvalho, M., Le Saux, B., Trouv´e-Peloux, P., Champagnat, F., Almansa, A.: Multitask learning of height and semantics from aerial images. IEEE Geoscience and Remote Sensing Letters (2019)Google ScholarGoogle Scholar
  5. Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence 40(4), 834-848 (2017)Google ScholarGoogle Scholar
  6. Chen, Y., Li, J., Xiao, H., Jin, X., Yan, S., Feng, J.: Dual path networks. In:Advances in Neural Information Processing Systems. pp. 4467-4475 (2017)Google ScholarGoogle Scholar
  7. Dai, J., Li, Y., He, K., Sun, J.: R-fcn: Object detection via region-based fully convolutional networks. In: Advances in neural information processing systems. pp. 379-387 (2016)Google ScholarGoogle Scholar
  8. Dubost, F., Bortsova, G., Adams, H., Ikram, A., Niessen, W.J., Vernooij, M., De Bruijne, M.: Gp-unet: Lesion detection from weak labels with a 3d regression network. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. pp. 214-221. Springer (2017)Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Eigen, D., Fergus, R.: Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In: Proceedings of the IEEE international conference on computer vision. pp. 2650-2658 (2015)Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Gamal, M., Siam, M., Abdel-Razek, M.: Shuffleseg: Real-time semantic segmentation network. arXiv preprint arXiv:1803.03816 (2018)Google ScholarGoogle Scholar
  11. Haarbrink, R., Eisenbeiss, H., : Accurate dsm production from unmanned helicopter systems. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci 37, 1259-1264 (2008)Google ScholarGoogle Scholar
  12. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 770-778 (2016)Google ScholarGoogle ScholarCross RefCross Ref
  13. Hore, A., Ziou, D.: Image quality metrics: Psnr vs. ssim. In: 2010 20th International Conference on Pattern Recognition. pp. 2366-2369. IEEE (2010)Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Kendall, A., Badrinarayanan, V., Cipolla, R.: Bayesian segnet: Model uncertainty in deep convolutional encoder-decoder architectures for scene understanding. arXiv preprint arXiv:1511.02680 (2015)Google ScholarGoogle Scholar
  15. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems. pp. 1097-1105 (2012)12Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Li, X., Chen, H., Qi, X., Dou, Q., Fu, C.W., Heng, P.A.: H-denseunet: hybrid densely connected unet for liver and tumor segmentation from ct volumes. IEEE transactions on medical imaging 37(12), 2663-2674 (2018)Google ScholarGoogle Scholar
  17. Liu, C., Chen, L.C., Schroff, F., Adam, H., Hua, W., Yuille, A.L., Fei-Fei, L.: Auto-deeplab: Hierarchical neural architecture search for semantic image segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 82-92 (2019)Google ScholarGoogle ScholarCross RefCross Ref
  18. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 3431-3440 (2015)Google ScholarGoogle ScholarCross RefCross Ref
  19. Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation. In: Proceedings of the IEEE international conference on computer vision. pp. 1520-1528 (2015)Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Noronha, S., Nevatia, R.: Detection and modeling of buildings from multiple aerial images. IEEE Transactions on pattern analysis and machine intelligence 23(5), 501-518 (2001)Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: International Conference on Medical image computing and computer-assisted intervention. pp. 234{241. Springer (2015)Google ScholarGoogle Scholar
  22. Ros, G., Sellart, L., Materzynska, J., Vazquez, D., Lopez, A.M.: The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 3234-3243 (2016)Google ScholarGoogle ScholarCross RefCross Ref
  23. Shin, H.C., Roth, H.R., Gao, M., Lu, L., Xu, Z., Nogues, I., Yao, J., Mollura, D., Summers, R.M.: Deep convolutional neural networks for computer-aided detection: Cnn architectures, dataset characteristics and transfer learning. IEEE transactions on medical imaging 35(5), 1285-1298 (2016)Google ScholarGoogle Scholar
  24. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)Google ScholarGoogle Scholar
  25. Verma, V., Kumar, R., Hsu, S.: 3d building detection and modeling from aerial lidar data. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06). vol. 2, pp. 2213-2220. IEEE (2006)Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Vincent, O.R., Folorunso, O., : A descriptive algorithm for sobel image edge detection. In: Proceedings of Informing Science & IT Education Conference (InSITE). vol. 40, pp. 97-107. Informing Science Institute California (2009)Google ScholarGoogle ScholarCross RefCross Ref
  27. Wu, H., Cai, Z., Wang, Y.: Vison-based auxiliary navigation method using augmented reality for unmanned aerial vehicles. In: IEEE 10th International Conference on Industrial Informatics. pp. 520-525. IEEE (2012)Google ScholarGoogle ScholarCross RefCross Ref
  28. Xie, S., Girshick, R., Doll´ar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 1492-1500 (2017)Google ScholarGoogle ScholarCross RefCross Ref
  29. Zhan, H., Garg, R., Saroj Weerasekera, C., Li, K., Agarwal, H., Reid, I.: Unsupervised learning of monocular depth estimation and visual odometry with deep feature reconstruction. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 340-349 (2018)Google ScholarGoogle ScholarCross RefCross Ref
  30. Zhang, X., Ye, Z., Zhu, J.H., LI, S.: Unmanned aerial vehicle flight simulation and training system based on virtual reality [j]. Acta Simulata Systematica Sinica 8(2002)Google ScholarGoogle Scholar

Index Terms

  1. Height Estimation from Single Aerial Imagery with a Deep Boundary-Guided Network
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICMAI '21: Proceedings of the 2021 6th International Conference on Mathematics and Artificial Intelligence
          March 2021
          142 pages
          ISBN:9781450389464
          DOI:10.1145/3460569

          Copyright © 2021 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 31 August 2021

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited
        • Article Metrics

          • Downloads (Last 12 months)11
          • Downloads (Last 6 weeks)0

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format