skip to main content
10.1145/3569966.3570092acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsseConference Proceedingsconference-collections
research-article

3D Model Retrieval Algorithm Based on Attention and Multi-view Fusion

Authors Info & Claims
Published:20 December 2022Publication History

ABSTRACT

With the rapid development of computer vision, 3D data is increasing rapidly. How to retrieve similar model from a large number of models has become a hot research topic. However, in order to meet people's demand, the retrieval accuracy need to be further improved. In terms of multi-view 3D model retrieval, how to effectively learn the information between views is the key to improving performance. In this paper, we propose a novel 3D model retrieval algorithm based on attention and multi-view fusion. Specifically, we mainly constructed two modules. First, dynamic attentive graph learning module is used to learn the intrinsic relationship between view blocks; Then we propose the Attention-NetVlad algorithm, which combines the channel attention algorithm and the NetVlad algorithm. It learns the information between feature channels to enhance the feature expression ability firstly, then uses the NetVlad algorithm to fuse multiple view features into a global feature according to the clustering information. Finally the global feature is used as the only feature of the model to retrieve according to Euclidean distance. In comparison with other state-of-the-art methods by utilizing ModelNet10 and ModelNet40 the proposed method has demonstrated significant improvement for retrieval mAP. Our experiments also demonstrate the effectiveness of the modules in the algorithm.

References

  1. Charles Ruizhongtai Qi, Hao Su, Kaichun Mo and Leonidas J. Guibas. 2017. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR’17).IEEE, Honolulu, HI, USA. 77-85.https://doi.org/10.1109/CVPR.2017.16Google ScholarGoogle ScholarCross RefCross Ref
  2. Ali Cheraghian and Lars Petersson.2019. 3DCapsule: Extending the Capsule Architecture to Classify 3D Point Clouds. In Proceedings of the Winter Conference on Applications of Computer (WACV’19). Waikoloa Village, HI, USA,1194-1202. https://doi.org/10.1109/WACV.2019.00132Google ScholarGoogle ScholarCross RefCross Ref
  3. Hongsen Liu, Yang Cong, Chenguang Yang and Yandong Tang. 2019. Efficient 3D object recognition via geometric information preservation.Pattern Recognit 92, (2019),135-145. https://doi.org/10.1016/j.patcog.2019.03.025Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Jianwen Jiang , Di Bao ,Ziqiang Chen , Xibin Zhao and Yue Gao. 2019. MLVCNN: Multi-loop-view convolutional neural network for 3D shape retrieval. In Proceedings of the Conference on Artificial Intelligence (AAAI2019). AAAI, Honolulu,Hawaii, USA,8513-8520. https://doi.org/10.1609/aaai.v33i01.33018513Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Heyu Zhou,An-An Liu, Weizhi Nie and Jie Nie. 2020. Multi-view saliency guided deep neural network for 3-d object retrieval and classification. IEEE Trans Multim 22,6 (2020) 1496–1506.https://doi.org/10.1109/TMM.2019.2943740Google ScholarGoogle ScholarCross RefCross Ref
  6. Zhirong Wu , Shuran Song , Aditya Khosla , Fisher Yu , Linguang Zhang, Xiaoou Tang and Jianxiong Xiao. 2015. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the Conference on Computer Vision and Pattern Recognition.(CVPR’15) IEEE.Boston, MA, USA ,1912–1920. https://doi.org/10.1109/CVPR.2015.7298801Google ScholarGoogle ScholarCross RefCross Ref
  7. Hang Su, Subhransu Maji, Evangelos Kalogerakis and Erik G. Learned-Miller. 2015 Multi-view convolutional neural networks for 3d shape recognition. In Proceedings of the International Conference on Computer Vision(ICCV’15). IEEE. Santiago, Chile, 945-953 https://doi.org/10.1109/ICCV.2015.114Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Cheng Wang, Ming Cheng, Ferdous Sohel, Mohammed Bennamoun andJonathan Li. 2019. Normalnet: A voxel-based cnn for 3d object classification and retrieval. Neurocomputing 323,JAN.5 (2019),139–147. https://doi.org/10.1016/j.neucom.2018.09.075Google ScholarGoogle ScholarCross RefCross Ref
  9. Takahiko Furuya, Ryutarou Ohbuchi. 2016. Deep aggregation of local 3d geometric features for 3d model retrieval.In Proceedings of the British Machine Vision Conference(BMVC’16).York, UKGoogle ScholarGoogle Scholar
  10. Sudhakar Kumawat and Shanmuganathan Raman. 2019. Lp-3dcnn: Unveiling local phase in 3d convolutional neural networks.In Proceedings of the Conference on Computer Vision and Pattern Recognition(CVPR’19). IEEE, Long Beach, CA, USA, 4898–4907Google ScholarGoogle Scholar
  11. Yifan Feng ,Zizhao Zhang ,,Xibin Zhao , Rongrong Ji and Yue Gao. 2018. Gvcnn: Group view convolutional neural networks for 3d shape recognition. In Proceedings of the Conference on Computer Vision and Pattern Recognition(CVPR’18).IEEE, Salt Lake City, UT, USA, 264–272Google ScholarGoogle Scholar
  12. Weizhi Nie, Shu Xiang and An-an Liu. 2018. Multi-scale cnns for 3d model retrieval. Multim Tools Appl 77,17 (2018), 22953–22963. https://doi.org/10.1007/s11042-018-5641-1Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Kai Sun , Jiangshe Zhang, Junmin Liu, Ruixuan Yu and Zengjie Song. 2021. DRCNN: Dynamic routing convolutional neural network for multi-view 3D object recognition. IEEE Trans. Image Process. 30 (2021), 868-877. https://doi.org/10.1109/TIP.2020.3039378Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Weizhi Nie, Yue Zhao, Dan Song and Yue Gao. 2021 Dan: deep-attention network for 3d shape recognition. IEEE Trans. Image Process, 30 (2021), 4371-4383. https://doi.org/10.1109/TIP.2021.3071687Google ScholarGoogle ScholarCross RefCross Ref
  15. Yue Zhao, Weizhi Nie, An-An Liu, Zan Gao and Yuting Su. 2021. Svhan: Sequential view based hierarchical attention network for 3d shape recognition. In Proceedings of the Conference on Multimedia (MM’2021). ACM, China, 2130-2138. https://doi.org/10.1145/3474085.3475371Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Petar Velickovic ,Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio and Yoshua Bengio. 2017. Graph Attention Networks.CoRR. abs/1710.10903. (2017) http://arxiv.org/abs/1710.10903Google ScholarGoogle Scholar
  17. Joan Bruna, Wojciech Zaremba , Arthur Szlam and Yann LeCun. 2014. Spectral Networks and Locally Connected Networks on Graphs. In Proceedings of the International Conference on Learning Representations Computer Science (ICLR’14). Banff, AB, Canada. http://arxiv.org/abs/1312.6203Google ScholarGoogle Scholar
  18. Chong Mou, Jian Zhang and Zhuoyuan Wu. 2021. Dynamic Attentive Graph Learning for Image Restoration. In Proceedings of the International Conference on Computer Vision (ICCV’21). IEEE, Montreal, QC, Canada, 4308—4317. https://doi.org/10.1109/ICCV48922.2021.00429Google ScholarGoogle ScholarCross RefCross Ref
  19. Kaiming He , Xiangyu Zhang , Shaoqing Ren and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR’16) IEEE. Las Vegas, NV, USA,770-778 https://doi.org/10.1109/CVPR.2016.90Google ScholarGoogle ScholarCross RefCross Ref
  20. Relja Arandjelovic , Petr Gronat , Akihiko Torii, Tomas Pajdla and Josef Sivic. 2016. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE Las Vegas, NV, USA, 5297—5307. https://doi.org/10.1109/CVPR.2016.572Google ScholarGoogle ScholarCross RefCross Ref
  21. Qilong Wang, Banggu Wu , Pengfei Zhu , Peihua Li , Wangmeng Zuo and Qinghua. 2020. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR’20) IEEE Seattle, WA, USA 11531—11539Google ScholarGoogle ScholarCross RefCross Ref
  22. P.University. 2015. ModelNet40 Retrieved November 20, 2020 from http://modelnet.cs.princeton. edu/Google ScholarGoogle Scholar
  23. Michael M. Kazhdan ,Thomas A. Funkhouser and Szymon Rusinkiewicz.2003. Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors. In Proceedings of the Eurographics Symposium on Geometry Processing.ACM, Aachen, Germany,156-164. https://doi.org/10.2312/SGP/SGP03/156-165Google ScholarGoogle ScholarCross RefCross Ref
  24. Ding-Yun Chen, Xiao-Pei Tian , Yu-Te Shen and Ming Ouhyoung. 2003. On visual similarity based 3d model retrieval. Comput Graph Forum 22, 3 (2003), 223-232. https://doi.org/10.1111/1467-8659.00669Google ScholarGoogle ScholarCross RefCross Ref
  25. Haoxuan You, Yifan Feng, Rongrong Ji and Yue Gao. 2018. PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition. In Proceedings of the Multimedia Conference (MM’18). ACM, Seoul, Republic of Korea, 1310-1318 https://doi.org/10.1145/3240508.3240702Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Haoxuan You, Yifan Feng, Xibin Zhao, Changqing Zou , Rongrong Ji and Yue Gao. 2019 PVRNet: Point-View Relation Neural Network for 3D Shape Recognition . In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI’19). Honolulu, Hawaii,USA.9119-9126. https://doi.org/10.1609/aaai.v33i01.33019119Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Weizhi Nie, Qi Liang, Yixin Wang, dXing Wei and Yuting Su. 2021. MMFN: Multimodal Information Fusion Networks for 3D Model Classification and Retrieval. ACM Trans. Multim. Comput. Commun. Appl.16,4 (2021) 131:1-131:22. https://doi.org/10.1145/3410439Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Chao Ma, Yulan Guo , Jungang Yang and Wei An. 2019. Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval. IEEE Trans. Multim.21,5 (2019), 1169—1182. https://doi.org/10.1109/TMM.2018.2875512Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Xinwei He, Yang Zhou, Zhichao Zhou, Song Bai and Xiang Bai. 2018. Triplet-Center Loss for Multi-View 3D Object Retrieval. In Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR’18). IEEE, Salt Lake City, UT, USA,1945-1954Google ScholarGoogle ScholarCross RefCross Ref
  30. Zhizhong Han, Mingyang Shang , Zhenbao Liu, Chi-Man Vong, Yu-Shen Liu, Matthias Zwicke, Junwei Han and C. L. Philip Chen. 2019. SeqViews2SeqLabels: Learning 3D Global Features via Aggregating Sequential Views by RNN With Attention. IEEE Trans. Image Process, 28,2(2019) 658-672. https://doi.org/10.1109/TIP.2018.2868426Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. 3D Model Retrieval Algorithm Based on Attention and Multi-view Fusion
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering
          October 2022
          753 pages
          ISBN:9781450397780
          DOI:10.1145/3569966

          Copyright © 2022 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 20 December 2022

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate33of74submissions,45%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format