research-article

Quadruplet Networks for Sketch-Based Image Retrieval

Authors:
Omar Seddati

UMONS, Mons, Belgium

UMONS, Mons, Belgium
View Profile

,
Stéphane Dupont

UMONS, Mons, Belgium

UMONS, Mons, Belgium
View Profile

,
Saïd Mahmoudi

UMONS, Mons, Belgium

UMONS, Mons, Belgium
View Profile

ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia RetrievalJune 2017Pages 184–191https://doi.org/10.1145/3078971.3078985

Published:06 June 2017Publication History

ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval

Pages 184–191

ABSTRACT

Freehand sketches are a simple and powerful tool for communication. They are easily recognized across cultures and suitable for various applications. In this paper, we use deep convolutional neural networks (ConvNets) to address sketch-based image retrieval (SBIR). We first train our ConvNets on sketch and image object recognition in a large scale benchmark for SBIR (the sketchy database). We then conduct a comprehensive study of ConvNets features for SBIR, using a kNN similarity search paradigm in the ConvNet feature space. In contrast to recent SBIR works, we propose a new architecture the quadruplet networks which enhance ConvNet features for SBIR. This new architecture enables ConvNets to extract more robust global and local features. We evaluate our approach on three large scale datasets. Our quadruplet networks outperform previous state-of-the-art on two of them by a significant margin and gives competitive results on the third. Our system achieves a recall of 42.16% (at k=1) for the sketchy database (more than 5% improvement), a Kendal score of 43.28Τ_b on the TU-Berlin SBIR benchmark (close to 6Τ_b improvement) and a mean average precision (MAP) of 32.16% on Flickr15k (a category level SBIR benchmark).

References

Artem Babenko, Anton Slesarev, Alexandr Chigorin, and Victor Lempitsky. 2014. Neural codes for image retrieval. In European conference on computer vision. Springer, 584--599.Google ScholarCross Ref
Artem Babenko, Anton Slesarev, Alexandr Chigorin, and Victor Lempitsky. 2014. Neural codes for image retrieval. In European conference on computer vision. Springer, 584--599.Google ScholarCross Ref
Serge Belongie, Jitendra Malik, and Jan Puzicha. 2002. Shape matching and object recognition using shape contexts. IEEE transactions on pattern analysis and machine intelligence 24, 4 (2002), 509--522. Google ScholarDigital Library
Sreyasee Das Bhattacharjee, Junsong Yuan, Weixiang Hong, and Xiang Ruan. 2016. Query Adaptive Instance Search using Object Sketches. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 1306--1315. Google ScholarDigital Library
Konstantinos Bozas and Ebroul Izquierdo. 2012. Large scale sketch based image retrieval using patch hashing. In International Symposium on Visual Computing. Springer, 210--219.Google ScholarCross Ref
Tu Bui and John Collomosse. 2015. Scalable sketch-based image retrieval using color gradient features. In Proceedings of the IEEE International Conference on Computer Vision Workshops. 1--8. Google ScholarDigital Library
Tu Bui, Leonardo Ribeiro, Moacir Ponti, and John Collomosse. 2016. Generali- sation and Sharing in Triplet Convnets for Sketch based Visual Search. arXiv preprint arXiv:1611.05301 (2016).Google Scholar
Xiaochun Cao, Hua Zhang, Si Liu, Xiaojie Guo, and Liang Lin. 2013. Sym-fish: A symmetry-aware flip invariant sketch histogram shape descriptor. In Proceedings of the IEEE International Conference on Computer Vision. 313--320. Google ScholarDigital Library
Ronan Collobert, Koray Kavukcuoglu, and Clément Farabet. 2011. Torch7: A matlab-like environment for machine learning. In BigLearn, NIPS Workshop.Google Scholar
Navneet Dalal and Bill Triggs. 2005. Histograms of oriented gradients for human detection. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, Vol. 1. IEEE, 886--893. Google ScholarDigital Library
Aishwarya Deore and BL Gunjal. 2016. Advanced Sketch Based Image Retrieval System Using Object Boundary Selection Algorithm. (2016).Google Scholar
Mathias Eitz, James Hays, and Marc Alexa. 2012. How do humans sketch objects? ACM Trans. Graph. 31, 4 (2012), 44--1. Google ScholarDigital Library
Mathias Eitz, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. 2010. An evaluation of descriptors for large-scale image retrieval from sketched feature lines. Computers & Graphics 34, 5 (2010), 482--498. Google ScholarDigital Library
Mathias Eitz, Kristian Hildebrand, Tamy Boubekeur, and Marc Alexa. 2011. Sketch-based image retrieval: Benchmark and bag-of-features descriptors. IEEE transactions on visualization and computer graphics 17, 11 (2011), 1624--1636. Google ScholarDigital Library
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.Google ScholarCross Ref
Rui Hu and John Collomosse. 2013. A performance evaluation of gradient field hog descriptor for sketch based image retrieval. Computer Vision and Image Understanding 117, 7 (2013), 790--806. Google ScholarDigital Library
Cheng Jin, Zheming Wang, Tianhao Zhang, Qinen Zhu, and Yuejie Zhang. 2015. A Novel Visual-Region-Descriptor-based Approach to Sketch-based Image Re- trieval. In Proceedings of the 5th ACM on International Conference on Multimedia Retrieval. ACM, 267--274. Google ScholarDigital Library
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classifica- tion with deep convolutional neural networks. In Advances in neural information processing systems. 1097--1105. Google ScholarDigital Library
Joseph J LaViola Jr and Robert C Zeleznik. 2007. MathPad 2: a system for the creation and exploration of mathematical sketches. In ACM SIGGRAPH 2007 courses. ACM, 46. Google ScholarDigital Library
Qiang Li, Yahong Han, and Jianwu Dang. 2016. Sketch4Image: a novel framework for sketch-based image retrieval based on product quantization with coding residuals. Multimedia Tools and Applications 75, 5 (2016), 2419--2434. Google ScholarDigital Library
Yi Li, Yi-Zhe Song, and Shaogang Gong. 2013. Sketch Recognition by Ensemble Matching of Structured Features.. In BMVC, Vol. 1. 2.Google Scholar
David G Lowe. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60, 2 (2004), 91--110. Google ScholarDigital Library
Jianwei Niu, Jun Ma, Jie Lu, Xuefeng Liu, and Zeyu Zhu. 2017. M-SBIR: An Improved Sketch-Based Image Retrieval Method Using Visual Word Mapping. In International Conference on Multimedia Modeling. Springer, 257--268.Google Scholar
Tom Y Ouyang and Randall Davis. 2011. ChemInk: a natural real-time recognition system for chemical drawings. In Proceedings of the 16th international conference on Intelligent user interfaces. ACM, 267--276. Google ScholarDigital Library
Yonggang Qi, Yi-Zhe Song, Tao Xiang, Honggang Zhang, Timothy Hospedales, Yi Li, and Jun Guo. 2015. Making better use of edges via perceptual grouping. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1856--1865.Google ScholarCross Ref
Yonggang Qi, Yi-Zhe Song, Honggang Zhang, and Jun Liu. 2016. Sketch-based image retrieval via Siamese convolutional neural network. In Image Processing (ICIP), 2016 IEEE International Conference on. IEEE, 2460--2464.Google ScholarCross Ref
Patsorn Sangkloy, Nathan Burnell, Cusuh Ham, and James Hays. 2016. The sketchy database: learning to retrieve badly drawn bunnies. ACM Transactions on Graphics (TOG) 35, 4 (2016), 119. Google ScholarDigital Library
Ravi Kiran Sarvadevabhatla and R Venkatesh Babu. 2015. Freehand sketch recognition using deep features. arXiv preprint arXiv:1502.00254 (2015).Google Scholar
Ravi Kiran Sarvadevabhatla, Jogendra Kundu, and others. 2016. Enabling My Robot To Play Pictionary: Recurrent Neural Networks For Sketch Recognition. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 247--251. Google ScholarDigital Library
Rosália G Schneider and Tinne Tuytelaars. 2014. Sketch classification and classification-driven analysis using fisher vectors. ACM Transactions on Graphics (TOG) 33, 6 (2014), 174. Google ScholarDigital Library
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 815--823.Google ScholarCross Ref
Omar Seddati, Stephane Dupont, and Saïd Mahmoudi. 2015. Deepsketch: deep convolutional neural networks for sketch recognition and similarity search. In Content-Based Multimedia Indexing (CBMI), 2015 13th International Workshop on. IEEE, 1--6.Google ScholarCross Ref
Omar Seddati, Stephane Dupont, and Saïd Mahmoudi. 2016. DeepSketch 2: Deep convolutional neural networks for partial sketch recognition. In Content-Based Multimedia Indexing (CBMI), 2016 14th International Workshop on. IEEE, 1--6.Google ScholarCross Ref
Omar Seddati, Stéphane Dupont, and Saïd Mahmoudi. 2016. DeepSketch2Image: Deep Convolutional Neural Networks for Partial Sketch Recognition and Image Retrieval. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, 739--741. Google ScholarDigital Library
Eli Shechtman and Michal Irani. 2007. Matching local self-similarities across images and videos. In Computer Vision and Pattern Recognition, 2007. CVPR'07. IEEE Conference on. IEEE, 1--8.Google ScholarCross Ref
Josef Sivic, Andrew Zisserman, and others. 2003. Video google: A text retrieval approach to object matching in videos.. In iccv, Vol. 2. 1470--1477. Google ScholarDigital Library
Ivan E Sutherland. 1964. Sketchpad a man-machine graphical communication system. Transactions of the Society for Computer Simulation 2, 5 (1964), R--3.Google Scholar
B Szántó, P Pozsegovics, Z Vámossy, and Sz Sergyan. 2011. Sketch4match Content-based image retrieval system using sketches. In Applied Machine Intelligence and Informatics (SAMI), 2011 IEEE 9th International Symposium on. IEEE, 183--188.Google ScholarCross Ref
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceedings of the IEEE Conference on Com- puter Vision and Pattern Recognition. 1--9.Google ScholarCross Ref
Ji Wan, Dayong Wang, Steven Chu Hong Hoi, Pengcheng Wu, Jianke Zhu, Yongdong Zhang, and Jintao Li. 2014. Deep learning for content-based image retrieval: A comprehensive study. In Proceedings of the 22nd ACM international conference on Multimedia. ACM, 157--166. Google ScholarDigital Library
Yuxin Wang, Miao Yu, Qi Jia, and He Guo. 2011. Query by sketch: An asymmetric sketch-vs-image retrieval system. In Image and Signal Processing (CISP), 2011 4th International Congress on, Vol. 3. IEEE, 1368--1372.Google ScholarCross Ref
Yongxin Yang and Timothy M Hospedales. 2015. Deep neural networks for sketch recognition. arXiv preprint arXiv: 1501.07873 (2015).Google Scholar
Kemal Tugrul Yesilbek, Cansu Sen, Serike Cakmak, and T Metin Sezgin. 2015. SVM-based sketch recognition: which hyperparameter interval to try?. In Proceedings of the workshop on Sketch-Based Interfaces and Modeling. Eurographics Association, 117--121. Google ScholarDigital Library
Qian Yu, Yongxin Yang, Yi-Zhe Song, Tao Xiang, and Timothy Hospedales. 2015. Sketch-a-net that beats humans. arXiv preprint arXiv:1501.07873 (2015).Google Scholar
Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Rui Wang, and Xiaochun Cao. 2016. Sketchnet: Sketch classification with web images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1105--1113.Google ScholarCross Ref
Rong Zhou, Liuli Chen, and Liqing Zhang. 2012. Sketch-based image retrieval on a large scale database. In Proceedings of the 20th ACM international conference on Multimedia. ACM, 973--976. Google ScholarDigital Library

Index Terms

Quadruplet Networks for Sketch-Based Image Retrieval
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Image representations
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

Sketch-based Image Retrieval using Generative Adversarial Networks
MM '17: Proceedings of the 25th ACM international conference on Multimedia

For sketch-based image retrieval (SBIR), we propose a generative adversarial network trained on a large number of sketches and their corresponding real images. To imitate human search process, we attempt to match candidate images with theimaginary image ...
Read More
DeepSketch 3

Freehand sketches are a simple and powerful tool for communication. They are easily recognized across cultures and suitable for various applications. In this paper, we use deep convolutional neural networks (ConvNets), state-of-the-art in the field of ...
Read More
Sketch-based image retrieval using keyshapes

Although sketch based image retrieval (SBIR) is still a young research area, there are many applications capable of exploiting this retrieval paradigm, such as web searching and pattern detection. Moreover, nowadays drawing a simple sketch query turns ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval
June 2017
524 pages
ISBN:9781450347013
DOI:10.1145/3078971
General Chairs:
Bogdan Ionescu
University Politehnica of Bucharest, Romania
,
Nicu Sebe
University of Trento, Italy
,
Program Chairs:
Jiashi Feng
National University of Singapore, Singapore
,
Martha Larson
Radboud University & Delft University of Technology, The Netherlands
,
Rainer Lienhart
University of Augsburg, Germany
,
Cees Snoek
University of Amsterdam & Qualcomm Research Netherlands, The Netherlands
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 June 2017
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
convnets
metric learning
quadruplet networks
sketch recognition
sketch-based image retrieval
triplet networks
Qualifiers
- research-article
Conference

Acceptance Rates
ICMR '17 Paper Acceptance Rate33of95submissions,35%Overall Acceptance Rate254of830submissions,31%
More
Upcoming Conference
ICMR '24

Sponsor:

sigmm

International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket , Thailand
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 21
  Total Citations
  View Citations
- 439
  Total Downloads
- Downloads (Last 12 months)14
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Quadruplet Networks for Sketch-Based Image Retrieval

ICMR '17: Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Sketch-based Image Retrieval using Generative Adversarial Networks

DeepSketch 3

Sketch-based image retrieval using keyshapes