Abstract
From the emerging of Deep Convolution Neural Network (DCNN), the Visual Information Retrieval would have good prospects for visual features automatically extracted at high semantic levels. However, the deep features could not be robust to some challenges as one-to-many and many-to-one relationships between face identifiers and facial attributes in querying on the face identifier level. To solve these issues at the large-scale level, we proposed a face retrieval system by using the “divide and conquer” method: query by attributes instead of querying by the identifier. We used Facial Attributes in the Fast-Filter stage, after that, our proposed system would retrieve the Face Identifier from the retrieved candidates. DCNN is very useful in the facial attribute learning because of the same network architecture for multiple-attribute groups. We built the attribute learning model following the bottom-up and top-down process. The bottom-up process uses DCNNs with the corresponding face parts and the top-down process is based on our proposed Facial Attribute Ontology (FAO). FAO supports multi-task learning in DCNN, re-usability for other retrieval tasks, flexibility in intelligent queries. We experimented our proposed method on the LFWA and CelebA dataset; our system achieved the average precision at 85.68%, this result is higher than some state-of-the-art methods. In more details, we also outperformed at 25 on 40 attribute detectors. Moreover, we speeded up the retrieval process based on the multi-attribute space and the indexing method named Hierarchical K-means++. At last, on retrieval experiments, we gathered 0.79 and 0.82 MAP-score average for one attribute query in LFWA and CelebA respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Truong, Q.T., Ly, N.Q.: Building the facial expressions recognition system based on RGB-D images in high performance. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.-P. (eds.) ACIIDS 2016. LNCS (LNAI), vol. 9622, pp. 377–387. Springer, Heidelberg (2016). https://doi.org/10.1007/978-3-662-49390-8_37
Vo, A., Ly, N.Q.: Facial expression recognition using pyramid local phase quantization descriptor. In: Nguyen, V.-H., Le, A.-C., Huynh, V.-N. (eds.) Knowledge and Systems Engineering. AISC, vol. 326, pp. 105–115. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-11680-8_9
Le, D.D.M., Nguyen, D.T., Ly, N.Q.: Facial expressions recognition based on joint shape-texture cues in video. J. Sci. 53, 111–125 (2014)
Buu, A.L., Ly, N.Q., et al.: An individualized system for face ageing and facial expressions based on 3D practical faces data. In: ICCAIS, pp. 398–403. IEEE (2012)
Le, D.C., Ly, N.Q.: Building a 3D face reconstruction system based on 2D frontal facial image and 3D morphable model in high performance. In: IEEE-RIVF (2015)
Le Cun, Y., et al.: Handwritten digit recognition: applications of neural network chips and automatic learning. IEEE Commun. Mag. 27(11), 41–46 (1989)
Kumar, N., Belhumeur, P., Nayar, S.: FaceTracer: a search engine for large collections of images with faces. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008. LNCS, vol. 5305, pp. 340–353. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88693-8_25
Zhang, N., et al.: PANDA: pose aligned networks for deep attribute modeling. In: CVPR (2014)
Liu, Z., et al.: Deep learning face attributes in the wild. In: ICCV (2015)
Huang, G.B., et al.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report 07-49, Amherst, October 2007
Arai, K., Barakbah, A.R.: Hierarchical K-means: an algorithm for centroids initialization for K-means. Rep. Fac. Sci. Eng. 36(1), 25–31 (2007)
Arthur, D., Vassilvitskii, S.: K-means++: the advantages of careful seeding. In: Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms (2007)
Wu, Z., et al.: Scalable face image retrieval with identity-based quantization and multireference reranking. IEEE Trans. PAMI 33(10), 1991–2001 (2011)
Chen, B.-C., Chen, C.-S., Hsu, W.H.: Cross-age reference coding for age-invariant face recognition and retrieval. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8694, pp. 768–783. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10599-4_49
Chen, B.-C., et al.: Scalable face image retrieval using attribute-enhanced sparse codewords. IEEE Trans. Multimed. 15(5), 1163–1173 (2013)
Park, U., Jain, A.K.: Face matching and retrieval using soft biometrics. IEEE Trans. Inf. Forensics Secur. 5(3), 406–415 (2010)
Han, H., Otto, C., Jain, A.K.: Age estimation from face images: human vs. machine performance. In: International Conference on Biometrics (ICB). IEEE (2013)
Moghaddam, B., Yang, M.-H.: Learning gender with support faces. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 707–711 (2002)
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: 2015 CVPR (2015). https://doi.org/10.1109/cvpr.2015.7298682
Sze, V., et al.: Efficient processing of deep neural networks: a tutorial and survey. arXiv preprint arXiv:1703.09039 (2017). https://doi.org/10.1109/jproc.2017.2761740
Zhong, Y., Sullivan, J., Li, H.: Face attribute prediction using off-the-shelf CNN features. In: 2016 International Conference on Biometrics, June 2016
Zhong, Y., Sullivan, J., Li, H.: Leveraging mid-level deep representations for predicting face attributes in the wild. In: ICIP (2016). https://doi.org/10.1109/icip.2016.7532958
Zhong, Y., et al.: Transferring from face recognition to face attribute prediction through adaptive selection of off-the-shelf CNN representations. In: 23th ICPR (2016)
Zhong, Y.: Human face identification and face attribute prediction: from Gabor filtering to deep learning. Doctoral thesis, Stockholm, Swedish (2016)
Bottou, L.: Large-scale machine learning with stochastic gradient descent. In: Lechevallier, Y., Saporta, G. (eds.) Proceedings of COMPSTAT’2010, pp. 177–186. Physica-Verlag HD, Heidelberg (2010). https://doi.org/10.1007/978-3-7908-2604-3_16
Palatucci, M., Pomerleau, D. Hinton, G.E., Mitchell, T.M.: Zero-shot learning with semantic output codes. In: Advances in Neural Information Processing Systems (2009)
Liu, J., Kuipers, B., Savarese, S.: Recognizing human actions by attributes. In: IEEE Conference on Computer Vision and Pattern Recognition (2011)
Zhou, X.S., Huang, T.S.: Relevance feedback in image retrieval: a comprehensive review. Multimed. Syst. 8(6), 536–544 (2003)
Chen, B.C., et al.: Semi-supervised face image retrieval using sparse coding with identity constraint. In: Proceedings of the 19th ACM International Conference on Multimedia (2011)
Manning, C.D., et al.: Introduction to Information Retrieval, vol. 151. Cambridge University Press, Cambridge (2008)
Sidorov, G., Gelbukh, A., et al.: Soft similarity and soft cosine measure: similarity of features in vector space model. Computación y Sistemas 18(3), 491–504 (2014)
Lienhart, R., Maydt, J.: An extended set of Haar-like features for rapid object detection. In: Proceedings of the International Conference on Image Processing. IEEE (2002)
Huang, G.B., Mattar, M., et al.: Learning to align from scratch. In: NIPS (2012)
Bentley, J.L.: Multidimensional binary search trees used for associative searching. Commun. ACM 18(9), 509–517 (1975). https://doi.org/10.1145/361002.361007
Kumar, N., et al.: Describable visual attributes for face verification and image search. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 1962–1977 (2011)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Nguyen, H.M., Ly, N.Q., Phung, T.T.T. (2018). Large-Scale Face Image Retrieval System at Attribute Level Based on Facial Attribute Ontology and Deep Neuron Network. In: Nguyen, N., Hoang, D., Hong, TP., Pham, H., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2018. Lecture Notes in Computer Science(), vol 10752. Springer, Cham. https://doi.org/10.1007/978-3-319-75420-8_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-75420-8_51
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-75419-2
Online ISBN: 978-3-319-75420-8
eBook Packages: Computer ScienceComputer Science (R0)