Abstract
Due to the wide application prospect and market value of emotion recognition, it has become an important research topic in today’s society. Among them, facial expression recognition (FER) plays an important role in expressing human emotional information. Generally, the FER classification process includes face pre-processing (face detection, alignment, etc.), which adds extra workload. To this end, detection and classification are carried out simultaneously in this paper. We first manually annotated the RAF-DB dataset. We then designed an end-to-end FER network with better performance and applied it to facial expressions called FER-YOLO. FER-YOLO is built on the basis of YOLOv3. We combine the squeeze-and-excitation (SE) module with the backbone network and assign a certain weight to each feature channel so that FER-YOLO can focus on learning prominent facial features. We also discussed the performance changes caused by the lightweight enhanced feature extraction networks. Experimental results show that the proposed FER-YOLO network is 3.03% mAP higher than YOLOv3 on the RAF-DB dataset.
Southwest Jiaotong University.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Poria, S., Cambria, E., Bajpai, R., Hussain, A.: A review of affective computing: from unimodal analysis to multimodal fusion. Inf. Fusion 37, 98–125 (2017)
Spezialetti, M., Placidi, G., Rossi, S.: Emotion recognition for human-robot interaction: recent advances and future perspectives. Front. Robot. AI 7, 145–155 (2020)
Joesph, C., Rajeswari, A., Premalatha, B., Balapriya, C.: Implementation of physiological signal based emotion recognition algorithm. In: IEEE 36th International Conference on Data Engineering (ICDE) 2020, Dallas, TX, USA, pp. 2075–2079 (2020). https://doi.org/10.1109/ICDE48307.2020.9153878
Cosentino, S., Randria, E.I.S., Lin, J.-Y., Pellegrini, T., Sessa, S., Takanishi, A.: Group emotion recognition strategies for entertainment robots. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018, Madrid, Spain, pp. 813–818 (2018). https://doi.org/10.1109/IROS.2018.8593503
Li, G., Wang, Y.: Research on Leamer’s emotion recognition for intelligent education system. In: IEEE 3rd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, China, pp. 754–758 (2018). https://doi.org/10.1109/IAEAC.2018.8577590
Rusli, N., Sidek, S.N., Yusof, H.M., Ishak, N.I., Khalid, M., Dzulkarnain, A.A.A.: Implementation of wavelet analysis on thermal images for affective states recognition of children with autism spectrum disorder. IEEE Access 8, 120818–120834 (2020)
Zou, J., Cao, X., Zhang, S., Ge, B.: A facial expression recognition based on improved convolutional neural network. In: IEEE International Conference of Intelligent Applied Systems on Engineering (ICIASE), Fuzhou, China, pp. 301–304 (2019). https://doi.org/10.1109/ICIASE45644.2019.9074074
Singh, S., Nasoz, F.: Facial expression recognition with convolutional neural networks. In: 10th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA, pp. 0324–0328 (2020). https://doi.org/10.1109/CCWC47524.2020.9031283
Ma, H., Celik, T., Li, H.-C.: Lightweight attention convolutional neural network through network slimming for robust facial expression recognition. Signal Image Video Process. 1863–1711 (2021)
Mohan, K., Seal, A., Krejcar, O., Yazidi, A.: FER-net: facial expression recognition using deep neural net. Neural Comput. Appl. 33(15), 9125–9136 (2021). https://doi.org/10.1007/s00521-020-05676-y
Xie, S., Hu, H.: Facial expression recognition using hierarchical features with deep comprehensive multipatches aggregation convolutional neural networks. IEEE Trans. Multimedia 21(1), 211–220 (2019)
Ma, H., Celik, T.: FER-Net: facial expression recognition using densely connected convolutional network. Electron. Lett. 55(4), 184–186 (2019)
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, pp. 779–788 (2016). https://doi.org/10.1109/CVPR.2016.91
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 6517–6525 (2017). https://doi.org/10.1109/CVPR.2017.690
Joseph, R., Ali, F.: YOLOv3: An Incremental Improvement. arXiv:1804.02767 (2018)
Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y.M.: YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv:2004.10934 (2020)
Liu, W., et al.: SSD: single shot MultiBox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Lin, T.-Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 422, 318–327 (2020). https://doi.org/10.1109/TPAMI.2018.2858826
Hu, J., Shen, L., Albanie, S., Sun, G., Wu, E.: Squeeze-and-excitation networks. IEEE Trans. Pattern Anal. Mach. Intell. 428, 2011–2023 (2020)
Mamalet, F., Garcia, C.: Simplifying ConvNets for fast learning. In: International Conference on Artificial Neural Networks (ICANN 2012), Lausanne, Switzerland, pp. 58–65 (2012). https://doi.org/10.1007/978-3-642-33266-1_8
Li, S., Deng, W., Du, J.: Reliable crowdsourcing and deep locality-preserving learning for expression recognition in the wild. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 2584–2593 (2017). https://doi.org/10.1109/CVPR.2017.277
Acknowledgements
This work was supported by Sichuan Provincial Science and Technology Projects (2019JDJQ0023).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Ma, H., Celik, T., Li, H. (2021). FER-YOLO: Detection and Classification Based on Facial Expressions. In: Peng, Y., Hu, SM., Gabbouj, M., Zhou, K., Elad, M., Xu, K. (eds) Image and Graphics. ICIG 2021. Lecture Notes in Computer Science(), vol 12888. Springer, Cham. https://doi.org/10.1007/978-3-030-87355-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-87355-4_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-87354-7
Online ISBN: 978-3-030-87355-4
eBook Packages: Computer ScienceComputer Science (R0)