Object Detection in Heritage Archives Using a Human-in-Loop Concept

Kasturi, Surya; Shenfield, Alex; Roast, Chris; Page, Danny Le; Broome, Alice

doi:10.1007/978-3-031-47508-5_14

Surya Kasturi¹⁹,
Alex Shenfield¹⁹,
Chris Roast¹⁹,
Danny Le Page²⁰ &
…
Alice Broome²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1453))

Included in the following conference series:

UK Workshop on Computational Intelligence

109 Accesses

Abstract

The use of object detection has become common within the area of computer vision and has been considered essential for a numerous applications. Currently, the field of object detection has undergone significant development and can be broadly classified into two categories: traditional machine learning methods that employ diverse computer vision techniques, and deep learning methods. This paper proposes a methodology that incorporates the human-in-loop feedback concept to enhance the deep learning object detection capabilities of pre-trained models. These Deep Learning models were developed using a custom humanities and social science dataset that was obtained from the British Online Archives collections database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chai, C., Li, G.: Human-in-the-loop techniques in machine learning. IEEE Data Eng. Bull. 43(3), 37–52 (2020)
Google Scholar
Dai, J., Li, Y., He, K., Sun, J.: R-FCN: object detection via region-based fully convolutional networks. Adv. Neural Inf. Process. Syst. 29 (2016)
Google Scholar
Diligenti, M., Roychowdhury, S., Gori, M.: Integrating prior knowledge into deep learning. In: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), pp. 920–923 (2017)
Google Scholar
Girshick, R.: Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580–587 (2014)
Google Scholar
Hosang, J., Benenson, R., Schiele, B.: A convnet for non-maximum suppression. In: Pattern Recognition: 38th German Conference, GCPR 2016, Hannover, Germany, September 12–15, 2016, Proceedings, vol. 38, pp. 192–204. Springer (2016)
Google Scholar
Kumar, V., Smith-Renner, A., Findlater, L., Seppi, K., Boyd-Graber, J.: Why didn’t you listen to me? Comparing user control of human-in-the-loop topic models. arXiv:1905.09864 (2019)
Label Studio contributors: Label Studio. https://labelstud.io/ (2021). Accessed September 2021
Lin, T.Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Google Scholar
Lin, T.Y., Goyal, P., Girshick, R., He, K., Dollár, P.: Focal loss for dense object detection. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2980–2988 (2017)
Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C., Berg, A.: SSD: single shot multibox detector. In: European Conference on Computer Vision (ECCV) (2016)
Google Scholar
Madono, K., Nakano, T., Kobayashi, T., Ogawa, T.: Efficient human-in-the-loop object detection using bi-directional deep sort and annotation-free segment identification. In: 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), pp. 1226–1233. IEEE (2020)
Google Scholar
Obaid, H.S., Dheyab, S.A., Sabry, S.S.: The impact of data pre-processing techniques and dimensionality reduction on the accuracy of machine learning. In: 2019 9th Annual Information Technology, Electromechanical Engineering and Microelectronics Conference (IEMECON), pp. 279–283. IEEE (2019)
Google Scholar
Redmon, J., Divvala, S., Girshick, R., Farhadi, A.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779–788 (2016)
Google Scholar
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 28 (2015)
Google Scholar
Tan, M., Pang, R., Le, Q.V.: Efficientdet: scalable and efficient object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781–10790 (2020)
Google Scholar
Torrey, L., Shavlik, J.: Transfer learning. In: Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques, pp. 242–264. IGI Global (2010)
Google Scholar
Uijlings, J.R., Van De Sande, K.E., Gevers, T., Smeulders, A.W.: Selective search for object recognition. Int. J. Comput. Vis. 104, 154–171 (2013)
Article Google Scholar
Wojke, N., Bewley, A., Paulus, D.: Simple online and realtime tracking with a deep association metric. In: 2017 IEEE International Conference on Image Processing (ICIP), pp. 3645–3649. IEEE (2017)
Google Scholar
Wu, X., Xu, B., Zheng, Y., Ye, H., Yang, J., He, L.: Fast video crowd counting with a temporal aware network. Neurocomputing 403, 13–20 (2020)
Article Google Scholar
Xin, D., Ma, L., Liu, J., Macke, S., Song, S., Parameswaran, A.: Accelerating human-in-the-loop machine learning: challenges and opportunities. In: Proceedings of the Second Workshop on Data Management for End-to-End Machine Learning, pp. 1–4 (2018)
Google Scholar
Yao, A., Gall, J., Leistner, C., Van Gool, L.: Interactive object detection. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3242–3249. IEEE (2012)
Google Scholar
Yu, F., Seff, A., Zhang, Y., Song, S., Funkhouser, T., Xiao, J.: LSUN: construction of a large-scale image dataset using deep learning with humans in the loop. arXiv:1506.03365 (2015)
Zhang, R., Torabi, F., Guan, L., Ballard, D.H., Stone, P.: Leveraging human guidance for deep reinforcement learning tasks. arXiv:1909.09906 (2019)
Zitnick, C.L., Dollár, P.: Edge boxes: locating object proposals from edges. In: Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6–12, 2014, Proceedings, Part V, vol. 13, pp. 391–405. Springer (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Sheffield Hallam University, Sheffield, UK
Surya Kasturi, Alex Shenfield & Chris Roast
British Online Archives, Leeds, UK
Danny Le Page & Alice Broome

Authors

Surya Kasturi
View author publications
You can also search for this author in PubMed Google Scholar
Alex Shenfield
View author publications
You can also search for this author in PubMed Google Scholar
Chris Roast
View author publications
You can also search for this author in PubMed Google Scholar
Danny Le Page
View author publications
You can also search for this author in PubMed Google Scholar
Alice Broome
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Surya Kasturi .

Editor information

Editors and Affiliations

Aston University, Birmingham, UK
Nitin Naik
Cardiff Metropolitan University, Cardiff, UK
Paul Jenkins
Aston University, Birmingham, UK
Paul Grace
Northumbria University, Newcastle upon Tyne, UK
Longzhi Yang
Devi Ahilya University, Indore, India
Shaligram Prajapat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kasturi, S., Shenfield, A., Roast, C., Page, D.L., Broome, A. (2024). Object Detection in Heritage Archives Using a Human-in-Loop Concept. In: Naik, N., Jenkins, P., Grace, P., Yang, L., Prajapat, S. (eds) Advances in Computational Intelligence Systems. UKCI 2023. Advances in Intelligent Systems and Computing, vol 1453. Springer, Cham. https://doi.org/10.1007/978-3-031-47508-5_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-47508-5_14
Published: 01 February 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47507-8
Online ISBN: 978-3-031-47508-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Object Detection in Heritage Archives Using a Human-in-Loop Concept