Skip to main content
Log in

Next-LSTM: a novel LSTM-based image captioning technique

  • ORIGINAL ARTICLE
  • Published:
International Journal of System Assurance Engineering and Management Aims and scope Submit manuscript

Abstract

Recently, image captioning has evolved into an immensely popular area in the field of Computer Vision. Research in this area is active and various Machine learning-based image captioning models have been proposed in the literature. It strives to generate natural language sentences in order to describe the salient parts of a given image. The main challenge with the existing approaches is effectively extracting image features to generate adequate image captions. Further, there is a need to improve the generalizability of the results on large and diverse datasets. In the current paper, a novel method, namely Next-LSTM is proposed for image captioning. It first extracts the image features using ResNeXt. It is a powerful convolution neural network based model that is adopted for the first time in the image captioning domain. Later, it applies a Long-short term memory network on the extracted features to generate accurate captions for the images. The proposed framework is then evaluated on the benchmark Flickr-8k dataset on Accuracy and BLEU Score. The performance of the proposed framework is also compared to the state-of-the-art approaches, and it outperforms the existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

Download references

Funding

The authors did not receive funding from any organization for the submitted work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Priya Singh.

Ethics declarations

Conflict of interest

The authors confirm that there are no known conflicts of interest associated with this publication and there has been no financial gains for this work that could have influenced its outcome.

Human and/or animals participants

None of the authors conducted any experiments with human participants or animals for this paper.

Informed consent

None of the authors conducted any investigations involving human subjects or animals for this research work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Singh, P., Kumar, C. & Kumar, A. Next-LSTM: a novel LSTM-based image captioning technique. Int J Syst Assur Eng Manag 14, 1492–1503 (2023). https://doi.org/10.1007/s13198-023-01956-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13198-023-01956-7

Keywords

Navigation