Drishti—Artificial Vision

Rathore, Sneh; Sharma, Sahil; Singh, Lisha

doi:10.1007/978-981-13-6772-4_50

Sneh Rathore³⁷,
Sahil Sharma³⁷ &
Lisha Singh³⁷

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 553))

1559 Accesses

Abstract

Drishti is a computer vision and deep learning-based application developed using Python programming language for the sole purpose of envisioning the real-time environment by generating the natural language description of the real-time captured scenes. The primary objective of this project is to enable a visually impaired person to know about his or her environment in real time. In this, digital image processing is used to generate the annotations about the surroundings. To express the features, Python has been selected as an interacting language. For the ease of a user, GUI has been provided for their usage. Though the GUI has been operated and guided by Python script, there is no need for a person to know the language, for general usage.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pan P, Xu Z, Yang Y, Wu F, Zhuang Y (2015) Hierarchical recurrent neural encoder for video representation with application to captioning. CoRR, abs/1511.03476
Google Scholar
Chen X, Zitnick CL (2014) Learning a recurrent visual representation for image caption generation. CoRR, abs/1411.5654
Google Scholar
Farhadi A, Hejrati M, Sadeghi MA, Young P, Rashtchian C, Hockenmaier J, Forsyth D (2010) Every picture tells a story: generating sentences from images. In: Proceedings of the 11th European conference on computer vision: part IV, ECCV’10. Springer-Verlag, Berlin, Heidelberg, pp 15–29
Chapter Google Scholar
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. CoRR, abs/1512.03385
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Bartlett PL, Pereira FCN, Burges CJC, Bottou L, Weinberger KQ (eds) NIPS, pp 1106–1114
Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
Article Google Scholar
Venugopalan S, Rohrbach M, Donahue J, Mooney R, Darrell T, Saenko K (2015) Sequence to sequence—video to text. CoRR, abs/1505.00487
Google Scholar
Venugopalan S, Xu H, Donahue J, Rohrbach M, Mooney R, Saenko K (2014) Translating videos to natural language using deep recurrent neural networks. CoRR, abs/1412.4729
Google Scholar
Karpathy A, Johnson J, Fei-Fei L (2015) Visualizing and understanding recurrent networks. CoRR, abs/1506.02078
Google Scholar
Ng JY, Hausknecht MJ, Vijayanarasimhan S, Vinyals O, Monga R, Toderici G (2015) Beyond short snippets: deep networks for video classification. CoRR, abs/1503.08909
Google Scholar

Download references

Acknowledgements

Statement of Consent: It is to confirm that the image (Fig. 3b) in this paper entitled “Drishti—Artificial Vision,” includes the two authors of this paper, namely Sneh Rathore and Sahil Sharma. We hereby confirm our identity and provide the consent to publish the image (Fig. 3b) in this paper.

Author information

Authors and Affiliations

Department of Information Technology, HMR Institute of Technology, New Delhi, India
Sneh Rathore, Sahil Sharma & Lisha Singh

Authors

Sneh Rathore
View author publications
You can also search for this author in PubMed Google Scholar
Sahil Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Lisha Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sneh Rathore .

Editor information

Editors and Affiliations

Department of Electrical Engineering, Indian Institute of Technology Delhi , New Delhi, Delhi, India
Sukumar Mishra
Department of Electrical Engineering , National Institute of Technology, Hamirpur, Himachal Pradesh, India
Yog Raj Sood
JSS Academy of Technical Education, Noida, Uttar Pradesh, India
Anuradha Tomar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rathore, S., Sharma, S., Singh, L. (2019). Drishti—Artificial Vision. In: Mishra, S., Sood, Y., Tomar, A. (eds) Applications of Computing, Automation and Wireless Systems in Electrical Engineering. Lecture Notes in Electrical Engineering, vol 553. Springer, Singapore. https://doi.org/10.1007/978-981-13-6772-4_50

Download citation

DOI: https://doi.org/10.1007/978-981-13-6772-4_50
Published: 01 June 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6771-7
Online ISBN: 978-981-13-6772-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics