Abstract
Drishti is a computer vision and deep learning-based application developed using Python programming language for the sole purpose of envisioning the real-time environment by generating the natural language description of the real-time captured scenes. The primary objective of this project is to enable a visually impaired person to know about his or her environment in real time. In this, digital image processing is used to generate the annotations about the surroundings. To express the features, Python has been selected as an interacting language. For the ease of a user, GUI has been provided for their usage. Though the GUI has been operated and guided by Python script, there is no need for a person to know the language, for general usage.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pan P, Xu Z, Yang Y, Wu F, Zhuang Y (2015) Hierarchical recurrent neural encoder for video representation with application to captioning. CoRR, abs/1511.03476
Chen X, Zitnick CL (2014) Learning a recurrent visual representation for image caption generation. CoRR, abs/1411.5654
Farhadi A, Hejrati M, Sadeghi MA, Young P, Rashtchian C, Hockenmaier J, Forsyth D (2010) Every picture tells a story: generating sentences from images. In: Proceedings of the 11th European conference on computer vision: part IV, ECCV’10. Springer-Verlag, Berlin, Heidelberg, pp 15–29
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. CoRR, abs/1512.03385
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Bartlett PL, Pereira FCN, Burges CJC, Bottou L, Weinberger KQ (eds) NIPS, pp 1106–1114
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
Venugopalan S, Rohrbach M, Donahue J, Mooney R, Darrell T, Saenko K (2015) Sequence to sequence—video to text. CoRR, abs/1505.00487
Venugopalan S, Xu H, Donahue J, Rohrbach M, Mooney R, Saenko K (2014) Translating videos to natural language using deep recurrent neural networks. CoRR, abs/1412.4729
Karpathy A, Johnson J, Fei-Fei L (2015) Visualizing and understanding recurrent networks. CoRR, abs/1506.02078
Ng JY, Hausknecht MJ, Vijayanarasimhan S, Vinyals O, Monga R, Toderici G (2015) Beyond short snippets: deep networks for video classification. CoRR, abs/1503.08909
Acknowledgements
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Rathore, S., Sharma, S., Singh, L. (2019). Drishti—Artificial Vision. In: Mishra, S., Sood, Y., Tomar, A. (eds) Applications of Computing, Automation and Wireless Systems in Electrical Engineering. Lecture Notes in Electrical Engineering, vol 553. Springer, Singapore. https://doi.org/10.1007/978-981-13-6772-4_50
Download citation
DOI: https://doi.org/10.1007/978-981-13-6772-4_50
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6771-7
Online ISBN: 978-981-13-6772-4
eBook Packages: EngineeringEngineering (R0)