Skip to main content

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 553))

  • 1559 Accesses

Abstract

Drishti is a computer vision and deep learning-based application developed using Python programming language for the sole purpose of envisioning the real-time environment by generating the natural language description of the real-time captured scenes. The primary objective of this project is to enable a visually impaired person to know about his or her environment in real time. In this, digital image processing is used to generate the annotations about the surroundings. To express the features, Python has been selected as an interacting language. For the ease of a user, GUI has been provided for their usage. Though the GUI has been operated and guided by Python script, there is no need for a person to know the language, for general usage.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Pan P, Xu Z, Yang Y, Wu F, Zhuang Y (2015) Hierarchical recurrent neural encoder for video representation with application to captioning. CoRR, abs/1511.03476

    Google Scholar 

  2. Chen X, Zitnick CL (2014) Learning a recurrent visual representation for image caption generation. CoRR, abs/1411.5654

    Google Scholar 

  3. Farhadi A, Hejrati M, Sadeghi MA, Young P, Rashtchian C, Hockenmaier J, Forsyth D (2010) Every picture tells a story: generating sentences from images. In: Proceedings of the 11th European conference on computer vision: part IV, ECCV’10. Springer-Verlag, Berlin, Heidelberg, pp 15–29

    Chapter  Google Scholar 

  4. He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. CoRR, abs/1512.03385

    Google Scholar 

  5. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Bartlett PL, Pereira FCN, Burges CJC, Bottou L, Weinberger KQ (eds) NIPS, pp 1106–1114

    Google Scholar 

  6. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780

    Article  Google Scholar 

  7. Venugopalan S, Rohrbach M, Donahue J, Mooney R, Darrell T, Saenko K (2015) Sequence to sequence—video to text. CoRR, abs/1505.00487

    Google Scholar 

  8. Venugopalan S, Xu H, Donahue J, Rohrbach M, Mooney R, Saenko K (2014) Translating videos to natural language using deep recurrent neural networks. CoRR, abs/1412.4729

    Google Scholar 

  9. Karpathy A, Johnson J, Fei-Fei L (2015) Visualizing and understanding recurrent networks. CoRR, abs/1506.02078

    Google Scholar 

  10. Ng JY, Hausknecht MJ, Vijayanarasimhan S, Vinyals O, Monga R, Toderici G (2015) Beyond short snippets: deep networks for video classification. CoRR, abs/1503.08909

    Google Scholar 

Download references

Acknowledgements

Statement of Consent: It is to confirm that the image (Fig. 3b) in this paper entitled “Drishti—Artificial Vision,” includes the two authors of this paper, namely Sneh Rathore and Sahil Sharma. We hereby confirm our identity and provide the consent to publish the image (Fig. 3b) in this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sneh Rathore .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rathore, S., Sharma, S., Singh, L. (2019). Drishti—Artificial Vision. In: Mishra, S., Sood, Y., Tomar, A. (eds) Applications of Computing, Automation and Wireless Systems in Electrical Engineering. Lecture Notes in Electrical Engineering, vol 553. Springer, Singapore. https://doi.org/10.1007/978-981-13-6772-4_50

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-6772-4_50

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-6771-7

  • Online ISBN: 978-981-13-6772-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics