skip to main content
research-article

Video search and indexing with reinforcement agent for interactive multimedia services

Published:22 February 2013Publication History
Skip Abstract Section

Abstract

In this study, we present a video search and indexing system based on the state support vector (SVM) network, video graph, and reinforcement agent for recognizing and organizing video events. In order to enhance the recognition performance of the state SVM network, two innovative techniques are presented: state transition correction and transition quality estimation. The classification results are also merged into the video indexing graph, which facilitates the search speed. A reinforcement algorithm with an efficient scheduling scheme significantly reduces both the power consumption and time. The experimental results show the proposed state SVM network was able to achieve a precision rate as high as 83.83% and the query results of the indexing graph reached 80% accuracy. The experiments also demonstrate the performance and feasibility of our system.

References

  1. Benini, L., Hodgson, R., and Siegel, P. 1998. System-level power estimation and optimization. In Proceedings of the International Symposium on Low Power Electronics and Design. 173--178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Benini, L., Paleologo, G., Bogliolo, A., and de Micheli, G. 1999. Policy optimization for dynamic power management. IEEE Trans. Comput.-Aided Desi. 18, 6, 813--833. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chen, B.-W., Wang, J.-C., and Wang, J.-F. 2009a. A novel video summarization based on mining the story-structure and semantic relations among concept entities. IEEE Trans. Multimed. 11, 2, 295--312. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chen, C.-Y., Wang, T.-C., Wang, J.-F., and Shieh, L.-P. 2009b. SVM-based state transition framework for dynamical human behavior identification. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. IEEE Computer Society, 1933--1936. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chung, E., Benini, L., and de Micheli, G. 1999a. Dynamic power management using adaptive learning trees. In Proceedings of the International Conference on Computer-Aided Design. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Chung, E., Benini, L., and de Micheli, G. 1999b. Dynamic power management for non-stationary service requests. In Proceedings of the Symposium on Design, Automation, and Test in Europe. 77--81. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Costa Ribeiro, C. H. 2002. A tutorial on reinforcement learning techniques. Department of Theory of Computation Technological Institute of Aeronautics São Jos e dos Campos, Brazil.Google ScholarGoogle Scholar
  8. Ganapathiraju, A., Hamaker, J. E., and Picone, J. 2004. Applications of support vector machines to speech recognition. IEEE Trans. Sig. Proc. 52, 8, 2348--2355. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Golding, R., Bosch, P., and Wilkes, J. 1996. Idleness is not sloth: HP Laboratories Tech. Rep. HPL-96--140.Google ScholarGoogle Scholar
  10. Huang, X., Acero, A., and Hon, H.-W. 2001. Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Prentice-Hall, New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Huang, C.-L., Shih, H.-C., and Chao, C.-Y. 2006. Semantic analysis of soccer video using dynamic Bayesian network. IEEE Trans. Multimed. 8, 4, 749--760. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Irani, S., Shukla, S., and Gupta, R. 2002. Competitive analysis of dynamic power management strategies for systems with multiple power saving state. In Proceedings of the Design Automation and Test Conference Europe (DATE02). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Li, Y., Tian, Y., Duan, L.-Y., Yang, J., Huang, T., and Gao, W. 2010. Sequence multi-labeling: A unified video annotation scheme with spatial and temporal context. IEEE Trans. Multimed. 12, 8, 814--828. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Lu, Y., Chung, E., Simunic, T., Benini, L., and de Micheli, G. 2000. Quantitative comparison of PM Algorithms. Design, Automation and Test in Europe, pp. 20--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lu, Y., Simunic, T., and de Micheli, G. 1999a. Software controlled power management. In Proceedings of the 7th International Workshop on Hardware/Software Codesign. 157--161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Paul, A., Jiang, Y. C., and Wang, J. F. 2010. Computation aware scheme for visual signal processing, J. Softw.Google ScholarGoogle Scholar
  17. Ren, Z., Krogh, B. H., and Marculescu, R. 2005. Hierarchical adaptive dynamic power management. IEEE Trans. Comput. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Shih, E., Bahl, V., and Sinclair, M. 2003. Reducing energy consumption of Wireless, Mobile Devices Using a Secondary Low-Power Channel, MIT Laboratory for Computer Science, March.Google ScholarGoogle Scholar
  19. Shukla, S. and Gupta, R. 2001. A model checking approach to evaluating system level power management for embedded systems. In Proceedings of IEEE Workshop on High Level Design Validation and Test (HLDVT 01). IEEE Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Simunic, T. 2000. Dynamic management of power consumption HP Labs, Tech. Rep.Google ScholarGoogle Scholar
  21. Simunic, T., Bennin, L., and de Micheli, G. 1999. Event-driven power management. In Proceedings of the International Symposium on System Synthesis, pp. 18--23. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Simunic, T., Benini, L., and de Micheli, G. 2000. Power management of laptop hard disk. Design, Automation, and Test in Europe, p. 736.Google ScholarGoogle Scholar
  23. Srinivasan, P. and Shi, J. 2007. Bottom-up recognition and parsing of the human body. In Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition. IEEE Computer Society, 1--8. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Sutton, R. S. and Barto, A. G. 1998. Reinforcement learning -- An introduction, MITPress, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Takahashi, M., Fujii, M., and Yagi, N. 2008. Automatic pitch type recognition from baseball broadcast videos. In Proceedings of the 10th IEEE International Symposium on Multimedia. IEEE Computer Society, 15--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Tang, J., Hua, X.-S., Qi, G.-J., Song, Y., and Wu, X. 2008. Video annotation based on kernel linear neighborhood propagation. IEEE Trans. Multimed. 10, 4, 620--628. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Xie, L., Xu, P., Chang, S.-F., Divakaran, A., and Sun, H. 2004. Structure analysis of soccer video with domain knowledge and hidden Markov models. Patt. Recog. Lett. 25, 7, 767--775. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Xu, G., Ma, Y.-F., Hong-Jiang, Z., and Yang, S.-Q. 2005. An HMM-based framework for video semantic analysis. IEEE Trans. Circu. Syst. Video Tech. 15, 11, 1422--1433. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Zhong, L. and Jha, N. K. 2004. Dynamic power optimization of interactive systems, In Proceedings of the IEEE International Conference on VLSI Design. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Zhong, L. and Jha, N. K. 2006. Dynamic power optimization targeting user delays in interactive systemes. IEEE Trans. Mobile Computing. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Video search and indexing with reinforcement agent for interactive multimedia services

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Embedded Computing Systems
          ACM Transactions on Embedded Computing Systems  Volume 12, Issue 2
          Special issue on embedded systems for interactive multimedia services (ES-IMS)
          February 2013
          209 pages
          ISSN:1539-9087
          EISSN:1558-3465
          DOI:10.1145/2423636
          Issue’s Table of Contents

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 22 February 2013
          • Revised: 1 March 2011
          • Accepted: 1 March 2011
          • Received: 1 November 2010
          Published in tecs Volume 12, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader