ABSTRACT
The ability to automatically capture and index multimedia information for later perusal and review is critical to the success of future multimedia services. In this paper, we describe how to automatically generate indexes of real-time streams without requiring deep content analysis. Our techniques involve segmenting continuous audio and video into natural units, and relating these to discrete events from the multimedia application, such as user interactions, control events, and data content. In addition, we describe how to search within multimedia streams using query-based retrieval and visual and auditory retrieval modes. This multimodal retrieval allows for quick browsing and visual comprehension of multimedia streams. Finally we show how our techniques apply to the area of multimedia conference recording.
- AHUJ-88.Ahuja, S. R., J. R. Ensor, and D. N. Horn, "The Rapport Multimedia Conferencing Systems," "Proceedings of the Conference on Office Information Systems, Palo Alto, CA, March 1988, 1-8. Google ScholarDigital Library
- CRAI-93.Craighill, E., and R. Lang, E Martin, K. Skinner, "CECED: A System for Information Multimedia Collaboration,'' "Proceedings of the ACM Multimedia 1993, Anaheim, CA, August 1993, 437-445. Google ScholarDigital Library
- GINS-93.Ginsberg, A., "A Unified Approach to Automatic Indexing and Information Retrieval," IEEE Expert, October 1993, 46-56. Google ScholarDigital Library
- JPEG-92."JPEG Movie File Specification, Release 1.0," Parallax Graphics, Inc, Santa Clara, CA, [email protected] com, November 5, 1992.Google Scholar
- LAMM-92.Lamming, Michael, G, and Newman, William M.,"Activity-based Information Retrieval: Technology in Support of Personal Memory," Information Processing, Volume III, 1992, 68-81. Google ScholarDigital Library
- MILL-92.Mills, M., J. Cohen, and Y. Y. Wong, "A Magnifier Tool for Video Data," "Proceedings of Computer Human Interaction, May 1992, 93-98. Google ScholarDigital Library
- MINN-93.Minneman, S. L., and S. R. Harrison, "Where Were We: Making and Using Near-synchronous, Pre-narrative Video," "Proceedings of the ACM Multimedia 1993, Anaheim, CA, August 1993. Google ScholarDigital Library
- RANG-93.Rangan, P. V., "Video Conferencing, file storage, and management in multimedia computer systems," Computer Networks and ISDN Systems, March 1993, 901-919. Google ScholarDigital Library
- SHAH-94.Shahraray, Behzad, "Scene Change Detection and Content-Based Sampling of Video Sequences," Unpublished, 1994.Google Scholar
- SMIT-91.Smith, T. A., and N. C. Pincever, "Parsing Movies in Context," USENIX, Nashville, TN, June 1991, 157-168.Google Scholar
- TEOD-93.Teodosio, L., and W. Bender, "Salient Video Stills: Content and Context Preserved," "Proceedings of the ACM Multimedia 1993, Anaheim, CA, August 1993, 39-46. Google ScholarDigital Library
- WILC-92.Wilcox, L., I. Smith and M. Bush, "Wordspotting for Voice Editing and Audio Indexing," "Proceedings of Computer Human Interaction, May 1992, 655-656. Google ScholarDigital Library
Index Terms
- Towards intelligent recognition of multimedia episodes in real-time applications
Recommendations
Intelligent and Pervasive Multimedia Systems
Recent advances in pervasive computing and the proliferation of multimedia-capable devices have stimulated the development of intelligent and pervasive multimedia applications. This special issue provides excellent coverage of this area, including ...
Securing RTP Packets Using Per-Packet Selective Encryption Scheme for Real-Time Multimedia Applications
TRUSTCOM '13: Proceedings of the 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and CommunicationsTo secure multimedia communications, existing encryption techniques usually encrypt the whole data stream using the same session key during a session period. The use of online session key usually confronts with tradeoff problem between latency caused ...
Comments