Abstract
In a digital environment, it is feasible to integrate multimedia materials into a library collection with ease. However, it seems likely that nontextual surrogates for multimedia objects, e.g., videos, could effectively augment textual representations of those objects. In this study, five video surrogates were evaluated in relation to their usefulness and usability in accomplishing specific tasks. The surrogates (storyboards with text or audio keywords, slide shows with text or audio keywords, fast forward) were created for each of seven video segments. Ten participants, all of whom watch videos at least monthly and search for videos at least occasionally, viewed the surrogates for seven video segments and provided comments about the strengths and weaknesses of each. In addition, they performed a series of tasks (gist determination, object recognition, action recognition, and visual gist determination) with three surrogates selected from those available. No surrogate was universally judged “best,” but the fast forward surrogate garnered the most support, particularly from experienced video users. The participants expressed their understanding of video gist as composed of three components: topicality, the story of the video, and the visual gist of the video. They identified several real-world tasks for which they regularly use video collections. The viewing compaction rates used in these surrogates supported adequate performance, but users expressed a desire for more control over surrogate speed and sequencing. Further development of these surrogates is warranted by these results, as well as the development of mechanisms for surrogate display.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Borko, H., Bernier, C.: Abstracting Concepts and Methods. Academic Press, New York(1975).
Christel, M., Smith, M., Taylor, C. R., Winkler, D.: Evolving Video Skims into Useful Multimedia Abstractions. In: Proceedings of CHI’ 98: Human Factors in Computing Systems (Los Angeles, April 1998). ACM Press, New York (1998), 171–178.
Christel, M., Winkler, D., Taylor, C. R.: Improving Access to a Digital Video Library. Paper presented at the Human-Computer Interaction: INTERACT97, the 6th IFIP Conference on Human-Computer Interaction, Sydney, Australia, July 14–18, 1997.
Cohen, J.: A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement 20 (1960), 37–46.
Cooper, W. S.: A Definition of Relevance for Information Retrieval. Information Storage & Retrieval 7 (1971), 19–37.
Ding, W.: Designing Multimodal Surrogates for Video Browsing and Retrieval. Unpublished doctoral dissertation, University of Maryland (1999).
Ding, W., Marchionini, G., Soergel, D.: Multimodal Surrogates for Video Browsing. In: Proceedings of Digital Libraries’ 99: the Fourth Annual ACM Conference on Digital Libraries (Berkeley, CA, August 1999). ACM Press, New York (1999), 85–93.
Ding, W., Marchionini, G., Tse, T.: Previewing Video Data: Browsing Key Frames at High Rates Using a Video Slide Show Interface. In: Proceedings of the International Symposium on Research, Development and Practice in Digital Libraries (Tsukuba, Japan, 1997), 151–158.
Elliot, E.: Watch, Grab, Arrange, See: Thinking with Motion Images via Streams and Collages. MSVS thesis document., MIT Media Lab, Cambridge MA (1993).
England, P., Allen, R. B., Sullivan, M., Bianchi, M., Heybey, A., Dailianas, A.: Ibrowse: The Bellcore Video Library Toolkit. In: Proceedings of the SPIE Photonics West’ 96: Electronic Imaging Science and Technology’ 96: Storage and Retrieval for Image and Video Database IV (San Jose CA, January 1996).
Goodrum, A.: Evaluation of Text-Based and Image-Based Representations for Moving Image Documents. Unpublished doctoral dissertation, University of North Texas (1997).
Goodrum, A. A.: Multidimensional Scaling of Video Surrogates. Journal of the American Society for Information Science 52 (2001), 174–182.
Heilprin, L.: Paramorphism versus Homomorphism in Information Science. In: Heilprin, L. (ed.): Toward Foundations of Information Science. Knowledge Industry Pub., White Plains NY (1985), 115–136.
Komlodi, A., Marchionini, G.: Key Frame Preview Techniques for Video Browsing. In: Proceedings of the ACM Digital Libraries Conference’ 98 (Pittsburgh, PA, June 24–26, 1998). ACM Press, New York (1998).
Li, F., Gupta, A., Sanocki, E., He, L., Rui, Y.: Browsing Digital Video. In: CHI 2000 Conference Proceedings: Human Factors in Computing Systems (The Hague, Netherlands, April 3–6, 2000). ACM Press, New York (2000), 169–176.
Marchionini, G., Geisler, G., Brunk, B.: AgileViews: A Human-Centered Framework for Interfaces to Information Spaces. In: ASIS 2000: Proceedings of the 63rd ASIS Annual Meeting, Volume 37 (Chicago, November 12–16, 2000). Information Today, Medford, NJ (2000), 271–280.
Maybury, M.: Intelligent Multimedia Information Retrieval. MIT Press, Cambridge MA (1997).
Mills, M., Cohen, J., Wong, Y.: A Magnifier Tool for Video Data. In: Proceedings of CHI’ 92: Human Factors in Computing Systems (Monterey, CA, May 3–7, 1992). ACM Press, New York (1992), 93–98.
Nielsen, J., Levy, J.: Measuring Usability: Preference vs. Performance. Communications of the ACM 37 (April 1994), 66–75.
O'Connor, B.: Access to Moving Image Documents: Background Concepts and Proposals for Surrogates for Film and Video Works. Journal of Documentation 41 (1985), 209–220.
Robertson, G., Czerwinski, M., Larson, K., Robbins, D., Thiel, D., van Dantzich, M.: Data Mountain: Using Spatial Memory for Document Management. In: Proceedings of the 11th Annual ACM Symposium on User Interface Software and Technology (San Francisco, November 1998), 153–162.
Rorvig, M. E.: A Method for Automatically Abstracting Visual Documents. Journal of the American Society for Information Science 44 (1993), 40–56.
Saracevic, T.: The Concept of “Relevance” in Information Science: A Historical Review. In: Saracevic, T. (ed.): Introduction to Information Science. Bowker, New York (1970), 111–151.
Slaughter, L., Shneiderman, B., Marchionini, G.: Comprehension and Object Recognition Capabilities for Presentations of Simultaneous Video Key Frame Surrogates. In: Research and Advanced Technology for Digital Libraries: Proceedings of the First European Conference (EDSL’ 97, Pisa, Italy, 1997), 41–54.
Smeaton, A.: Content-based Access to Digital Video: The Físchlár System and the TREC Video Track. Paper presented at MMCBIR 2001-Multimedia Content-based Indexing and Retrieval (INRIA, Rocquencourt, France, September 2001). http://www.cdvp.dcu.ie/Papers/MMCBIR2001.pdf. Last accessed January 26, 2002.
Smith, M., Kanade, T.: Video Skimming and Characterization through the Combination of Image and Language Understanding. In: Proceedings of the 1998 IEEE Workshop on Content-based Access of Image and Video Databases (Bombay, India, January 1998). IEEE, Los Alamitos CA (1998), 61–70.
Tonomura, Y., Akutsu, A., Otsuji, K., Sadakata, T.: VideoMAP and VideoSpaceIcon: Tools for Anatomizing Video Content. In: Proceedings of INTERCHI’ 93: Human Factors in Computing Systems (Amsterdam, April 1993), 131–136.
Tse, T., Marchionini, G., Ding, W., Slaughter, L., Komlodi, A.: Dynamic Key Frame Presentation Techniques for Augmenting Video Browsing. In: Proceedings of AVI’ 98: Advanced Visual Interfaces (L’Aquila, Italy, May 1998), 185–194.
Turner, J.: Determining the Subject Content of Still and Moving Image Documents for Storage and Retrieval: An Experimental Investigation. Unpublished doctoral dissertation, University of Toronto (1984).
van Dijk, T. A., Kintsch, W.: Strategies of Discourse Comprehension. Academic Press, New York (1983).
van Dijk, T. A., Kintsch, W.: Toward a Model of Text Comprehension and Production. Psychological Review 85 (1978), 363–394.
Wactlar, H., Christel, M., Gong, Y., Hauptmann, A.: Lessons Learned from Building a Terabyte Digital Video Library. Computer 32 (2, 1999), 66–73.
Yeo, B.-L., Yeung, M.: Retrieving and Visualizing Video. Communication of the ACM 40 (December 1997), 43–52.
Zhang, H. J., Low, C. Y., Smoliar, S. W.: Video Parsing and Browsing Using Compressed Data. Multimedia Tools and Applications 1 (1995), 89–111.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wildemuth, B.M. et al. (2002). Alternative Surrogates for Video Objects in a Digital Library: Users’ Perspectives on Their Relative Usability. In: Agosti, M., Thanos, C. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2002. Lecture Notes in Computer Science, vol 2458. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45747-X_36
Download citation
DOI: https://doi.org/10.1007/3-540-45747-X_36
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44178-6
Online ISBN: 978-3-540-45747-3
eBook Packages: Springer Book Archive