skip to main content
10.1145/2155555.2155565acmconferencesArticle/Chapter ViewAbstractPublication PagesmmsysConference Proceedingsconference-collections
research-article

Multi-video summary and skim generation of sensor-rich videos in geo-space

Published:22 February 2012Publication History

ABSTRACT

User-generated videos have become increasingly popular in recent years. Due to advances in camera technology it is now very easy and convenient to record videos with mobile devices, such as smartphones. Here we consider an application where users collect and share a large set of videos that are related to a geographic area, say a city. Such a repository can be a great source of information for prospective tourists when they plan to visit a city and would like to get a preview of its main areas. The challenge that we address is how to automatically create a preview video summary from a large set of source videos.

The main features of our technique are that it is fully automatic and leverages meta-data sensor information which is acquired in conjunction with videos. The meta-data is collected from GPS and compass sensors and is used to describe the viewable scenes of the videos. Our method then proceeds in three steps through the analysis of the sensor data. First, we generate a single video summary. Shot boundaries are detected based on different motion types of camera movements and key frames are extracted related to motion patterns. Second, we build video skims for popular places (i.e., hotspots) aiming to provide maximal coverage of hotspot areas with minimal redundancy (per-spot multi-video summary). Finally, the individual hotspot skims are linked together to generate a pleasant video tour that visits all the popular places (multi-spot multi-video summary).

References

  1. Geovid. http://geovid.org/.Google ScholarGoogle Scholar
  2. K. Aizawa, K. Ishijima, and M. Shiina. Summarizing Wearable Video. In International Conference on Image Processing, volume 3, pages 398--401 vol.3, 2001.Google ScholarGoogle Scholar
  3. K. Aizawa, D. Tancharoen, S. Kawasaki, and T. Yamasaki. Efficient Retrieval of Life Log based on Context and Content. In ACM Workshop on Continuous Archival and Retrieval of Personal Experiences, CARPE, pages 22--31, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y. Arase, X. Xie, T. Hara, and S. Nishio. Mining People's Trips from Large Scale Geo-tagged Photos. In ACM Multimedia, MM '10, pages 133--142, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Arslan Ay, R. Zimmermann, and S. H. Kim. Viewable Scene Modeling for Geospatial Video Search. In ACM Multimedia, MM '08, pages 309--318, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Z. Botev, D. Kroese, and T. Taimre. Generalized Cross-entropy Methods with Applications to Rare-event Simulation and Optimization. Simulation, 83:785--806, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Buchin, A. Driemel, M. van Kreveld, and V. Sacristán. An Algorithmic Framework for Segmenting Trajectories based on Spatio-temporal Criteria. In SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS '10, pages 202--211, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Ekin, A. Tekalp, and R. Mehrotra. Automatic Soccer Video Analysis and Summarization. IEEE Transactions on Image Processing, 12(7):796--807, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Graham. Vision and Visual Perception. 1965.Google ScholarGoogle Scholar
  10. J. Kim, H. S. Chang, K. Kang, M. Kim, J. Kim, and H. Kim. Summarization of News Video and its Description for Content-based Access. International Journal of Imaging Systems and Technology, 13(5):267--274, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  11. M. S. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based Multimedia Information Retrieval: State of the Art and Challenges. ACM Transactions on Multimedia Computing, Communications and Applications, 2:1--19, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Li and B. Merialdo. Multi-video Summarization based on AV-MMR. In Int'l Workshop on Content-Based Multimedia Indexing, CBMI '10, pages 1--6, 2010.Google ScholarGoogle Scholar
  13. Y. Li and B. Merialdo. Multi-video summarization based on OB-MMR. In Int'l Workshop on Content-Based Multimedia Indexing, CBMI '11, pages 163--168, 2011.Google ScholarGoogle Scholar
  14. X. Lu, C. Wang, J.-M. Yang, Y. Pang, and L. Zhang. Photo2Trip: Generating Travel Routes from Geo-tagged Photos for Trip Planning. In ACM Multimedia, MM '10, pages 143--152, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. Ma, L. Lu, H. Zhang, and M. Li. A User Attention Model for Video Summarization. In ACM Multimedia, MM '02, pages 533--542, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Microsoft. Image Composite Editor Panorama Stitcher. http://research.microsoft.com/en-us/um/redmond/groups/ivm/ICE/.Google ScholarGoogle Scholar
  17. A. G. Money and H. Agius. Video Summarisation: A Conceptual Framework and Survey of the State of the Art. Journal of Visual Communication and Image Represention, 19:121--143, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. I. Otsuka, K. Nakane, A. Divakaran, K. Hatanaka, and M. Ogawa. A Highlight Scene Detection and Video Summarization System using Audio Feature for a Personal Video Recorder. IEEE Transactions on Consumer Electronics, 51(1):112--116, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Shao, D. Jiang, M. Wang, H. Chen, and L. Yao. Multi-video Summarization using Complex Graph Clustering and Mining. Computer Science and Information Systems, 7(1):85--98, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  20. F. Shipman, A. Girgensohn, and L. Wilcox. Creating Navigable Multi-level Video Summaries. In IEEE International Conference on Multimedia and Expo, ICME '03, pages 753--756, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Tjondronegoro, Y.-P. P. Chen, and B. Pham. Highlights for more Complete Sports Video Summarization. IEEE Multimedia, 11(4):22--37, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. B. T. Truong and S. Venkatesh. Video Abstraction: A Systematic Review and Classification. ACM Transactions on Multimedia Computing, Communications and Applications, 3, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. F. Wang and B. Merialdo. Multi-document Video Summarization. In IEEE International Conference on Multimedia and Expo, ICME '09, pages 1326--1329, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. C. Xu, J. Wang, K. Wan, Y. Li, and L. Duan. Live Sports Event Detection based on Broadcast Video and Web-casting Text. In ACM Multimedia, MM '06, pages 221--230, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multi-video summary and skim generation of sensor-rich videos in geo-space

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        MMSys '12: Proceedings of the 3rd Multimedia Systems Conference
        February 2012
        247 pages
        ISBN:9781450311311
        DOI:10.1145/2155555
        • General Chair:
        • Mark Claypool,
        • Program Chair:
        • Carsten Griwodz

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 22 February 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate176of530submissions,33%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader