research-article

Multi-video summary and skim generation of sensor-rich videos in geo-space

Authors:
Ying Zhang

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

,
Guanfeng Wang

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

,
Beomjoo Seo

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

,
Roger Zimmermann

National University of Singapore, Singapore

National University of Singapore, Singapore
View Profile

MMSys '12: Proceedings of the 3rd Multimedia Systems ConferenceFebruary 2012Pages 53–64https://doi.org/10.1145/2155555.2155565

Published:22 February 2012Publication History

MMSys '12: Proceedings of the 3rd Multimedia Systems Conference

Pages 53–64

ABSTRACT

User-generated videos have become increasingly popular in recent years. Due to advances in camera technology it is now very easy and convenient to record videos with mobile devices, such as smartphones. Here we consider an application where users collect and share a large set of videos that are related to a geographic area, say a city. Such a repository can be a great source of information for prospective tourists when they plan to visit a city and would like to get a preview of its main areas. The challenge that we address is how to automatically create a preview video summary from a large set of source videos.

The main features of our technique are that it is fully automatic and leverages meta-data sensor information which is acquired in conjunction with videos. The meta-data is collected from GPS and compass sensors and is used to describe the viewable scenes of the videos. Our method then proceeds in three steps through the analysis of the sensor data. First, we generate a single video summary. Shot boundaries are detected based on different motion types of camera movements and key frames are extracted related to motion patterns. Second, we build video skims for popular places (i.e., hotspots) aiming to provide maximal coverage of hotspot areas with minimal redundancy (per-spot multi-video summary). Finally, the individual hotspot skims are linked together to generate a pleasant video tour that visits all the popular places (multi-spot multi-video summary).

References

Geovid. http://geovid.org/.Google Scholar
K. Aizawa, K. Ishijima, and M. Shiina. Summarizing Wearable Video. In International Conference on Image Processing, volume 3, pages 398--401 vol.3, 2001.Google Scholar
K. Aizawa, D. Tancharoen, S. Kawasaki, and T. Yamasaki. Efficient Retrieval of Life Log based on Context and Content. In ACM Workshop on Continuous Archival and Retrieval of Personal Experiences, CARPE, pages 22--31, 2004. Google ScholarDigital Library
Y. Arase, X. Xie, T. Hara, and S. Nishio. Mining People's Trips from Large Scale Geo-tagged Photos. In ACM Multimedia, MM '10, pages 133--142, 2010. Google ScholarDigital Library
S. Arslan Ay, R. Zimmermann, and S. H. Kim. Viewable Scene Modeling for Geospatial Video Search. In ACM Multimedia, MM '08, pages 309--318, 2008. Google ScholarDigital Library
Z. Botev, D. Kroese, and T. Taimre. Generalized Cross-entropy Methods with Applications to Rare-event Simulation and Optimization. Simulation, 83:785--806, 2007. Google ScholarDigital Library
M. Buchin, A. Driemel, M. van Kreveld, and V. Sacristán. An Algorithmic Framework for Segmenting Trajectories based on Spatio-temporal Criteria. In SIGSPATIAL International Conference on Advances in Geographic Information Systems, GIS '10, pages 202--211, 2010. Google ScholarDigital Library
A. Ekin, A. Tekalp, and R. Mehrotra. Automatic Soccer Video Analysis and Summarization. IEEE Transactions on Image Processing, 12(7):796--807, 2003. Google ScholarDigital Library
C. Graham. Vision and Visual Perception. 1965.Google Scholar
J. Kim, H. S. Chang, K. Kang, M. Kim, J. Kim, and H. Kim. Summarization of News Video and its Description for Content-based Access. International Journal of Imaging Systems and Technology, 13(5):267--274, 2003.Google ScholarCross Ref
M. S. Lew, N. Sebe, C. Djeraba, and R. Jain. Content-based Multimedia Information Retrieval: State of the Art and Challenges. ACM Transactions on Multimedia Computing, Communications and Applications, 2:1--19, 2006. Google ScholarDigital Library
Y. Li and B. Merialdo. Multi-video Summarization based on AV-MMR. In Int'l Workshop on Content-Based Multimedia Indexing, CBMI '10, pages 1--6, 2010.Google Scholar
Y. Li and B. Merialdo. Multi-video summarization based on OB-MMR. In Int'l Workshop on Content-Based Multimedia Indexing, CBMI '11, pages 163--168, 2011.Google Scholar
X. Lu, C. Wang, J.-M. Yang, Y. Pang, and L. Zhang. Photo2Trip: Generating Travel Routes from Geo-tagged Photos for Trip Planning. In ACM Multimedia, MM '10, pages 143--152, 2010. Google ScholarDigital Library
Y. Ma, L. Lu, H. Zhang, and M. Li. A User Attention Model for Video Summarization. In ACM Multimedia, MM '02, pages 533--542, 2002. Google ScholarDigital Library
Microsoft. Image Composite Editor Panorama Stitcher. http://research.microsoft.com/en-us/um/redmond/groups/ivm/ICE/.Google Scholar
A. G. Money and H. Agius. Video Summarisation: A Conceptual Framework and Survey of the State of the Art. Journal of Visual Communication and Image Represention, 19:121--143, 2008. Google ScholarDigital Library
I. Otsuka, K. Nakane, A. Divakaran, K. Hatanaka, and M. Ogawa. A Highlight Scene Detection and Video Summarization System using Audio Feature for a Personal Video Recorder. IEEE Transactions on Consumer Electronics, 51(1):112--116, 2005. Google ScholarDigital Library
J. Shao, D. Jiang, M. Wang, H. Chen, and L. Yao. Multi-video Summarization using Complex Graph Clustering and Mining. Computer Science and Information Systems, 7(1):85--98, 2010.Google ScholarCross Ref
F. Shipman, A. Girgensohn, and L. Wilcox. Creating Navigable Multi-level Video Summaries. In IEEE International Conference on Multimedia and Expo, ICME '03, pages 753--756, 2003. Google ScholarDigital Library
D. Tjondronegoro, Y.-P. P. Chen, and B. Pham. Highlights for more Complete Sports Video Summarization. IEEE Multimedia, 11(4):22--37, 2004. Google ScholarDigital Library
B. T. Truong and S. Venkatesh. Video Abstraction: A Systematic Review and Classification. ACM Transactions on Multimedia Computing, Communications and Applications, 3, 2007. Google ScholarDigital Library
F. Wang and B. Merialdo. Multi-document Video Summarization. In IEEE International Conference on Multimedia and Expo, ICME '09, pages 1326--1329, 2009. Google ScholarDigital Library
C. Xu, J. Wang, K. Wan, Y. Li, and L. Duan. Live Sports Event Detection based on Broadcast Video and Web-casting Text. In ACM Multimedia, MM '06, pages 221--230, 2006. Google ScholarDigital Library

Index Terms

Multi-video summary and skim generation of sensor-rich videos in geo-space
1. Human-centered computing
  1. Collaborative and social computing
    1. Collaborative and social computing systems and tools
2. Information systems
  1. World Wide Web

Recommendations

Points of Interest Detection from Multiple Sensor-Rich Videos in Geo-Space
MM '14: Proceedings of the 22nd ACM international conference on Multimedia

Recently, the popularity of user generated videos has highlighted efficient video indexing and browsing as an urgent problem. Points of interest (POI) detection is a technique to address this issue by establishing the implicit relationship among ...
Read More
Brief and high-interest video summary generation: evaluating the AT&T labs rushes summarizations
TVS '08: Proceedings of the 2nd ACM TRECVid Video Summarization Workshop

Video summarization is essential for the user to understand the main theme of video sequences in a short period, especially when the volume of the video is huge and the content is highly redundant. In this paper, we present a video summarization system, ...
Read More
DVS: a dynamic multi-video summarization system of sensor-rich videos in geo-space
MM '12: Proceedings of the 20th ACM international conference on Multimedia

It is now very easy to produce user generated videos (UGV) due to progress in camera and recording technologies on mobile devices, such as smartphones. Additionally, the ubiquitous, built-in sensors in digital devices can greatly enrich these videos ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MMSys '12: Proceedings of the 3rd Multimedia Systems Conference
February 2012
247 pages
ISBN:9781450311311
DOI:10.1145/2155555
General Chair:
Mark Claypool
Worcester Polytechnical Institute, MA
,
Program Chair:
Carsten Griwodz
Simula Research Laboratory and University of Oslo, Norway
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 22 February 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
geo-tagging
key frame extraction
sensor data mining
video skim
video summarization
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate176of530submissions,33%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 20
  Total Citations
  View Citations
- 387
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multi-video summary and skim generation of sensor-rich videos in geo-space

MMSys '12: Proceedings of the 3rd Multimedia Systems Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

Points of Interest Detection from Multiple Sensor-Rich Videos in Geo-Space

Brief and high-interest video summary generation: evaluating the AT&T labs rushes summarizations

DVS: a dynamic multi-video summarization system of sensor-rich videos in geo-space

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Multi-video summary and skim generation of sensor-rich videos in geo-space

MMSys '12: Proceedings of the 3rd Multimedia Systems Conference

ABSTRACT

References

Cited By

Index Terms

Recommendations

Points of Interest Detection from Multiple Sensor-Rich Videos in Geo-Space

Brief and high-interest video summary generation: evaluating the AT&T labs rushes summarizations

DVS: a dynamic multi-video summarization system of sensor-rich videos in geo-space

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media