Online Appearance Manifold Learning for Video Classification and Clustering

Yang, Li; Wang, Xiaokun

doi:10.1007/978-3-319-42108-7_43

Li Yang²² &
Xiaokun Wang²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9787))

Included in the following conference series:

International Conference on Computational Science and Its Applications

1531 Accesses

Abstract

Video classification and clustering are key techniques in multimedia applications such as video segmentation and recognition. This paper investigates the application of incremental manifold learning algorithms to directly learn nonlinear relationships among video frames. Video frame classification and clustering are performed to the projected data in an intrinsic latent space. This approach has avoided partitioning video frames into arbitrary groups. It works even when the input video frames are under-sampled or unevenly distributed. Experiments show that video classification and clustering give better results in the latent space than in the original high dimensional space.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Available at http://www.ee.surrey.ac.uk/personal/a.hilton/research.html.
2.
Available at http://amp.ece.cmu.edu/projects/FaceAuthentication/Default.htm.
3.
Available at http://www.cs.toronto.edu/~roweis/data.html.
4.
Silhouette value is a measure of how similar a data point is to data points in its own cluster versus data points in other clusters. Let \(a_i\) denote the average distance of the i-th point to all other points in the same cluster. Let \(b_i\) denote the minimum of average distances of the i-th point to all points in other clusters, that is, the average of distances to all points in the next closest cluster. The i-th point’s silhouette value is defined as \(s_i = (b_i - a_i)/\max (a_i, b_i)\). Silhouette values range from \({-}1\) to +1.

References

Law, M.H.C., Jain, A.K.: Incremental nonlinear dimensionality reduction by manifold learning. IEEE Trans. Pattern Anal. Mach. Intell. 28(3), 377–391 (2006)
Article Google Scholar
Lee, K.C., Kriegman, D.: Online learning of probabilistic appearance manifolds for video-based recognition and tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Washington, DC, vol. 1, pp. 852–859 (2005)
Google Scholar
Roach, M., Mason, J., Xu, L., Stentiford, F.: Recent trends in video analysis : a taxonomy of video classification problems. In: Proceedings of the International Conference on Internet and Multimedia Systems and Applications, Honolulu, HI, pp. 864–871, August 2003
Google Scholar
Seung, H.S., Lee, D.D.: The manifold ways of perception. Science 290(5500), 2268–2269 (2000)
Article Google Scholar
Tenenbaum, J.B., de Silva, V., Langford, J.C.: A global geometric framework for nonlinear dimensionality reduction. Science 290, 2319–2323 (2000)
Article Google Scholar
Yang, L.: Building k-connected neighborhood graphs for isometric data embedding. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 827–831 (2006)
Article Google Scholar
Yeo, B.L., Liu, B.: Rapid scene analysis on compressed videos. IEEE Trans. Circ. Syst. Video Technol. 5(6), 533–544 (1995)
Article Google Scholar
Yeung, M., Yeo, B.L.: Time-constrained clustering for segmentation of video into story units. In: Proceedings of the International Conference on Pattern Recognition, Vienna, Austria, pp. 375–380, August 1996
Google Scholar
Yeung, M., Yeo, B.L., Liu, B.: Extracting story units from long programs for video browsing and navigation. In: Proceedings of the International Conference on Multimedia Computing and Systems, Hiroshima, Japan, pp. 296–305, June 1996
Google Scholar
Zhang, H.J., Low, C.Y., Smoliar, S.W.: Video parsing and browsing using compressed data. Multimedia Tools Appl. 1(1), 89–111 (1995)
Article Google Scholar
Zhao, D., Yang, L.: Incremental isometric embedding of high dimensional data using connected neighborhood graphs. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 86–98 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Western Michigan University, Kalamazoo, MI, USA
Li Yang & Xiaokun Wang

Authors

Li Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaokun Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Li Yang .

Editor information

Editors and Affiliations

University of Perugia , Perugia, Italy
Osvaldo Gervasi
University of Basilicata , Potenza, Italy
Beniamino Murgante
Covenant University , Ota, Nigeria
Sanjay Misra
University of Minho , Braga, Portugal
Ana Maria A.C. Rocha
Polytechnic University , Bari, Italy
Carmelo M. Torre
Monash University , Clayton, Victoria, Australia
David Taniar
Kyushu Sangyo University , Fukuoka, Japan
Bernady O. Apduhan
Saint Petersburg State University , Saint Petersburg, Russia
Elena Stankova
Beijing University of Posts & Telecommunication , Beijing, China
Shangguang Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, L., Wang, X. (2016). Online Appearance Manifold Learning for Video Classification and Clustering. In: Gervasi, O., et al. Computational Science and Its Applications – ICCSA 2016. ICCSA 2016. Lecture Notes in Computer Science(), vol 9787. Springer, Cham. https://doi.org/10.1007/978-3-319-42108-7_43

Download citation

DOI: https://doi.org/10.1007/978-3-319-42108-7_43
Published: 12 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42107-0
Online ISBN: 978-3-319-42108-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics