Skip to main content

A Novel Framework Based on Trace Norm Minimization for Audio Event Detection

  • Conference paper
Book cover Neural Information Processing (ICONIP 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7063))

Included in the following conference series:

Abstract

In this paper, a novel framework based on trace norm minimization for audio event detection is proposed. In the framework, both the feature extraction and pattern classifier are made by solving corresponding convex optimization problem with trace norm regularization or under trace norm constraint. For feature extraction, robust principle component analysis (robust PCA) via minimizing a combination of the nuclear norm and the ℓ1-norm is used to extract matrix representation features which is robust to outliers and gross corruption for audio segments. These matrix representation features are fed to a linear classifier where the weight matrix and bias are learned by solving similar trace norm regularized problems. Experiments on real data sets indicate that this novel framework is effective and noise robust.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lu, L.: Content analysis for audio classification and segmentation. IEEE Trans. Speech and Audio Processing 10, 504–516 (2002)

    Article  Google Scholar 

  2. Cui, R., Lu, L., Zhung, H.J., Cai, L.H.: Highlight sound effects detection in audio stream. In: Proceedings of IEEE International Conference on Multimedia and Expo, pp. 37–40 (2003)

    Google Scholar 

  3. Pradeep, K.A., Namunu, C.M., Mohan, S.K.: Audio based event detection for multimedia surveillance. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (2006)

    Google Scholar 

  4. Zhuang, X., Zhou, X., Huang, T.S., Hasegawa-Johnson, M.: Feature analysis and selection for acoustic event detection. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 17–20 (2008)

    Google Scholar 

  5. Fazel, M., Hindi, H., Boyd, S.P.: A rank minimization heuristic with application to minimum order system approximation. In: Proceedings of the American Control Conference, pp. 4734–4739 (2001)

    Google Scholar 

  6. Srebro, N., Rennie, J.D.M., Jaakkola, T.S.: Maximum-margin matrix factorization. In: Proceedings of Advances in Neural Information Processing Systems, pp. 1329–1336 (2005)

    Google Scholar 

  7. Argyriou, A., Evgeniou, T., Pontil, M.: Convex multi-task feature learning. Machine Learning 73(3), 243–272 (2008)

    Article  Google Scholar 

  8. Wright, J., Ganesh, A., Rao, S., Peng, Y., Ma, Y.: Robust principal component analysis: Exact recovery of corrupted low-rank matrices via convex optimization. In: Proceedings of Advances in Neural Information Processing Systems (2009)

    Google Scholar 

  9. Lin, Z., Chen, M., Wu, L., Ma, Y.: The augmented lagrange multiplier method for exact recovery of corrupted low-rank matrices. UIUC Technical Report (2009)

    Google Scholar 

  10. Tomioka, R., Aihara, K.: Classifying matrices with a spectral regularization. In: 24th International Conference on Machine Learning, pp. 895–902 (2007)

    Google Scholar 

  11. Toh, K., Yun, S.: An accelerated proximal gradient algorithm for nuclear norm regularized least squares problems. Pacific J. Optim. 6, 615–640 (2010)

    MathSciNet  MATH  Google Scholar 

  12. Ji, S., Ye, J.: An accelerated gradient method for trace norm minimization. In: 26th International Conference on Machine Learning, pp. 457–464 (2009)

    Google Scholar 

  13. Liu, Y.J., Sun, D., Toh, K.C.: An implementable proximal point algorithmic framework for nuclear norm minimization. Mathematical Programming, 1–38 (2009)

    Google Scholar 

  14. Jolliffe, I.T.: Principal Component Analysis. Springer Series in Statistics. Springer, Berlin (1986)

    Book  MATH  Google Scholar 

  15. Davis, S., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Transactions on Acoustics, Speech and Signal Processing 28(4), 357–366 (1980)

    Article  Google Scholar 

  16. Youku, http://www.youku.com

  17. Bickel, P., Ritov, Y., Tsybakov, A.: Simultaneous analysis of Lasso and Dantzig selector. Annals of Statistics 37(4), 1705–1732 (2009)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shi, Z., Han, J., Zheng, T. (2011). A Novel Framework Based on Trace Norm Minimization for Audio Event Detection. In: Lu, BL., Zhang, L., Kwok, J. (eds) Neural Information Processing. ICONIP 2011. Lecture Notes in Computer Science, vol 7063. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24958-7_75

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24958-7_75

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24957-0

  • Online ISBN: 978-3-642-24958-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics