skip to main content
research-article

Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix

Authors Info & Claims
Published:11 April 2011Publication History
Skip Abstract Section

Abstract

We analyze the convergence of randomized trace estimators. Starting at 1989, several algorithms have been proposed for estimating the trace of a matrix by 1/MΣi=1M ziT Azi, where the zi are random vectors; different estimators use different distributions for the zis, all of which lead to E(1/MΣi=1M ziT Azi) = trace(A). These algorithms are useful in applications in which there is no explicit representation of A but rather an efficient method compute zTAz given z. Existing results only analyze the variance of the different estimators. In contrast, we analyze the number of samples M required to guarantee that with probability at least 1-δ, the relative error in the estimate is at most ϵ. We argue that such bounds are much more useful in applications than the variance. We found that these bounds rank the estimators differently than the variance; this suggests that minimum-variance estimators may not be the best.

We also make two additional contributions to this area. The first is a specialized bound for projection matrices, whose trace (rank) needs to be computed in electronic structure calculations. The second is a new estimator that uses less randomness than all the existing estimators.

References

  1. Achlioptas, D. 2001. Database-friendly random projections. In PODS '01: Proceedings of the 20th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems. ACM, New York, 274--281. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ailon, N., and Chazelle, B. 2006. Approximate nearest neighbors and the fast Johnson-Lindenstrauss transform. In Proceedings of the 38th Annual ACM Symposium on Theory of Computing (STOC'06). ACM, New York, 557--563. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Avron, H., Maymounkov, P., and Toledo, S. 2010. Blendenpik: Supercharging LAPACK's least-squares solver. SIAM J. Sci. Comput. 32, 3, 1217--1236.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bai, Z., Fahey, M., and Golub, G. 1996. Some large scale matrix computation problems. J. Comput. Appl. Math 74, 71--89. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bai, Z., Fahey, M., Golub, G., Menon, M., and Richter, E. 1998. Computing partial eigenvalue sum in electronic structure calculations. Tech. rep. SCCM-98-03, Stanford University.Google ScholarGoogle Scholar
  6. Bekas, C., Kokiopoulou, E., and Saad, Y. 2007. An estimator for the diagonal of a matrix. Appl. Numer. Math. 57, 11-12, 1214--1229. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Box, G. E. P., Hunter, W. G., and Hunter, J. S. 1978. Statistics for Experimenters: An Introduction to Design, Data Analysis, and Model Building. Wiley & Sons.Google ScholarGoogle Scholar
  8. D'Elia, M., Haber, H., and Horesh, L. 2011. Design of proper orthogonal decomposition bases by means of stochastic optimization. To be submitted.Google ScholarGoogle Scholar
  9. Drabold, D. A., and Sankey, O. F. 1993. Maximum entropy approach for linear scaling in the electronic structure problem. Phys. Rev. Lett. 70, 3631--3634.Google ScholarGoogle ScholarCross RefCross Ref
  10. Feller, W. 1971. An Introduction to Probability Theory and Its Applications, Vol. 2, 3rd. Ed. Wiley.Google ScholarGoogle Scholar
  11. Gudmundsson, T., Kenney, C. S., and Laub, A. J. 1995. Small-sample statistical estimates for matrix norms. SIAM J. Matrix Anal. Appl. 16, 3, 776--792. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Hutchinson, M. F. 1989. A stochastic estimator of the trace of the influence matrix for Laplacian smoothing splines. Comm. Stat. Simulat. Comput. 18, 1059--1076.Google ScholarGoogle ScholarCross RefCross Ref
  13. Iitaka, T., and Ebisuzaki, T. 2004. Random phase vector for calculating the trace of a large matrix. Phys. Rev. E 69, 057701--1--057701--4.Google ScholarGoogle ScholarCross RefCross Ref
  14. Janssen, A. J. E. M., van Leeuwaarden, J. S. H., and Zwart, B. 2008. Gaussian expansions and bounds for the Poisson distribution applied to the Erlang B formula. Adv. App. Prob. 40, 1, 122--143.Google ScholarGoogle ScholarCross RefCross Ref
  15. Kenney, C. S., Laub, A. J., and Reese, M. S. 1998. Statistical condition estimation for linear systems. SIAM J. Sci. Comput. 19, 2, 566--583. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Li, P., Hastie, T., and Church, K. 2007. Nonlinear estimators and tail bounds for dimension reduction in l<sub>1</sub> using Cauchy random projections. In Learning Theory, Lecture Notes in Computer Science Series, Vol. 4539. Springer, Berlin, Chap. 37, 514--529. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Silver, R. N., and R&amp;#246;der, H. 1997. Calculation of densities of states and spectral functions by Chebychev recursion and maximum entropy. Phys. Rev. E 56, 4822--4829.Google ScholarGoogle ScholarCross RefCross Ref
  18. Tsourakakis, C. E. 2008. Fast counting of triangles in large real networks without counting: Algorithms and laws. In Proceedings of the IEEE International Conference on Data Mining (ICDM'08), 608--617. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Wallace, D. L. 1959. Bounds on normal approximations to student's and the chi-square distributions. Ann. Math. Stat. 30, 4, 1121--1130.Google ScholarGoogle ScholarCross RefCross Ref
  20. Wang, L. W. 1994. Calculating the density of states and optical-absorption spectra of large quantum systems by the plane-wave moments method. Phys. Rev. B 49, 10154--10158.Google ScholarGoogle ScholarCross RefCross Ref
  21. Wheeler, J. C., and Blumstein, C. 1972. Modified moments for harmonic solids. Phys. Rev. B 6, 4380--4382.Google ScholarGoogle ScholarCross RefCross Ref
  22. Wong, M. N., Hickernell, F. J., and Liu, K. I. 2004. Computing the trace of a function of a sparse matrix via Hadamard-like sampling. Tech rep. 377(7/04), Hong Kong Baptist University.Google ScholarGoogle Scholar

Index Terms

  1. Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Journal of the ACM
          Journal of the ACM  Volume 58, Issue 2
          April 2011
          102 pages
          ISSN:0004-5411
          EISSN:1557-735X
          DOI:10.1145/1944345
          Issue’s Table of Contents

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 11 April 2011
          • Accepted: 1 October 2010
          • Revised: 1 September 2010
          • Received: 1 April 2010
          Published in jacm Volume 58, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader