skip to main content
article
Free Access

The P2 algorithm for dynamic calculation of quantiles and histograms without storing observations

Published:01 October 1985Publication History
Skip Abstract Section

Abstract

A heuristic algorithm is proposed for dynamic calculation of the median and other quantiles. The estimates are produced dynamically as the observations are generated. The observations are not stored; therefore, the algorithm has a very small and fixed storage requirement regardless of the number of observations. This makes it ideal for implementing in a quantile chip that can be used in industrial controllers and recorders. The algorithm is further extended to histogram plotting. The accuracy of the algorithm is analyzed.

References

  1. 1 Buchholz, W. File organization and addressing. IBM Syst. 1. 2, (196% 66-111.Google ScholarGoogle Scholar
  2. 2 Knott, G.D. Hashing functions. Cornput. 1. 18. 3 (1975). 265-276.Google ScholarGoogle Scholar
  3. 3 Knuth. D.E. The Art of Computer Programming. Vol. 3, Sorting and Searching. Addison-Wesley, Reading, Mass., 1973. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4 Pearson, ES. and Hartley, H.O. Biometrikn, Tables for Stotisticinns. Cambridge University Press, Cambridge, 1972.Google ScholarGoogle Scholar
  5. 5 Peterson, W.W. Addressing for random access storage. IBM I. Res. Develop. 1. 2 (Apr. 1957). 130-146.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6 Van der Pool, J.A. Optimum storage allocation for initial loading of a file. IBM J. Res. Develop. 16 (Nov. 1972), 579-586.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7 Vitter, J.S. Implementations for coalesced hashing. Commun. ACM 25, 12 (Dec. 1962). 911-926. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. The P2 algorithm for dynamic calculation of quantiles and histograms without storing observations

              Recommendations

              Reviews

              Leonard Francis Zettel

              .abstract A heuristic algorithm is proposed for dynamic calculation of the median and other quantiles. . . . [T]he algorithm has a very small and fixed storage requirement regardless of the number of observations. . . . The algorithm is further extended to histogram plotting. . . . — From the Authors' Abstract The algorithm looks interesting and is presented in sufficient detail so that one could implement it. It requires the storage of ten numbers for the calculation of a quantile; thus, it would probably save storage on large samples of observations, since conventional algorithms require the equivalent to access the data sorted from lowest to highest. The analysis that accompanies the algorithm is best described as a lick, a promise, and a bit of arm waving. There is no formal analysis of execution behavior. Nothing is proven about the statistical behavior of the estimators that the algorithm produces. Results of Monte Carlo runs of 50 samples are given for various cases. Similar work presented in the Journal of the American Statistical Association would typically have sample sizes of 10,000.

              Access critical reviews of Computing literature here

              Become a reviewer for Computing Reviews.

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in

              Full Access

              • Published in

                cover image Communications of the ACM
                Communications of the ACM  Volume 28, Issue 10
                Oct. 1985
                64 pages
                ISSN:0001-0782
                EISSN:1557-7317
                DOI:10.1145/4372
                Issue’s Table of Contents

                Copyright © 1985 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 1 October 1985

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • article

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader