skip to main content
10.1145/281035.281048acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
Article
Free Access

Searching for the sorting record: experiences in tuning NOW-Sort

Published:01 August 1998Publication History
First page image

References

  1. 1.A. Acharya, M. Uysal, R. Bennett, A. Mendelson, M. Beynon, J. K. Hollingsworth, J. Saltz, and A. Sussman. Tuning the performance of I/0 intensive parallel applications. In Proceedings of the Fourth Workshop on Input#Output in Parallel and Distributed Systems, pages 15-27, Philadelphia, May 1996. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.R.C. Agarwal. A Super Scalar Sort Algorithm for RISC Processors. In 1996 ACM SIGMOD Conference, pages 240-246, June 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.T. E. Anderson, D. E. Culler, D. A. Patterson, and The NOW Team. A Case for NOW (Networks of Workstations). IEEE Micro, February 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. 4.T. E. Anderson and E. Lazowska. Quartz: A Tool for Tuning Parallel Program Performance. In Proceedings of the 1989 ACM SIGMETRICS and PERFORMANCE Conference on Measurement and Modeling of Computer Systems, pages 115-125, May 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.A.C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, D. E. Culler, J. M. Hellerstein, and D. A. Patterson. High-Performance Sorting on Networks of Workstations. In SIGMOD '97, May 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.R.H. Arpaci-Dusseau, A. C. Arpaci-Dusseau, D. E. Culler, J. M. Hellerstein, and D. A. Patterson. The Architectural Costs of Streaming I/O: A Comparison of Workstations, Clusters, and SMPs. In HPCA '98, February 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.N. Boden, D. Cohen, R. E. Felderman, A. Kulawik, and C. Seitz. Myrinet: A Gigabit-per-second Local Area Network. IEEE Micro, February 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.J. Chapin, S. Herrod, M. Rosenblum, and A. Gupta. Memory System Performance of UNIX on CC-NUMA Multiprocessors. In 1995 ACM SIGMETRICS/Performance ConJerence, pages 1- 13, May 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.B. Cmelik and D. Keppel. Shade: A Fast Instruction-Set Simulator For Execution Profiling. In Proceedings of the 1994 ACM SIGMETRICS Conference, pages 128-37, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta, T. von Eicken, and K. Yelick. Parallel Programming in Split-C. In Supercomputing '93, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.D. E. Culler, L. T. Liu, R. P. Martin, and C. O. Yoshikawa. LogP Performance Assessment of Fast Network Interfaces. IEEE Micro, 2/I 996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. 12.DEC. DECChip 21064 RISC Microprocessor Preliminary Data Sheet. Technical report, Digital Equipment Corporation, 1992.Google ScholarGoogle Scholar
  13. 13.A. et. al. A Measure of Transaction Processing Power. Datarnation, 31(7):112-118, 1985. Also in Readings in Database Systems, M.H. Stonebraker ed., Morgan Kaufmann, San Mateo, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.D. E Ghormley, D. Petrou, S. H. Rodrigues, A. M. Vahdat, and T. E. Anderson. A Global Layer Unix for a Network of Workstations. To appear in Software Practice and Experience, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.J.K. Hollingsworth. Finding Bottlenecks in Large-scale Parallel Programs. PhD thesis, University of Wisconsin, Aug. 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.J. K. Hollingsworth and E. L. Miller. Using Content-Derived Names for Configuration Management. In 1997ACMSymposium on Software Reusibility, Boston, May 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. 17.S. Kleiman, J. Voll, J. Eykholt, A. Shivalingiah, D. Williams, M. Smith, S. Barton, and G. Skinner. Symmetric Multiprocessing in Solaris 2.0. In Proceedings of COMPCON Spring '92, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.A. R. Lebeck and D. A. Wood. Cache Profiling and the SPEC Benchmarks: A Case Study. IEEE COMPUTER, pages 15-26, October 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. 19.A. M. Mainwaring. Active Message Application Programming Interface and Communication Subsystem Organization. Master's thesis, University of California, Berkeley, 1995.Google ScholarGoogle Scholar
  20. 20.M. Martonosi, D. W. Clark, and M. Mesarina. The SHRIMP Hardware Performance Monitor: Design and Applications. In Proceedings of 1996 SIGMETRICS Symposium on Parallel and Distributed Tools (SPDT), February 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. 21.M. Martonosi, A. Gupta, and T. E. Anderson. MemSpy: Analyzing Memory System Bottlenecks in Programs. In Proceedings of the 1992 ACM SIGMETRICS and PERFORMANCE Conference on Measurementand Modeling of Computer Systems, pages 1-12, May 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. 22.T. Mathisen. Pentium Secrets. Byte, pages 19 I- 192, July 1994.Google ScholarGoogle Scholar
  23. 23.R. V. Meter. Observing the Effects of Multi-Zone Disks. In Proceedings of the 1997 USENIX Conference, Jan. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. 24.B.P. Miller, M. D. Callaghan, J. M. Cargille, J. K. Hollingsworth, R. B. Irvin, K. L. Karavanic, K. Kunchithapadam, and T. Newhall. The Paradyn Parallel Performance Measurement Tools. IEEE Computer, 28(11 ), 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. 25.C. Nyberg, T. Barclay, Z. Cvetanovic, J. Gray, and D. Lomet. AlphaSort: A RISC Machine Sort. In 1994 ACM SIGMOD Conference, May 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. 26.S. Perl and W. E. Weihl. Performance Assertion Checking. In Proceedings of the 14th ACM Symposium on Operating Systems Principles, pages 134-45, December 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. 27.S. E. Perl and R. L. Sites. Studies of Windows NT Performance Using Dynamic Execution Traces. in OSDI 2, pages 169-184, October 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. 28.D. A. Reed, C. L. Elford, T. Madhyastha, W. H. Scullin, R. A. Aydt, and E. Smirni. I/O, Performance Analysis, and Performance Data Immersion. In Proceedings of MASCOTS '96, pages 5-16, February 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. 29.M. Rosenblum, E. Bugnion, S. Devine, and S. A. Herrod. Using the SimOS Machine Simulator to Study Complex Computer Systems. ACM Transactions on Modelling and Computer Simulation (TOMACS), January 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. 30.R. H. Saavedra-Barrera. CPU Performance Evaluation and Execution #me Prediction Using Narrow Spectrum Benchmarking. PhD thesis, U.C. Berkeley, Computer Science Division, February 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. 31.M. Stonebraker. Operating System Support for Database Management. Communications of the ACM, 24(7):412-418, July 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. 32.A. Sweeney, D. Doucette, W. Hu, C. Anderson, M. Nishimoto, and G. Peck. Scalability in the XFS File System. In Proceedings of the USENIX 1996 Annual Technical Conference, Jan. 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. 33.M. Tremblay, D. Greenley, and K. Normoyle. The Design of the Microarchitecture of UltraSPARC-I. Proceedings of the IEEE, 83( 12): 1653-63, December 1995.Google ScholarGoogle ScholarCross RefCross Ref
  34. 34.T. von Eicken, D. E. Culler, S. C. Goldstein, and K. E. Schauser. Active Messages: a Mechanism for Integrated Communication and Computation. In Proceedings of the 19th Annual Symposium on Computer Architecture, Gold Coast, Australia, May 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Searching for the sorting record: experiences in tuning NOW-Sort

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SPDT '98: Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
          August 1998
          162 pages
          ISBN:1581130015
          DOI:10.1145/281035

          Copyright © 1998 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 August 1998

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          SPDT '98 Paper Acceptance Rate14of41submissions,34%Overall Acceptance Rate14of41submissions,34%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader