skip to main content
10.1145/1383529.1383545acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

Efficient scheduling of scientific workflows in a high performance computing cluster

Published:23 June 2008Publication History

ABSTRACT

The scientific computing community, especially academia is clearly in need of technology to handle and organize the 1-100+ Terabyte datasets coming from computer simulations and scientific instrumentation. In this paper we briefly describe GrayWulf, an exemplar cluster for data intensive applications using SQL Server and HPC Clusters. One of the key software components of GrayWulf is Trident, a scientific workflow workbench that performs automatic scheduling of workflows across the cluster. We examine the challenges of scheduling workflows on GrayWulf, algorithms to improve performance, and present early results from applying Trident to schedule data loading workflows on GrayWulf for an actual e-Science project

References

  1. Project Neptune http://www.neptune.washington.edu/.Google ScholarGoogle Scholar
  2. Swiss Experiment, http://www.swiss-experiment.ch.Google ScholarGoogle Scholar
  3. Life Under Your Feet http://lifeunderyourfeet.org.Google ScholarGoogle Scholar
  4. Bell, J. Gray and A. Szalay, Petascale Computational Systems. IEEE Computer 39(1): 110--112 (2006). Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Microsoft Windows Workflow Foundation (WinWF) http://en.wikipedia.org/wiki/Windows_Workflow_Foundation.Google ScholarGoogle Scholar
  6. Technical Computing Group of Microsoft Research http://www.microsoft.com/science.Google ScholarGoogle Scholar
  7. Microsoft Silverlight http://silverlight.net/.Google ScholarGoogle Scholar
  8. Pan-STARRS http://pan-starrs.ifa.hawaii.edu/public/.Google ScholarGoogle Scholar

Index Terms

  1. Efficient scheduling of scientific workflows in a high performance computing cluster

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CLADE '08: Proceedings of the 6th international workshop on Challenges of large applications in distributed environments
          June 2008
          74 pages
          ISBN:9781605581569
          DOI:10.1145/1383529

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 June 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader