Abstract
RSD (Resource and Service Description) is a scheme for specifying resources and services in complex heterogeneous computing systems and metacomputing environments. At the system administrator level, RSD is used to specify the available system components, such as the number of nodes, their interconnection topology, CPU speeds, and available software packages. At the user level, a GUI provides a comfortable, high-level interface for specifying system requests. A textual editor can be used for defining repetitive and recursive structures. This gives service providers the necessary flexibility for fine-grained specification of system topologies, interconnection networks, system and software dependent properties. All these representations are mapped onto a single, coherent internal object-oriented resource representation.
Dynamic aspects (like network performance, availability of compute nodes, and compute node loads) are traced at runtime and included in the resource description to allow for optimal process mapping and dynamic task load balancing at runtime at the metacomputer level. This is done in a self-organizing way, with human system operators becoming only involved when new hardware/software components are installed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baraglia, R., Faieta, G., Formica, M., and Laforenza, D. (1996). Experiences with a wide area network metacomputing management tool using IBM SP-2 parallel systems. Concurrency: Practice and Experience, 8.
Bauer, B. and Ramme, F. (1991). A general purpose resource description language. In Grebe, R. and Baumann, M., editors, Parallele Datenverarbeitung mit dem Transputer, pages 68–75, Berlin. Springer-Verlag.
Bayucan, A., Henderson, R., Proett, T., Tweten, D., and Kelly, B. (1996). Portable Batch System: External Reference Specification. Release 1.1.7. NASA Ames Research Center.
Beisel, T., Gabriel, E., and Resch, M. (1997). An extension to MPI for distributed computing on MPPs. In Bubak, M., Dongarra, J., and Wasniewski, J., editors, Recent Advances in Parallel Virtual Machine and Message Passing Interface, pages 25–33. Springer-Verlag LNCS.
Brune, M., Gehring, J., Keller, A., Monien, B., Ramme, F., and Reinefeld, A. (1998). Specifying resources and services in metacomputing environments. Parallel Computing. To appear.
Brune, M., Gehring, J., and Reinefeld, A. (1997). Heterogeneous message passing and a link to resource management. Journal of Supercomputing, 11:355–369.
Fagg, G. and Dongarra, J. (1996). PVMPI: An integration of the PVM and MPI systems. Calculateurs Paralléles, 8(2):151–166.
Fitzgerald, S., Foster, I., Kesselman, C., Laszewski, G. V., Smith, W., and Tuecke, S. (1997). A directory service for configuring high-performance distributed computations. Preprint. Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL.
Foster, I. and Kesselman, C. (1997). Globus: A metacomputing infrastructure toolkit. Journal of Supercomputer Applications, pages 115–128.
Keller, A. and Reinefeld, A. (1998). CCS resource management in networked HPC systems. In Heterogeneous Computing Workshop HCW’98, Orlando.
LoadLeveler (1997). SP Parallel Programming Workshop: oadLeveler. http://www.mhpcc.edu/training/workshop/html/loadleveler/LoadLeveler.html
MIT (1998). The Athena Project. Massachusetts Institute of Technology. http://web.mit.edu/o1h/Welcome/index.html
Smarr, L. and Catlett, C. (1992). Metacomputing. Communications of the ACM, 35(6): 45–52.
Tivoli (1998). The Tivoli Management Environment. Tivoli Systems Inc. http://www.tivoli.com.
Yeong, W., Howes, T., and Kille, S. (1995). Lightweight directory access protocol. RFC 1777, 03/2895, Draft Standard.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer Science+Business Media New York
About this chapter
Cite this chapter
Brune, M., Gehring, J., Keller, A., Reinefeld, A. (1998). RSD — Resource and Service Description. In: Schaeffer, J. (eds) High Performance Computing Systems and Applications. The Springer International Series in Engineering and Computer Science, vol 478. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-5611-4_18
Download citation
DOI: https://doi.org/10.1007/978-1-4615-5611-4_18
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7567-8
Online ISBN: 978-1-4615-5611-4
eBook Packages: Springer Book Archive