Abstract
Next-generation problem-solving environments (PSEs) promise significant advances over those now available. They will span scientific disciplines and incorporate collaboration capabilities. They will host feature-detection and other agents, allow data mining and pedigree tracking, and provide access from a wide range of devices. Fundamental changes in PSE architecture are required to realize these and other PSE goals. This paper focuses specifically on issues related to data management and recommends an approach based on open, metadata-driven repositories with loosely defined, dynamic schemas. Benefits of this approach are discussed, and the redesign of the Extensible Computational Chemistry Environment's (Ecce) data storage architecture to use such a repository is described, based on the distributed authoring and versioning (DAV) standard. The suitability of DAV for scientific data, the mapping of the Ecce schema to DAV, and promising initial results are presented.
References
S. Gallopoulos, E. Houstis and J.R. Rice, Problem-solving environments for computational science, IEEE Computational Science & Engineering (Summer 1994) 11-23.
J.R. Rice and R.F. Boisvert, From scientific software libraries to problem-solving environments, IEEE Computational Science & Engineering (Fall 1996) 44-53
Molecular Science Software Suite, http://www.emsl.pnl.gov:2080/ mscf/about/descr_ms3.html
D.A. Dixon, T.H. Dunning, M. Dupuis, D.F. Feller, D.K. Gracio, R.J. Harrison, J.A. Nichols and K.L. Schuchardt, Computational chemistry in the environmental molecular sciences laboratory, in: High-Performance Computing (Plenum, 1999).
D.R. Jones, T.L. Keller, K.L. Schuchardt, H.L. Taylor and D.K. Gracio, Extensible computational chemistry environment data centered framework for scientific research, in: Domain-Specific Application Frameworks: Manufacturing, Networking, Distributed Systems, and Software Development, Vol. 3, No. 0-471-332801 (1999) ch. 24.
M. Atkinson, F. Bancilhon, D. DeWitt, K. Dittrich, D. Maier and S. Zdonik, The object-oriented database system manifesto, in: Proceedings of the 1st International Conference on Deductive and Object-Oriented Databases, Kyoto, Japan (December 1989) pp. 223-240.
M. Stonebraker, L.A. Rowe, B.G. Lindsay, J. Gray, M.J. Carey, M.L. Brodie, P.A. Bernstein and D. Beech, Third-generation database system manifesto - the committee for advanced DBMS function, Computer Standards and Interfaces 13 (1991) 41–54; also SIGMOD Record 19(3) (September 1990) 31-44.
External Review Committee Report on the Extensible Computational Chemistry Environment (January 1996).
M.J. Carey and D.J. DeWitt, Of objects and databases: A decade of turmoil, in: Proceedings of the 22nd VLDB Conference, Mumbai [Bombay], India (September 1996) pp. 3-14
R.A. Whiteside, E.J. Friedman-Hill and R.J. Detry, PRE: A framework for enterprise integration, http://daytona.ca.sandia.gov/pre/ s-docs/Information/HICCS.html
RFC 2616 Hypertext Transfer Protocol - HTTP/1.1, http://andew2. andew.cmu.edu/rfc/rfc2616.html
XML Specification, http://www.w3.org/TR/REC-xml
RFC 2518 HTTP Extensions for Distributed Authoring - WEBDAV, http://andrew2.andrew.cmu.edu/rfc/rfc2518.html
R.T. Fielding, E.J. Whitehead, Jr., K.M. Anderson, G.A. Bolcer, P. Oreizy and R.N. Taylor, Web-based development of complex information products, Communications of the ACM 41(8) (August 1998) 84–92.
S. Reddy, D. Lowry, S. Reddy, R. Henderson, J. Davis and A. Babich, DAV searching & locating, http://www.webdav.org/dasl/protocol/draftdasl-protocol-00.html
WebDAV Ordered Collections Protocol, http://www.ics.uci.edu/pub/ ietf/webdav/collection/draft-ietf-webdav-ordering-protocol-02.txt
Goals for Web Versioning, http://www.webdav.org/deltav/goals/draftietf-webdav-version-goals-01.txt
P.M. Rust, H.S. Rzepa, M. Write and S. Zara, A universal approach to web-based chemistry using XML and CML, Chemical Communications (2000) pp. 1471-1472.
Math Markup Language, http://www.w3.org/TR/REC-MathML/
Extensible Scientific Interchange Language, http://www.cacr.Caltech. edu/SDA/xsil/
Slide, http://jakarta.apache.org/slide/index.html
Xythos, http://www.xythos.com/
Oracle, http://www.oracle.com
Tamino, http://www.softwareag.com/tamino/
WebDAV, http://www.webdav.org
DBM Comparisons, http://www.rz.uni-hohenheim.de/anw/prg/perl/ nmanual/lib/AnyDBM_File.html
DAV Explorer, http://www.ics.uci.edu/~webdav/
WebDAV mod_dav, http://www.webdav.org/mod_dav/
Document Object Model (DOM) Level 2 Core Specification, http:// www.w3.org/TR/DOM-Level-2-Core/
Simple API for XML, http://www.megginson.com/SAX
GridFTP: Protocol Extensions to FTP for the Grid. W. Allcock, J. Bester, J. Breshnahan, A. Chervenak, L. Liming and S. Tuecke, Internet Draft. March 2001, http://www.gridforum.org
Unified Modeling Language, http://www.omg.org/uml
Protein Data Bank Format, http://www.rcsb.org/pdb/docs/format/ pdbguide2.2/guide2.2_frame.html
Global GridForum, http://www.gridforum.org
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Schuchardt, K., Myers, J. & Stephan, E. A Web-Based Data Architecture for Problem-Solving Environments: Application of Distributed Authoring and Versioning to the Extensible Computational Chemistry Environment. Cluster Computing 5, 287–296 (2002). https://doi.org/10.1023/A:1015625205311
Issue Date:
DOI: https://doi.org/10.1023/A:1015625205311