Recovering High-Level Structure of Software Systems Using a Minimum Description Length Principle

Lutz, Rudi

doi:10.1007/3-540-45750-X_8

Recovering High-Level Structure of Software Systems Using a Minimum Description Length Principle

Rudi Lutz²

Conference paper
First Online: 01 January 2002

583 Accesses
11 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2464))

Abstract

In [12] a system was described for finding good hierarchical decompositions of complex systems represented as collections of nodes and links, using a genetic algorithm, with an information theoretic fitness function (representing complexity) derived from a minimum description length principle. This paper describes the application of this approach to the problem of reverse engineering the high-level structure of software systems.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Briand, L.C., Morasca, S., and Basili, V.R. (1996) Property-based software engineering measurement: Refining the additivity properties. IEEE Transactions on Software Engineering, 22(1):68–86.
Article Google Scholar
Collins, R. and Jefferson, D. (1991) Selection in massively parallel genetic algorithms. Proceedings of the Fourth International Conference on Genetic Algorithms, ICGA-91 Belew, R.K. and Booker, L.B. (eds.), Morgan Kaufmann.
Google Scholar
Doval, D., Mancoridis, S., and Mitchell, B.S. (1999) Automatic Clustering of Software Systems using a Genetic Algorithm. IEEE Proceedings of the 1999 International Conference on Software Tools and Engineering Practice (STEP’99).
Google Scholar
Glover, F. (1989) Tabu Search-Part I. ORSA Journal on Computing, Vol. 1, No. 3, pp. 190–206.
MATH Google Scholar
Goldberg, D.E. (1989) Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley.
Google Scholar
Harman, M., Hierons, R., and Proctor, M. (2002) A New Representation and Crossover Operator for Search-Based Optimization of Software Modularization. Submitted to GECCO-2002.
Google Scholar
Holland, J.H. (1975) Adaptation in Natural and Artificial Systems. Now published by MIT Press.
Google Scholar
Hutchens, D., and Basili, R. (1985) System StructureAnalysis: Clustering with Data Bindings. IEEE Transactions on Software Engineering, SE-11(8):749–757, 1985.
Article Google Scholar
Kirkpatrick, S., Gelatt Jr., C.D., Vecchi, M.P. (1983) Optimization by Simulated Annealing, Science, 220, 4598, 671–680.
Article MathSciNet Google Scholar
Koza, J.R. (1992) Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press.
Google Scholar
Li, M. and Vitanyi, P. (1997) An Introduction to Kolmogorov Complexity Theory and Its Applications. Springer-Verlag.
Google Scholar
Lutz, R. (2001) Evolving Good Hierarchical Decompositions of Complex Systems. Journal of Systems Architecture, 47, pp. 613–634.
Article Google Scholar
Mancoridis, S., Mitchell, B.S., Rorres, C., Chen, Y., Gansner, E.R. (1998) Using automatic clustering to produce high-level system organizations of source code. In International Workshop on Program Comprehension (IWPC’98) IEEE Computer Society Press, Los Alamitos, California, USA, pp.45–53.
Google Scholar
McIlhagga, M., Husbands, P., and Ives, R. (1996) A comparison of simulated annealing, dispatching rules and a coevolutionary distributed genetic algorithm as optimization techniques for various integrated manufacturing planning problems. In Proceedings of PPSN IV, Volume I. LNCS 1141, pp. 604–613, Springer-Verlag.
Google Scholar
Mitchell, M. (1996) An Introduction to Genetic Algorithms. MIT Press.
Google Scholar
Mitchell, T.M. (1997) Machine Learning. McGraw-Hill.
Google Scholar
Rissanen, J. (1978) Modelling by the shortest data description. Automatica-J.IFAC, 14, pp.465–471.
Article MATH Google Scholar
Shannon, C.E. (1948) The mathematical theory of communications. Bell System Technical Journal 27:379–423, 623-656.
MathSciNet Google Scholar
Thornton, C.J. and du Boulay, B. (1992) Artificial Intelligence Through Search. Intellect, Oxford, England.
Google Scholar
Wiggerts, T. (1997) Using clutering algorithms in legacy systems remodularisation. In Proc. Working Conference on Reverse Engineering (WCRE’97)
Google Scholar
Wood, J.A. (1998) Improving Software Designs via the Minimum Description Length Principle. Ph.D. Thesis, University of Sussex (available from http://cogslib.cogs.susx.ac.uk)

Download references

Author information

Authors and Affiliations

School of Cognitive and Computing Sciences, University of Sussex, Sussex
Rudi Lutz

Authors

Rudi Lutz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Information Systems, University of Limerick, Ireland
Michael O’Neill , Richard F. E. Sutcliffe , Conor Ryan , Malachy Eaton & Niall J. L. Griffith , , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lutz, R. (2002). Recovering High-Level Structure of Software Systems Using a Minimum Description Length Principle. In: O’Neill, M., Sutcliffe, R.F.E., Ryan, C., Eaton, M., Griffith, N.J.L. (eds) Artificial Intelligence and Cognitive Science. AICS 2002. Lecture Notes in Computer Science(), vol 2464. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45750-X_8

Download citation

DOI: https://doi.org/10.1007/3-540-45750-X_8
Published: 27 August 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44184-7
Online ISBN: 978-3-540-45750-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics