Reference Hub1
Text Mining in Program Code

Text Mining in Program Code

Alexander Dreweke, Ingrid Fischer, Tobias Werth, Marc Wörlein
Copyright: © 2009 |Pages: 20
ISBN13: 9781599049908|ISBN10: 1599049902|EISBN13: 9781599049915
DOI: 10.4018/978-1-59904-990-8.ch035
Cite Chapter Cite Chapter

MLA

Dreweke, Alexander, et al. "Text Mining in Program Code." Handbook of Research on Text and Web Mining Technologies, edited by Min Song and Yi-Fang Brook Wu, IGI Global, 2009, pp. 626-645. https://doi.org/10.4018/978-1-59904-990-8.ch035

APA

Dreweke, A., Fischer, I., Werth, T., & Wörlein, M. (2009). Text Mining in Program Code. In M. Song & Y. Brook Wu (Eds.), Handbook of Research on Text and Web Mining Technologies (pp. 626-645). IGI Global. https://doi.org/10.4018/978-1-59904-990-8.ch035

Chicago

Dreweke, Alexander, et al. "Text Mining in Program Code." In Handbook of Research on Text and Web Mining Technologies, edited by Min Song and Yi-Fang Brook Wu, 626-645. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-59904-990-8.ch035

Export Reference

Mendeley
Favorite

Abstract

Searching for frequent pieces in a database with some sort of text is a well-known problem. A special sort of text is program code as e.g. C++ or machine code for embedded systems. Filtering out duplicates in large software projects leads to more understandable programs and helps avoiding mistakes when reengineering the program. On embedded systems the size of the machine code is an important issue. To ensure small programs, duplicates must be avoided. Several different approaches for finding code duplicates based on the text representation of the code or on graphs representing the data and control flow of the program and graph mining algorithms.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.