An object-oriented approach to language compositions for software language engineering

doi:10.1016/j.jss.2013.04.087

Journal of Systems and Software

Volume 86, Issue 9, September 2013, Pages 2451-2464

https://doi.org/10.1016/j.jss.2013.04.087 Get rights and content

Highlights

•
Language composition has not obtained enough attention, is still not well-understood, and associated terminology is confusing.
•
OO techniques and concepts are powerful enough to implement all types of language compositions.
•
Several small examples of different types of language composition are shown using easy to understand DSLs.

Abstract

In this paper, it is shown that inheritance, a core concept from object-oriented programming, is a possible solution for realizing composition of computer languages. Language composability is a property of language descriptions, which can be further classified into informal (language syntax and semantics are hard-coded in compiler/interpreter) and formal language descriptions (syntax and semantics are formally specified with one of several formal methods for language definition). However, language composition is much easier to achieve with declarative formal language descriptions into which the notion of inheritance is introduced. Multiple attribute grammar inheritance, as implemented in the language implementation system LISA, can assist in realizing all of the different types of language compositions identified in Erdweg et al. (2012). Different examples are given throughout the paper using an easy to understand domain-specific language that describes simple robot movement.

Introduction

Software Language Engineering (SLE) (Kleppe, 2008) is a young engineering discipline with the aim of establishing a systematic and rigorous approach to the development, use, and maintenance of computer languages, which comprises specification, modeling and programming languages. Although in this paper emphasis is given to grammar-based domain-specific languages (DSLs) (Hudak, 1996, van Deursen et al., 2000, Mernik et al., 2005, Fowler, 2010, Kosar et al., 2010, Mernik, 2013, Kolomvatsos et al., 2012), SLE is equally focused on metamodel-based DSLs (Gray et al., 2007, Sprinkle et al., 2009) and general-purpose languages (GPLs) (e.g., Java, Gosling et al., 1996). A special focus of SLE, which is also a topic of this paper, is that a formal description is used to design and implement a language (e.g., generating a compiler), as well as to generate various language-based tools (e.g., editor, debugger) (Henriques et al., 2005). Any language description, formal or informal, should be amenable for refinement and composition. Unfortunately, this is usually not the case, making DSLs harder to adopt to frequent changes (Mernik and Žumer, 2005). To be able to design and implement DSLs more easily, modular, extensible, and reusable language descriptions are needed. It is even possible for some of the descriptions to be inferred from DSL programs (Hrnčič et al., 2011, Hrnčič et al., 2012). A language engineer may want to include new language features incrementally as the computer language evolves. Moreover, a language engineer may like to build a computer language simply by reusing different language description modules (language components, language fragments), such as modules for expressions, declarations, as well as to reuse and extend previous language descriptions. Thus, language description composition is a high level goal that still needs much work in the area of SLE.

In the recent paper (Erdweg et al., 2012) it has been pointed out that language composition has not obtained enough attention, is still not well-understood, and associated terminology is confusing. All of these points suggest that research in this area is not yet mature. Language composability has been identified in Erdweg et al. (2012) not as a property of languages themselves, but as a property of language description (e.g., how language specifications, formal or informal, can be composed together). To enable language composition, a language description has to be reused as is; that is, any changes to a language description are not allowed, but language descriptions can be extended or additional glue code can be written. This is similar to the Open/Closed Principle in Object-Oriented Design (Meyer, 1997). The following types of language composition have been distinguished in Erdweg et al. (2012): language extension (which subsumes also language restriction), language unification, self-extension, and extension composition.

In the case of general software development, the use of object-oriented techniques and concepts (Abadi and Cardelli, 1996), such as encapsulation and inheritance, greatly improves incremental software development and reusability. Object-oriented techniques and concepts already have been integrated into language descriptions (Mernik et al., 2000, Hedin and Magnusson, 2003, Mernik and Žumer, 2005) to enable new features to be implemented. The main objective of this paper is to show that object-oriented techniques and concepts are powerful enough to implement all types of language compositions identified in Erdweg et al. (2012). For practical purposes, different types of language compositions are explained using a simple and easy to understand Robot DSL (Mernik and Žumer, 2005). In this case study, a compiler generator tool called LISA (Henriques et al., 2005, Mernik and Žumer, 2005) will be used. In addition to the Robot DSL, more complex examples can be found in Mernik and Žumer (2005) and Fister et al., 2012, Fister et al., 2013. The usefulness of object-oriented design/architecture/programming has been shown in many other engineering disciplines (e.g., Lejeune et al., 2012, Loenzo et al., 2010, Murthy et al., 2011). In this paper, the usefulness of object-oriented approach is shown as applied to SLE. As previously stated, in this paper the emphasis is on DSLs. Firstly, it is easier to show various language compositions on small DSLs rather than on GPLs, which are usually much larger than DSLs with respect to the size of syntax and semantic specifications (Črepinšek et al., 2010). Secondly, DSLs are an emerging popular area of research in the field of software engineering. For example, DSLs are one of the most important parts in Generative Programming (Czarnecki and Eisenecker, 2000), Product Lines (Clements, 2002), Software Factories (Greenfield and Short, 2004), and Model-Driven Engineering (Schmidt, 2006, Gray et al., 2007, Sprinkle et al., 2009). The common denominator of the aforementioned development methodologies is the involvement of the end-user in the development of software. DSLs provide end-users with the ability to program/model their solutions. Moreover, with the current achievements in DSL research, the vision of language-oriented software development (Ward, 1995, Dmitriev, 2004) is a step closer to realization. The idea of language-oriented software development is to first develop various DSLs and then use them together to develop an application. To achieve the vision of language-oriented software development, it is necessary to improve the ability to compose languages.

The paper is organized as follows: Section 2 briefly describes LISA – Language Implementation System using Attribute Grammars, which is used as a language description system for language composition. Various types of language compositions are explained on the simple Robot DSL, which is also introduced in this section. In Section 3, different types of language compositions are mentioned briefly and concrete examples are presented and discussed. Related work is described in Section 4, followed by the conclusions in Section 5.

Section snippets

LISA and Robot DSL

The challenge in formal language description is to support modularity and abstraction in a manner that allows incremental changes to be made as easily as possible. To achieve this goal, inheritance (Taivalsaari, 1996, Redondo and Ortin, 2013) can be very effective since it is a language mechanism that allows new definitions to be based on existing ones. A new specification can inherit the properties of its ancestors, and may introduce new properties that extend or modify the inherited

Types of language composition

The following types of language composition have been distinguished in Erdweg et al. (2012): language extension (which subsumes also language restriction), language unification, self-extension, and extension composition. In the following section, each type of language composition is described briefly and explained using a simple DSL.

Related work

A new and more suitable taxonomy for language composition has been proposed only recently in Erdweg et al. (2012), where the following types of language composition have been identified: language extension, language unification, self-extension and extension composition. Section 3 demonstrated how different language compositions can be realized using LISA (see Table 1). Moreover, it was also reported in Erdweg et al. (2012) that existing terminology is confusing. The rationale for such a claim

Conclusion

To achieve the vision of language-oriented software development (Ward, 1995, Dmitriev, 2004), there is a need for language descriptions, whether informal or formal, to be composed. Language composition is an active research area in the field of Software Language Engineering. To promote this research even further, new terminology and a taxonomy for language composition has been recently proposed in Erdweg et al. (2012). The following types of language compositions have been identified in Erdweg

Marjan Mernik received his M.Sc., and Ph.D. degrees in Computer Science from the University of Maribor in 1994 and 1998 respectively. He is currently a professor at the University of Maribor, Faculty of Electrical Engineering and Computer Science. He is also a visiting professor at the University of Alabama in Birmingham, Department of Computer and Information Sciences, and at the University of Novi Sad, Faculty of Technical Sciences. His research interests include programming languages,

References (86)

A. Bertolino et al.
Is my model right? Let me ask the expert
The Journal of Systems and Software
(2011)
M. Bravenboer et al.
Stratego/XT 0.17. A language and toolset for program transformation
Science of Computer Programming
(2008)
M. Bravenboer et al.
Preventing injection attacks with syntax embeddings
Science of Computer Programming
(2010)
J.R. Cordy et al.
TXL: a rapid prototyping system for programming language dialects
Computer Languages
(1991)
T. Dinkelaker et al.
Incremental concrete syntax for embedded languages with support for separate compilation
Science of Computer Programming
(2013)
G. Hedin et al.
JastAdd: an aspect-oriented compiler construction system
Science of Computer Programming
(2003)
D. Hrnčič et al.
A memetic grammar inference algorithm for language learning
Applied Soft Computing
(2012)
K. Kolomvatsos et al.
Debugging applications created by a domain specific language: the IPAC case
The Journal of Systems and Software
(2012)
T. Kosar et al.
A preliminary study on various implementation approaches of domain-specific language
Information and Software Technology
(2008)
A. Lejeune et al.
Object-oriented design to automate a high order non-linear solver based on asymptotic numerical method
Advances in Engineering Software
(2012)

R.A.G. Loenzo et al.

An object-oriented architecture for sensorless cutting force feedback for CNC milling process monitoring and control

Advances in Engineering Software

(2010)

M. Mernik et al.

Incremental programming language development

Computer Languages, Systems and Structures

(2005)

A.R.C. Murthy et al.

Object-oriented programming paradigm for damage tolerant evaluation of engineering structural components

Advances in Engineering Software

(2011)

J.M. Redondo et al.

Efficient support of dynamic inheritance for class- and prototype-based languages

The Journal of Systems and Software

(2013)

X. Wu et al.

Component-based LR parsing

Computer Languages, Systems and Structures

(2010)

M. Abadi et al.

A Theory of Objects

(1996)

A. Afroozeh et al.

Island grammar-based parsing using GLL and Tom

A.V. Aho et al.

Compilers: Principles, Techniques, and Tools

(2007)

M. Aksit et al.

Grammar Inheritance

(1991)

E. Avdičaušević et al.

AspectCOOL: an experiment in design and implementation of aspect-oriented language

ACM SIGPLAN Notices

(2001)

J. Bachrach et al.

Java Syntactic Extender (JSE)

ACM SIGPLAN Notices

(2001)

E. Balland et al.

Tom: piggybacking rewriting on java

C. Brabrand et al.

Growing Languages with Metamorphic Syntax Macros

ACM SIGPLAN Notices

(2002)

M.G.J. van den Brand et al.

Disambiguation Filters for Scannerless Generalized LR Parsers

M. Bravenboer et al.

Concrete syntax for objects: domain-specific language embedding and assimilation without restrictions

B.R. Bryant et al.

Challenges and directions in formalizing the semantics of modeling languages

Computer Science and Information Systems

(2011)

L. Cardelli et al.

Extensible syntax with lexical scoping

(1994)

J. Cervelle et al.

A simple implementation of grammar libraries

Computer Science and Information Systems

(2008)

S. Chiba

A metaobject protocol for C++

ACM SIGPLAN Notices

(1995)

P. Clements

L. Northrop Software Product Lines: Practices and Patterns

(2002)

K. Czarnecki et al.

Generative Programming: Methods, Tools and Applications

(2000)

M. Črepinšek et al.

On automata and language based grammar metrics

Computer Science and Information Systems

(2010)

A. van Deursen et al.

Domain-specific languages: an annotated bibliography

ACM SIGPLAN Notices

(2000)

S. Dmitriev

Language Oriented Programming: The Next Programming Paradigm

(2004)

M. Emerson et al.

Techniques for metamodel composition

S. Erdweg et al.

SugarJ: library-based syntactic language extensibility.

S. Erdweg et al.

Language composition untangled

I. Fister et al.

Implementation of EasyTime formal semantics using a LISA compiler generator

Computer Science and Information Systems

(2012)

I. Fister et al.

EasyTime++: a case study on incremental domain-specific language development

Information Technology and Control

(2013)

M. Fowler

Domain Specific Languages

(2010)

J. Gosling et al.

The Java language specification

(1996)

J. Gray et al.

Domain-specific modeling

J. Greenfield et al.

Software Factories: Assembling Applications with Patterns, Models, Frameworks, and Tools

(2004)

Cited by (49)

Composition operators for modeling languages: A literature review
2023, Journal of Computer Languages
Efficiently engineering modeling languages demands their reuse through composition. Research in language engineering has produced many different operators to reuse and compose languages and language parts. Unfortunately, these operate on different dimensions of languages, produce diverse results, and are distributed across various technological spaces and publications, which hampers understanding the state of language composition for researchers and practitioners. To mitigate this, we report the results of a literature review on modeling language composition operators. In this review, we identify operators, their properties, and supported language dimensions, and relate them to categories of language composition. Through this, our survey draws a new, detailed map of modeling language composition operators that can guide researchers in software language engineering in identifying uncharted territory and practitioners in employing the most suitable composition operators.
On the granularity of linguistic reuse
2023, Journal of Systems and Software
Programming languages are complex software systems integrated across an ecosystem of different applications such as language compilers or interpreters but also an integrated development environment comprehensive of syntax highlighting, code completion, error recovery, and a debugger. The complexity of language ecosystems can be faced using language workbenches—i.e., tools that tackle the development of programming languages, domain specific languages and their ecosystems in a modular way.
As with any other software system, one of the priorities that developers struggle to achieve when developing programming languages is reusability. After all, the capacity to easily reuse and adapt existing components to new scenarios can dramatically improve development times. Therefore, as programming languages offer features to reuse existing code, language workbenches should offer tools to reuse existing language assets. However, reusability can be achieved in many different ways.
In this work, we identify six forms of linguistic reusability, ordered by level of granularity: (i) sub-languages composition, (ii) language features composition, (iii) syntax and semantics assets composition, (iv) semantic assets composition, (v) actions composition, and. (vi) action extension. We use these mechanisms to extend the taxonomy of language composition proposed by Erdweg et al. To show a concrete application of this taxonomy, we evaluate the capabilities provided by the Neverlang language workbench with regards to our taxonomy and extend it by adding explicit support for any granularity level that was originally not supported. This is done by instantiating two levels of reusability as actual operators—desugaring, and delegation. We evaluate these operators against the clone-and-own approach, which was the only form of reuse at that level of granularity prior to the introduction of explicit operators. We show that with the clone-and-own approach the design quality of the source code is negatively affected. We conclude that language workbenches can benefit from the introduction of mechanisms to explicitly support reuse at all granularity levels.
Automatic compiler/interpreter generation from programs for Domain-Specific Languages: Code bloat problem and performance improvement
2022, Journal of Computer Languages
Using advanced AI approaches, the development of Domain-Specific Languages (DSLs) can be facilitated for domain experts who are not proficient in programming language development. In this paper, we first addressed the aforementioned problem using Semantic Inference. However, this approach is very time-consuming. Namely, a lot of code bloat is present in the generated language specifications, which increases the time required to evaluate a solution. To improve this, we introduced a multi-threaded approach, which accelerates the evaluation process by over 9.5 times, while the number of fitness evaluations using the improved Long Term Memory Assistance (LTMA) was reduced by up to 7.3%. Finally, a reduction in the number of input samples (fitness cases) was proposed, which reduces CPU consumption further.
Inferring Absolutely Non-Circular Attribute Grammars with a Memetic Algorithm
2021, Applied Soft Computing
When valid syntactical structures are additionally constrained with context-sensitive information the Grammar Inference needs to be extended to the Semantic Inference. In this paper, it is shown that a complete compiler/interpreter for small Domain-Specific Languages (DSLs) can be generated automatically solely from given programs and their associated meanings using Semantic Inference. In this work a wider class of Attribute Grammars has been learned, while only S-attributed and L-attributed Grammars have previously been inferred successfully. Inferring Absolutely Non-Circular Attribute Grammars (ANC-AG) with complex dependencies among attributes has been achieved by integrating a Memetic Algorithm (MA) into the $L I S A . S I$ tool. The results show that the proposed Memetic Algorithm is at least four times faster on the selected benchmark than the previous method.
Systematic composition of independent language features
2019, Journal of Systems and Software
Systematic reuse is crucial to efficiently engineer and deploy software languages to software experts and domain experts alike. But “software languages are software too”, and hence their engineering, customization, and reuse are subject to similar challenges. To this effect, we propose an approach for composing independent, grammar-based language syntax modules in a structured way that realizes a separation of concerns among the participants in the life cycle of the languages. We present a refined concept of systematic and controlled syntactic variability of extensible software language product lines through identification of syntax variation points and derivation of variants from independently developed features. This facilitates reuse of software languages and reduces the efforts of engineering and customizing languages for specific domains. We realized our concept with the MontiCore language workbench and assessed it through a case study on architecture description languages. Ultimately, systematic and controlled software language reuse reduces the effort of software language engineering and fosters the applicability of software languages.
Software language engineering in the large: towards composing and deriving languages
2018, Computer Languages, Systems and Structures
Suitable software languages are crucial to tackling the ever-increasing complexity of software engineering processes and software products. They model, specify, and test products, describe processes and interactions with services and serve many other purposes. Meanwhile, engineering suitable modeling languages with useful tooling also has become a challenging endeavor - and far too often, new languages are developed from scratch. We shed light on the advances of modeling language engineering that facilitate reuse, modularity, compositionality, and derivation of new languages based on language components. To this end, we discuss ways to design, combine, and derive modeling languages in all their relevant aspects. We illustrate the application of advanced language engineering throughout the paper, which culminates in the example of deriving complete domain-specific transformations language from existing language components.

View all citing articles on Scopus

View full text

An object-oriented approach to language compositions for software language engineering

Highlights

Abstract

Introduction

Section snippets

LISA and Robot DSL

Types of language composition

Related work

Conclusion

The Journal of Systems and Software

Science of Computer Programming

Science of Computer Programming

Computer Languages

Science of Computer Programming

Science of Computer Programming

Applied Soft Computing

The Journal of Systems and Software

Information and Software Technology

Advances in Engineering Software

Advances in Engineering Software

Computer Languages, Systems and Structures

Advances in Engineering Software

The Journal of Systems and Software

Computer Languages, Systems and Structures

A Theory of Objects

Island grammar-based parsing using GLL and Tom

Compilers: Principles, Techniques, and Tools

Grammar Inheritance

AspectCOOL: an experiment in design and implementation of aspect-oriented language

ACM SIGPLAN Notices

Java Syntactic Extender (JSE)

ACM SIGPLAN Notices

Tom: piggybacking rewriting on java

Growing Languages with Metamorphic Syntax Macros

ACM SIGPLAN Notices

Disambiguation Filters for Scannerless Generalized LR Parsers

Concrete syntax for objects: domain-specific language embedding and assimilation without restrictions

Challenges and directions in formalizing the semantics of modeling languages

Computer Science and Information Systems

Extensible syntax with lexical scoping

A simple implementation of grammar libraries

Computer Science and Information Systems

A metaobject protocol for C++

ACM SIGPLAN Notices

L. Northrop Software Product Lines: Practices and Patterns

Generative Programming: Methods, Tools and Applications

On automata and language based grammar metrics

Computer Science and Information Systems

Domain-specific languages: an annotated bibliography

ACM SIGPLAN Notices

Language Oriented Programming: The Next Programming Paradigm

Techniques for metamodel composition

SugarJ: library-based syntactic language extensibility.

Language composition untangled

Implementation of EasyTime formal semantics using a LISA compiler generator

Computer Science and Information Systems

EasyTime++: a case study on incremental domain-specific language development

Information Technology and Control

Domain Specific Languages

The Java language specification

Domain-specific modeling

Software Factories: Assembling Applications with Patterns, Models, Frameworks, and Tools