Big Data is about extracting valuable information from data in order to use it in intelligent ways such as to revolutionize decision-making in businesses, science and society. Big Data may open up radical new ways and unprecedented opportunities of attacking software engineering problems. Already now forums, forges, blogs, Q&A sites, and social networks, provide a wealth of data that may be analysed to uncover new requirements, provide evidence on usage and development trends of application frameworks, or to perform empirical studies involving real-world software developers. In addition, real-time data collected from mobile and cloud applications may be analysed to detect user trends, preferences, and optimization opportunities.
BIGDSE 2016 features contributions and discussions that explore opportunities that Big Data technology offers to software engineering, both in research and practice ("big data for software engineering"). BIGDSE also looks at the software engineering challenges imposed by building Big Data software systems ("software engineering for big data").
Proceeding Downloads
Decisions as a service for application centric real time analytics
The need for application-level intelligence cannot be easily satisfied with existing architectures or methodologies that separate methods and tools for application developers and data scientists. We aim, therefore, to develop a framework (an ...
Streaming software analytics
In this paper we present a novel software analytics infrastructure supporting for a combination of three requirements to serve software practitioners in utilising data-driven decision making: (1) Real-time insight: streaming software analytics unify ...
DataLab: a version data management and analytics system
One challenge in big data analytics is the lack of tools to manage the complex interactions among code, data and parameters, especially in the common situation where all these factors can change a lot. We present our preliminary experience with DataLab, ...
Exploring a framework for identity and attribute linking across heterogeneous data systems
Online-activity-generated digital traces provide opportunities for novel services and unique insights as demonstrated in, for example, research on mining software repositories. The inability to link these traces within and among systems, such as Twitter,...
Providing big data applications with fault-tolerant data migration across heterogeneous NoSQL databases
The recent growing interest on highly-available data-intensive applications sparked the need for flexible and portable storage technologies, e.g., NoSQL databases. Unfortunately, the lack of standard interfaces and architectures for NoSQLs makes it ...
Data model evolution using object-NoSQL mappers: folklore or state-of-the-art?
In big data software engineering, the schema flexibility of NoSQL document stores is a major selling point: When the document store itself does not actively manage a schema, the data model is maintained within the application. Just like object-...
Towards a model-driven design tool for big data architectures
Big Data technologies are rapidly becoming a key enabler for modern industries. However, the entry costs inherent to "going Big" are considerable, ranging from learning curve, renting/buying infrastructure, etc. A key component of these costs is the ...
Toward big data value engineering for innovation
This article articulates the requirements for an effective big data value engineering method. It then presents a value discovery method, called Eco-ARCH (Eco-ARCHitecture), tightly integrated with the BDD (Big Data Design) method for addressing these ...
A reference architecture for big data systems in the national security domain
Acquirers, system builders, and other stakeholders of big data systems need to define requirements, develop and evaluate solutions, and integrate systems together. A reference architecture enables these software engineering activities by standardizing ...
A big data framework for cloud monitoring
Elasticity is a key component of modern cloud environments and monitoring is an essential part of this process. Monitoring demonstrates several challenges including gathering metrics from a variety of layers (infrastructure, platform, application), the ...
The "BigSE" project: lessons learned from validating industrial text mining
As businesses become increasingly reliant on big data analytics, it becomes increasingly important to test the choices made within the data miners. This paper reports lessons learned from the BigSE Lab, an industrial/university collaboration that ...
Predicting and fixing vulnerabilities before they occur: a big data approach
The number and variety of cyber-attacks is rapidly increasing, and the rate of new software vulnerabilities is also rising dramatically. The cybersecurity community typically reacts to attacks after they occur. Being reactive is costly and can be fatal, ...
Understanding quality requirements in the context of big data systems
While the domain of big data is anticipated to affect many aspects of human endeavour, there are numerous challenges in building big data applications among which is how to address big data characteristics in quality requirements. In this paper, we ...
Recommendations
1st international workshop on big data software engineering (BIGDSE 2015)
ICSE '15: Proceedings of the 37th International Conference on Software Engineering - Volume 2Big Data is about extracting valuable information from data in order to use it in intelligent ways such as to revolutionize decision-making in businesses, science and society. BIGDSE 2015 discusses the link between Big Data and software engineering and ...