Abstract
The implementation of Smart-Cities is growing all over the world. From big cities to small villages, information able to provide a better and efficient urban management is collected from multiple sources (sensors). Such information has to be stored, queried, analyzed and displayed, aiming to contribute to a better quality of life for citizens and also a more sustainable environment. In this context it is important to choose the right database engine for this scenario. NoSQL databases are now generally accepted by the database community to support application niches. They are known for their scalability, simplicity, and key-indexed data storage, thus, allowing an easy data distribution and balancing over several nodes.
In this paper a NoSQL engine is tested, Cassandra, which is one of the most scalable, amongst most NoSQL engines and therefore, a candidate for use in our application scenario. The paper focuses on horizontal scalability, which means that, by adding more nodes, it is possible to respond to more requests with the same or better performance, i.e., more nodes mean reduced execution time. Although, adding more computational resources, does not always result in better performance. This work assesses how each workload (e.g., data volume, simultaneous users) influence scalability performance. An overview of the Cassandra database engine is presented in the paper. Following, it will be tested and evaluated using the benchmark Yahoo Cloud Serving Benchmark (YCSB).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Beernaert, L., Gomes, P., Matos, M., Vilaça, R., Oliveira, R.: Evaluating Cassandra as a manager of large file sets. In: Proceedings of the 3rd International Workshop on Cloud Data and Platforms, pp. 25–30. ACM (2013)
Carpenter, J., Hewitt, E.: Cassandra: The Definitive Guide: Distributed Data at Web Scale. O’Reilly Media, Inc. (2016)
Cassandra, A.: The apache software foundation. The Apache Cassandra project (2013)
Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing, pp. 143–154. ACM (2010)
Dean, J., Ghemawat, S.: MapReduce: a flexible data processing tool. Commun. ACM 53(1), 72–77 (2010)
Dede, E., Sendir, B., Kuzlu, P., Hartog, J., Govindaraju, M.: An evaluation of Cassandra for Hadoop. In: 2013 IEEE Sixth International Conference on Cloud Computing (CLOUD), pp. 494–501. IEEE (2013)
Feng, C., Zou, Y., Xu, Z.: Ccindex for Cassandra: a novel scheme for multi-dimensional range queries in Cassandra. In: 2011 Seventh International Conference on Semantics Knowledge and Grid (SKG), pp. 130–136. IEEE (2011)
Fukuda, S., Kawashima, R., Saito, S., Matsuo, H.: Improving response time for Cassandra with query scheduling. In: 2013 First International Symposium on Computing and Networking (CANDAR), pp. 128–133. IEEE (2013)
Garefalakis, P., Papadopoulos, P., Manousakis, I., Magoutis, K.: Strengthening consistency in the Cassandra distributed key-value store. In: IFIP International Conference on Distributed Applications and Interoperable Systems, pp. 193–198. Springer (2013)
Konstantinou, I., Angelou, E., Boumpouka, C., Tsoumakos, D., Koziris, N.: On the elasticity of NoSQL databases over cloud management platforms. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 2385–2388. ACM (2011)
Pirzadeh, P., Tatemura, J., Po, O., Hacıgümüş, H.: Performance evaluation of range queries in key value stores. J. Grid Comput. 10(1), 109–132 (2012)
Talia, D.: Clouds for scalable big data analytics. Computer 46(5), 98–101 (2013)
Welsh, M., Culler, D., Brewer, E.: SEDA: an architecture for well-conditioned, scalable internet services. In: ACM SIGOPS Operating Systems Review, vol. 35, pp. 230–243. ACM (2001)
Acknowledgements
“This article is a result of the CityAction project CENTRO-01-0247-FEDER-017711, supported by Centro Portugal Regional Operational Program (CENTRO 2020), under the Portugal 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), and also financed by national funds through FCT - Fundação para a Ciência e Tecnologia, I.P., under the project UID/Multi/04016/2016. Furthermore, we would like to thank the Instituto Politécnico de Viseu for their support.”
Special thanks to Maryam Abbasi and Pedro Martins for their persistence and availability for making this paper possible.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Abbasi, M. et al. (2019). NoSQL Scalability Performance Evaluation over Cassandra. In: Rocha, Á., Serrhini, M. (eds) Information Systems and Technologies to Support Learning. EMENA-ISTL 2018. Smart Innovation, Systems and Technologies, vol 111. Springer, Cham. https://doi.org/10.1007/978-3-030-03577-8_56
Download citation
DOI: https://doi.org/10.1007/978-3-030-03577-8_56
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03576-1
Online ISBN: 978-3-030-03577-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)