Skip to main content

NoSQL Scalability Performance Evaluation over Cassandra

  • Conference paper
  • First Online:
Information Systems and Technologies to Support Learning (EMENA-ISTL 2018)

Abstract

The implementation of Smart-Cities is growing all over the world. From big cities to small villages, information able to provide a better and efficient urban management is collected from multiple sources (sensors). Such information has to be stored, queried, analyzed and displayed, aiming to contribute to a better quality of life for citizens and also a more sustainable environment. In this context it is important to choose the right database engine for this scenario. NoSQL databases are now generally accepted by the database community to support application niches. They are known for their scalability, simplicity, and key-indexed data storage, thus, allowing an easy data distribution and balancing over several nodes.

In this paper a NoSQL engine is tested, Cassandra, which is one of the most scalable, amongst most NoSQL engines and therefore, a candidate for use in our application scenario. The paper focuses on horizontal scalability, which means that, by adding more nodes, it is possible to respond to more requests with the same or better performance, i.e., more nodes mean reduced execution time. Although, adding more computational resources, does not always result in better performance. This work assesses how each workload (e.g., data volume, simultaneous users) influence scalability performance. An overview of the Cassandra database engine is presented in the paper. Following, it will be tested and evaluated using the benchmark Yahoo Cloud Serving Benchmark (YCSB).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Beernaert, L., Gomes, P., Matos, M., Vilaça, R., Oliveira, R.: Evaluating Cassandra as a manager of large file sets. In: Proceedings of the 3rd International Workshop on Cloud Data and Platforms, pp. 25–30. ACM (2013)

    Google Scholar 

  2. Carpenter, J., Hewitt, E.: Cassandra: The Definitive Guide: Distributed Data at Web Scale. O’Reilly Media, Inc. (2016)

    Google Scholar 

  3. Cassandra, A.: The apache software foundation. The Apache Cassandra project (2013)

    Google Scholar 

  4. Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing, pp. 143–154. ACM (2010)

    Google Scholar 

  5. Dean, J., Ghemawat, S.: MapReduce: a flexible data processing tool. Commun. ACM 53(1), 72–77 (2010)

    Article  Google Scholar 

  6. Dede, E., Sendir, B., Kuzlu, P., Hartog, J., Govindaraju, M.: An evaluation of Cassandra for Hadoop. In: 2013 IEEE Sixth International Conference on Cloud Computing (CLOUD), pp. 494–501. IEEE (2013)

    Google Scholar 

  7. Feng, C., Zou, Y., Xu, Z.: Ccindex for Cassandra: a novel scheme for multi-dimensional range queries in Cassandra. In: 2011 Seventh International Conference on Semantics Knowledge and Grid (SKG), pp. 130–136. IEEE (2011)

    Google Scholar 

  8. Fukuda, S., Kawashima, R., Saito, S., Matsuo, H.: Improving response time for Cassandra with query scheduling. In: 2013 First International Symposium on Computing and Networking (CANDAR), pp. 128–133. IEEE (2013)

    Google Scholar 

  9. Garefalakis, P., Papadopoulos, P., Manousakis, I., Magoutis, K.: Strengthening consistency in the Cassandra distributed key-value store. In: IFIP International Conference on Distributed Applications and Interoperable Systems, pp. 193–198. Springer (2013)

    Google Scholar 

  10. Konstantinou, I., Angelou, E., Boumpouka, C., Tsoumakos, D., Koziris, N.: On the elasticity of NoSQL databases over cloud management platforms. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management, pp. 2385–2388. ACM (2011)

    Google Scholar 

  11. Pirzadeh, P., Tatemura, J., Po, O., Hacıgümüş, H.: Performance evaluation of range queries in key value stores. J. Grid Comput. 10(1), 109–132 (2012)

    Article  Google Scholar 

  12. Talia, D.: Clouds for scalable big data analytics. Computer 46(5), 98–101 (2013)

    Article  Google Scholar 

  13. Welsh, M., Culler, D., Brewer, E.: SEDA: an architecture for well-conditioned, scalable internet services. In: ACM SIGOPS Operating Systems Review, vol. 35, pp. 230–243. ACM (2001)

    Google Scholar 

Download references

Acknowledgements

“This article is a result of the CityAction project CENTRO-01-0247-FEDER-017711, supported by Centro Portugal Regional Operational Program (CENTRO 2020), under the Portugal 2020 Partnership Agreement, through the European Regional Development Fund (ERDF), and also financed by national funds through FCT - Fundação para a Ciência e Tecnologia, I.P., under the project UID/Multi/04016/2016. Furthermore, we would like to thank the Instituto Politécnico de Viseu for their support.”

Special thanks to Maryam Abbasi and Pedro Martins for their persistence and availability for making this paper possible.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Maryam Abbasi , Filipe Sá , Daniel Albuquerque , Cristina Wanzeller , Filipe Caldeira , Paulo Tomé , Pedro Furtado or Pedro Martins .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Abbasi, M. et al. (2019). NoSQL Scalability Performance Evaluation over Cassandra. In: Rocha, Á., Serrhini, M. (eds) Information Systems and Technologies to Support Learning. EMENA-ISTL 2018. Smart Innovation, Systems and Technologies, vol 111. Springer, Cham. https://doi.org/10.1007/978-3-030-03577-8_56

Download citation

Publish with us

Policies and ethics