Skip to main content

Design of Distributed Parallel Computing Using by MapReduce/MPI Technology

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7979))

Abstract

This paper describes a constructive approach of distributed parallel computing using by hybrid union of MAPREDUCE and MPI technologies for solving oil extracting problems. We extend a common architecture of MAPREDUCE model by organizing decomposition of computational domain at different stages of MAPREDUCE process. We describes Model Driven Architecture (MDA) models for developing formal views of high-performance computing technologies using MAPREDUCE. We made computing experiments and show on specific HPC infrastructure. All implementations of programs is realize on Java platform. This approach will possible one of the ways to do cloud computing on high performance heterogeneous systems.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gelenbe, E., Lichnewsky, A., Staphylopatis, A.: Experience with the parallel solution of partial-differential equations on a distributed computing system. IEEE Transactions on Computers 31(12), 1157–1164 (1982)

    Article  MATH  Google Scholar 

  2. Gropp, W., Lusk, E., Doss, N., et al.: A high-performance, portable implementation of the MPI message passing interface standard. Parallel Computing 22(6), 789–828 (1996)

    Article  MATH  Google Scholar 

  3. Sunderam, V.S., Geist, G.A., Dongarra, J., et al.: The PVM concurrent computing system - evolution, experiences, and trends. Parallel Computing 20(4), 531–545 (1994)

    Article  MATH  Google Scholar 

  4. Malyshkin, V.: Assembling of Parallel Programs for Large Scale Numerical Modeling, p. 1021. IGI Global, Chicago (2010)

    Google Scholar 

  5. Becker, J.C., Dagum, L.: Particle simulation on heterogeneous distributed supercomputers. Concurrency-Practice and Experience 5(4), 367–377 (1993)

    Article  Google Scholar 

  6. Fougère, D., Gorodnichev, M., Malyshkin, N., Malyshkin, V., Merkulov, A., Roux, B.: NumGrid middleware: MPI support for computational grids. In: Malyshkin, V.E. (ed.) PaCT 2005. LNCS, vol. 3606, pp. 313–320. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  7. Diaz, J., Munoz-Caro, C., Nino, A.A.: Survey of Parallel Programming Models and Tools in the Multi and Many-Core Era. IEEE Transactions on Parallel and Distributed Systems 23(8), 1369–1386 (2012)

    Article  Google Scholar 

  8. Wang, J., Liu, Z.: Parallel Data Mining Optimal Algorithm of Virtual Cluster. In: International Conference on Fuzzy Systems and Knowledge, vol. 5, pp. 358–362 (2008)

    Google Scholar 

  9. Pandey, S., Buyya, R.: Scheduling Workflow Applications Based on Multi-source Parallel Data Retrieval in Distributed Computing Networks. Computer J. 55(11), 1288–1308 (2012)

    Article  Google Scholar 

  10. Liu, H., Orban, D.: GridBatch: Cloud Computing for Large-Scale Data-Intensive Batch Applications. In: CCGRID 2008, vol. 1, pp. 295–305 (2008)

    Google Scholar 

  11. Valilai, O.F., Houshmand, M.: A collaborative and integrated platform to support distributed manufacturing system using a service-oriented approach based on cloud computing paradigm. Robotics and Computer-Integrated Manufacturing 29(1), 110–127 (2013)

    Article  Google Scholar 

  12. Dean, J., Ghemawat, S.: Mapreduce: Simplified data processing on large clusters. Communications of the ACM 51(1), 107–113 (2008)

    Article  Google Scholar 

  13. Fagg, G.E., Dongarra, J.: FT-MPI: Fault Tolerant MPI, Supporting Dynamic Applications in a Dynamic World. In: Dongarra, J., Kacsuk, P., Podhorszki, N. (eds.) PVM/MPI 2000. LNCS, vol. 1908, pp. 346–353. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  14. He, Q., Wang, Q., Zhuang, F., Tan, Q., Shi, Z.: Parallel CLARANS Clustering Based on MapReduce. Energy Procedia 13, 3269–3279 (2011)

    Article  Google Scholar 

  15. Cohen, J.: Graph twiddling in a MapReduce world. Computing in Science and Engineering 11, 29–41 (2009)

    Article  Google Scholar 

  16. Baker, M.: MpiJava: A Java interface to MPI. University of Portsmouth, Portsmouth (2010)

    Google Scholar 

  17. Aziz, H., Sattar, A.: Mathematical modeling of reservoir systems. Nedra, Moscow (1982)

    Google Scholar 

  18. Demmel, J., Veselic, K.: Jacobis method is more accurate than QR. SIAM Journal on Matrix Analysis and Applications 13(4), 1204–1245 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  19. Lam, C.: Hadoop In Action. Manning Publications Co., Stamford (2010)

    Google Scholar 

  20. Apache Software Foundation, HDFS (2011), http://hadoop.apache.org/common/docs/current/hdfs_design.html

  21. Srirama, S.N., Jakovits, P., Vainikko, E.: Adapting scientific computing problems to clouds using MapReduce. Future Generation Computer Systems 28(2), 184–192 (2012)

    Article  Google Scholar 

  22. Frankel, D.: Model Driven Architecture. Applying MDA to Enterprise Computing. Wiley Publishing, Indiana (2003)

    Google Scholar 

  23. Lugato, D.: Model-driven engineering for high-performance computing applications. In: The 19th IASTED International Conference on Modelling and Simulation, pp. 18–33. IEEE Press, New York (2008)

    Google Scholar 

  24. Klusik, U., Loogen, R., Priebe, S., Rubio, F.: Implementation skeletons in Eden: Low-effort parallel programming. In: Mohnen, M., Koopman, P. (eds.) IFL 2000. LNCS, vol. 2011, pp. 71–88. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  25. Srirama, S.N., Batrashev, O., Jakovits, P., et al.: Scalability of parallel scientific applications on the cloud. Scientific Programming 19(2-3), 91–105 (2011)

    Google Scholar 

  26. Plimpton, S., Devine, K.: MapReduce in MPI for Large-scale graph algorithms. Parallel Computing 37(9), 610–632 (2011)

    Article  Google Scholar 

  27. Ekanayake, J., Li, H., Zhang, B., Gunarathne, T., Bae, S.-H., Qiu, J., Fox, G.: Twister: a runtime for iterative MapReduce. In: HPDC 2010, pp. 810–818. ACM, New York (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Akhmed-Zaki, D., Danaev, N., Matkerim, B., Bektemessov, A. (2013). Design of Distributed Parallel Computing Using by MapReduce/MPI Technology. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 2013. Lecture Notes in Computer Science, vol 7979. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39958-9_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39958-9_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39957-2

  • Online ISBN: 978-3-642-39958-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics