Skip to main content

Applying MapReduce Framework to Peer-to-Peer Computing Applications

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7654))

Abstract

MapReduce is a programming framework for processing large amount of data in distribution. MapReduce implementations, such as Hadoop MapReduce, basically operate on dedicated clusters of workstations to achieve high performance. However, the dedicated clusters can be unrealistic for users who infrequently have a demand of solving large distributed problems. This paper presents an approach of applying the MapReduce framework on peer-to-peer (P2P) networks for distributed applications. This approach aims at exploiting leisure resources including storage, bandwidth and processing power on peers to perform MapReduce operations. The paper also introduces a prototyping implementation of a MapReduce P2P system, where the main functions of peers contain contributing computing resources, forming computing groups and executing the MapReduce operations. The performance evaluation of the system has been compared with the Hadoop cluster using the prevailing word count problem.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM 51, 107–113 (2008)

    Article  Google Scholar 

  2. Cohen, B.: Incentives Build Robustness in BitTorrent. In: Proc. 1st Workshop on Economics of Peer-to-Peer Systems (2003)

    Google Scholar 

  3. Heckmann, O., Bock, A., Mauthe, A., Steinmetz, R.: The eDonkey File-Sharing Network. In: Proc. GI Jahrestagung (2), pp. 224–228 (2004)

    Google Scholar 

  4. Berkovsky, S., Kuflik, T., Ricci, F.: P2P Case Retrieval with an Unspecified Ontology. In: Muñoz-Ávila, H., Ricci, F. (eds.) ICCBR 2005. LNCS (LNAI), vol. 3620, pp. 91–105. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  5. Faroo, http://www.faroo.com/ (last access in January 2011)

  6. Yacy, http://www.yacy.de/ (last access in January 2011)

  7. Schlosser, M., Sintek, M., Decker, S., Nejdl, W.: A Scalable and Ontology-Based P2P Infrastructure for Semantic Web Services. In: Proc. 2nd International Conference on Peer-to-Peer Computing, P2P 2002, p. 104. IEEE Computer Society, Washington, DC (2002)

    Chapter  Google Scholar 

  8. Tatarinov, I., Ives, Z., Madhavan, J., Halevy, A., Suciu, D., Dalvi, N., Dong, X., Kadiyska, Y., Miklau, G., Mork, P.: The Piazza Peer Data Management Project. SIGMOD Rec. 32(3), 47–52 (2003)

    Article  Google Scholar 

  9. Tran, H.M., Schönwälder, J.: Heuristic Search using a Feedback Scheme in Unstructured Peer-to-Peer Networks. In: Proc. 5th International Workshop on Databases, Information Systems and Peer-to-Peer Computing. Springer (2007)

    Google Scholar 

  10. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Schenker, S.: A Scalable Content Addressable Network. In: Proc. Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications, SIGCOMM 2001, pp. 161–172. ACM Press, New York (2001)

    Chapter  Google Scholar 

  11. Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications. In: Proc. Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications, SIGCOMM 2001, pp. 149–160. ACM Press, New York (2001)

    Chapter  Google Scholar 

  12. Maymounkov, P., Mazières, D.: Kademlia: A Peer-to-Peer Information System Based on the XOR Metric. In: Druschel, P., Kaashoek, M.F., Rowstron, A. (eds.) IPTPS 2002. LNCS, vol. 2429, pp. 53–65. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  13. Gnutella Protocol Specification version 0.4 (2001), http://rfc-gnutella.sourceforge.net/developer/stable/index.html (last access in March 2012)

  14. Clarke, I., Sandberg, O., Wiley, B., Hong, T.W.: Freenet: A Distributed Anonymous Information Storage and Retrieval System. In: Federrath, H. (ed.) Anonymity 2000. LNCS, vol. 2009, pp. 46–66. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  15. Yang, B., Garcia-Molina, H.: Designing a Super-Peer Network. In: Proc. 19th International Conference on Data Engineering, ICDE 2003, p. 49. IEEE Computer Society, Los Alamitos (2003)

    Google Scholar 

  16. Marozzo, F., Talia, D., Trunfio, P.: A Framework for Managing MapReduce Applications in Dynamic Distributed Environments. In: Proc. 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing, pp. 149–158. IEEE Computer Society, Los Alamitos (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dang, H.T., Tran, H.M., Vu, P.N., Nguyen, A.T. (2012). Applying MapReduce Framework to Peer-to-Peer Computing Applications. In: Nguyen, NT., Hoang, K., Jȩdrzejowicz, P. (eds) Computational Collective Intelligence. Technologies and Applications. ICCCI 2012. Lecture Notes in Computer Science(), vol 7654. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34707-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34707-8_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34706-1

  • Online ISBN: 978-3-642-34707-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics