Skip to main content

Information Harvest from Social Network Data (Facebook 100 million URLS)

  • Chapter
  • First Online:
IAENG Transactions on Engineering Technologies

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 247))

  • 1622 Accesses

Abstract

Online social networks serves as an arena for its members to get in touch with each other, mutually share their information, ideas among themselves. In online social networks the members usually proclaim a profile, which consists of work and education, arts and entertainment and some basic information like gender, e-mail, etc., Such profile facilitates in spotting people, know about their interest, and interact with them in need. The intention of this research is to devise an algorithm to extract information such as name, email address, gender and interest of facebook users from a URL and to predict the gender if unspecified. The Dataset used in this work is a list of 100 million Facebook URLs. This research work paves a way to identify the email communities in Facebook. The outcome of this research reveals the fact that most of the email domains of the facebook user’s fall into yahoo, hotmail, Gmail and msn. The other domains are with least number of users. The users with Yahoo id are higher when compared to other email domains. It also discloses that majority of the interest of facebook members is towards sports. It is followed by music, technology, travelling, God and Temple run, PC gaming.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Facebook statistics [Online] Available: http://www.facebook.com/press/info.php?statistics

  2. Facebook 100 million user profile [Online] Available: http://www.skullsecurity.org

  3. Mislove A, Viswanath B, Gummadi KP, Druschel P (2010) You are who you know: inferring user profiles in online social networks. In: Proceedings of WSDM, 2010

    Google Scholar 

  4. Chun H, Kwak H, Eom Y-H, Ahn Y–Y, Moon S, Jeong H (2008) Online social networks: sheer volume vs social interaction. In: Proceedings of IMC, 2008

    Google Scholar 

  5. Polakis I, Kontaxis G, Markatos E (2010) Using social networks to harvest e-mail addresses. In: Proceedings of WPES’2010

    Google Scholar 

  6. Gatterbauer W, Bohunsky P, Herzog M, Krupl B, Pollak B (2007) Towards domain independent information extraction from web tables. In: Proceeding of the international world wide web conference committee (IW3C2), May 8–12 2007, ACM, Banff, Alberta, Canada, pp 71–80

    Google Scholar 

  7. RAG Gultom, RF Sari, B Budiardjo (2011) Proposing the new algorithm and technique development for integration web table extraction and building a Mashup. J Comput Sci 7(2):129–142, ISSN 1549–3636

    Google Scholar 

  8. Zhai Y, Liu B (2005) Web data extraction based on partial tree alignment. In: Proceedings of WWW 2005, May 10–14 2005, Chiba, Japan. ACM 1-59593-046-9/05/0005

    Google Scholar 

  9. Zheleva E, Getoor L (2009) To join or not to join: the illusion of privacy in social networks with mixed public and private user profiles. In: Proceedings of WWW 2009

    Google Scholar 

  10. He J, Chu WW, Liu Z (2006) Inferring privacy information from social networks. In: Proceedings of ISI, pp 154–165

    Google Scholar 

  11. Heatherly R, Kantarcioglu M,Thuraisingha B, Lindamood J (2009) Reventing private information inference attacks on social networks. Technical report UTDCS-03-09, University of Texas at Dallas

    Google Scholar 

  12. Tang C, Ross K, Saxena N, Chen R (2011) What’s in a name: a study of names, gender inference, and gender behavior in facebook. DASFAA workshops 2011, pp 344–356

    Google Scholar 

  13. Nancy P, Geetha Ramani R (2012) Knowledge discovery (email harvesting, gender identification and prediction) in social network data (facebook 100 million URLs), Lecture notes in engineering and computer science. In: Proceedings of the world congress on engineering and computer science 2012, 24–26 October, 2012, San Francisco, USA, pp 449–454

    Google Scholar 

  14. Popular baby names [Online] Available: http://www.ssa.gov.OACT/babynames

  15. Facebook name list [Online] Available: http://sites.google.com/site/facebooknamelist/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. Nancy .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Nancy, P., Geetha Ramani, R. (2014). Information Harvest from Social Network Data (Facebook 100 million URLS). In: Kim, H., Ao, SI., Amouzegar, M., Rieger, B. (eds) IAENG Transactions on Engineering Technologies. Lecture Notes in Electrical Engineering, vol 247. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-6818-5_36

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-6818-5_36

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-6817-8

  • Online ISBN: 978-94-007-6818-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics