ABSTRACT
The first task any individual faces after joining an online social network (OSN) is locating friends that are present on that particular site. Most OSNs offer some variation of a tool that imports email contact lists to facilitate the task of finding one's friends. However, given that OSNs attempt to reconnect individuals with past acquaintances, one might not have access to the email address for a long lost friend. Furthermore, people tend to utilize a number of aliases online, meaning that an email address cannot always be used to reliably find a friend. Thus, new members must still manually search for friends based on a number of biographical attributes, such as gender, age, hometown, etc. It is not clear, however, what attributes are useful for conducting the search. Even after the search has been performed, the person performing the search might be left with a number of candidate profiles. In this paper, we develop a system for searching and matching individuals in OSNs. We evaluate the efficacy of our person matching techniques by measuring the overlap between social networks, and comparing our results to those published by compete.com. We then look at several interesting properties of overlapping profiles in both networks.
- Y.-Y. Ahn, S. Han, H. Kwak, S. Moon, and H. Jeong. Analysis of topological characteristics of huge online social networking services. In WWW'07: Proceedings of the 16th international conference on World Wide Web, pages 835--844, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
- L. Bilge, T. Strufe, D. Balzarotti, and E. Kirda. All your contacts are belong to us: Automated identity theft attacks on social networks. In 18th International World Wide Web Conference (WWW2009), April 2009. Google ScholarDigital Library
- Compete. http://blog.compete.com/2007/11/12/connecting-the-social-graph-member-overlap-at-opensocial-and-facebook.Google Scholar
- A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. IEEE Transactions On Knowledge and Data Engineering, 19, January 2007. Google ScholarDigital Library
- E. Hargittai. Whose space? differences among users and non-users of social networks, journal of computer mediated communication, vol 3, issue 1, 2007.Google Scholar
- R. Holzer, B. Malin, and L. Sweeney. Email Alias Detection Using Social Network Analysis. In Proc. LinkKDD, 2005. Google ScholarDigital Library
- JBoost. http://jboost.sourceforge.net/.Google Scholar
- D. Liben-Nowell and et al. Geographic routing in social networks. In Proceedings of the National Academy of Sciences (PNAS), pages 11623--11628, 2005.Google Scholar
- Mashable. http://mashable.com/2007/07/17/social-network-aggregators/.Google Scholar
- M. McNett, D. Gupta, A. Vahdat, and G. M. Voelker. Usher: An Extensible Framework for Managing Clusters of Virtual Machines. In Proceedings of Large Installation System Administration Conference, 2007. Google ScholarDigital Library
- A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel, and B. Bhattacharjee. Measurement and analysis of online social networks. In IMC'07: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement, pages 29--42, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
- Rapleaf. http://www.nexdot.net/blog/2007/11/12/social-network-overlap-and-why-opensocial-could-be-useful/.Google Scholar
- F. P. Room. http://www.facebook.com/press/info.php?statistics.Google Scholar
- Tesseract. http://code.google.com/p/tesseract-ocr/.Google Scholar
- VentureBeat. http://venturebeat.com/2009/01/07/facebooks-traffic-growth-leaving-rivals-in-the-dust/.Google Scholar
- S. Webb, J. Caverlee, and C. Pu. Social Honeypots: Making Friends With A Spammer Near You. In Proceedings of the Conference on Email and Anti-Spam, 2008.Google Scholar
Index Terms
- I seek you: searching and matching individuals in social networks
Recommendations
Measurement and analysis of online social networks
IMC '07: Proceedings of the 7th ACM SIGCOMM conference on Internet measurementOnline social networking sites like Orkut, YouTube, and Flickr are among the most popular sites on the Internet. Users of these sites form a social network, which provides a powerful means of sharing, organizing, and finding content and contacts. The ...
Unified domain-specific language for collecting and processing data of social media
Data provided by social media becomes an increasingly important analysis material for social scientists, market analysts, and other stakeholders. Diversity of interests leads to the emergence of a variety of crawling techniques and programming ...
Design and Implementation of Facebook Crawler Based on Interaction Simulation
TRUSTCOM '12: Proceedings of the 2012 IEEE 11th International Conference on Trust, Security and Privacy in Computing and CommunicationsThe extensive use of Online Social Networks (OSNs) has attracted such wide attention of academia that OSNs have become a hot topic. Based on the interaction simulation, we have designed and implemented a Facebook crawler, which can obtain the complete ...
Comments