ABSTRACT
We describe a method that predicts the location of user-generated content using textual features alone. Unlike previous methods for geotagging text documents, our proposed method is not sensitive to how we discretize space. We also discover that spatial resolution has an impact on the prediction accuracy, which allows us to trade-off the spatial resolution of the predicted location against our confidence about its accuracy. Our method can be used to estimate the error in document's predicted location, enabling us to filter out poor quality predictions. We evaluate the proposed method extensively on user-generated content collected from two different social media sites, Flickr and Twitter. Our evaluation examines its performance on the geotagging task and with respect to different parameters. We achieve state-of-the-art results for all three tasks: location prediction, error estimation and result ranking and also provide a theoretical explanation of the effect of spatial resolution factor on geotagging accuracy. Our findings provide valuable insights into the design of geotagging systems and their quality control.
- K. H. Brodersen, C. S. Ong, K. E. Stephan, and J. M. Buhmann. The balanced accuracy and its posterior distribution. In Pattern Recognition (ICPR), 2010 20th International Conference on, pages 3121--3124. IEEE, 2010. Google ScholarDigital Library
- D. J. Crandall, L. Backstrom, D. Huttenlocher, and J. Kleinberg. Mapping the world's photos. In WWW '09: Proceedings of the 18th international conference on World wide web, pages 761--770, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
- B. Han, P. Cook, and T. Baldwin. Text-based twitter user geolocation prediction. J. Artif. Intell. Res.(JAIR), 49: 451--500, 2014. Google ScholarDigital Library
- C. Hauff, B. Thomee, and M. Trevisiol. Working notes for the placing task at mediaeval 2013. In MediaEval 2013 Workshop, Barcelona, Spain, 2013.Google Scholar
- S. Kinsella, V. Murdock, and N. O'Hare. I'm eating a sandwich in glasgow: modeling locations with tweets. In Proceedings of the 3rd international workshop on Search and mining user-generated contents, pages 61--68. ACM, 2011. Google ScholarDigital Library
- T. Rattenbury and M. Naaman. Methods for extracting place semantics from Flickr tags. ACM Transactions on the Web (TWEB), 3(1): 1, 2009. Google ScholarDigital Library
- O. Van Laere, S. Schockaert, and B. Dhoedt. Georeferencing flickr resources based on textual meta-data. Information Sciences, 238: 52--74, 2013. Google ScholarDigital Library
- M. P. Wand and M. C. Jones. Kernel smoothing, volume 60. Crc Press, 1994.Google Scholar
Index Terms
- Placing user-generated content on the map with confidence
Recommendations
Placing User-Generated Photo Metadata on a Map
SMAP '11: Proceedings of the 2011 Sixth International Workshop on Semantic Media Adaptation and PersonalizationIn this paper we analyze large user photo collections from Flickr in order to select the most appropriate tags to describe a geographical area. We cluster photos based on their latitude and longitude and divide large areas into smaller clusters, which ...
Placing images on the world map: a microblog-based enrichment approach
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalEstimating the geographic location of images is a task which has received increasing attention recently. Large numbers of images uploaded to platforms such as Flickr do not contain GPS-based latitude/longitude coordinates. Obtaining such geographic ...
On the "localness" of user-generated content
CSCW '10: Proceedings of the 2010 ACM conference on Computer supported cooperative workThe "localness" of participation in repositories of user-generated content (UGC) with geospatial components has been cited as one of UGC's greatest benefits. However, the degree of localness in major UGC repositories such as Flickr and Wikipedia has ...
Comments