Skip to main content

Leveraging VGI for Gazetteer Enrichment: A Case Study for Geoparsing Twitter Messages

  • Conference paper
  • First Online:
Web and Wireless Geographical Information Systems (W2GIS 2015)

Abstract

With the advent of Volunteered Geographical Information (VGI), the amount of user-contributed spatial data grows around the world each day. Such spatial data may contain valuable information which may help other research fields, such as the Digital Gazetteers used in Geographic Information Retrieval (GIR), for instance. The Digital Gazetteers have a powerful role in the geoparsing process. They need to keep themselves up-to-date and as complete as possible to enable geoparsers to perform lookup and then resolve toponym recognition precisely over digital texts. In this context, this paper proposes a method of gazetteer enrichment leveraging VGI data sources. Indeed VGI environments are not originally developed to work as gazetteers, however, they often contain more detailed and up-to-date information than gazetteers. Our method is applied in a geoparser environment by adapting its heuristics set besides enriching the corresponding gazetteer. A case study was performed by geoparsing Twitter messages focused solely on the microtexts in order to evaluate the performance of the enriched system. The results obtained were compared with previous results of a case study that used the same dataset and both the gazetteer and the geoparser without improvements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Beard, K.: A semantic web based gazetteer model for VGI. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, GEOCROWD 2012, pp. 54–61. ACM, New York (2012). http://doi.acm.org/10.1145/2442952.2442962

  2. Bishr, M., Kuhn, W.: Geospatial information bottom-up: a matter of trust and semantics. In: Fabrikant, S., Wachowicz, M. (eds.) Lecture Notes in Geoinformation and Cartography, pp. 365–387. The European Information Society (2007)

    Google Scholar 

  3. Campelo, C.E.C., Baptista, C.d.S.: Geographic scope modeling for web documents. In: Proceedings of the 2nd International Workshop on Geographic Information Retrieval, pp. 11–18. GIR 2008, ACM, New York, NY, USA (2008). http://doi.acm.org/10.1145/1460007.1460010

  4. Campelo, C.E.C., de Souza Baptista, C.: A model for geographic knowledge extraction on web documents. In: Heuser, C.A., Pernul, G. (eds.) ER 2009. LNCS, vol. 5833, pp. 317–326. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  5. Falcão, A.G.R., Baptista, C.d.S., Menezes, L.C.d.: Crowd4City: utilizando sensores humanos como fonte de dados em cidades inteligentes (in portuguese). In: Proceedings of the 8th Brazilian Symposium on Information Systems, pp. 144–149. São Paulo, Brazil (2012)

    Google Scholar 

  6. Furche, T., Grasso, G., Orsi, G., Schallhart, C., Wang, C.: Automatically learning gazetteers from the deep web. In: Proceedings of the 21st International Conference Companion on World Wide Web, pp. 341–344. WWW 2012 Companion, ACM, New York, NY, USA (2012). http://doi.acm.org/10.1145/2187980.2188044

  7. Gelernter, J., Ganesh, G., Krishnakumar, H., Zhang, W.: Automatic gazetteer enrichment with user-geocoded data. In: Proceedings of the Second ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, pp. 87–94. GEOCROWD 2013, ACM, New York, NY, USA (2013). http://doi.acm.org/10.1145/2534732.2534736

  8. Goodchild, M.F.: Citizens as voluntary sensors: spatial data infrastructure in the world of Web 2.0. International Journal of Spatial Data Infrastructures Research 2, 24–32 (2007)

    Google Scholar 

  9. Haklay, M., Weber, P.: OpenStreetMap: User-Generated Street Maps. IEEE Pervasive Computing 7(4), 12–18 (2008)

    Article  Google Scholar 

  10. Jilani, M., Corcoran, P., Bertolotto, M.: Automated highway tag assessment of open-streetmap road networks. In: Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 4–7. Dallas, Texas, USA (2014)

    Google Scholar 

  11. Keßler, C., Janowicz, K., Bishr, M.: An agenda for the next generation gazetteer: geographic information contribution and retrieval. In: Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 91–100. GIS 2009, ACM, New York, NY, USA (2009). http://doi.acm.org/10.1145/1653771.1653787

  12. Koukoletsos, T., Haklay, M., Ellul, C.: Assessing Data Completeness of VGI through an Automated Matching Procedure for Linear Data: Assessing Data Completeness of VGI. Transactions in GIS 16(4), 477–498 (2012). http://discovery.ucl.ac.uk/1354847/

    Article  Google Scholar 

  13. Lamprianidis, G., Skoutas, D., Papatheodorou, G., Pfoser, D.: Extraction, integration and analysis of crowdsourced points of interest from multiple web sources. In: Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information, pp. 16–23. GeoCrowd 2014, ACM, New York, NY, USA (2014). http://doi.acm.org/10.1145/2676440.2676445

  14. Moura, T.H.V.M., Davis Jr., C.A.: Integration of linked data sources for gazetteer expansion. In: Proceedings of 8th ACM SIGSPATIAL Workshop on Geographic Information Retrieval (GIR 2014) (2014)

    Google Scholar 

  15. Oliveira, M.G.d., Baptista, C.d.S., Campelo, C.E.C., Acioli Filho, J.A.M., Falcão, A.G.R.: Automated production of volunteered geographic information from social media. In: Proceedings of XV Brazilian Symposium on GeoInformatics, pp. 118–129 (2014)

    Google Scholar 

  16. Peng, X., Chen, R., Cheng, C., Yan, X.: A folksonomy-ontology-based digital gazetteer service. In: 2010 18th International Conference on Geoinformatics, pp. 1–6 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maxwell Guimarães de Oliveira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

de Oliveira, M.G., Campelo, C.E.C., de Souza Baptista, C., Bertolotto, M. (2015). Leveraging VGI for Gazetteer Enrichment: A Case Study for Geoparsing Twitter Messages. In: Gensel, J., Tomko, M. (eds) Web and Wireless Geographical Information Systems. W2GIS 2015. Lecture Notes in Computer Science(), vol 9080. Springer, Cham. https://doi.org/10.1007/978-3-319-18251-3_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-18251-3_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-18250-6

  • Online ISBN: 978-3-319-18251-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics