skip to main content
10.1145/1463434.1463502acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
poster

Computing information gain for spatial data support

Published:05 November 2008Publication History

ABSTRACT

Widespread use of GPS devices and explosion of remotely sensed geospatial images along with cheap storage devices has resulted in vast amounts of data. More recently, with the advent of wireless technology, a large number of sensor networks have been deployed to monitor many human, biological and natural processes. This poses a challenge in many data rich application domains. The problem now is how best to choose the datasets to solve specific problems. Some of the datasets may be redundant and their inclusion in analysis may not only be time consuming, but may lead to erroneous conclusions. We propose the concept of data support as the basis for efficient, cost-effective and intelligent use of geospatial data in order to reduce uncertainty in the analysis and consequently in the results. Data support is defined as the process of determining the information utility of a data source to help decide which one to include or exclude to improve cost-effectiveness in existing data analysis. In this article we use mutual information as the basis of computing data support. The concept of mutual information is defined in information theory as a measure to compute information gain or loss between two disjoint datasets. We use this to compute the optimal datasets in specific applications. The effectiveness of the approach is demonstrated using an application in the hydrological analysis domain.

References

  1. Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media, LLC. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Chow, Ven Te. (1964). Handbook of applied hydrology. McGraw-Hill.Google ScholarGoogle Scholar
  3. Cover, T. M., Thomas, J. A. (2005). Elements of Information Theory, Second Edition. John Wiley & Sons, Inc., Hoboken, New Jersey. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Deng, K., Bourke, C., Scott, S., Sunderman, J., and Zheng, Y. (2007). Bandit-Based Algorithms for Budget Learning, in Proceedings of the Seventh IEEE International Conference on Data Mining (ICDM'2007), Omaha, NE, October 28--31, pp. 463--468. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Fu, L., Samal, A., Soh, L.-K. (2008). Techniques for Computing Fitness of Use (FoU) for Time Series Datasets with Applications in the Geospatial Domain. In Geoinformatica, Vol. 12, Issue 1, pp. 91--115. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Hunter, G. (1998). Managing Uncertainty in GIS, NCGIA Core Curriculum in GIScience, http://www.ncgia.ucsb.edu/giscc/units/u187/u187_f.html, posted February 03, 1998.Google ScholarGoogle Scholar
  7. Milton, J. S., and Arnold, J. C. (2002). Introduction to Probability and Statistics: Principles and Applications for Engineering and the Computing Sciences, 4th Edition. The McGraw-Hill Companies, Inc., New York, New York. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Computing information gain for spatial data support

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        GIS '08: Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
        November 2008
        559 pages
        ISBN:9781605583235
        DOI:10.1145/1463434

        Copyright © 2008 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 5 November 2008

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • poster

        Acceptance Rates

        Overall Acceptance Rate220of1,116submissions,20%
      • Article Metrics

        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader