ABSTRACT
Widespread use of GPS devices and explosion of remotely sensed geospatial images along with cheap storage devices has resulted in vast amounts of data. More recently, with the advent of wireless technology, a large number of sensor networks have been deployed to monitor many human, biological and natural processes. This poses a challenge in many data rich application domains. The problem now is how best to choose the datasets to solve specific problems. Some of the datasets may be redundant and their inclusion in analysis may not only be time consuming, but may lead to erroneous conclusions. We propose the concept of data support as the basis for efficient, cost-effective and intelligent use of geospatial data in order to reduce uncertainty in the analysis and consequently in the results. Data support is defined as the process of determining the information utility of a data source to help decide which one to include or exclude to improve cost-effectiveness in existing data analysis. In this article we use mutual information as the basis of computing data support. The concept of mutual information is defined in information theory as a measure to compute information gain or loss between two disjoint datasets. We use this to compute the optimal datasets in specific applications. The effectiveness of the approach is demonstrated using an application in the hydrological analysis domain.
- Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer Science+Business Media, LLC. Google ScholarDigital Library
- Chow, Ven Te. (1964). Handbook of applied hydrology. McGraw-Hill.Google Scholar
- Cover, T. M., Thomas, J. A. (2005). Elements of Information Theory, Second Edition. John Wiley & Sons, Inc., Hoboken, New Jersey. Google ScholarDigital Library
- Deng, K., Bourke, C., Scott, S., Sunderman, J., and Zheng, Y. (2007). Bandit-Based Algorithms for Budget Learning, in Proceedings of the Seventh IEEE International Conference on Data Mining (ICDM'2007), Omaha, NE, October 28--31, pp. 463--468. Google ScholarDigital Library
- Fu, L., Samal, A., Soh, L.-K. (2008). Techniques for Computing Fitness of Use (FoU) for Time Series Datasets with Applications in the Geospatial Domain. In Geoinformatica, Vol. 12, Issue 1, pp. 91--115. Google ScholarDigital Library
- Hunter, G. (1998). Managing Uncertainty in GIS, NCGIA Core Curriculum in GIScience, http://www.ncgia.ucsb.edu/giscc/units/u187/u187_f.html, posted February 03, 1998.Google Scholar
- Milton, J. S., and Arnold, J. C. (2002). Introduction to Probability and Statistics: Principles and Applications for Engineering and the Computing Sciences, 4th Edition. The McGraw-Hill Companies, Inc., New York, New York. Google ScholarDigital Library
Index Terms
- Computing information gain for spatial data support
Recommendations
Using spatial data support for reducing uncertainty in geospatial applications
Widespread use of GPS devices and ubiquity of remotely sensed geospatial images along with cheap storage devices have resulted in vast amounts of digital data. More recently, with the advent of wireless technology, a large number of sensor networks have ...
Research on the fusion method of spatial data and multimedia information of multimedia sensor networks in cloud computing environment
In the process of fusing spatial data and multimedia information of multimedia sensor networks in cloud computing environment, prone to transmit the repeated information many times, the energy consumption of multimedia sensor networks is increased, ...
The Extraction of Snow Cover Information Based on MODIS Data and Spatial Modeler Tool
ETTANDGRS '08: Proceedings of the 2008 International Workshop on Education Technology and Training & 2008 International Workshop on Geoscience and Remote Sensing - Volume 01The snow cover, which is an important component of the climatic system and a main indicator factor of the global change, will affect temperature, precipitation and other climatic factors. Remote sensing technologies introduce a potential scientific ...
Comments