A Review of Practical AI for Remote Sensing in Earth Sciences

Janga, Bhargavi; Asamani, Gokul Prathin; Sun, Ziheng; Cristea, Nicoleta

doi:10.3390/rs15164112

Open AccessReview

A Review of Practical AI for Remote Sensing in Earth Sciences

¹

Center for Spatial Information Science and Systems, College of Science, George Mason University, 4400 University Drive, MSN 6E1, Fairfax, VA 22030, USA

²

Department of Civil and Environmental Engineering, University of Washington, Seattle, WA 98195, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2023, 15(16), 4112; https://doi.org/10.3390/rs15164112

Submission received: 7 July 2023 / Revised: 14 August 2023 / Accepted: 15 August 2023 / Published: 21 August 2023

(This article belongs to the Special Issue Recent Development of Practical AI in Remote Sensing and Geoinformatics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Integrating Artificial Intelligence (AI) techniques with remote sensing holds great potential for revolutionizing data analysis and applications in many domains of Earth sciences. This review paper synthesizes the existing literature on AI applications in remote sensing, consolidating and analyzing AI methodologies, outcomes, and limitations. The primary objectives are to identify research gaps, assess the effectiveness of AI approaches in practice, and highlight emerging trends and challenges. We explore diverse applications of AI in remote sensing, including image classification, land cover mapping, object detection, change detection, hyperspectral and radar data analysis, and data fusion. We present an overview of the remote sensing technologies, methods employed, and relevant use cases. We further explore challenges associated with practical AI in remote sensing, such as data quality and availability, model uncertainty and interpretability, and integration with domain expertise as well as potential solutions, advancements, and future directions. We provide a comprehensive overview for researchers, practitioners, and decision makers, informing future research and applications at the exciting intersection of AI and remote sensing.

Keywords:

Artificial Intelligence; remote sensing technology; deep learning; LiDAR; image classification; object detection; change detection; data analysis

Graphical Abstract

1. Introduction

Remote sensing is a technology that enables data collection without direct contact with the subject, utilizing sensors to measure or detect various types of energy, such as electromagnetic radiation and acoustic signals, emitted, reflected, or scattered by the object under investigation [1]. Multiple sensors and platforms have been developed for remote sensing. As sensors continue to advance, the amount of remote sensing data generated has reached staggering proportions. For example, according to NASA’s Earth Science Data Systems (ESDS), the Earthdata Cloud held more than 59 petabytes (PB) of data as of September 2021. ESDS estimates that this amount is expected to increase to more than 148 PB in 2023, 205 PB in 2024, and 250 PB in 2025 [2]. To effectively manage this massive volume of remote sensing data, preprocessing techniques, including noise reduction and sensor calibration using a variety of algorithms and data compression algorithms, are utilized to minimize the data size, while computer systems with ample memory and parallel processing capabilities facilitate the handling of these large datasets [3].

With the increasing data quality and volume from remote sensing platforms, there is a need for computational platforms and effective tools to handle and extract valuable information from remote sensing datasets. AI tools can assist in managing large volumes of observations, modeling, analysis, and environmental forecasting, and have proven effective for key tasks such as noise reduction [4], data fusion [5], object detection [6,7], and many other important applications. As AI technologies develop, acquiring and storing remote sensing data becomes increasingly important. The process of obtaining this large volume of data entails using various sensors on different platforms, such as Unmanned Aerial Vehicles (UAVs) [8], unmanned ground vehicles (UGVs), aircraft, and satellites. These sensors, including Global Positioning System (GPS), Inertial Measurement Unit (IMU), LiDAR, and cameras, play an important role in capturing diverse types of energy, such as electromagnetic radiation and acoustic signals, emitted, reflected, or scattered by the objects of interest. In remote sensing, fusing data from multiple sensors, such as LiDAR, multispectral or hyperspectral imaging, and radar, facilitates comprehensive and detailed analysis of the Earth’s surface, atmosphere, and environment [9]. In advanced applications, AI-powered onboard and ground processing systems take center stage, autonomously handling critical tasks like calibration, filtering, filling, and scaling [10,11]. These algorithms identify intricate patterns and detect anomalies, minimizing subjectivity and bias in the analysis process and empowering researchers to efficiently assimilate, analyze, and interpret vast amounts of remote sensing data with unprecedented speed and accuracy.

A number of challenges related to AI approaches may limit their practical applications. For example, training AI algorithms, especially deep learning models, requires significant computational resources, making them challenging to develop on resource-constrained shared devices. Many neural network-based models are often considered black-box models, and understanding the reasons behind AI predictions is difficult but critical for gaining trust and ensuring effective decision making [12]. Creating labeled datasets for training AI models in remote sensing can be labor-intensive and time consuming, especially for fine-grained or multi-class tasks [13], and transferring AI models trained on one dataset to perform well on different datasets can also require additional resources. Incorporating domain-specific knowledge and expertise into AI models is essential to ensure the representation of relevant features and relationships [14,15].

To successfully deploy an operational AI model, there are a few critical steps to consider. First, real-world applications usually need AI models to scale efficiently to process large-scale remote sensing data in real time, with minimal turnaround. Practical AI systems require collaborative platforms for AI developers, domain experts, and remote sensing practitioners working together to share knowledge, data, and best practices, with public-facing applications displaying user-friendly tools and interfaces that enable non-experts to leverage AI capabilities for remote sensing applications effectively. Uncertainty estimates are also needed for decision-making processes, especially in accuracy-critical applications like precision agriculture and environmental monitoring [16]. When integrated with social media and sensitive data, AI systems need to address privacy concerns, ethical considerations, and compliance with local and international regulations.

This review paper aims to comprehensively evaluate and synthesize the existing literature on the need to develop practical AI in remote sensing. We aim to provide valuable insights that inform future research and applications. Key contributions of this paper include the following:

Overview of successful examples of practical AI in research and real-world applications;
Discussion of research challenges and reality gaps in the practical integration of AI with remote sensing;
Emerging trends and advancements in practical AI techniques for remote sensing;
Common challenges practical AI face in remote sensing, such as data quality, availability of training data, interpretability, and the requirement for domain expertise;
Potential practical AI solutions and ongoing or future real-world applications.

We adopted a structured approach to organize this paper. First, we commence with a background, which is a significant section that provides crucial context on AI and remote sensing, emphasizing key techniques. Subsequently, we explore various applications of AI in remote sensing, presenting an overview of the methods employed and relevant use cases. Additionally, we discuss the challenges related to AI integration in remote sensing. Finally, we summarize futuristic AI applications that can potentially transform various fields beyond what we currently imagine.

2. Basics of AI and Remote Sensing

This section comprehensively explores the fundamental concepts of remote sensing and discusses key AI techniques in this field. A systematic literature review was conducted to achieve a comprehensive understanding, encompassing reputable sources such as peer-reviewed publications, conference papers, and technical reports. The selected literature was critically analyzed, and key insights and findings were synthesized to provide comprehensive coverage of a broad spectrum of AI techniques in remote sensing.

2.1. Brief Recap of Remote Sensing Technologies

Understanding the fundamental principles of remote sensing is important for comprehending its diverse techniques and applications and integrating them with AI techniques. Remote sensing systems are built to take advantage of the various parts of the electromagnetic spectrum (Figure 1) and atmospheric windows to observe different targets. Passive sensors detect natural energy emitted or reflected by the Earth, such as optical sensors that capture sunlight reflection (Figure 2a), whereas active sensors emit energy and measure the reflected or backscattered signals (Figure 2b). This wide range of sensors enables remote sensing data to be acquired via satellites for global coverage, aircraft for higher spatial resolution, and drones for small-scale data collection [17].

Once remote sensing data is acquired, interpreting images and digital data becomes crucial in extracting meaningful information. Digital image processing techniques include filtering, image fusion, feature extraction, and classification algorithms, enabling the extraction of valuable insights [18]. The following paragraphs describe the main remote sensing techniques, while AI methods that assist with data processing are presented in Section 2.2.

2.1.1. Optical Remote Sensing

This technique focuses on gathering and interpreting optical data, primarily within the visible and near-infrared sections of the electromagnetic spectrum (Figure 1) [19]. As sunlight interacts with the Earth’s surface, materials on the surface absorb and reflect specific wavelengths of light. This interaction creates unique spectral signatures that are characteristic of different surface features [20]. The sensors, available in handheld, airborne, and spaceborne modes, contain detectors that record light intensity across different wavelengths (Figure 3). The recorded data are transmitted to ground stations or processing centers, where they are processed and transformed into images or spectral data.

In the context of optical remote sensing image (RSI) object detection, the primary objective is to ascertain whether a given aerial or satellite image contains pertinent objects and precisely determine their locations [21]. To ensure image quality, several processing steps are undertaken. Preprocessing involves noise removal and contrast enhancement to improve clarity and interpretability, followed by feature extraction, where relevant characteristics are identified and extracted from the images for further analysis. The ultimate objective is to classify objects within the images and assess the accuracy of the results. This classification process allows for effective interpretation and understanding of the image information. An accuracy assessment is also performed to verify the reliability and precision of the results.

In optical remote sensing, three primary modes are commonly used as follows: handheld, airborne, and spaceborne. Handheld sensors capture spectral signatures of ground objects, facilitating ground-truthing and small-scale data collection. Airborne sensors mounted on airplanes or drones offer higher spatial resolution and efficient coverage of larger areas, making them useful for tasks such as land cover/land-use mapping [22], crop health assessment, and identification of ecological hotspots. Spaceborne sensors on satellites provide extensive coverage and repeated observations over time, enabling the mapping of large areas, monitoring changes in land use, tracking migratory patterns, and observing atmospheric conditions. The wealth of data collected by spaceborne sensors contribute significantly to various applications, including environmental monitoring, urban planning, disaster management [23], and climate studies.

Vegetation indices, like the Normalized Difference Vegetation Index (NDVI) [24], are derived from optical remote sensing data by analyzing reflectance and absorption [25]. They serve as early detectors of nutrient deficiencies by studying light reflection changes [26]. For instance, higher near-infrared reflection often means nitrogen shortage, whereas less red light reflection could indicate phosphorus deficiency [27,28]. Monitoring these indices over time offers predictive insights into vegetation growth dynamics, which extends to crop trends. AI analysis of historical data uncovers vegetation responses to changing conditions and can inform fertilizer and pesticide use by farmers, resulting in resource savings, higher yields, and reduced chemical reliance. AI-based methods also proved valuable for deriving snow-covered areas from sensors with radiometric information limited to visible and near-infrared bands [29,30], allowing for applications in environmental monitoring at m-scale spatial resolution.

2.1.2. Radar Remote Sensing

This technique operates in the microwave region of the electromagnetic spectrum (Figure 1), involving the transmission and reception of microwave waves [31]. A radar antenna emits pulses of microwave radiation toward the Earth or space, capturing the echoes reflected by the targets and containing data regarding the targets’ characteristics, including distance, direction, shape, size, roughness, and dielectric properties (Figure 4) [32]. By analyzing the time and intensity of the echo signals, radar remote sensing can generate images or maps of the targets with varying resolutions and perspectives. It is widely used in mapping land surfaces, monitoring weather patterns, studying ocean currents, and detecting objects such as buildings and vehicles [33].

Synthetic Aperture Radar (SAR) produces high-resolution surface images and is particularly valuable for large-scale forest cover mapping because it can penetrate clouds and foliage, enabling accurate mapping even in challenging weather or limited visibility conditions [34]. The dual-polarization technology employed by SAR allows for differentiation between different forest canopy types and the underlying vegetation. When the radar signal encounters the forest canopy, it scatters, with a portion of the signal returning to the radar instrument. This returned signal carries crucial information about forest structure and biomass. By incorporating dual-polarization radar, the accuracy and comprehensiveness of forest mapping are enhanced, providing detailed insights into both the forest structure and underlying vegetation. The ability of SAR to effectively distinguish between various forest canopy types and the vegetation beneath them is a significant advantage. This capability enables SAR to generate high-resolution data that can detect changes in forest cover with exceptional precision [35].

2.1.3. LiDAR

LiDAR operates by emitting pulsed lasers that reach a target, and the time it takes for the reflected light to return to the sensor is precisely measured to calculate the distance between the sensor and the object (Figure 5) [36]. For airborne surveys, the distance traveled is then converted to elevation, and multiple returns allow for mapping forests and tree heights [37,38] Figure 5. LiDAR systems incorporate GPS systems, which identify the locations of the emitted light energy, and an inertial measurement unit, IMU, which provides the aircraft’s orientation in the sky.

LiDAR systems record the reflected rays of light in the form of a waveform or distribution in two different ways. In a Discrete Return LiDAR System [39], the waveform curve is analyzed to identify individual peaks, with individual points on the ground recorded at each peak location, whereas a full waveform LiDAR System records the complete distribution of the returned light energy, and although data processing is more complex, it has the potential to capture a larger amount of information compared to discrete return LiDAR systems. Whether collected as discrete points or entire waveforms, LiDAR data are often available as a LiDAR point cloud, which represents a three-dimensional collection of points in space.

2.1.4. Thermal Remote Sensing

This technique is a passive remote sensing method that measures the radiant flux emitted by ground objects within specific wavelength ranges, typically 3–5 μm and 8–14 μm [40,41]. Thermal cameras, radiometers, and other sensors are utilized to capture energy within the thermal infrared range. The thermal detector can be either cryogenic or uncooled and converts the data into electrical signals, which are then processed to generate thermal images or temperature data of the target object or surface. By analyzing these thermal images and data, valuable information about the object’s emissivity, reflectivity, and temperature can be obtained. Factors that can impact the accuracy of TIR remote sensing data include atmospheric conditions, changes in solar illumination, and variations in target emissivity and radiance. To address these uncertainties, TIR data often undergo calibration or correction processes to ensure precise temperature measurement and analysis. Thermal remote sensing can be employed in environmental monitoring and wildfire detection [42,43]. As an example, Figure 6 shows a temperature map derived from ECOSTRESS data collected during the historic Pacific Northwest heatwave in 2021.

2.1.5. Multispectral and Hyperspectral Imaging

Multi-spectral cameras have the ability to detect a broader range of wavelengths beyond the visible spectrum, including infrared and ultraviolet (Figure 1). It relies on spectral signature rather than spatial shape to detect and discriminate among different materials in a scene [44]. The camera captures a sequence of images using different filters that target specific wavelengths or bands of light in parallel, forming a comprehensive dataset containing information from various spectral channels. The images then undergo a series of processing steps, including normalization, calibration, alignment, registration, noise reduction, and enhancement. Hyper-spectral imaging (HSI) [45] is a more advanced technique that collects information across the electromagnetic spectrum with very high spectral resolution from ground objects using hundreds of narrow bands [46]. HSI data contain numerous narrow spectral bands, creating a dataset known as a hyper-spectral image cube containing spatial dimensions (x, y coordinates) and spectral bands (wavelengths) and enabling detailed analysis of reflected or emitted light at specific spectral intervals. However, the high-dimensional and noisy nature of the data poses analysis challenges, requiring the application of algorithms that facilitate denoising, classification, detection, and other tasks. It should be noted that there is no absolute threshold on the number of bands that distinguish between multispectral and hyperspectral remote sensing [47].

Above all, data from multiple sensors are combined to gain a deeper understanding of the system investigated [48,49]. Table 1 provides an overview of the pros and cons of each remote sensing technique, with its advantages, limitations, and applications.

2.2. Key AI Techniques in Remote Sensing

2.2.1. Conventional Machine Learning in Remote Sensing

The remote sensing community has extensively utilized conventional machine learning methods for various tasks such as classification, object detection, and geophysical parameter estimation. These methods have proven effective in handling multi-temporal and multi-sensor remote sensing data, providing valuable information for environmental monitoring [14,50,51,52,53].

Ensemble decision-tree-derived classifiers are well-known algorithms for classifying tasks with remote sensing data [54,55,56]. These algorithms include bagging [57], boosting [58,59], and random forest (RF) techniques [60]. The RF approach was used in a variety of applications ranging from land cover classification [61,62,63,64,65,66] to data fusion [7,67] classification tasks using hyperspectral data [68,69]. Random forest involves bagging, creating an ensemble of decision trees by randomly selecting samples and features from the training data. By combining multiple decision trees, RF classifiers can provide robust predictions while offering variable importance (VI) measurements and are often used in remote sensing applications [70]. This feature selection method allows RF to effectively rank and eliminate irrelevant features, reducing dimensionality and identifying the most significant remote sensing and geographic data that offer new insights into the Earth system [49,71]. The selective feature choice in RF is particularly beneficial as it prevents overfitting, enhances generalization, and reduces computational load and redundancy. Despite these advantages, accurately selecting discriminatory variables from high-dimensional remote sensing data remains challenging [72], and the selection of training data may influence the results [73].

Similar to RF, boosting approaches such as the Extreme Gradient Boosting (XGBoost) method also utilize decision trees as base learners but take the process further by combining the strengths of individual trees in a boosting technique [74]. This iterative process sequentially creates decision trees, with each subsequent tree focused on correcting the errors of its predecessors. This approach helps XGBoost achieve low bias and variance, ultimately improving classification. An advantage of XGBoost in remote sensing data classification is its ability to handle cases where different classes (e.g., algal bloom species) exhibit similar spectral signatures but may have varying concentrations or distributions [75]. To ensure optimal accuracy and prevent overfitting, XGBoost employs hyper-parameter tuning techniques.

Another conventional technique is Support Vector Machines (SVMs) that categorize data by discovering high-dimensional hyperplanes that effectively separate distinct classes, leading to improved data generalization and better image classification [76]. These machines handle challenges like non-linearity and dimensionality by utilizing the kernel trick, which involves mapping input data into higher-dimensional spaces and relies on a subset of training data, referred to as support vectors, to establish decision boundaries. By leveraging kernel functions, SVMs transform input data, enabling the identification of hyperplanes in expanded dimensions and effectively accommodating scenarios in which original feature separability is limited [77]. Notably, SVMs incorporate a flexible soft margin approach, allowing for a degree of misclassification tolerance [78].

2.2.2. Deep Learning in Remote Sensing

Deep learning, a subfield of machine learning, has emerged as a valuable tool in remote sensing, offering solutions to unprecedented challenges and creating new opportunities in remote sensing applications [53,79,80,81]. Deep learning utilizes hierarchical artificial neural networks to identify patterns within data and extract valuable features from large and complex datasets [82]. During training, the network adjusts weights and biases through a process known as backpropagation, enhancing its ability to recognize patterns and relationships as it processes more data. Deep learning networks gradually transform the data into representations suitable for specific tasks such as image preprocessing, object recognition, and pixel-based classification [83]. This section lists and briefly introduces some common deep learning algorithms.

Deep Convolutional Neural Networks (DCNNs)

Deep Convolutional Neural Networks, DCNNs, utilize a multi-layer architecture effective for image recognition and classification tasks [84,85]. The architecture of DCNNs consists of multiple layers, in which the initial layers, known as convolutional layers, play a fundamental role in detecting low-level features within the input image (Figure 7). They achieve this by applying convolutional filters, also called kernels, to the image. These filters effectively act as feature detectors, focusing on edges, corners, and other basic patterns that characterize the image, helping identify simple shapes and textures in the scene. A non-linear activation function, Rectified Linear Unit, ReLU [86], is applied after each convolutional operation to introduce non-linearity and enable the learning of more intricate patterns. Following the convolutional layers, pooling layers are utilized to reduce the spatial dimensions of the data while retaining the essential information. Pooling achieves this downsampling by aggregating information from neighboring pixels and introducing the ability to detect certain features regardless of their spatial position within the image. The convolution and pooling process is typically repeated multiple times to allow the network to learn higher-level features and representations progressively. As the network goes deeper into its layers, it can capture increasingly abstract and sophisticated features essential for recognizing complex objects or patterns. The last fully connected layer of the DCNN generates probabilities associated with the different classes of objects, with the softmax activation function ensuring that the class probabilities sum up to one. This final classification step enables the network to recognize and categorize objects present in the remote sensing image accurately [87].

The convolution is calculated using the following equation:

y [i, j] = \underset{m}{\sum^{}} \underset{n}{\sum^{}} x [i + m, j + n] \cdot w [m, n] + b

where y[i,j] is the output feature map at position (i, j), x is the input image, w is the filter, b is the bias term, and m and n are the indices of the filter.

An activation function can be defined as

ReLU(x) = max (0,x)

setting all negative values to zero and leaving positive values unchanged.

It is a simple activation function that is computationally efficient to compute and helps alleviate the vanishing gradient problem, which can occur during backpropagation in DCNN. It is worth noting that ReLU is not without its limitations. One issue is the “dying ReLU” problem, where neurons can become “stuck” during training and become inactive, resulting in zero activations that prevent learning. To address this, variants like Leaky ReLU [88] and Parametric ReLU [89] have been introduced. While Figure 7 illustrates a basic DCNN architecture as an example, recent years have seen the evolution of more specialized architectures for specific applications. Notably, U-Net [90] and SegNet [91] are tailored for semantic segmentation tasks in images. U-Net features a contracting path with repeated 3 × 3 convolutions, ReLU activations, and 2 × 2 max pooling for feature extraction, followed by an expansive path for upsampling and generating detailed segmentation masks. On a similar note, SegNet focuses on pixel-wise image labeling. It comprises an encoder network akin to VGG16’s convolutional layers, a decoder network for low-to-full resolution feature mapping, and a pixel-wise classification layer. Further, along the timeline, AlexNet [92] ushered in a new era for DCNNs with its multi-layered architecture, employing convolution, max pooling, and Local Response Normalization (LRN) to process image features. VGG introduces depth with its 3 × 3 convolutional kernel, leading to VGG16 and VGG19 models known for their accuracy. The Inception network, designed by Google, utilizes diverse kernel sizes for capturing features at varying scales, whereas DeepLab [93] harnesses DCNNs, atrous convolution, and CRFs for precise semantic segmentation, achieving high accuracy and efficiency.

2.: Deep Residual Networks (ResNets)

In remote sensing, the need for deep neural networks arises due to the complexity of high-dimensional and noisy data caused by similar spectral characteristics of objects. However, neural networks are trained using a back-propagation process that relies on gradient descent, which decreases the loss function and finds the weights that minimize it. If there are too many layers, repeated multiplications will eventually reduce the gradient until it “disappears”, and performance will plateau or deteriorate with each additional layer [94]. To handle this issue, ResNets were introduced as a solution to this “degradation problem” in deep learning models [95,96].

ResNets introduce residual blocks or “skip connections” or “shortcut connections”. These skip connections allow for the stacking of multiple identity mappings, which are essentially convolutional layers that initially do nothing. By bypassing and reusing the activations of the previous layer, the skip connections introduce a shortcut for the gradients to flow more directly during backpropagation. This helps to speed up the initial training phase by compressing the network into fewer layers.

The core difference of residual learning is the residual block and skip connections which are defined as

y = F(x) + x

where F is the residual mapping (sequence of convolutional layers), x is the input to the block, and y is the output. Residual blocks allow us to train much deeper neural networks bypassing one or more layers in between. ‘Shortcut projection’, which is a 1 × 1 convolutional layer, denoted as P(x), is incorporated within the skip connection, allowing for dimension adjustment and alignment of the feature maps. Shortcut projection is represented as

y = F(x) + P(x)

where P represents the 1 × 1 convolutional layer used for dimension adjustment. By ensuring that the information passed between layers is well-aligned and optimized, shortcut projection contributes to faster training convergence and more effective model learning. The initial training enables the model to establish a baseline data representation. Once this initial training is complete, all layers are expanded, and the remaining parts of the network, known as the residual parts, are allowed to explore more of the feature space of the input image. Through these techniques, ResNets address the vanishing gradient problem and facilitate the training of much deeper models, which can effectively capture and represent the complex and subtle patterns present in remote sensing imagery.

3.: You Only Look Once (YOLO)

Algorithms for real-time object detection and segmentation in remote sensing images represent significant advancement with applications in the identification and classification of multiple objects within large datasets of images or video frames. The algorithm named YOLO (You Only Look Once) has gained popularity for its ability to process the entire image simultaneously using a Single Shot Detector and a CNN [97], initially leveraging the Darknet framework [98]. Within YOLO, bounding boxes indicating the location, class, and confidence score of each detected object within the image are generated [99] (Figure 8). The confidence score produced by YOLO reflects both the likelihood of an object being present in the bounding box and the accuracy of the box itself and is used in the final detection process. Overlapping bounding boxes can still occur. To refine the results and ensure only the most accurate detections are retained, YOLO incorporated Non-Maximum Suppression (NMS), a technique that eliminates redundant bounding boxes by keeping only the one with the highest confidence score. YOLOv2 [100] improves the speed and the type of object detected, and YOLOv3 enables the prediction of objects of different sizes [101,102,103].

YOLO has further evolved through multiple versions, currently eight, with different updates, including changes in backbone architectures, the addition and then removal of anchors, and the use of PyTorch and PaddlePaddle frameworks, with the overall goal of balancing speed and accuracy for real-time object detection [104,105]

4.: Faster Region-Based CNN (R-CNN)

Faster R-CNN is a two-step approach for object detection in remote sensing [106] based on two key modules: the Region Proposal Network (RPN) and the Fast R-CNN detector. The Fast R-CNN module is an upgrade of the previous R-CNN approach allowing simultaneous processing of the entire image and region proposals in a single forward propagation pass and also replacing the slower SVM-based classification with a softmax layer, increasing the processing speed while also improving detection accuracy [107]. The RPN uses predefined bounding boxes of various scales and aspect ratios to determine areas of interest for the detector. The RPN operates by sliding a small network over the convolutional feature map, producing object proposals with corresponding objectness scores that undergo further processing through fully connected layers for box regression and box classification. This allows the model to refine the positions of the proposed bounding boxes and classify them accurately.

5.: Self-Attention Methods

In remote sensing, approaches such as Recurrent Neural Networks (RNNs) face challenges related to capturing complex contextual dependencies when analyzing longer sequences of images. RNNs are well-suited for sequential data analysis, yet they encounter difficulties in effectively capturing the nuanced relationships between distant elements within extended sequences. This limitation can lead to a loss of important contextual information and hinder their performance on tasks involving long-range dependencies. To overcome this limitation, attention mechanisms have been designed to allow access to all elements in a sequence at each time step, facilitating a comprehensive understanding of dependencies and improving the handling of longer sequences.

The transformer architecture [108], originally developed for natural language processing, has played a key role in advancing attention mechanisms by introducing self-attention as a standalone mechanism. The model involves transforming feature maps into sequences of embeddings, which capture essential information from the input data. This capability is particularly valuable in modeling spatial and spectral dependencies in remote sensing imagery. By incorporating attention mechanisms, transformers can effectively learn and leverage the contextual and spatial relationships present in remote sensing data, making them highly suited for complex and high-dimensional data analysis [109].

The general formula for attention is

SelfAttention(X) = softmax(Q K^T/d_k)_V

where X is the input; Q is the query matrix obtained by linearly transforming the input embeddings: Q = XWQ; K is the key matrix obtained by linearly transforming the input embeddings: K = XWK; V is the value matrix obtained by linearly transforming the input embeddings: V = XW_V; d_K is the dimension of the key and query vectors; WQ, W_V, and WK are learnable weight matrices for linear transformations.

BERT (Bidirectional Encoder Representations from Transformers) is an example of a transformer-based model that has shown remarkable success in language representation learning tasks that captures bidirectional contextual information by considering both the left and right context in all layers [110]. When applying BERT to remote sensing data, a specific approach can be followed as described by [111] regarding the hyperspectral imagery. The hyperspectral images (HSIs) are flattened and directly inputted into the BERT model for feature extraction, allowing the model to learn global dependencies among spectral bands. The addition of a multi-head self-attention (MHSA) mechanism accommodates diverse pixel relationships regardless of spatial distance, enabling the model to effectively capture long-range dependencies and complex relationships within the hyperspectral data.

6.: Long Short-Term Memory, LSTM

LSTM, short for Long Short-Term Memory [112], is a type of recurrent neural network (RNN) that is commonly used for sequence modeling and time series analysis [113]. The LSTM design aims to address the vanishing gradient problem in traditional RNNs, which can make it challenging to capture long-term dependencies in sequences [114]. LSTMs receive an input sequence, which could be a sequence of sensor readings, or any other sequential data, with each element in the sequence representing a feature vector. At each time step, the LSTM network activates a series of gates: input gate, forget gate, and output gate, controlling the level of information allowed to enter, exit, or be retained, with the use of memory cell states and hidden states. The input gate takes the current input and the previous hidden state as inputs, and a sigmoid activation function for these inputs produces a value between 0 and 1 for each element in the feature vector. A selection process is then applied, with 1 being retention and 0 being elimination in the cell. A similar process occurs in the forget gate that decides which elements of the memory cell should be erased or forgotten. The memory cell is then updated based on the input from the input and the forget gates, allowing the LSTM to retain important information and discard irrelevant or redundant information. The output gate takes the current input and the updated hidden state from the previous time step and, similarly, determines which elements of the cell should be outputted. The hidden state is updated based on the output from the output gate and the updated memory cell, and the LSTM network can output a prediction based on the updated hidden state. This prediction can be used for various tasks such as sequence classification, sequence generation, or time series forecasting.

2.2.3. Other AI Methods in Remote Sensing

There is a growing interest in utilizing generative adversarial networks, GANs [115], in remote sensing applications [116,117]. GANs are neural networks excelling in handling complex, high-dimensional data, even with limited or no annotated training data [118].

GANs consist of two networks, a generator and a discriminator, trained in competition. The generator produces fake images (forgeries) using random noise, which the discriminator evaluates alongside real images (Figure 9). Both networks train simultaneously and compete against each other. The generator learns from the discriminator’s feedback, incorporating synthetic and real images through backpropagation, leveraging the discriminator’s error signal. This iterative cycle enhances the generator’s ability to produce higher-quality, more realistic images. The generator becomes proficient at deceiving the discriminator by refining the forgeries through successive iterations and feeding them back to the discriminator, completing the GAN training process [119].

Adversarial training is as follows:

minGmaxD Ex~Pdata[logD(x)] + Ex~Pz[log(1 − D(G(z)))]

where G is the generator network, D is the discriminator network, z is random noise, x is a real sample, P_data is true data distribution, and P_z is the prior distribution of the random noise vector. The generator and discriminator compete to outperform each other in a min-max game.

GANs have various applications in remote sensing, including image-to-image translation tasks like dehazing and removal of thin clouds. For this purpose, the CycleGAN [79] and its variants can be used to accomplish cloud-removal tasks [120]. CycleGAN can be trained on datasets with image pairs with clouds and no clouds, with the goal of learning the mapping between the two sets of images. With the trained CycleGAN, clouds can be removed in new sets of images. CycleGAN consists of two generators and two discriminators, with each generator handling the forward and back translation between the image domains, while each discriminator distinguishes between real and synthetic images. During training, the generators aim to maximize the probability of the discriminators making mistakes while the discriminators strive to accomplish their tasks. Challenges related to applying this method for cloud removal tasks include a high percentage of cloud cover in the image or complex cloud shapes not seen in training datasets.

To enhance the resolution of low-resolution satellite images, the SRGAN (Super-Resolution Generative Adversarial Network) model can be utilized [121]. Built on a ResNet, the generator learns to map low-resolution images to high-resolution counterparts. The discriminator’s task is to differentiate between generated and real high-resolution images. During training, the generator seeks to deceive the discriminator, while the discriminator aims to classify the images correctly [7].

For image-to-image translation tasks and other tasks such as image sharpening, classification, and others, the Pix2Pix GAN model can also be used [122]. A series of other GAN-based algorithms, such as HRPGAN [123] and similar algorithms, can also be used for super-resolution, whereas MARTA GANs [124] can be used for data augmentation, PSGAN for pan-sharpening [125], and ES-CCGAN [126] and CLOUD-GAN [127] based on CycleGAN for dehazing and cloud removal [118].

Deep Reinforcement Learning (DRL) offers advantages in remote sensing, such as learning from unlabeled data and improving decision-making processes [128]. DRL combines reinforcement learning (RL) techniques with deep neural networks to create a powerful framework for solving complex problems. RL involves an agent interacting with an environment to maximize cumulative rewards, while deep neural networks approximate optimal policies. The agent observes the environment’s state, takes an action, and receives a reward based on the action. The agent updates its policy using the reward signal and transitions to a new state, aiming to maximize cumulative reward over time. Deep neural networks serve as function approximators, capturing complex relationships between states and actions and generalizing to new situations [129].

An example of DRL in remote sensing is unsupervised band selection in hyperspectral image classification [130], specifically using a deep Q-network, DQN [131]. The currently selected bands represent the state by formulating the problem as a Markov decision process, MDP [132], and adding the next band is considered the action. The DQN learns a band-selection policy by maximizing the reward signal based on classification accuracy from the selected bands. Training involves normalized spectral signatures and reward signals, updating DQN weights with batches of these data. The learned policy is evaluated on unseen datasets to assess generalization and accuracy, demonstrating its superiority over other methods. Adjustments to DQN parameters, such as layer count, neuron count, and learning rate, can further enhance accuracy and consistency. This model is suitable for remote sensing image processing applications that analyze large amounts of data, overcoming challenges related to limited labeled samples and redundant spectral information.

Each technique offers unique benefits and is suited for specific tasks in remote sensing, enabling researchers and practitioners to choose the most appropriate approach based on their data and objectives. Table 2 provides an overview of the key AI techniques in remote sensing, highlighting their advantages, limitations, and applications.

3. Current Practical Applications of AI in Remote Sensing

3.1. Land Cover Mapping

AI techniques have been widely used in mapping tasks for assigning labels to individual image pixels and allowing for the categorization based on different spectral and spatial features, providing valuable information about the distribution and characteristics of land cover types in a specific area [133,134,135] (Figure 10). As a practical example, the Environmental Systems Research Institute (Esri) has recently released a high-resolution (10 m) annual global land cover map (2017–2022), which was created using a full CNN with a U-Net architecture developed using Impact Observatory [136]. To train this model, a massive training dataset of over five billion labeled image pixels was utilized, generously provided by the National Geographic Society. The map-making process involved utilizing the comprehensive coverage and high spatial resolution of the European Space Agency’s (ESA) Sentinel-2 satellite imagery.

Creating the map entailed running the AI model on an extensive collection of approximately 400,000 Earth observations of Land Use/Land Cover, LULC [137], of around 500 terabytes of cached imagery. The model incorporated six Sentinel-2 surface reflectance bands and generated ten land cover classes, including water, trees, grass, crops, and built areas. To achieve a comprehensive depiction of land cover, the final map was created by compositing the outputs of the model applied to multiple dates of imagery throughout the year, offering a comprehensive depiction of land cover. The computation process required approximately 1.2 million core hours to handle the immense computational load, with Microsoft Azure Batch expediting the processing time, with up to 6400 cores running simultaneously.

3.2. Earth Surface Object Detection

SpaceKnow’s GEMSTONE (Global Economy Monitoring System Delivering Transparency and Online Expertise) project aims to develop advanced ML algorithms that utilize satellite data for monitoring global economic activity [138]. These algorithms combine spectral unmixing and deep neural networks (DNNs) to detect [139] raw materials and manufactured structures, enabling comprehensive monitoring. Spectral unmixing involves analyzing the spectral properties of satellite imagery to identify and differentiate specific materials of interest, whereas DNNs classify and distinguish these detected materials, ensuring accurate and high-quality results. These algorithms are deployed in carefully selected locations, and the analysis outputs are aggregated into specific economic indices. Users can access this information via a user-friendly dashboard or an API (Application Programming Interface), allowing seamless integration into their organizations’ workflows. The effectiveness of these algorithms has been demonstrated via case studies such as the Nagoya Port Analysis, in which various elements, such as oil tanks, were detected and tracked [140] over time, providing valuable insights into the port’s activity. A road algorithm successfully monitored the expansion of the road network in Zayed City, Abu Dhabi, showcasing its potential for large-scale monitoring of urbanization and road development.

3.3. Multisource Data Fusion and Integration

Integrating information from various remote sensing techniques can provide a comprehensive understanding of objects or phenomena. This process involves collecting data from diverse sources, ensuring accurate data registration and co-registration, integrating correlated measurements, and estimating desired object attributes or identities [141]. For instance, the European Space Agency (ESA) utilizes AI and satellite data to tackle surveying water pipe networks, detecting leaks, and identifying new water sources. Access to clean drinking water and reducing water pipe leaks are significant concerns in regions dealing with water scarcity, both in developing and developed nations. To handle this, ESA has developed a service catering to the needs of governments, water utilities, charities, non-profits, and NGOs operating in these areas. The service merges neural networks with multi-spectral and synthetic aperture radar satellite data, particularly ESA Sentinel 1 and 2 data. Neural networks recognize water’s spectral and backscatter signatures, indicating moisture. This enables comprehensive surveys to locate underground water sources and identify pipe network leaks. As a result, a detailed map of Earth’s sub-surface water has been created, boasting a spatial resolution of 10 square meters [142]. This map encompasses over 1.5 trillion [142] satellite tiles and stores vast amounts of data. ESA has also launched a free underground water mapping service called SpaceWater.AI, with the support of Esri, Nvidia, and Amazon Web Services. Pilot users, such as the United Nations High Commissioner for Refugees (UNHCR) and WaterAid, are already benefiting from this service. The accuracy of identifying underground water sources reaches a maximum peak of up to 98% [142], although it may vary based on geographic and environmental conditions.

Additionally, ESA has also developed the Total Ecosystem Management of the InterTidal Habitat (TEMITH) project [143], led by the University of Southampton, to monitor Solent’s intertidal habitat on England’s south coast using Earth Observation (EO) data. This project focuses on two pressures: algal mats and sediment disturbance. Gathering and preparing data involve multiple steps. Satellite data from various sources, including in situ datasets, are used to select collection dates and locations. For feature detection, two sensors are used as follows: Copernicus Sentinel-2 (10 m resolution) and the high-resolution MAXAR (0.31 to 0.61 m). Imagery is captured within a 4 week timeframe, extending to 8 weeks if needed, preferably during low tide and cloud-free conditions. Sediment disturbance detection uses mapped polygon datasets for model training, supplemented by drone imagery, aerial photography, and high-resolution satellite imagery for additional labeling. The labeling process considers scarring morphology and context, selecting high-confidence polygons for model training. Similarly, mapped polygon datasets for algal mats, seagrass, and salt marsh detection come from diverse sources, including the Environment Agency, Hampshire and Isle Wight Wildlife Trust, Natural England, and the Channel Coastal Observatory. Dataset selection is based on suitability and compatibility with available satellite imagery, aiming for a match within two weeks of data collection. Prioritizing Sentinel-2 imagery known for cloud-free, low-tide images, enhances feature visibility. The project trains three ResU-Net models and six U-Net CNN models. These models identify indicators like nutrient enrichment, seagrass presence, and salt marsh presence, targeting sediment disturbance and algal mats.

3.4. Three-Dimensional and Invisible Object Extraction

Remote sensing data are the primary source for extracting valuable information about the 3D structures and spectral characteristics of objects [144]. Two key types of data used in remote sensing are LiDAR data and hyperspectral data. LiDAR data provide detailed information on object heights and shapes within a surveyed area, whereas hyperspectral data capture the electromagnetic spectrum reflected or emitted by objects, allowing for the identification and analysis of different materials based on their unique spectral characteristics. However, both data types face challenges, such as spectral redundancy, low spatial resolution for hyperspectral data, and the presence of high- and low-frequency information in LiDAR data.

Startups like Enview have introduced a Web-based AI service specifically designed for LiDAR data analysis. By utilizing CNNs, Enview enables the automated identification of physical objects within 3D point clouds, including power lines, pipelines, buildings, trees, and vehicles. This technology is particularly beneficial for companies in the electricity and natural gas distribution sector, streamlining object identification through the segmentation and classification of LiDAR data. Enview’s AI technology has already delivered significant cost savings by automating power line inspection [145].

In the realm of HSI, Metaspectral, a Vancouver-based company, has developed an AI platform that combines HSI and edge computing to revolutionize various industries. The platform incorporates data compression techniques and deep neural networks and supports various neural architectures. By reducing data streams without compromising information, the platform enables real time, pixel-by-pixel analysis of hyperspectral data. Metaspectral’s AI platform finds applications in space exploration, recycling, and agriculture. The Canadian Space Agency utilizes this technology to measure greenhouse gas levels on Earth. In recycling, the system accurately classifies plastics by analyzing their chemical structures, enhancing the recycling process. In agriculture, the early detection of diseases is made possible by identifying specific spectral signatures associated with plant diseases, allowing for timely interventions. Additionally, the platform aids in climate change mitigation efforts by detecting and analyzing wildfire risks through hyperspectral analysis, facilitating proactive measures like controlled burns [146,147].

4. Existing Challenges

This section will discuss the challenges and limitations of AI in remote sensing [14], with potential solutions and advancements for overcoming these challenges.

4.1. Data Availability

AI training data are often sourced from satellites, aerial sensors, or ground-based instruments. However, these valuable data are not always readily accessible to researchers, scientists, or organizations. Some datasets may be restricted due to proprietary rights or controlled by government agencies, limiting their availability for broader use. Additionally, certain remote sensing datasets have limited temporal coverage, making it challenging to assess interannual and decadal variability [148,149]. Consequently, the limited access to remote sensing data can impede the development and application of AI in this field.

To effectively train AI models, a significant amount of labeled data is required to teach algorithms to recognize and interpret specific features and patterns in remote sensing data. However, creating labeled datasets can be a time-consuming task that demands expertise [150]. The availability of accurately labeled data is essential to achieve reliable results when training AI models. Real-time or frequent updates of remote sensing data are crucial for monitoring and analyzing dynamic environmental conditions and changes [151]. However, the availability of such timely data can be limited, especially in certain regions or for specific types of data. This limitation can undermine the effectiveness of AI applications in remote sensing, as models trained on outdated or infrequent data may need to represent current conditions accurately. Overcoming the challenge of data availability in remote sensing requires collaborative efforts to improve data sharing and access [152]. The initiatives that promote open data policies, data-sharing platforms, and partnerships between organizations can facilitate greater availability of remote sensing data. Collaborating with space agencies, government organizations, and private entities can also expand access to the necessary data for training and implementing AI models in remote sensing applications [153].

4.2. Training Optimization

Achieving optimal performance of AI models in remote sensing demands careful consideration and a solid grasp of mathematics. Selecting suitable loss functions is important in guiding models toward improved accuracy. For instance, cross-entropy loss is commonly employed for land cover classification, whereas mean squared error (MSE) loss is preferable for regression tasks [154]. Imbalanced datasets can pose a significant challenge during model optimization when certain classes are rare or underrepresented. In these conditions, the model may exhibit bias towards the majority class, resulting in poor performance for the minority classes [155]. Optimizing complex models in remote sensing comes with its own set of challenges. Deep learning models like CNNs or RNNs possess numerous parameters and demand substantial computational resources for training [156]. Algorithms such as stochastic gradient descent (SGD) and its variants, such as Adam or RMSprop, are commonly employed for parameter updates [157]. Fine-tuning the learning rate, selecting appropriate batch sizes, and determining convergence criteria are critical steps in optimizing complex models. Additionally, hardware limitations can introduce training time and computational efficiency challenges.

4.3. Data Quality

The accuracy, reliability, and completeness of training data directly influence the model’s performance and generalization capability [158]. Obtaining accurate and reliable ground truth labels can be difficult due to limited ground-based observations, subjective interpretations, or human errors [159]. For instance, mislabeling land cover classes or confusion between similar classes can greatly affect the training and performance of models in land cover classification. Different sources, sensors, or acquisition times result in variations in spatial resolution, spectral characteristics, or temporal patterns [160]. These inconsistencies can introduce biases and complicate the training process. In time series analysis, inconsistent temporal sampling intervals or missing observations can hinder the model’s ability to capture temporal patterns accurately [161].

4.4. Uncertainty

Uncertainty arises in remote sensing data from various sources, including atmospheric conditions, sensor limitations, data acquisition techniques, and natural variability, caused by factors like clouds, haze, or aerosols, resulting in incomplete or distorted remote sensing data [162]. Sensor characteristics and calibration also contribute to uncertainty [163]. AI models trained on static datasets may need adjustments to adapt to these dynamic variations and may not generalize well to different locations or periods. Temporal and spatial variability of natural phenomena also will further contribute to uncertainty in remote sensing-based AI models [164].

4.5. Model Interpretability

Interpretability ensures the trustworthiness and validation of AI model outputs [165] and becomes especially important in sensitive applications like environmental monitoring [166] or disaster response, where transparency and accountability are crucial. However, AI models, particularly complex deep learning models, often function as black boxes, making it difficult to understand or explain their internal mechanisms and decision-making processes [167]. Efforts are being made to address the interpretability of AI models in remote sensing [168]. Techniques such as model explainability, feature importance analysis, or visualization methods can help shed light on the reasoning behind the model’s predictions [169].

4.6. Diversity

Evaluating and validating AI models on diverse and independent datasets are critical steps to assess their generalization ability. To ensure consistent and reliable performance in real-world applications, it is essential to test the models across different geographic regions, seasons, sensor types, and environmental conditions. However, one of the main challenges lies in the availability of diverse and representative training data [6]. Currently, various techniques are employed to address the data availability challenges. Data augmentation generates additional training examples by applying transformations, such as rotation, scaling, or noise addition, to the existing data [170]. This technique exposes the model to broader variations, enhancing its ability to generalize to unseen data. Another common approach is transfer learning, where pre-trained models trained on large-scale datasets like ImageNet serve as a starting point [171]. By fine-tuning these pre-trained models on a smaller remote sensing dataset, the models can leverage their acquired knowledge and adapt it to the specific task. Ensemble methods also contribute to diversity and generalization [172] by combining multiple individual models, each trained with different algorithms or variations of the training data.

While progress has been made in these areas, there are still unresolved aspects that researchers are actively working on. Ensuring that the training dataset is representative of the target population or the real-world distribution of data presents a significant challenge, and collecting a representative dataset that covers all possible variations, particularly in remote sensing, where data can be scarce or costly to obtain, is demanding [173]. Developing effective techniques to adapt pre-trained models to remote sensing-specific features and variations remains an ongoing research area.

Remote sensing applications often involve detecting and analyzing rare or complex events [174], such as natural disasters or occurrences of rare species. AI models trained on standard datasets may have yet to encounter such events during training, posing challenges in generalizing these scenarios. Research efforts are focused on developing techniques to handle these rare events and improve the generalization capabilities of AI models. For example, IBM and NASA have collaboratively introduced the largest geospatial AI foundation model, named watsonx.ai, in partnership with Hugging Face. This model utilizes NASA’s satellite data, specifically Harmonized Landsat Sentinel-2 (HLS) data, to revolutionize Earth observation and advance climate science. This joint initiative aims to democratize AI access, particularly in addressing evolving environmental conditions. The geospatial model is accessible on Hugging Face’s open-source platform, showcasing its commitment to open AI and science. It stands out as the first open-source AI foundation model developed in collaboration with NASA. This partnership emphasizes the potential of open-source technologies in deepening our understanding of Earth’s climate and environment. The watsonx.ai model excels in tasks such as flood and burns scar mapping, demonstrating a 15 percent enhancement over existing techniques. IBM’s expertise in AI and NASA’s Earth-satellite data contribute to the model’s accuracy and effectiveness. The collaboration resonates with NASA’s Open Source Science Initiative and IBM’s broader efforts in AI advancement. Moreover, this geospatial model holds potential beyond its current applications. It could be adapted for tasks such as deforestation tracking, crop yield prediction, and greenhouse gas monitoring. IBM’s Environmental Intelligence Suite will soon feature a commercial version of the model [175]. Another common issue is the perpetuation of biases and inequities when AI models are trained on biased or unrepresentative data [56,176].

4.7. Integrity and Security

Biases or inaccuracies in the training data can result in biased or unreliable AI predictions, which can have consequences in real-world applications [177]. To maintain integrity, it is essential to prioritize transparency, fairness, and accountability throughout the AI model development and training processes [178]. By adhering to these principles, the integrity of the AI system can be upheld, instilling trust in its outcomes and promoting ethical practices. As discussed above, maintaining integrity in remote sensing data involves multiple aspects, including data quality, data integrity, and the prevention of tampering or manipulation [179]. Protecting data integrity entails safeguarding the data from unauthorized modifications, tampering, or cyberattacks. Remote sensing data can be vulnerable to malicious actions, such as data breaches or unauthorized access [180]. One concern is the potential compromise of personal privacy through detailed imagery capturing identifiable features or activities. To address this, robust encryption protocols [181] and secure communication channels should be implemented while transmitting remote sensing data [182]. Additionally, secure storage systems, including servers or cloud platforms equipped with access controls and encryption mechanisms, are essential for protecting the data from unauthorized access. Privacy regulations, such as GDPR, impose strict data handling, storage, and sharing requirements [183].

5. Ongoing and Future Practical AI Applications in Remote Sensing

This section explores ongoing and potential ideas that can advance practical AI applications. The workaround for these ideas may already be in progress, and some may inspire future applications with transformative impacts on environmental management.

5.1. Wildfire Detection and Management

The application of AI in wildfire management is increasing steadily [184], using advanced algorithms and remote sensing technologies to enable early detection and rapid response. AI systems analyze data from satellites, drones [185], and sensors to track wildfires in real time and predict fire behavior accurately by considering historical fire data, weather patterns, and topographical information. This data-driven approach enhances firefighting efficiency and reduces the impact of wildfires on communities and ecosystems.

AI’s benefit lies in its capacity to handle large-scale data analysis [186] and pattern recognition, identifying hidden correlations in historical fire data, weather, and other relevant factors. AI-powered drones equipped with thermal imaging cameras can swiftly detect fires, leading to quicker response times and reduced costs. The Prometheus system developed by ESA uses AI and satellite data to predict wildfire behavior. Successful AI integration in wildfire management relies on a network of sensors collecting real-time data on fire occurrence, weather, and environment, fed into AI algorithms for analysis. Advanced ML techniques, like deep learning and neural networks, train AI models on vast datasets to enhance accuracy. To harness AI’s potential, investments in infrastructure, communication networks, and technology are necessary. Though initial costs may be significant, benefits include reduced damages, improved response times, and enhanced firefighter safety. As AI systems become more sophisticated, their seamless integration into wildfire management practices will drive automation and efficiency.

5.2. Illegal Logging and Deforestation Monitoring

By analyzing satellite and drone imagery, AI can detect changes in forest cover, logging patterns, and illegal encroachments. This information can be used to track deforestation and identify areas that need protection. To revolutionize deforestation monitoring, AI with satellite imagery helps detect changes in forest cover and detect illegal logging in real time. The implementation involves effectively utilizing technologies like the Google Earth Engine (GEE) [187] and employing advanced AI algorithms. Satellite imagery data are collected from the different sources of remote sensing technology on changes in forest cover, which are then subjected to data cleaning and organization during the pre-processing stage of an AI model. The algorithms are then applied to analyze the data and identify patterns in illegal logging activities in a particular geographical area, which helps in decision making, ultimately leading to concrete actions against deforestation and holding illegal loggers accountable. As AI technology advances, we anticipate developing even more innovative and efficient applications for protecting our forests [188,189]. A notable example of this approach is Global Forest Watch (GFW), which utilizes satellite imagery and advanced algorithms to monitor deforestation globally, alerting governments, NGOs, and stakeholders.

5.3. Coastal and Marine Ecosystem Monitoring

To protect coastal and marine ecosystems, AI can detect changes in coral reefs [190], identify marine pollution, track marine species, and support the sustainable management of coastal resources (Figure 11). One noteworthy trend in marine research involves using image recognition algorithms to analyze photographs or videos of marine environments. These algorithms can identify organisms or objects of interest, making them valuable tools for monitoring changes in animal populations and pinpointing areas where human activities are causing ecological damage. ML algorithms can also analyze underwater sounds [191]. Understanding underwater soundscapes can be complex, but specific sounds can be recognized and distinguished from background noise with ML. This capability allows researchers and managers to monitor changes in ecosystem dynamics and gain valuable insights into the evolution of marine ecosystems [192]. In marine research [193], computer vision techniques can be used to analyze high-definition (HD) digital camera photo sequences captured by fixed underwater stations, Autonomous Underwater Vehicles (AUVs), and Remotely Operated Vehicles (ROVs) across various oceanic regions. This technology facilitates the identification of areas with potential fish activity in their natural habitat, providing details such as the number of fish, species composition, and abundance in different locations.

5.4. Biodiversity Conservation and Habitat Monitoring

Advanced image analysis techniques, such as object detection and classification, can offer valuable insights to identify and monitor habitats, track species populations, and assess ecological connectivity, thereby enhancing the accuracy and efficiency of biodiversity monitoring [194]. AI helps improve the conservation and sustainable use of biological and ecosystem values [195]. GEE, which integrates AI for geospatial data analysis, can be used to process large amounts of satellite imagery and other remote sensing data [187]. Imagine deploying AI-powered cameras that can automatically recognize and count species in remote areas and generate real-time data on population trends and distribution. This information becomes invaluable in guiding conservation efforts and assessing the progress of restoration projects. Another trend is AI applications that analyze extensive scientific literature, news articles, and social media posts [196] related to biodiversity and environmental issues. By extracting relevant information, identifying patterns, and detecting trends, NLP algorithms enable researchers and policymakers to stay updated on the latest developments in the field.

5.5. Airborne Disease Monitoring and Forecasting

The future of using AI and remote sensing envisions a proactive and data-driven approach to public health, and we might detect outbreaks early, respond rapidly, and implement targeted interventions [197]. By monitoring various indicators, such as air quality [198], weather patterns, and population density, AI can identify potential hotspots and areas at risk. Remote sensing technologies equipped with AI-enabled sensors can provide real-time surveillance of disease-prone areas [199]. Drones, for example, can collect data on air quality [200], temperature, and humidity, whereas satellites can capture high-resolution imagery. AI models trained on historical data, combined with remote sensing inputs, can generate accurate disease forecasting models. By analyzing factors such as environmental conditions, population movement, and social interactions, these models can predict the future spread of airborne diseases, informing public health agencies to prepare resources, implement preventive measures, and allocate healthcare facilities in advance, minimizing the impact of outbreaks. AI can also be used to detect and diagnose diseases early [201].

AI and remote sensing can aid in risk assessment by analyzing various factors that contribute to disease transmission, including air pollution levels, urbanization patterns, and human mobility. By understanding the risk factors associated with specific areas or populations, public health authorities can develop targeted strategies for prevention, allocate resources efficiently, and prioritize interventions where they are most needed. AI-powered systems can also play a role in raising public awareness and educating communities about airborne diseases. Through real-time data visualization, interactive maps, and user-friendly interfaces, individuals can access information about disease prevalence, preventive measures, and local resources.

5.6. Precision Forestry

The combination of AI, LiDAR [202], and hyperspectral imagery provides detailed information on forest structure, biomass, and species composition, promoting sustainable and efficient forestry management [203]. Advanced thermal imaging techniques detect subtle temperature changes in trees as early indicators of pest infestation or disease outbreaks, and temperature variations can enable the detection of changes even before visible symptoms appear. Additionally, non-invasive acoustic sensors provide continuous monitoring and real-time insights into tree health and growth dynamics. By detecting anomalies such as wind-induced stress or structural weaknesses, these sensors assist forest managers in promptly dealing with potential issues [204].

Additionally, short-range remote sensing technology captures data that aid in visualizing various artifacts on tree trunks, providing valuable insights into their current and future health status [205]. For detecting tassels in RGB imagery acquired by unmanned aerial vehicles (UAVs), an algorithm, YOLOv5-tassel, is used, and it has significant potential in precision agriculture [206]. Incorporating AI algorithms significantly increases the probability of identifying these artifacts. This technological integration enables accurate measurement of tree characteristics and quality, whether the trees are standing or lying, facilitating an understanding of tree health and informed decision making in forestry management practices.

5.7. Urban Heat Island Mitigation

For identifying heat patterns, vegetation cover [207], and surface materials, AI can help urban planners optimize green infrastructure, develop heat mitigation strategies, and improve urban liveability. By integrating AI with satellite remote sensing and urban sensor network data, an integrated framework can provide accurate predictions of the urban heat island phenomenon [208], offering spatiotemporal granularity. This predictive capability is valuable for forecasting UHI (Figure 12) at specific times, facilitating the development of mitigation strategies, and formulating relevant policies to counteract its effects [209].

AI algorithms can analyze various contributing factors, including land use type, urban morphology, and anthropogenic heat emissions, which contribute to the formation of heat islands. Leveraging this knowledge, geospatial and AI-based models can predict the impacts of different urban design and mitigation strategies on local temperatures, informing urban planners and decision makers to make informed choices and implement tailored strategies to combat urban heat islands based on the unique characteristics [210].

5.8. Precision Water Management

Integrating weather patterns and soil conditions with AI systems can yield accurate irrigation recommendations, predict crop water stress, and facilitate resource allocation, enhancing water use efficiency and conservation. In water management applications, particularly in extracting water bodies from remote sensing images, neural network architectures can be employed for semantic segmentation [211,212,213]. Furthermore, AI algorithms offer promising opportunities to develop digital image classification methods, specifically for assessing water usage in irrigation. These methods utilize multi-temporal image data from remote sensing systems such as Landsat and Sentinel-2 to generate comprehensive crop maps encompassing various growing seasons. These emerging technologies enable cost-effective and accurate mapping of irrigated crops, facilitating effective water resource management [214]. Moreover, Adaptive Intelligent Dynamic Water Resource Planning (AIDWRP) could be employed to sustain the urban areas’ water environment [215]. The utilization of Big Data and ML technologies also holds the potential to impact many facets of environment and water management [216].

5.9. Disaster Resilience Planning

By assessing the exposure and susceptibility of critical infrastructure and communities, AI-powered remote sensing can support the development of effective disaster response plans, early warning systems, and resilient urban designs [217]. It guides individuals during disasters, offering real-time evacuation information, shelter locations, and critical details of the affected areas [218]. AI-enhanced remote sensing services and products can enhance disaster preparedness awareness, assisting emergency agencies in evacuations and resource deliveries. Urban Resilience.AI Lab researchers use big data for AI models, crucial for mitigation, preparedness, and recovery (Urban Resilience.AI Lab). Predictive analytics anticipate evacuations using seismic and weather data while combining satellite images, seismometers, and social media verifies disasters for faster responses (AI for Disaster Response, AIRD). AI evaluates the damage, allocates resources, and prioritizes recovery efforts using satellite imagery [219]. Additionally, AI assesses pre-disaster vulnerability, utilizing remote sensing data to identify high-risk areas [220]. These advancements enhance disaster readiness, minimizing impacts on communities [221].

6. Conclusions

The integration of AI techniques in remote sensing has emerged as a powerful paradigm with tremendous potential for practical applications. This convergence creates exciting opportunities to advance our understanding of Earth’s dynamics, support decision-making processes, and foster sustainable development. This review paper provides a comprehensive overview of the current state of AI in remote sensing, emphasizing its significance and impact. This paper covers the fundamentals of remote sensing technologies, including optical remote sensing, radar remote sensing, LiDAR, thermal remote sensing, and multispectral/HSI. It delves into key AI techniques used in remote sensing, such as conventional ML and deep learning, including DCNNs, ResNets, YOLO, Faster R-CNN, and self-attention methods. Various practical applications of AI in remote sensing are discussed in this paper, including image classification and land cover mapping, object detection and change detection, data fusion and integration, and hyperspectral/LiDAR data analysis. These applications showcase the effectiveness of AI in enhancing data analysis, improving accuracy, and automating processes. The paper also identifies several challenges: data availability, training optimization, data quality, security of sensitive remote sensing data, uncertainty in real-world scenarios, integrity, and diversity. Addressing these challenges requires further research and innovative solutions to ensure practical implementation. This paper outlines ongoing and potential applications, such as wildfire detection and management, illegal logging and deforestation monitoring, coastal and marine ecosystem monitoring, biodiversity conservation and habitat monitoring, airborne disease monitoring and forecasting, precision forestry, urban heat island mitigation, precision water management, and disaster resilience planning. Beyond these applications, there are even more possibilities, including precision agriculture optimization, renewable energy site selection, disaster management, early warning systems, and urban planning and infrastructure development. These envisioned applications highlight the transformative benefits of AI in addressing critical challenges and improving decision making in diverse fields, showcasing its potential to solve environmental and societal issues.

Author Contributions

Conceptualization, Z.S. and N.C.; methodology, Z.S., B.J. and N.C.; formal analysis, B.J., Z.S. and N.C.; investigation, B.J., Z.S. and N.C.; resources, Z.S. and N.C.; data curation, B.J. and G.P.A.; writing—original draft preparation, B.J. and Z.S.; writing—review and editing, B.J., Z.S., N.C. and G.P.A.; visualization, B.J., Z.S. and N.C.; supervision, Z.S. and N.C.; project administration, Z.S.; funding acquisition, Z.S. and N.C. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the NASA ACCESS program (#80NSSC21M0028). Drs. Cristea and Sun were supported by the National Science Foundation award EAR-1947875 and EAR-1947893, and OAC-2117834. N. Cristea was additionally supported by the University of Washington eScience Institute.

Data Availability Statement

Not applicable.

Acknowledgments

Thanks to Lakshmi Chetana Gomaram Bikshapathireddy for the help in organizing the references.

Conflicts of Interest

The authors declare no conflict of interest.

References

Campbell, J.B.; Wynne, R.H. Introduction to Remote Sensing; Guilford Press: New York, NY, USA, 2011. [Google Scholar]
Earthdata Cloud Evolution. Earthdata. 30 March 2022. Available online: https://www.earthdata.nasa.gov/eosdis/cloud-evolution (accessed on 4 July 2023).
Jensen, J.R. Remote Sensing of the Environment: An Earth Resource Perspective 2/e; Pearson Education: Bangalore, India, 2009. [Google Scholar]
Mohan, E.; Rajesh, A.; Sunitha, G.; Konduru, R.M.; Avanija, J.; Babu, L.G. A deep neural network learning-based speckle noise removal technique for enhancing the quality of synthetic-aperture radar images. Concurr. Comput. Pract. Exp. 2021, 33, e6239. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, L. Artificial Intelligence for Remote Sensing Data Analysis: A review of challenges and opportunities. IEEE Geosci. Remote Sens. Mag. 2022, 10, 270–294. [Google Scholar] [CrossRef]
Li, J.; Li, Y.; He, L.; Chen, J.; Plaza, A. Spatio-temporal fusion for remote sensing data: An overview and new benchmark. Sci. China Inf. Sci. 2020, 63, 1–17. [Google Scholar] [CrossRef]
Xu, S.; Cheng, J.; Zhang, Q. A Random Forest-Based Data Fusion Method for Obtaining All-Weather Land Surface Temperature with High Spatial Resolution. Remote Sens. 2021, 13, 2211. [Google Scholar] [CrossRef]
Kinaneva, D.; Hristov, G.; Raychev, J.; Zahariev, P. Early Forest Fire Detection Using Drones and Artificial Intelligence. In Proceedings of the 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia, 20–24 May 2019; pp. 1060–1065. [Google Scholar]
Ghamisi, P.; Rasti, B.; Yokoya, N.; Wang, Q.M.; Hofle, B.; Bruzzone, L.; Bovolo, F.; Chi, M.M.; Anders, K.; Gloaguen, R.; et al. Multisource and multitemporal data fusion in remote sensing a comprehensive review of the state of the art. IEEE Geosci. Remote Sens. Mag. 2019, 7, 6–39. [Google Scholar] [CrossRef]
Mo, Y.; Xu, Y.; Liu, Y.; Xin, Y.; Zhu, S. Comparison of gap-filling methods for producing all-weather daily remotely sensed near-surface air temperature. Remote Sens. Environ. 2023, 296, 113732. [Google Scholar] [CrossRef]
Peng, J.; Loew, A.; Merlin, O.; Verhoest, N.E.C. A review of spatial downscaling of satellite remotely sensed soil moisture. Rev. Geophys. 2017, 55, 341–366. [Google Scholar] [CrossRef]
Hong, D.; He, W.; Yokoya, N.; Yao, J.; Gao, L.; Zhang, L.; Chanussot, J.; Zhu, X. Interpretable Hyperspectral Artificial Intelligence: When nonconvex modeling meets hyperspectral remote sensing. IEEE Geosci. Remote Sens. Mag. 2021, 9, 52–87. [Google Scholar] [CrossRef]
Chen, H.; Qi, Z.; Shi, Z. Remote sensing image change detection with transformers. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–14. [Google Scholar] [CrossRef]
Sun, Z.; Sandoval, L.; Crystal-Ornelas, R.; Mousavi, S.M.; Wang, J.; Lin, C.; Cristea, N.; Tong, D.; Carande, W.H.; Ma, X.; et al. A review of earth artificial intelligence. Comput. Geosci. 2022, 159, 105034. [Google Scholar]
Le Moigne, J. Artificial Intelligence and Machine Learning for Earth Science. In Proceedings of the 2021 International Space University (ISU) Alumni Conference, Online, 30 July 2021. [Google Scholar]
Sayer, A.M.; Govaerts, Y.; Kolmonen, P.; Lipponen, A.; Luffarelli, M.; Mielonen, T.; Patadia, F.; Popp, T.; Povey, A.C.; Stebel, K.; et al. A review and framework for the evaluation of pixel-level uncertainty estimates in satellite aerosol remote sensing. Atmospheric Meas. Tech. 2020, 13, 373–404. [Google Scholar] [CrossRef]
Lillesand, T.; Kiefer, R.W.; Chipman, J. Remote Sensing and Image Interpretation, 5th ed.; John Wiley & Sons: Hobokan, NJ, USA, 2004; ISBN 0471152277. [Google Scholar]
Gupta, R.P. Remote Sensing Geology; Springer: Berlin/Heidelberg, Germany, 2017; ISBN 9783662558744. [Google Scholar]
Prasad, S.; Bruce, L.M.; Chanussot, J. Optical Remote Sensing—Advances in Signal Processing and Exploitation Techniques; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Aggarwal, S. Principles of remote sensing. Satell. Remote Sens. GIS Appl. Agric. Meteorol. 2004, 23, 23–28. [Google Scholar]
Cheng, G.; Han, J. A survey on object detection in optical remote sensing images. ISPRS J. Photogramm. Remote Sens. 2016, 117, 11–28. [Google Scholar] [CrossRef]
Yang, H.; Nguyen, T.-N.; Chuang, T.-W. An Integrative Explainable Artificial Intelligence Approach to Analyze Fine-Scale Land-Cover and Land-Use Factors Associated with Spatial Distributions of Place of Residence of Reported Dengue Cases. Trop. Med. Infect. Dis. 2023, 8, 238. [Google Scholar] [CrossRef]
Kamarulzaman, A.M.M.; Jaafar, W.S.W.M.; Said, M.N.M.; Saad, S.N.M.; Mohan, M. UAV Implementations in Urban Planning and Related Sectors of Rapidly Developing Nations: A Review and Future Perspectives for Malaysia. Remote Sens. 2023, 15, 2845. [Google Scholar] [CrossRef]
Pettorelli, N. The Normalized Difference Vegetation Index; Oxford University Press: Cary, NC, USA, 2013. [Google Scholar]
Sun, Z.; Peng, C.; Deng, M.; Chen, A.; Yue, P.; Fang, H.; Di, L. Automation of Customized and Near-Real-Time Vegetation Condition Index Generation Through Cyberinfrastructure-Based Geoprocessing Workflows. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4512–4522. [Google Scholar] [CrossRef]
Gholizadeh, A.; Kopačková, V. Detecting vegetation stress as a soil contamination proxy: A review of optical proximal and remote sensing techniques. Int. J. Environ. Sci. Technol. 2019, 16, 2511–2524. [Google Scholar] [CrossRef]
Stone, M.L.; Solie, J.B.; Raun, W.R.; Whitney, R.W.; Taylor, S.L.; Ringer, J.D. Use of Spectral Radiance for Correcting In-season Fertilizer Nitrogen Deficiencies in Winter Wheat. Trans. ASAE 1996, 39, 1623–1631. [Google Scholar] [CrossRef]
Osborne, S.L.; Schepers, J.S.; Francis, D.D.; Schlemmer, M.R. Detection of Phosphorus and Nitrogen Deficiencies in Corn Using Spectral Radiance Measurements. Agron. J. 2002, 94, 1215–1221. [Google Scholar] [CrossRef]
Cannistra, A.F.; Shean, D.E.; Cristea, N.C. High-resolution CubeSat imagery and machine learning for detailed snow-covered area. Remote Sens. Environ. 2021, 258, 112399. [Google Scholar] [CrossRef]
John, A.; Cannistra, A.F.; Yang, K.; Tan, A.; Shean, D.; Lambers, J.H.R.; Cristea, N. High-Resolution Snow-Covered Area Mapping in Forested Mountain Ecosystems Using PlanetScope Imagery. Remote Sens. 2022, 14, 3409. [Google Scholar] [CrossRef]
Richards, J.A. Remote Sensing with Imaging Radar; Springer: Berlin/Heidelberg, Germany, 2009; Volume 1. [Google Scholar] [CrossRef]
Dinh, H.T.M.; Hanssen, R.; Rocca, F. Radar interferometry: 20 years of development in time series techniques and future perspectives. Remote Sens. 2020, 121, 1364. [Google Scholar]
Oguchi, T.; Hayakawa, Y.S.; Wasklewicz, T. Remote Data in Fluvial Geomorphology: Characteristics and Applications. In Treatise on Geomorphology; Elsevier: Amsterdam, The Netherlands, 2022; pp. 1116–1142. [Google Scholar]
Moreira, A.; Prats-Iraola, P.; Younis, M.; Krieger, G.; Hajnsek, I.; Papathanassiou, K.P. A tutorial on synthetic aperture radar. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–43. [Google Scholar] [CrossRef]
Devaney, J.; Barrett, B.; Barrett, F.; Redmond, J.; O’halloran, J. Forest Cover Estimation in Ireland Using Radar Remote Sensing: A Comparative Analysis of Forest Cover Assessment Methodologies. PLoS ONE 2015, 10, e0133583. [Google Scholar] [CrossRef]
Dubayah, R.O.; Drake, J.B. Lidar remote sensing for forestry. J. For. 2000, 98, 44–46. [Google Scholar]
Dassot, M.; Constant, T.; Fournier, M. The use of terrestrial LiDAR technology in forest science: Application fields, benefits and challenges. Ann. For. Sci. 2011, 68, 959–974. [Google Scholar] [CrossRef]
Deems, J.S.; Painter, T.H.; Finnegan, D.C. Lidar measurement of snow depth: A review. J. Glaciol. 2013, 59, 467–479. [Google Scholar] [CrossRef]
Disney, M.; Kalogirou, V.; Lewis, P.; Prieto-Blanco, A.; Hancock, S.; Pfeifer, M. Simulating the impact of discrete-return lidar system and survey characteristics over young conifer and broadleaf forests. Remote Sens. Environ. 2010, 114, 1546–1560. [Google Scholar] [CrossRef]
Prakash, A. Thermal remote sensing: Concepts, issues and applications. Int. Arch. Photogramm. Remote Sens. 2000, 33, 239–243. [Google Scholar]
Bakker, W.H.; Feringa, W.; Gieske, A.S.M.; Gorte, B.G.H.; Grabmaier, K.A.; Hecker, C.A.; Horn, J.A.; Huurneman, G.C.; Janssen, L.L.F.; Kerle, N.; et al. Thermal Remote Sensing; Humboldt.Edu: Arcata, CA, USA, 2009. [Google Scholar]
Allison, R.S.; Johnston, J.M.; Craig, G.; Jennings, S. Airborne Optical and Thermal Remote Sensing for Wildfire Detection and Monitoring. Sensors 2016, 16, 1310. [Google Scholar] [CrossRef]
Voogt, J.A.; Oke, T.R. Thermal remote sensing of urban climates. Remote Sens. Environ. 2003, 86, 370–384. [Google Scholar] [CrossRef]
Shaw, G.A.; Burke, H.K. Spectral imaging for remote sensing. Linc. Lab. J. 2003, 14, 3–28. [Google Scholar]
Manolakis, D.G.; Lockwood, R.B.; Cooley, T.W. Hyperspectral Imaging Remote Sensing: Physics, Sensors, and Algorithms; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
Sun, W.; Du, Q. Hyperspectral band selection: A review. IEEE Geosci. Remote Sens. Mag. 2019, 7, 118–139. [Google Scholar] [CrossRef]
Dong, P.; Chen, Q. LiDAR Remote Sensing and Applications; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
Weiss, M.; Jacob, F.; Duveiller, G. Remote sensing for agricultural applications: A meta-review. Remote Sens. Environ. 2020, 236, 111402. [Google Scholar] [CrossRef]
Ghosh, A.; Fassnacht, F.E.; Joshi, P.K.; Koch, B. A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales. Int. J. Appl. Earth Obs. Geoinf. 2014, 26, 49–63. [Google Scholar] [CrossRef]
Lary, D.J.; Alavi, A.H.; Gandomi, A.H.; Walker, A.L. Machine learning in geosciences and remote sensing. Geosci. Front. 2016, 7, 3–10. [Google Scholar] [CrossRef]
Sarker, I.H. Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions. SN Comput. Sci. 2021, 2, 420. [Google Scholar] [CrossRef]
Sun, Z.; Cristea, N.; Tong, D.; Tullis, J.; Chester, Z.; Magill, A. A review of cyberinfrastructure for machine learning and big data in the geosciences. Recent Adv. Geoinformatics Data Sci. 2023, 558, 161. [Google Scholar] [CrossRef]
Sun, Z.; Cristea, N.; Rivas, P. (Eds.) Artificial Intelligence in Earth Science: Best Practices and Fundamental Challenges; Elsevier-Health Sciences Division: Amsterdam, The Netherlands, 2023. [Google Scholar]
Saini, R.; Ghosh, S. Ensemble classifiers in remote sensing: A review. In Proceedings of the 2017 International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 5–6 May 2017; pp. 1148–1152. [Google Scholar] [CrossRef]
Miao, X.; Heaton, J.S.; Zheng, S.; Charlet, D.A.; Liu, H. Applying tree-based ensemble algorithms to the classification of ecological zones using multi-temporal multi-source remote-sensing data. Int. J. Remote Sens. 2011, 33, 1823–1849. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, J.; Shen, W. A Review of Ensemble Learning Algorithms Used in Remote Sensing Applications. Appl. Sci. 2022, 12, 8654. [Google Scholar] [CrossRef]
Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar] [CrossRef]
Schapire, R.E. A Brief Introduction to Boosting. Psu.Edu. 1999. Available online: https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=fa329f834e834108ccdc536db85ce368fee227ce (accessed on 4 August 2023).
Freund, Y.; Schapire, R.E. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J. Comput. Syst. Sci. 1997, 55, 119–139. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Pal, M. Random forest classifier for remote sensing classification. Int. J. Remote Sens. 2005, 26, 217–222. [Google Scholar] [CrossRef]
Gislason, P.O.; Benediktsson, J.A.; Sveinsson, J.R. Random Forests for land cover classification. Pattern Recognit. Lett. 2006, 27, 294–300. [Google Scholar] [CrossRef]
Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar] [CrossRef]
Mascaro, J.; Asner, G.P.; Knapp, D.E.; Kennedy-Bowdoin, T.; Martin, R.E.; Anderson, C.; Higgins, M.; Chadwick, K.D. A Tale of Two “Forests”: Random Forest Machine Learning Aids Tropical Forest Carbon Mapping. PLoS ONE 2014, 9, e85993. [Google Scholar] [CrossRef] [PubMed]
Phan, T.N.; Kuch, V.; Lehnert, L.W. Land Cover Classification using Google Earth Engine and Random Forest Classifier—The Role of Image Composition. Remote Sens. 2020, 12, 2411. [Google Scholar] [CrossRef]
Yang, K.; John, A.; Shean, D.; Lundquist, J.D.; Sun, Z.; Yao, F.; Todoran, S.; Cristea, N. High-resolution mapping of snow cover in montane meadows and forests using Planet imagery and machine learning. Front. Water 2023, 5, 1128758. [Google Scholar] [CrossRef]
Rittger, K.; Krock, M.; Kleiber, W.; Bair, E.H.; Brodzik, M.J.; Stephenson, T.R.; Rajagopalan, B.; Bormann, K.J.; Painter, T.H. Multi-sensor fusion using random forests for daily fractional snow cover at 30 m. Remote Sens. Environ. 2021, 264, 112608. [Google Scholar] [CrossRef]
Ham, J.; Chen, Y.; Crawford, M.M.; Ghosh, J. Investigation of the random forest framework for classification of hyperspectral data. IEEE Trans. Geosci. Remote Sens. 2005, 43, 492–501. [Google Scholar] [CrossRef]
Sabat-Tomala, A.; Raczko, E.; Zagajewski, B. Comparison of Support Vector Machine and Random Forest Algorithms for Invasive and Expansive Species Classification Using Airborne Hyperspectral Data. Remote Sens. 2020, 12, 516. [Google Scholar] [CrossRef]
Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
Behnamian, A.; Banks, S.; White, L.; Millard, K.; Pouliot, D.; Pasher, J.; Duffe, J. Dimensionality Reduction in The Presence of Highly Correlated Variables for Random Forests: Wetland Case Study. In Proceedings of the IGARSS 2019–2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 28 July–2 August 2019; pp. 9839–9842. [Google Scholar] [CrossRef]
Georganos, S.; Grippa, T.; Vanhuysse, S.; Lennert, M.; Shimoni, M.; Kalogirou, S.; Wolff, E. Less is more: Optimizing classification performance through feature selection in a very-high-resolution remote sensing object-based urban application. GIScience Remote Sens. 2017, 55, 221–242. [Google Scholar] [CrossRef]
Millard, K.; Richardson, M. On the Importance of Training Data Sample Selection in Random Forest Image Classification: A Case Study in Peatland Ecosystem Mapping. Remote Sens. 2015, 7, 8489–8515. [Google Scholar] [CrossRef]
Chen, T.; Carlos, G. XGBoost: A Scalable Tree Boosting System. arXiv 2016, arXiv:1603.02754. [Google Scholar]
Ghatkar, J.G.; Singh, R.K.; Shanmugam, P. Classification of algal bloom species from remote sensing data using an extreme gradient boosted decision tree model. Int. J. Remote Sens. 2019, 40, 9412–9438. [Google Scholar] [CrossRef]
Mountrakis, G.; Im, J.; Ogole, C. Support vector machines in remote sensing: A review. ISPRS J. Photogramm. Remote Sens. 2011, 66, 247–259. [Google Scholar] [CrossRef]
Kavzoglu, T.; Colkesen, I. A kernel functions analysis for support vector machines for land cover classification. Int. J. Appl. Earth Obs. Geoinf. 2009, 11, 352–359. [Google Scholar] [CrossRef]
Sheykhmousa, M.; Mahdianpari, M.; Ghanbari, H.; Mohammadimanesh, F.; Ghamisi, P.; Homayouni, S. Support Vector Machine Versus Random Forest for Remote Sensing Image Classification: A Meta-Analysis and Systematic Review. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 6308–6325. [Google Scholar] [CrossRef]
Zhu, J.-Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired image-to-image translation using cycle-consistent adversarial networks. arXiv 2017, arXiv:1703.10593. [Google Scholar]
Ma, L.; Liu, Y.; Zhang, X.; Ye, Y.; Yin, G.; Johnson, B.A. Deep learning in remote sensing applications: A meta-analysis and review. ISPRS J. Photogramm. Remote Sens. 2019, 152, 166–177. [Google Scholar] [CrossRef]
Yuan, Q.; Shen, H.; Li, T.; Li, Z.; Li, S.; Jiang, Y.; Xu, H.; Tan, W.; Yang, Q.; Wang, J.; et al. Deep learning in environmental remote sensing: Achievements and challenges. Remote Sens. Environ. 2020, 241, 111716. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Zhang, L.; Zhang, L.; Du, B. Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art. IEEE Geosci. Remote Sens. Mag. 2016, 4, 22–40. [Google Scholar] [CrossRef]
Traore, B.B.; Kamsu-Foguem, B.; Tangara, F. Deep convolution neural network for image recognition. Ecol. Inform. 2018, 48, 257–268. [Google Scholar] [CrossRef]
Chen, L.; Li, S.; Bai, Q.; Yang, J.; Jiang, S.; Miao, Y. Review of Image Classification Algorithms Based on Convolutional Neural Networks. Remote Sens. 2021, 13, 4712. [Google Scholar] [CrossRef]
Agarap, A.F. Deep learning using rectified linear units (relu). arXiv 2018, arXiv:1803.08375. [Google Scholar]
Aloysius, N.; Geetha, M. A review on deep convolutional neural networks. In Proceedings of the 2017 International Conference on Communication and Signal Processing (ICCSP), Chennai, India, 6–8 April 2017; pp. 0588–0592. [Google Scholar]
Dubey, A.K.; Jain, V. Comparative Study of Convolution Neural Network’s ReLu and Leaky-ReLu Activation Functions. In Applications of Computing, Automation and Wireless Systems in Electrical Engineering; Springer: Singapore, 2019; pp. 873–880. [Google Scholar]
Zhang, Y.-D.; Pan, C.; Sun, J.; Tang, C. Multiple sclerosis identification by convolutional neural network with dropout and parametric ReLU. J. Comput. Sci. 2018, 28, 1–10. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. arXiv 2015, arXiv:1505.04597. [Google Scholar]
Badrinarayanan, V.; Kendall, A.; Cipolla, R. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef] [PubMed]
Alom, M.Z.; Taha, T.M.; Yakopcic, C.; Westberg, S.; Sidike, P.; Nasrin, M.S.; Van Esesn, B.C.; Awwal, A.A.S.; Asari, V.K. The history began from alexnet: A comprehensive survey on deep learning approaches. arXiv 2018, arXiv:1803.01164. [Google Scholar]
Chen, L.-C.; Papandreou, G.; Kokkinos, I.; Murphy, K.; Yuille, A.L. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs. arXiv 2016, arXiv:1606.00915. [Google Scholar] [CrossRef] [PubMed]
Zhao, Y.; Zhang, X.; Feng, W.; Xu, J. Deep Learning Classification by ResNet-18 Based on the Real Spectral Dataset from Multispectral Remote Sensing Images. Remote Sens. 2022, 14, 4883. [Google Scholar] [CrossRef]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Li, H.; Wu, X.-J.; Durrani, T.S. Infrared and visible image fusion with ResNet and zero-phase component analysis. Infrared Phys. Technol. 2019, 102, 103039. [Google Scholar] [CrossRef]
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. arXiv 2015, arXiv:1506.02640. [Google Scholar]
Redmon, J. Darknet: Open Source Neural Networks in C. Pjreddie.Com. 2013. Available online: https://pjreddie.com/darknet/ (accessed on 4 August 2023).
Wu, Z.; Chen, X.; Gao, Y.; Li, Y. Rapid Target Detection in High Resolution Remote Sensing Images Using Yolo Model. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2018, 42, 1915–1920. [Google Scholar] [CrossRef]
Redmon, J.; Farhadi, A. YOLO9000: Better, Faster, Stronger. arXiv 2016, arXiv:1612.08242. [Google Scholar]
Xu, D.; Wu, Y. Improved YOLO-V3 with DenseNet for Multi-Scale Remote Sensing Target Detection. Sensors 2020, 20, 4276. [Google Scholar] [CrossRef]
Yang, F. An improved YOLO v3 algorithm for remote Sensing image target detection. J. Phys. Conf. Ser. 2021, 2132, 012028. [Google Scholar] [CrossRef]
Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
Jiang, P.; Ergu, D.; Liu, F.; Cai, Y.; Ma, B. A Review of Yolo Algorithm Developments. Procedia Comput. Sci. 2022, 199, 1066–1073. [Google Scholar] [CrossRef]
Terven, J.; Cordova-Esparza, D. A Comprehensive Review of YOLO: From YOLOv1 and Beyond. arXiv 2023, arXiv:2304.00501. [Google Scholar]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. arXiv 2015, arXiv:1506.01497. [Google Scholar] [CrossRef]
Girshick, R. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 13–16 December 2015; pp. 1440–1448. [Google Scholar]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. arXiv 2017, arXiv:1706.03762. [Google Scholar]
Aleissaee, A.A.; Kumar, A.; Anwer, R.M.; Khan, S.; Cholakkal, H.; Xia, G.-S.; Khan, F.S. Transformers in Remote Sensing: A Survey. Remote Sens. 2023, 15, 1860. [Google Scholar] [CrossRef]
Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
He, J.; Zhao, L.; Yang, H.; Zhang, M.; Li, W. HSI-BERT: Hyperspectral Image Classification Using the Bidirectional Encoder Representation from Transformers. IEEE Trans. Geosci. Remote Sens. A Publ. IEEE Geosci. Remote Sens. Soc. 2020, 58, 165–178. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Sun, Z.; Di, L.; Fang, H. Using long short-term memory recurrent neural network in land cover classification on Landsat and Cropland data layer time series. Int. J. Remote Sens. 2018, 40, 593–614. [Google Scholar] [CrossRef]
Graves, A.; Graves, A. Long short-term memory. In Supervised Sequence Labelling with Recurrent Neural Networks; Springer: Berlin/Heidelberg, Germany, 2012; pp. 37–45. [Google Scholar]
Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. Neurips.Cc. 2014. Available online: https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf (accessed on 7 August 2023).
Ankan, D.; Ye, J.; Wang, G. A Review of Generative Adversarial Networks (GANs) and Its Applications in a Wide Variety of Disciplines—From Medical to Remote Sensing. arXiv 2021, arXiv:2110.01442. [Google Scholar]
Jozdani, S.; Chen, D.; Pouliot, D.; Johnson, B.A. A review and meta-analysis of Generative Adversarial Networks and their applications in remote sensing. Int. J. Appl. Earth Obs. Geoinf. 2022, 108, 102734. [Google Scholar] [CrossRef]
Creswell, A.; White, T.; Dumoulin, V.; Arulkumaran, K.; Sengupta, B.; Bharath, A.A. Generative Adversarial Networks: An Overview. IEEE Signal Process. Mag. 2018, 35, 53–65. [Google Scholar] [CrossRef]
Xu, C.; Zhao, B. Satellite Image Spoofing: Creating Remote Sensing Dataset with Generative Adversarial Networks (Short Paper); Schloss Dagstuhl—Leibniz-Zentrum fuer Informatik GmbH.: Wadern/Saarbruecken, Germany, 2018. [Google Scholar]
Zi, Y.; Xie, F.; Song, X.; Jiang, Z.; Zhang, H. Thin Cloud Removal for Remote Sensing Images Using a Physical-Model-Based CycleGAN With Unpaired Data. IEEE Geosci. Remote Sens. Lett. 2021, 19, 1–5. [Google Scholar] [CrossRef]
Ledig, C.; Theis, L.; Huszár, F.; Caballero, J.; Cunningham, A.; Acosta, A.; Aitken, A.P.; Tejani, A.; Totz, J.; Wang, Z.; et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 105–114. [Google Scholar]
Isola, P.; Zhu, J.-Y.; Zhou, T.; Efros, A.A. Image-to-Image Translation with Conditional Adversarial Networks. arXiv 2017, arXiv:1611.07004. [Google Scholar]
Sun, H.; Wang, P.; Chang, Y.; Qi, L.; Wang, H.; Xiao, D.; Zhong, C.; Wu, X.; Li, W.; Sun, B. HRPGAN: A GAN-based Model to Generate High-resolution Remote Sensing Images. IOP Conf. Series Earth Environ. Sci. 2020, 428, 012060. [Google Scholar] [CrossRef]
Lin, D.; Fu, K.; Wang, Y.; Xu, G.; Sun, X. MARTA GANs: Unsupervised Representation Learning for Remote Sensing Image Classification. IEEE Geosci. Remote Sens. Lett. 2017, 14, 2092–2096. [Google Scholar] [CrossRef]
Liu, X.; Wang, Y.; Liu, Q. Psgan: A Generative Adversarial Network for Remote Sensing Image Pan-Sharpening. In Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece, 7–10 October 2018; pp. 873–877. [Google Scholar]
Hu, A.; Xie, Z.; Xu, Y.; Xie, M.; Wu, L.; Qiu, Q. Unsupervised Haze Removal for High-Resolution Optical Remote-Sensing Images Based on Improved Generative Adversarial Networks. Remote Sens. 2020, 12, 4162. [Google Scholar] [CrossRef]
Singh, P.; Komodakis, N. Cloud-Gan: Cloud Removal for Sentinel-2 Imagery Using a Cyclic Consistent Generative Adversarial Networks. In Proceedings of the IGARSS 2018—2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018; pp. 1772–1775. [Google Scholar] [CrossRef]
Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep Reinforcement Learning: A Brief Survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef]
Li, Y. Deep Reinforcement Learning: An Overview. arXiv 2017, arXiv:1701.07274. [Google Scholar]
Mou, L.; Saha, S.; Hua, Y.; Bovolo, F.; Bruzzone, L.; Zhu, X.X. Deep Reinforcement Learning for Band Selection in Hyperspectral Image Classification. IEEE Trans. Geosci. Remote. Sens. 2021, 60, 1–14. [Google Scholar] [CrossRef]
Fu, K.; Li, Y.; Sun, H.; Yang, X.; Xu, G.; Li, Y.; Sun, X. A Ship Rotation Detection Model in Remote Sensing Images Based on Feature Fusion Pyramid Network and Deep Reinforcement Learning. Remote Sens. 2018, 10, 1922. [Google Scholar] [CrossRef]
Filar, J.; Vrieze, K. Competitive Markov Decision Processes; Springer Science & Business Media: New York, NY, USA, 2012. [Google Scholar] [CrossRef]
Maxwell, A.E.; Warner, T.A.; Fang, F. Implementation of machine-learning classification in remote sensing: An applied review. Int. J. Remote Sens. 2018, 39, 2784–2817. [Google Scholar] [CrossRef]
Li, Y.; Zhang, H.; Xue, X.; Jiang, Y.; Shen, Q. Deep learning for remote sensing image classification: A survey. WIREs Data Min. Knowl. Discov. 2018, 8, e1264. [Google Scholar] [CrossRef]
Song, J.; Gao, S.; Zhu, Y.; Ma, C. A survey of remote sensing image classification based on CNNs. Big Earth Data 2019, 3, 232–254. [Google Scholar] [CrossRef]
Methodology & Accuracy Summary 10m Global Land Use Land Cover Maps. Impactobservatory.Com. 2022. Available online: https://www.impactobservatory.com/static/lulc_methodology_accuracy-ee742a0a389a85a0d4e7295941504ac2.pdf (accessed on 29 June 2023).
AI Enables Rapid Creation of Global Land Cover Map. Esri. 7 September 2021. Available online: https://www.esri.com/about/newsroom/arcuser/ai-enables-rapid-creation-of-global-land-cover-map/ (accessed on 5 July 2023).
SpaceKnow. GEMSTONE CASE STUDY: Global Economic Monitoring Using Satellite Data and AI/ML Technology. Medium. 25 April 2022. Available online: https://spaceknow.medium.com/gemstone-case-study-global-economic-monitoring-using-satellite-data-and-ai-ml-technology-6526c336bf18 (accessed on 29 June 2023).
Qi, W. Object detection in high resolution optical image based on deep learning technique. Nat. Hazards Res. 2022, 2, 384–392. [Google Scholar] [CrossRef]
Yang, W.; Song, H.; Du, L.; Dai, S.; Xu, Y. A Change Detection Method for Remote Sensing Images Based on Coupled Dictionary and Deep Learning. Comput. Intell. Neurosci. 2022, 2022, 3404858. [Google Scholar] [CrossRef] [PubMed]
Schmitt, M.; Zhu, X.X. Data Fusion and Remote Sensing: An ever-growing relationship. IEEE Geosci. Remote Sens. Mag. 2016, 4, 6–23. [Google Scholar] [CrossRef]
Floodly AI. Esa.Int. 15 January 2021. Available online: https://business.esa.int/projects/floodly-ai (accessed on 29 June 2023).
Paganini, M.; Wyniawskyj, N.; Talon, P.; White, S.; Watson, G.; Petit, D. Total Ecosystem Management of the InterTidal Habitat (TEMITH). Esa.Int. 12 September 2020. Available online: https://eo4society.esa.int/wp-content/uploads/2021/06/TEMITH-DMU-TEC-ESR01-11-E_Summary_Report.pdf (accessed on 5 July 2023).
Zhong, H.; Lin, W.; Liu, H.; Ma, N.; Liu, K.; Cao, R.; Wang, T.; Ren, Z. Identification of tree species based on the fusion of UAV hyperspectral image and LiDAR data in a coniferous and broad-leaved mixed forest in Northeast China. Front. Plant Sci. 2022, 13, 964769. [Google Scholar] [CrossRef]
Woodie, A. AI Opens Door to Expanded Use of LIDAR Data. Datanami. 17 September 2020. Available online: https://www.datanami.com/2020/09/17/ai-opens-door-to-expanded-use-of-lidar-data/ (accessed on 5 July 2023).
Technology. Metaspectral. 20 September 2022. Available online: https://metaspectral.com/technology/ (accessed on 29 June 2023).
Redins, L. Metaspectral’s AI Platform Uses Hyperspectral Imaging, Edge Computing to Transform Space, Recycling and Other Industries. 26 January 2023. Available online: https://www.edgeir.com/metaspectrals-ai-platform-uses-hyperspectral-imaging-edge-computing-to-transform-space-recycling-and-other-industries-20230125 (accessed on 5 July 2023).
Skulovich, O.; Gentine, P. A Long-term Consistent Artificial Intelligence and Remote Sensing-based Soil Moisture Dataset. Sci. Data 2023, 10, 154. [Google Scholar] [CrossRef] [PubMed]
Esen, Berivan, and Jonathan Wentworth. 2020. “Remote Sensing and Machine Learning.” Parliament.Uk. 19 June 2020. Available online: https://post.parliament.uk/research-briefings/post-pn-0628/ (accessed on 5 July 2023).
Holland, S.; Hosny, A.; Newman, S.; Joseph, J.; Chmielinski, K. The dataset nutrition label. Data Prot. Priv. 2020, 12, 1. [Google Scholar]
Verbesselt, J.; Zeileis, A.; Herold, M. Near real-time disturbance detection using satellite image time series. Remote Sens. Environ. 2012, 123, 98–108. [Google Scholar] [CrossRef]
Dörnhöfer, K.; Oppelt, N. Remote sensing for lake research and monitoring—Recent advances. Ecol. Indic. 2016, 64, 105–122. [Google Scholar] [CrossRef]
Engel-Cox, J.A.; Hoff, R.M.; Haymet, A. Recommendations on the Use of Satellite Remote-Sensing Data for Urban Air Quality. J. Air Waste Manag. Assoc. 2004, 54, 1360–1371. [Google Scholar] [CrossRef]
Wang, Q.; Ma, Y.; Zhao, K.; Tian, Y. A Comprehensive Survey of Loss Functions in Machine Learning. Ann. Data Sci. 2020, 9, 187–212. [Google Scholar] [CrossRef]
Kotsiantis, S.B.; Pintelas, P.E. Mixture of expert agents for handling imbalanced data sets. Ann. Math. Comput. Teleinform. 2003, 1, 46–55. [Google Scholar]
Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data 2021, 8, 1–74. [Google Scholar] [CrossRef] [PubMed]
Soydaner, D. A comparison of optimization algorithms for deep learning. Int. J. Pattern Recognit. Artif. Intell. 2020, 34, 2052013. [Google Scholar] [CrossRef]
Sheng, V.S.; Provost, F.; Ipeirotis, P.G. Get another label? improving data quality and data mining using multiple, noisy labelers. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, NV, USA, 24–27 August 2008; pp. 614–622. [Google Scholar]
Shan, J.; Aparajithan, S. Urban DEM generation from raw LiDAR data. Photogramm. Eng. Remote Sens. 2005, 71, 217–226. [Google Scholar] [CrossRef]
Gao, F.; Hilker, T.; Zhu, X.; Anderson, M.; Masek, J.; Wang, P.; Yang, Y. Fusing Landsat and MODIS Data for Vegetation Monitoring. IEEE Geosci. Remote Sens. Mag. 2015, 3, 47–60. [Google Scholar] [CrossRef]
Petitjean, F.; Inglada, J.; Gancarski, P. Satellite Image Time Series Analysis Under Time Warping. IEEE Trans. Geosci. Remote Sens. 2012, 50, 3081–3095. [Google Scholar] [CrossRef]
Griffith, D.A.; Chun, Y. Spatial Autocorrelation and Uncertainty Associated with Remotely-Sensed Data. Remote Sens. 2016, 8, 535. [Google Scholar] [CrossRef]
Miura, T.; Huete, A.; Yoshioka, H. Evaluation of sensor calibration uncertainties on vegetation indices for MODIS. IEEE Trans. Geosci. Remote Sens. 2000, 38, 1399–1409. [Google Scholar] [CrossRef]
Güntner, A.; Stuck, J.; Werth, S.; Döll, P.; Verzano, K.; Merz, B. A global analysis of temporal and spatial variations in continental water storage. Water Resour. Res. 2007, 43, W05416. [Google Scholar] [CrossRef]
Alvarez-Vanhard, E.; Corpetti, T.; Houet, T. UAV & satellite synergies for optical remote sensing applications: A literature review. Sci. Remote Sens. 2021, 3, 100019. [Google Scholar] [CrossRef]
Himeur, Y.; Rimal, B.; Tiwary, A.; Amira, A. Using artificial intelligence and data fusion for environmental monitoring: A review and future perspectives. Inf. Fusion 2022, 86–87, 44–75. [Google Scholar] [CrossRef]
von Eschenbach, W.J. Transparency and the black box problem: Why we do not trust AI. Philos. Technol. 2021, 34, 1607–1622. [Google Scholar] [CrossRef]
Kakogeorgiou, I.; Karantzalos, K. Evaluating explainable artificial intelligence methods for multi-label deep learning classification tasks in remote sensing. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102520. [Google Scholar] [CrossRef]
Belle, V.; Papantonis, I. Principles and Practice of Explainable Machine Learning. Front. Big Data 2021, 4, 688969. [Google Scholar] [CrossRef] [PubMed]
Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
Torrey, L.; Shavlik, J. Transfer learning. In Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques; IGI Global: Hershey, PA, USA, 2010; pp. 242–264. [Google Scholar]
Ganaie, M.A.; Hu, M.; Malik, A.K.; Tanveer, M.; Suganthan, P.N. Ensemble deep learning: A review. Eng. Appl. Artif. Intell. 2022, 115, 105151. [Google Scholar] [CrossRef]
Chi, M.; Plaza, A.; Benediktsson, J.A.; Sun, Z.; Shen, J.; Zhu, Y. Big Data for Remote Sensing: Challenges and Opportunities. Proc. IEEE 2016, 104, 2207–2219. [Google Scholar] [CrossRef]
Xie, M.; Jean, N.; Burke, M.; Lobell, D.; Ermon, S. Transfer Learning from Deep Features for Remote Sensing and Poverty Mapping. In Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016; Volume 30. [Google Scholar] [CrossRef]
Benchaita, S.; Mccarthy, B.H. IBM and NASA Open Source Largest Geospatial AI Foundation Model on Hugging Face. IBM Newsroom. 3 August 2023. Available online: https://newsroom.ibm.com/2023-08-03-IBM-and-NASA-Open-Source-Largest-Geospatial-AI-Foundation-Model-on-Hugging-Face (accessed on 10 August 2023).
Wang, J.; Lan, C.; Liu, C.; Ouyang, Y.; Qin, T.; Lu, W.; Chen, Y.; Zeng, W.; Yu, P. Generalizing to Unseen Domains: A Survey on Domain Generalization. IEEE Trans. Knowl. Data Eng. 2022, 35, 8052–8072. [Google Scholar] [CrossRef]
Mehrabi, N.; Morstatter, F.; Saxena, N.; Lerman, K.; Galstyan, A. A Survey on Bias and Fairness in Machine Learning. ACM Comput. Surv. 2021, 54, 1–35. [Google Scholar] [CrossRef]
Roselli, D.; Matthews, J.; Talagala, N. Managing bias in AI. In Proceedings of the 2019 World Wide Web Conference, New York, NY, USA, 13–17 May 2019; pp. 539–544. [Google Scholar]
Raji, I.D.; Smart, A.; White, R.N.; Mitchell, M.; Gebru, T.; Hutchinson, B.; Smith-Loud, J.; Theron, D.; Barnes, P. Closing the AI accountability gap: Defining an end-to-end framework for internal algorithmic auditing. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, Barcelona, Spain, 27–30 January 2020; pp. 33–44. [Google Scholar]
Alkhelaiwi, M.; Boulila, W.; Ahmad, J.; Koubaa, A.; Driss, M. An Efficient Approach Based on Privacy-Preserving Deep Learning for Satellite Image Classification. Remote Sens. 2021, 13, 2221. [Google Scholar] [CrossRef]
Zhang, X.; Zhu, G.; Ma, S. Remote-sensing image encryption in hybrid domains. Opt. Commun. 2012, 285, 1736–1743. [Google Scholar] [CrossRef]
Potkonjak, M.; Meguerdichian, S.; Wong, J.L. Trusted sensors and remote sensing. In Proceedings of the SENSORS, 2010 IEEE, Waikoloa, HI, USA, 1–4 November 2010; pp. 1104–1107. [Google Scholar] [CrossRef]
Ismael, C.; Molina, P. Unmanned aerial systems for photogrammetry and remote sensing: A review. ISPRS J. Photogramm. Remote Sens. 2014, 92, 79–97. [Google Scholar]
Jain, P.; Coogan, S.C.P.; Subramanian, S.G.; Crowley, M.; Taylor, S.W.; Flannigan, M.D. A review of machine learning applications in wildfire science and management. Environ. Rev. 2020, 28, 478–505. [Google Scholar] [CrossRef]
Bouguettaya, A.; Zarzour, H.; Taberkit, A.M.; Kechida, A. A review on early wildfire detection from unmanned aerial vehicles using deep learning-based computer vision algorithms. Signal Process. 2022, 190, 108309. [Google Scholar] [CrossRef]
Nguyen, G.; Dlugolinsky, S.; Bobák, M.; Tran, V.; García, L.; Heredia, I.; Malík, P.; Hluchý, L. Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: A survey. Artif. Intell. Rev. 2019, 52, 77–124. [Google Scholar] [CrossRef]
Amani, M.; Ghorbanian, A.; Ahmadi, S.A.; Kakooei, M.; Moghimi, A.; Mirmazloumi, S.M.; Moghaddam, S.H.A.; Mahdavi, S.; Ghahremanloo, M.; Parsian, S.; et al. Google Earth Engine Cloud Computing Platform for Remote Sensing Big Data Applications: A Comprehensive Review. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5326–5350. [Google Scholar] [CrossRef]
Garbini, S. How Geospatial AI Can Help You Comply with EU’s Deforestation Law—Customers. Picterra. 25 April 2023. Available online: https://picterra.ch/blog/how-geospatial-ai-can-help-you-comply-with-eus-deforestation-law/ (accessed on 6 July 2023).
Mujetahid, A.; Nursaputra, M.; Soma, A.S. Monitoring Illegal Logging Using Google Earth Engine in Sulawesi Selatan Tropical Forest, Indonesia. Forests 2023, 14, 652. [Google Scholar] [CrossRef]
González-Rivero, M.; Beijbom, O.; Rodriguez-Ramirez, A.; Bryant, D.E.; Ganase, A.; Gonzalez-Marrero, Y.; Herrera-Reveles, A.; Kennedy, E.V.; Kim, C.J.; Lopez-Marcano, S.; et al. Monitoring of Coral Reefs Using Artificial Intelligence: A Feasible and Cost-Effective Approach. Remote Sens. 2020, 12, 489. [Google Scholar] [CrossRef]
Lou, R.; Lv, Z.; Dang, S.; Su, T.; Li, X. Application of machine learning in ocean data. Multimedia Syst. 2021, 29, 1815–1824. [Google Scholar] [CrossRef]
Ditria, E.M.; Buelow, C.A.; Gonzalez-Rivero, M.; Connolly, R.M. Artificial intelligence and automated monitoring for assisting conservation of marine ecosystems: A perspective. Front. Mar. Sci. 2022, 9, 918104. [Google Scholar] [CrossRef]
Shafiq, S.I. Artificial intelligence and big data science for oceanographic research in Bangladesh: Preparing for the future. J. Data Acquis. Process. 2023, 38, 418. [Google Scholar]
Weeks, P.J.D.; Gaston, K.J. Image analysis, neural networks, and the taxonomic impediment to biodiversity studies. Biodivers. Conserv. 1997, 6, 263–274. [Google Scholar] [CrossRef]
Silvestro, D.; Goria, S.; Sterner, T.; Antonelli, A. Improving biodiversity protection through artificial intelligence. Nat. Sustain. 2022, 5, 415–424. [Google Scholar] [CrossRef]
Toivonen, T.; Heikinheimo, V.; Fink, C.; Hausmann, A.; Hiippala, T.; Järv, O.; Tenkanen, H.; Di Minin, E. Social media data for conservation science: A methodological overview. Biol. Conserv. 2019, 233, 298–315. [Google Scholar] [CrossRef]
Tong, D.Q.; Gill, T.E.; Sprigg, W.A.; Van Pelt, R.S.; Baklanov, A.A.; Barker, B.M.; Bell, J.E.; Castillo, J.; Gassó, S.; Gaston, C.J.; et al. Health and Safety Effects of Airborne Soil Dust in the Americas and Beyond. Rev. Geophys. 2023, 61, e2021RG000763. [Google Scholar] [CrossRef]
Alnuaim, A.; Ziheng, S.; Didarul, I. AI for improving ozone forecasting. In Artificial Intelligence in Earth Science; Elsevier: Amsterdam, The Netherlands, 2023; pp. 247–269. [Google Scholar]
Bragazzi, N.L.; Dai, H.; Damiani, G.; Behzadifar, M.; Martini, M.; Wu, J. How Big Data and Artificial Intelligence Can Help Better Manage the COVID-19 Pandemic. Int. J. Environ. Res. Public Heal. 2020, 17, 3176. [Google Scholar] [CrossRef]
Alnaim, A.; Sun, Z.; Tong, D. Evaluating Machine Learning and Remote Sensing in Monitoring NO₂ Emission of Power Plants. Remote Sens. 2022, 14, 729. [Google Scholar] [CrossRef]
Vaishya, R.; Javaid, M.; Khan, I.H.; Haleem, A. Artificial Intelligence (AI) applications for COVID-19 pandemic. Diabetes Metab. Syndr. Clin. Res. Rev. 2020, 14, 337–339. [Google Scholar] [CrossRef]
Lim, K.; Treitz, P.; Wulder, M.; St-Onge, B.; Flood, M. LiDAR remote sensing of forest structure. Prog. Phys. Geogr. Earth Environ. 2003, 27, 88–106. [Google Scholar] [CrossRef]
Liu, L.; Zhang, Q.; Guo, Y.; Chen, E.; Li, Z.; Li, Y.; Wang, B.; Ri, A. Mapping the Distribution and Dynamics of Coniferous Forests in Large Areas from 1985 to 2020 Combining Deep Learning and Google Earth Engine. Remote Sens. 2023, 15, 1235. [Google Scholar] [CrossRef]
Sharma, M.K.; Mujawar, R.; Mujawar, A.; Dhayalini, K. Precision Forestry: Integration of Robotics and Sensing Technologies for Tree Measurement and Monitoring. Eur. Chem. Bull. 2023, 12, 4747–4764. [Google Scholar]
Stereńczak, K. Precision Forestry. IDEAS NCBR—Intelligent Algorithms for Digital Economy. 13 April 2023. Available online: https://ideas-ncbr.pl/en/research/precision-forestry/ (accessed on 5 July 2023).
Liu, W.; Quijano, K.; Crawford, M.M. YOLOv5-Tassel: Detecting Tassels in RGB UAV Imagery With Improved YOLOv5 Based on Transfer Learning. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 8085–8094. [Google Scholar] [CrossRef]
Amila, J.; Ranaweera, N.; Abenayake, C.; Bandara, N.; De Silva, C. Modelling vegetation land fragmentation in urban areas of Western Province, Sri Lanka using an Artificial Intelligence-based simulation technique. PLoS ONE 2023, 18, e0275457. [Google Scholar]
Kolokotroni, M.; Zhang, Y.; Watkins, R. The London Heat Island and building cooling design. Sol. Energy 2007, 81, 102–110. [Google Scholar] [CrossRef]
Lyu, F.; Wang, S.; Han, S.Y.; Catlett, C.; Wang, S. An integrated cyberGIS and machine learning framework for fine-scale prediction of Urban Heat Island using satellite remote sensing and urban sensor network data. Urban Inform. 2022, 1, 1–15. [Google Scholar] [CrossRef]
Rahman, A.; Roy, S.S.; Talukdar, S.; Shahfahad (Eds.) Advancements in Urban Environmental Studies: Application of Geospatial Technology and Artificial Intelligence in Urban Studies; Springer International Publishing: Cham, Switzerland, 2023. [Google Scholar]
Alnaim, A.; Ziheng, S. Using Geoweaver to Make Snow Mapping Workflow FAIR. In Proceedings of the 2022 IEEE 18th International Conference on e-Science (e-Science), Salt Lake City, UT, USA, 11–14 October 2022; pp. 409–410. [Google Scholar]
Yang, K.; John, A.; Sun, Z.; Cristea, N. Machine learning for snow cover mapping. In Artificial Intelligence in Earth Science; Elsevier: Amsterdam, The Netherlands, 2023; pp. 17–39. [Google Scholar]
An, S.; Rui, X. A High-Precision Water Body Extraction Method Based on Improved Lightweight U-Net. Remote. Sens. 2022, 14, 4127. [Google Scholar] [CrossRef]
Al-Bakri, J.T.; D’Urso, G.; Calera, A.; Abdalhaq, E.; Altarawneh, M.; Margane, A. Remote Sensing for Agricultural Water Management in Jordan. Remote Sens. 2022, 15, 235. [Google Scholar] [CrossRef]
Xiang, X.; Li, Q.; Khan, S.; Khalaf, O.I. Urban water resource management for sustainable environment planning using artificial intelligence techniques. Environ. Impact Assess. Rev. 2020, 86, 106515. [Google Scholar] [CrossRef]
Sun, A.Y.; Scanlon, B.R. How can Big Data and machine learning benefit environment and water management: A survey of methods, applications, and future directions. Environ. Res. Lett. 2019, 14, 073001. [Google Scholar] [CrossRef]
Sun, W.; Bocchini, P.; Davison, B.D. Applications of artificial intelligence for disaster management. Nat. Hazards 2020, 103, 2631–2689. [Google Scholar] [CrossRef]
Chapman, A. Leveraging Big Data and AI for Disaster Resilience and Recovery; Texas A&M University College of Engineering: College Station, TX, USA, 2023; Available online: https://engineering.tamu.edu/news/2023/06/leveraging-big-data-and-ai-for-disaster-resilience-and-recovery.html (accessed on 10 August 2023).
Imran, M.; Castillo, C.; Lucas, J.; Meier, P.; Vieweg, S. AIDR: Artificial Intelligence for Disaster Response. In Proceedings of the 23rd International Conference on World Wide Web, Seoul, Republic of Korea, 7–11 April 2014; ACM: New York, NY, USA; pp. 159–162. [Google Scholar]
Gevaert, C.M.; Carman, M.; Rosman, B.; Georgiadou, Y.; Soden, R. Fairness and accountability of AI in disaster risk management: Opportunities and challenges. Patterns 2021, 2, 100363. [Google Scholar] [CrossRef] [PubMed]
Cao, L. AI and Data Science for Smart Emergency, Crisis and Disaster Resilience. Int. J. Data Sci. Anal. 2023, 15, 231–246. [Google Scholar] [CrossRef]

Figure 1. Simplified representation of the electromagnetic spectrum (adapted from https://crisp.nus.edu.sg/~research/tutorial/em.htm, accessed on 30 July 2023).

Figure 2. (a) Passive remote sensing: the sensor receives information. (b) Active remote sensing: the sensor emits and receives information.

Figure 3. The basic mechanism of optical remote sensing: sensors record information received as a function of wavelength and atmospheric conditions.

Figure 4. Radar sensor: converts microwave signals into electrical signals.

Figure 5. LiDAR sensor: detects objects at a distance D based on the speed of light, c, and the time between the light being emitted and being detected. Multiple returns assist in mapping objects with complex shapes. The yellow wave indicates multiple reflected returned rays, while the red-to-black gradient ray and the adjacent black wave represent the laser pulse.

Figure 6. Land surface temperature sensed by ECOSTRESS during the 2021 Pacific Northwest heatwave. Image Courtesy: NASA, https://earthobservatory.nasa.gov/images/148506/exceptional-heat-hits-pacific-northwest, accessed on 3 August 2023.

Figure 7. Illustration of a basic DCCN architecture.

Figure 8. YOLO workflow: the output shows identified objects from the original image. Darknet has been replaced in later versions of YOLO by other frameworks.

Figure 9. Simplified GAN architecture.

Figure 10. Esri Land Cover Explorer screenshot.

Figure 11. AI with remote sensors for coastal and marine ecosystem monitoring.

Figure 12. Urban heat island illustration.

Table 1. Summary of various types of remote sensing techniques.

Technique	Advantages	Limitations	Sample Applications
Optical remote sensing	- captures reflected solar radiation and emitted thermal radiation for analysis within the visible and near-infrared spectrum bands - provides various sensor types for the collection of handheld, airborne, and space-borne data - offers extensive coverage and repeated observations over time with spaceborne sensors	- atmospheric conditions can impact data accuracy, limitations due to sun angles and shadows - night-time data are not available, and single snapshot acquisition - limited visibility due to clouds which can hinder data collection, inability to penetrate clouds - cost and availability of high-resolution data	- land-use mapping, crop health assessment - Monitoring vegetation - Monitoring climate change
Radar remote sensing	- operates in the microwave region, providing valuable data on distance, direction, shape, size, roughness, and dielectric properties of targets - enables accurate mapping even in challenging weather or limited visibility conditions - utilizes dual-polarization technology for enhanced forest cover mapping	- data processing can be complex, especially for full waveform LiDAR systems - lack of spectral information and limited penetration through some materials - high sensitivity to surface roughness	- mapping land surfaces and monitoring weather patterns - studying ocean currents - detecting buildings, vehicles, and changes in forest cover
LiDAR	- provides precise distance and elevation measurements of ground objects - high-resolution 3D data - penetration of vegetation - day and night operation - multiple returns of one single laser pulse and reduced atmospheric interference	- data processing complexity, especially for full waveform LiDAR systems - accuracy dependent on elevation and angle - high cost and availability - limited penetration through thick dense vegetation	- create accurate and detailed 3D maps of trees, buildings, pipelines, etc
Thermal remote sensing	- measures radiant flux emitted by ground objects within specific wavelength ranges - provides information on the emissivity, reflectivity, and temperature of target objects	- atmospheric conditions, changes in solar illumination, and target variations can impact data accuracy	- agriculture (e.g., fire detection, urban heat islands) and environmental monitoring
Multispectral and hyperspectral imaging	- captures a broad range of wavelengths, including infrared and ultraviolet, for comprehensive data collection - HSI provides valuable insights into material composition, structure, and condition	- high-dimensional and noisy data in HSI pose analysis challenges - limited spectral resolution in multispectral imaging	- recognition of vegetation patterns such as greenness, vitality, and biomass - studying material properties (e.g., physical and chemical alterations

Table 2. AI models comparison table.

Technique	Advantages	Limitations	Applications
RF	- effectively handles multi-temporal and multi-sensor remote sensing data - provide variable importance measurements for feature selection - enhances generalization and reduces computational load and redundancy - RF feature selection prioritizes informative variables by evaluating interrelationships and discriminating ability in high-dimensional remote sensing data, leading to more accurate classification results	- can be sensitive to the choice of hyper-parameters - does not guarantee that the selected features will be the best for all tasks	- classification of remote sensing data - object detection in remote sensing
XGBoost	- the ability to handle cases where different classes exhibit similar spectral signatures - effective differentiation of classes with subtle spectral differences, enhancing classification performance. - utilization of hyper-parameter tuning techniques to ensure optimal accuracy and prevent overfitting	- hyperparameter sensitivity - prone to overfitting - slower than RF	- the classification of remote sensing data with high accuracy and robustness
DCNNs	- efficiently handle intricate patterns and features in remote sensing images - learn hierarchical representations of features from convolution and pooling layers - enable accurate recognition of objects through fully connected layers with softmax activation	- training DCNNs can be computationally expensive, especially for large-scale datasets - may suffer from vanishing gradients or overfitting if not properly regularized	- remote sensing image recognition and classification - object detection tasks in remote sensing using RPN
ResNets	- alleviate the degradation problem in deep learning models, allowing the training of much deeper networks - handling complex high-dimensional and noisy data in remote sensing	- implementing very deep networks may still require significant computational resources	- image recognition object detection
YOLO	- efficiently identify and classify multiple objects in large datasets of images or video frames - simultaneously process the entire image and region proposals - utilize NMS to remove overlapping bounding boxes and improve precision	- may struggle with the detection of small objects in low-resolution images - requires careful anchor box design for accurate bounding box predictions	- real-time object detection and segmentation in remote sensing images
Self Attention methods	- capture long-range dependencies in sequences and handle spatial and spectral dependencies in remote sensing data - provide access to all elements in a sequence, enabling a comprehensive understanding of dependencies	- transformer models can be memory-intensive due to their self-attention mechanism - properly tuning the number of attention heads and layers is essential for optimal performance	- sequence modeling and image classification in remote sensing data - time series analysis of remote sensing data and capture diverse pixel relationships regardless of spatial distance
LSTM	- effectively captures long-term dependencies in sequences - overcomes the vanishing gradient problem with gate mechanisms	- training LSTMs can be time consuming, particularly for longer sequences - can struggle with capturing very long-term dependencies in sequences - may require careful tuning of hyperparameters to prevent overfitting	- sequence modeling and time series analysis in remote sensing data
GANs	- capable of handling complex, high-dimensional data distributions with limited or no annotated training data - data augmentation method enhances the performance of data-reliant deep learning models	- training GANs can be challenging and unstable, requiring careful hyper-parameter tuning - generating high-quality, realistic images may be difficult in some cases - may suffer from mode collapse, where the generator produces limited variations in images	- image-to-image translation tasks like converting satellite images with cloud coverage into cloud-free versions using CycleGAN - enhancing the resolution of low-resolution satellite images with SRGAN and similar approaches - image-to-image translation, data augmentation, and pan-sharpening
DRL	- learns from unlabeled data to improve decision-making processes - combines reinforcement learning (RL) with deep neural networks for solving complex problems - handles redundant spectral information	- requires careful design and tuning of reward functions to ensure the desired behavior - training deep neural networks in DRL can be computationally expensive and time consuming - exploration vs. exploitation trade-off in RL can impact the learning process and can be dependent on the sample	- improving unsupervised band selection in hyperspectral image classification using DRL with DQN - image processing applications that analyze large amounts of data

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Janga, B.; Asamani, G.P.; Sun, Z.; Cristea, N. A Review of Practical AI for Remote Sensing in Earth Sciences. Remote Sens. 2023, 15, 4112. https://doi.org/10.3390/rs15164112

AMA Style

Janga B, Asamani GP, Sun Z, Cristea N. A Review of Practical AI for Remote Sensing in Earth Sciences. Remote Sensing. 2023; 15(16):4112. https://doi.org/10.3390/rs15164112

Chicago/Turabian Style

Janga, Bhargavi, Gokul Prathin Asamani, Ziheng Sun, and Nicoleta Cristea. 2023. "A Review of Practical AI for Remote Sensing in Earth Sciences" Remote Sensing 15, no. 16: 4112. https://doi.org/10.3390/rs15164112

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Review of Practical AI for Remote Sensing in Earth Sciences

Abstract

1. Introduction

2. Basics of AI and Remote Sensing

2.1. Brief Recap of Remote Sensing Technologies

2.1.1. Optical Remote Sensing

2.1.2. Radar Remote Sensing

2.1.3. LiDAR

2.1.4. Thermal Remote Sensing

2.1.5. Multispectral and Hyperspectral Imaging

2.2. Key AI Techniques in Remote Sensing

2.2.1. Conventional Machine Learning in Remote Sensing

2.2.2. Deep Learning in Remote Sensing

2.2.3. Other AI Methods in Remote Sensing

3. Current Practical Applications of AI in Remote Sensing

3.1. Land Cover Mapping

3.2. Earth Surface Object Detection

3.3. Multisource Data Fusion and Integration

3.4. Three-Dimensional and Invisible Object Extraction

4. Existing Challenges

4.1. Data Availability

4.2. Training Optimization

4.3. Data Quality

4.4. Uncertainty

4.5. Model Interpretability

4.6. Diversity

4.7. Integrity and Security

5. Ongoing and Future Practical AI Applications in Remote Sensing

5.1. Wildfire Detection and Management

5.2. Illegal Logging and Deforestation Monitoring

5.3. Coastal and Marine Ecosystem Monitoring

5.4. Biodiversity Conservation and Habitat Monitoring

5.5. Airborne Disease Monitoring and Forecasting

5.6. Precision Forestry

5.7. Urban Heat Island Mitigation

5.8. Precision Water Management

5.9. Disaster Resilience Planning

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI