Next Article in Journal
Statistical Physics Approach to Liquid Crystals: Dynamics of Mobile Potts Model Leading to Smectic Phase, Phase Transition by Wang–Landau Method
Next Article in Special Issue
Canine Olfactory Detection of a Non-Systemic Phytobacterial Citrus Pathogen of International Quarantine Significance
Previous Article in Journal
A Comparison of Variational Bounds for the Information Bottleneck Functional
Previous Article in Special Issue
Mutual Information as a Performance Measure for Binary Predictors Characterized by Both ROC Curve and PROC Curve Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of HIV/AIDS Epidemic and Socioeconomic Factors in Sub-Saharan Africa

College of Mathematics and System Science, Xinjiang University, Urumqi 830046, China
*
Author to whom correspondence should be addressed.
Entropy 2020, 22(11), 1230; https://doi.org/10.3390/e22111230
Submission received: 21 September 2020 / Revised: 21 October 2020 / Accepted: 26 October 2020 / Published: 29 October 2020
(This article belongs to the Special Issue Applications of Information Theory to Epidemiology)

Abstract

:
Sub-Saharan Africa has been the epicenter of the outbreak since the spread of acquired immunodeficiency syndrome (AIDS) began to be prevalent. This article proposes several regression models to investigate the relationships between the HIV/AIDS epidemic and socioeconomic factors (the gross domestic product per capita, and population density) in ten countries of Sub-Saharan Africa, for 2011–2016. The maximum likelihood method was used to estimate the unknown parameters of these models along with the Newton–Raphson procedure and Fisher scoring algorithm. Comparing these regression models, there exist significant spatiotemporal non-stationarity and auto-correlations between the HIV/AIDS epidemic and two socioeconomic factors. Based on the empirical results, we suggest that the geographically and temporally weighted Poisson autoregressive (GTWPAR) model is more suitable than other models, and has the better fitting results.

1. Introduction

Acquired immunodeficiency syndrome (AIDS) is a malignant infectious disease with a high fatality rate caused by human immunodeficiency virus (HIV). The HIV/AIDS epidemic has been one of the greatest global public health and social development problems since 1981, particularly in Sub-Saharan Africa. As of 31 December 2016, over 30 million people had died from the disease [1]. More than 70% of the 35 million people are infected with the HIV/AIDS disease in Sub-Saharan Africa. Thus, the HIV/AIDS epidemic of Sub-Saharan Africa has attracted extensive attention from researchers around the world [2,3,4].
In earlier studies, Janet et al. [5] and Hallman et al. [6] demonstrated the relationship between the disease and socioeconomic status. Chris et al. [7] indicated socioeconomic factors to explain this disease outperformed cultural ones in South Africa. Mathematical models always play an important role in evaluating the trends of the HIV/AIDS epidemic [8]. For example, regression models have been widely used in the study of the relationship between this disease and influencing factors. Shiboski et al. [9] considered a generalized linear model to obtain the statistical analysis of the HIV/AIDS disease. A mixed-effects linear regression model was used to analyze the correlation between national population and antenatal care [10]. Laurence et al. [11] applied a spatial regression model to show that the epidemic had substantial geographic variance across Sub-Saharan Africa.
This paper proposes several regressive models to investigate the relationships between the HIV/AIDS epidemic, the gross domestic product (GDP) per capita and the population density in ten countries of Sub-Saharan Africa. The Poisson regression model is introduced in Section 2.1. Section 2.2 and Section 2.3 describe two spatial models, respectively. A spatiotemporal autoregressive model is proposed in Section 2.4. The maximum likelihood method is used to obtain the iterative formulas of coefficient estimations in Section 3. The main results are shown in Section 4, followed by discussion in Section 5.

2. Methodologies

2.1. Poisson Regression Model

Regression models are a set of statistical processes for estimating the relationships between response and explanatory variables. The classical model is a linear regression. Nelder and Wedderburn [12] extended the linear model to a generalized linear regression for solving the discrete data problem. This kind of models are very important in ecology, medicine and economics [13,14,15]. Suppose that Y = ( Y 1 , Y 2 , , Y n ) is the response variable, where Y i ( i = 1 , , n ) are independent. The density function is
f ( y i ; θ i , ϕ i ) = exp y i θ i b ( θ i ) a ( ϕ i ) + c ( y i , ϕ i ) ,
where a ( · ) , b ( · ) , c ( · , · ) are known functions, and θ i , ϕ i are unknown parameters for i = 1 , 2 , , n . Denote μ i = E ( Y i ) , and g ( μ i ) = ln ( μ i ) is a link function. Let X i j be explanatory variables for the ith observation in the jth variable. Then, the Poisson regression (PR) model is given by
g ( μ i ) η i = j = 1 p β j X i j ,
where i = 1 , 2 , , n , and β j ( j = 1 , 2 , , p ) are unknown parameters.

2.2. Geographically Weighted Poisson Regression Model

With in-depth study, regression models have been frequently applied in epidemiology and health geography for trying to investigate the persistent geographical variations in disease [16]. Based on the generalized linear regression, Brunsdon et al. [17] proposed the geographically weighted regression model to analyze the spatial non-stationary processes of discrete data. The disease maps arising from this process are considered through the establishment of the geographically weighted Poisson regression (GWPR) model [18,19,20] below
g ( μ i ) η i = j = 1 p β j ( u i , v i ) X i j ,
where ( u i , v i ) ( i = 1 , 2 , , n ) are the geographical locations, and β j ( u i , v i ) ( j = 1 , 2 , , p ) are unknown parameters at the position ( u i , v i ) .

2.3. Geographically Weighted Poisson Autoregressive Model

Another issue deserving of special attention is whether there exists an interaction between different regions in terms of spatial data. Previous studies [21,22,23,24] showed that spatial data has not only spatial non-stationarity but also correlation. Zhang [25] proposed the geographically weighted Poisson autoregressive (GWPAR) model as follows:
g ( μ i ) η i = ρ k = 1 n c i k η k + j = 1 p β j ( u i , v i ) X i j ,
where ρ is a scalar autoregressive parameter, and c i k ( i , k = 1 , 2 , , n ) is the adjacency relation between the ith and kth locations. Let c i be the number of regions adjacent to the ith position. If the kth position is next to the ith’s, then c i k = 1 / c i . Otherwise, c i k = 0 .

2.4. Geographically and Temporally Weighted Poisson Autoregressive Model

Recently, many spatiotemporal models have been proposed to describe the spatiotemporal variations in the relationships of response and explanatory variables [26,27]. Concerning the modeling of spatiotemporal data, there are two important properties: non-stationarity and auto-correlation. The non-stationarity indicates that there exists more than one linear relation between response and explanatory variables. It can be used to identify where interesting relationships are likely to occur or where detailed investigation is necessary in the study areas [28]. Spatiotemporal auto-correlation is an important factor to determine the temporal correlations of observations [29]. These two problems always appeared together [30]. A geographically and temporally weighted autoregressive (GTWPAR) model can be applied to account for non-stationary and auto-correlated effects simultaneously.
Let Y be the response variable, and Y i k ( i = 1 , 2 , , n k , k = 1 , 2 , , T ) be the independent variables of Y in the ith position and the kth time. The density function can be defined as follows:
f ( y i k ; θ i k , ϕ i k ) = exp y i k θ i k b ( θ i k ) a ( ϕ i k ) + c ( y i k , ϕ i k ) ,
where the parameters are similar to Section 2.1. Denote μ i k = E ( Y i k ) , and g ( μ i k ) = ln ( μ i k ) . Let X i j k ( j = 1 , 2 , , p ) be the jth explanatory variable. The GTWPAR model is expressed by
g ( μ i k ) η i k = ρ m = 1 T l = 1 n k c l m ( i k ) η l m + j = 1 p k = 1 T β j k ( u i k , v i k , t k ) X i j k ,
where { β j k ( u i k , v i k , t k ) } is a set of unknown parameters at the ith position in the kth time, and c l m ( i k ) is the adjacent relation between the location ( u i k , v i k , t k ) and ( u l m , v l m , t m ) . Following the work of [31], the spatiotemporal distance between the locations ( u i k , v i k , t k ) and ( u l m , v l m , t m ) can be defined as
d l m ( i k ) = λ [ ( u i k u l m ) 2 + ( v i k v l m ) 2 ] + μ ( t k t m ) 2 ,
where μ and λ are used to balance spatiotemporal distances. Suppose that
c l m ( i k ) = 1 / c i k , 0 < d l m ( i k ) < d , 0 , otherwise ,
where d is a constant and satisfies min { d l m ( i k ) } < d < max { d l m ( i k ) } .
Next, we rewrite the model (4) in a matrix form
η = ρ C η + B X ,
where η = ( η 11 , , η n 1 1 , η 12 , , η n 2 2 , , η 1 T , , η n T T ) , C = ( c l m ( i k ) ) , X = ( X i j k ) and B = ( β j k ( u i k , v i k , t k ) ) . For convenience, define η K as the Kth element of η ; C I K and X I K are the Ith row and the Kth column of the matrices C and X, respectively. The detailed expressions of C, X and B are given in Appendix A.1.
Remark 1.
For the GTWPAR model (4), if ρ = 0 and β j k ( u i k , v i k , t k ) is independent of the spatiotemporal effect, the model is a PR model. If ρ = 0 and β j k ( u i k , v i k , t k ) is dependent on spatial effect but independent of temporal effect, the model becomes GWPR model. If ρ 0 and β j k ( u i k , v i k , t k ) is independent of temporal effect, it is the GWPAR model. Thus, PR, GWPR and GWPAR models are the special cases of the GTWPAR model.

3. Coefficient Estimation

In this section, we only provide the estimation method of the GTWPAR model since the PR, GWPR and GWPAR models are its special cases (Remark 1). Let ( u i k , v i k , t k ) ( i = 1 , 2 , , n k , k = 1 , 2 , , T ) be any point in the studied spatiotemporal region. We fix a point ( u 00 , v 00 , t 0 ) and assume that β j k ( u i k , v i k , t k ) β j 0 ( u 00 , v 00 , t 0 ) ( j = 1 , 2 , , p ) . Then, the model (4) can be rewritten by
η i k = g ( μ i k ) = ρ m = 1 T l = 1 n k c l m ( i k ) η l m + j = 1 p k = 1 T β j 0 ( u 00 , v 00 , t 0 ) X i j k .
Denote β ( u 00 , v 00 , t 0 ) = ( β 10 , , β p 0 ) , X = d i a g ( X i . ) and X i · = ( X i 1 , , X i p ) . The corresponding matrix form can be represented as η = ρ C η + β ( u 00 , v 00 , t 0 ) X .

3.1. Estimation of Parameter Vector β

For the fixed point ( u 00 , v 00 , t 0 ) , we define a spatiotemporal distance d i k ( 0 ) from this point to ( u i k , v i k , t k ) as d i k ( 0 ) = λ [ ( u 00 u i k ) 2 + ( v 00 v i k ) 2 ] + μ ( t 0 t k ) 2 . The Gauss kernel function of these two points can be written by
w i k ( u 00 , v 00 , t 0 ) = 1 2 π exp 1 2 d i k ( 0 ) h S T 2 = 1 2 π exp 1 2 λ [ ( u 00 u i k ) 2 + ( v 00 v i k ) 2 ] + μ ( t 0 t k ) 2 h S T 2 = 1 2 π exp 1 2 ( u 00 u i k ) 2 + ( v 00 v i k ) 2 h S 2 + ( t 0 t k ) 2 τ h S 2 ,
where h S T and h S are the space-time bandwidth and space bandwidth, respectively. Meanwhile, we have h S T 2 = λ h S 2 , and τ = λ / μ is a spatiotemporal factor. Without loss of generality, let λ = 1 . Then, the weighted maximum likelihood of Y i k ( i = 1 , 2 , , n k , k = 1 , 2 , , T ) at the point ( u 00 , v 00 , t 0 ) is
L ( β 10 , β 20 , , β p 0 ) = k = 1 T i = 1 n k f ( y i k ; θ i k , ϕ i k ) w i k ( u 00 , v 00 , t 0 ) ,
where f ( y i k ; θ i k , ϕ i k ) is the density function. The log-likelihood can be obtained as follows:
L 1 ( β ( u 00 , v 00 , t 0 ) ) = k = 1 T i = 1 n k y i k θ i k b ( θ i k ) a ( ϕ i k ) + c ( y i k , ϕ i k ) w i k ( u 00 , v 00 , t 0 ) .
Note that c ( y i k , ϕ i k ) = ln ( y i k ! ) , b ( θ i k ) = μ i k = exp ( θ i k ) , and a ( ϕ i k ) = ϕ i k = 1 . Thus, E ( Y i k ) = b ( θ i k ) = exp ( θ i k ) = μ i k , V a r ( Y i k ) = b ( θ i k ) a ( ϕ i k ) = exp ( θ i k ) = μ i k . Differentiating L 1 with respect to β ( u 00 , v 00 , t 0 ) yields
L 1 β r 0 = k = 1 T i = 1 n k y i k μ i k a i k ϕ θ i k β r 0 w i k ( u 00 , v 00 , t 0 ) = 0 ,
where β r 0 = β r ( u 00 , v 00 , t 0 ) ( r = 1 , 2 , , p ) , and
θ i k β r 0 = μ i k θ i k 1 μ i k g ( μ i k ) g ( μ i k ) β r 0 = 1 b ( θ i k ) 1 g ( μ i k ) η i k β r 0 .
For convenience, let N = k = 1 N n k and W = ( w i k ( u 00 , v 00 , t 0 ) ) N × N . Denote A = ( I N ρ C ) 1 , Y = ( Y 11 , , Y n 1 1 , , Y 1 T , , Y n T T ) , μ = ( μ 11 , , μ n 1 1 , , μ 1 T , , μ n T T ) , θ = ( θ 11 , , θ n 1 1 , , θ 1 T , , θ n T T ) , ϕ = ( ϕ 11 , , ϕ n 1 1 , , ϕ 1 T , , ϕ n T T ) . Suppose that Y K , μ K , θ K and ϕ K are the Kth elements of Y, μ , θ and ϕ , respectively. Then, we take the derivative of the model (5) with respect to β r 0 , and obtain
η l β r 0 = h = 1 N A l h X h r = A l · X · r , l = 1 , 2 , , N .
The calculation process is given in Appendix A.2. Thus, the Equation (6) can be rewritten as
L 1 β r 0 = 1 ϕ l = 1 N T l A l · X · r ( Y l μ l ) g ( μ l ) W l ( u 00 , v 00 , t 0 ) = 0 .
However, there is not a close-form solution for β ( u 00 , v 00 , t 0 ) . The Newton–Raphson procedure and Fisher scoring algorithm are used to get the estimation of β . The iterative formula is expressed as
β ^ ( m + 1 ) ( u 00 , v 00 , t 0 ) = β ^ ( m ) ( u 00 , v 00 , t 0 ) + I 1 ( β ^ ( m ) ( u 00 , v 00 , t 0 ) ) S ( β ^ ( m ) ( u 00 , v 00 , t 0 ) ) = ( ( A ( m ) X ) T ( m ) W ( u 00 , v 00 , t 0 ) ( A ( m ) X ) ) 1 × ( A ( m ) X ) T ( m ) W ( u 00 , v 00 , t 0 ) Z ( m ) ,
where the Fisher information matrix I ( β ) = E ( I ( β ) ) , and
S ( β ^ ( m ) ( u 00 , v 00 , t 0 ) ) = L 1 β 10 , L 1 β 20 , , L 1 β p 0
is the scalar vector. The detail process is provided in Appendix A.2. For the fixed point ( u i k , v i k , t k ) ( i = 1 , 2 , , n k ; k = 1 , 2 , , T ) , β ^ j k ( u i k , v i k , t k ) can be obtained by (7).
Remark 2.
The estimations β ^ ( u i k , v i k , t k ) ( i , l = 1 , 2 , , n k , k , m = 1 , 2 , , T ) are related to the temporal and spatial effects in the GTWPAR model. If m k , w i k ( u l m , v l m , t m ) = 0 and c l m ( i k ) = 0 , then β ^ ( u i k , v i k , t k ) = β ^ ( u i , v i ) correspond to the parameter estimations of the GWPAR model. If w i k ( u l m , v l m , t m ) = 0 ( m k ) and C = 0 , they are the estimations of the GWPR model. If W = 0 and C = 0 , then β ^ ( u i k , v i k , t k ) = β ^ are the global estimation values of the PR model.

3.2. Estimation of Parameter ρ

Based on the density function, the log-likelihood function of ρ is
L 2 ( ρ ) = k = 1 T i = 1 n k y i k θ i k b ( θ i k ) a ( ϕ i k ) + c ( y i k , ϕ i k ) .
Differentiating L 2 ( ρ ) with respect to ρ , we have
L 2 ρ = k = 1 T i = 1 n k y i k μ i k a i k ϕ θ i k ρ = 0 ,
where d θ i k d ρ = 1 b ( θ i k ) g ( μ i k ) d η i k d ρ . Then, we take the derivative of the model (5) with respect to ρ as follows:
d η l d ρ = d g ( μ l ) d ρ = h = 1 N A l · C · h η h .
The detail calculation is given in Appendix A.3. Then, the Equation (8) can be rewritten in the following nonlinear form
d L 2 d ρ = l = 1 N ( Y l μ l ) h = 1 N A l · C · h η h a l ϕ V ( μ l ) g ( μ l ) = 0 .
According to the Newton–Raphson procedure and Fisher scoring algorithm, the iterative formula of ρ ^ ( m + 1 ) is
ρ ^ ( m + 1 ) = ρ ^ ( m ) + I 1 ( ρ ^ ( m ) ) S ( ρ ^ ( m ) ) = ρ ^ ( m ) + ( ( A ( m ) C η ( m ) ) T ( m ) ( A ( m ) C η ( m ) ) ) 1 × ( A ( m ) C η ( m ) ) T ( m ) ( Z ( m ) η ( m ) ) ,
where the scalar vector S ( ρ ^ ( m ) ) = 1 ϕ ( A C η ) T ( Z η ) and the Fisher information matrix I ( ρ ) = 1 ϕ ( A C η ) T ( A C η ) . The calculation process of the scalar vector S ( ρ ^ ( m ) ) and the information matrix I is given in Appendix A.3.

4. Main Results

In this section, we apply the PR, GWPR, GWPAR and GTWPAR models to analyze the relationships between the HIV/AIDS epidemic, the GDP per capita and population density in ten countries of Sub-Saharan Africa from 2011 to 2016. The ten countries are Angola, Botswana, Lesotho, Malawi, Mozambique, Namibia, South Africa, Swaziland, Zimbabwe and Zambia. The parameters of these four models are estimated by the Newton–Raphson procedure and Fisher scoring algorithm. The coefficient of determination R 2 , the corrected Akaike information criterion (AICc), the deviation (D) and mean-square error (MSE) are used to compare the performances of the four models [18].

4.1. The HIV/AIDS Epidemic Models

The data of HIV/AIDS incidence, GDP per capita and population density were derived from http://data.cnki.net/InternationalData/Report. Readers should note that authorization is required to access the database on this website. Figure 1 describes the HIV/AIDS incidence in ten countries from 2011 to 2016. It shows that the incidence varies significantly in different regions. Angola has a minimum incidence of less than 5%, while Botswana and Swaziland have higher incidences of more than 20% every years. Therefore, it may be necessary to consider the temporal and spatial factors in analyzing the HIV/AIDS epidemic.
The distributions of HIV/AIDS cases, GDP per capita and population density are displayed in Figure 2. The Pearson correlation coefficients between these cases and GDP per capita and population density are 0.2739 and 0.1179 , respectively. Meanwhile, the two socioeconomic factors have different effects on the HIV/AIDS cases at the spatiotemporal locations. These reflect a spatiotemporal non-stationarity between the cases and two factors in ten countries from 2011 to 2016. Table 1 lists the p-values of the first-order autocorrelation of HIV/AIDS cases in the different years of the same region or the different regions of the same year. Each region has a significant spatial autocorrelation (p-value < 0.01) each year. Lesotho and South Africa had temporal autocorrelation during 2011 to 2016. Thus, the spatial and temporal autocorrelation should not be ignored.
Next, we standardized the two socioeconomic factors. The multiplex collinear test [32] was performed by the condition number k = λ max / λ min = 1.804 ( 15 ) ( λ is the eigenvalue of explanatory variable matrix). If k > 15 , then the data have collinearity. Otherwise, there is no collinearity. Thus, there is no collinearity between the two factors. Let μ i k , r i k and P i k be the annual HIV/AIDS cases (Unit: 1/1000 people), incidence (Unit: 1/100) and total population (unit: 100,000 people) in the kth year of the ith region, respectively. Denote g ( μ i k ) = η i k = ln μ i k = ln r i k + ln P i k ( μ i k = r i k P i k , i = 1 , 2 , , 10 and k = 1 , 2 , , 6 ). Let X i 1 k and X i 2 k be the GDP per capita and population density in the ith region at the kth year, respectively. The PR model is written by
g ( μ i k ) = β 0 + β 1 X i 1 k + β 2 X i 2 k , i = 1 , 2 , , 10 , k = 1 , 2 , , 6 ,
where β j ( j = 0 , 1 , 2 ) are unknown constants. The GWPR model is introduced as
g ( μ i k ) = β 0 ( u i k , v i k ) + β 1 ( u i k , v i k ) X i 1 k + β 2 ( u i k , v i k ) X i 2 k , i = 1 , 2 , , 10 ,
where k is a fixed constant taken from { 1 , 2 , , 6 } , and β j ( u i k , v i k ) are unknown spatial parameters for the ith country ( u i k , v i k ) in the kth year. Let ρ be a scalar autoregressive parameter, and c i l be a constant that represents an adjacency relation. The GWPAR model is
g ( μ i k ) = ρ l = 1 n c i l η i k + β 0 ( u i k , v i k ) + β 1 ( u i k , v i k ) X i 1 k + β 2 ( u i k , v i k ) X i 2 k ,
where n = 10 , k is a fixed constant, and β j ( u i k , v i k ) are defined as above. Let c l m ( i k ) be a spatiotemporal adjacency relation, and β j k ( u i k , v i k , t k ) ( k = 1 , 2 , , 6 ) be unknown spatiotemporal parameters in the ith country ( u i k , v i k ) in the kth year. The GTWPAR model is established as follows:
g ( μ i k ) = ρ m = 1 T l = 1 n k c l m ( i k ) η l m + β 0 k ( u i k , v i k , t k ) + β 1 k ( u i k , v i k , t k ) X i 1 k + β 2 k ( u i k , v i k , t k ) X i 2 k ,
where T = 6 ; n k = 10 for every k years; and ρ is defined as above.
Algorithms I, II, III and IV of PR, GWPR, GWPAR and GTWPAR models are provided in Appendix A.4, respectively.

4.2. Statistical Analysis

For the PR model, we get the estimated values of unknown parameters by Algorithm I. Then, the best space bandwidth is chosen by the cross-validation method. Following Huang et al. [28], the range [0.09, 2.49] of the space bandwidth is selected according to the minimum and maximum distance of the geographical positions. In the GWPR model, the best space bandwidth is h = 0.62 , 0.59 , 0.62 , 0.61 , 0.60 , 0.60 , and the estimations of coefficient functions are given by Algorithm II. The optimal space bandwidth of the GWPAR model is selected as h = 1.2895 , 1.1316 , 1.1316 , 1.0526 , 1.0526 , 1.0526 . Based on Algorithm III, we can get the estimations of coefficient functions and the scalar autoregressive parameter ρ ^ = 0.267 , 0.269 , 0.263 , 0.264 , 0.264 , 0.264 . For the GTWPAR model, we chose h s = 1.1316 , 0.9737 , 0.8947 , 0.9211 , 0.7842 , 0.8789 and τ = 0.1 , where τ ( > 0 ) is a balanced parameter. The coefficient estimations and scalar autoregressive parameter ρ ^ = 0.126 can be obtained by Algorithm IV. The quantile and mean values of coefficient estimations and response variables are shown in Table 2. We note that the GWPR, GWPAR and GTWPAR models can reflect the non-stationarity property of the influencing factors; the PR model cannot. Moreover, the GTWPAR model has a better performance than other models by comparing the true and fitted values.
The average estimated coefficients are visualized in Figure 3. For the PR model, the GDP per capita and population density had the same effect on the HIV/AIDS epidemic for ten countries in six years. However, there exist significant spatial non-stationarity and auto-correlation for different countries under the GWPR, GWPAR and GTWPAR models. Figure 4 shows the spatial distribution of the average MSE of their response variables. The lighter the color, the smaller the average error is. Thus, the GWPAR and GTWPAR models have the better fitting results.
These four indicators can effectively compare the performances of the proposed models (Table 3). The calculation formulas of R 2 , AICc, D and MSE are given in Appendix A.5. The coefficient of determination R 2 gradually increases from 12.91% of the PR model to 99.57% of the GTWPAR model. The MSE, AICc and D values of the GTWPAR model are smaller than those of other models. Therefore, the GTWPAR model is more suitable to investigate the spatiotemporal HIV/AIDS epidemic.
Based on the GTWPAR model, the mean values and 95% confidence intervals of the coefficient estimations are shown in Figure 5. The mean estimations are represented by the dot, and the 95% confidence intervals are given by the upper and lower lines. Note that the GDP per capita in Botswana, Namibia and South Africa has a positive effect on the HIV/AIDS cases. Six other countries (except Lesotho) had the opposite results. The population density for five countries had a positive effect on the HIV/AIDS cases—Angola, Botswana, Namibia, South Africa and Zambia. The population density of other five countries had the negative effect. Moreover, the impact of the GDP per capita on HIV/AIDS epidemic had a strong spatiotemporal non-stationarity in Lesotho, Malawi and Zimbabwe, while the population density had a strong spatiotemporal non-stationarity in Angola.

5. Conclusions

In this paper, we propose four regression models, including the PR, GWPR, GWPAR and GTWPAR, to investigate the non-stationary and auto-correlation properties. The relationships between the HIV/AIDS epidemic, GDP per capita and population density were analyzed in ten countries of Sub-Saharan Africa from 2011 to 2016. The unknown parameters of these models can be estimated by the Newton–Raphson procedure and Fisher scoring algorithm.
The PR model is a classical generalized model, which considers the global relationships between the response and explanatory variables. The GWPR and GWPAR models have been introduced to determine the spatial non-stationarity or auto-correlation. The GTWPAR model proposed by this article can be used to investigate not only spatiotemporal non-stationary but also auto-correlation. Thus, the PR, GWPR and GWPAR models are several special cases of the GTWPAR model (see Remark 1 and Remark 2). The performances of these models were evaluated by analyzing the correlations between the HIV/AIDS epidemic and two socioeconomic factors. The parameter estimations of the models can be obtained by Algorithms I, II, III and IV in Appendix A.4.
The results show that the impacts of GDP per capita and population density on HIV /AIDS cases had significant spatiotemporal non-stationarity and auto-correlation. The GWPR, GWPAR and GTWPAR models can reflect the strong spatial or spatiotemporal non-stationarity. The auto-correlation can be reflected in the GWPAR and GTWPAR models. Compared with other models, the GTWPAR model is more effective in terms of four comparison indicators. Thus, we suggest that the GTWPAR model can be used to analyze the spatiotemporal characteristics of the HIV/AIDS epidemic and the influences of the GDP per capita and population density.
Further work also exists in our study. For example, we observed that the effects of the GDP per capita for Lesotho, Malawi and Zimbabwe and the population density for Angola on HIV/AIDS had strong spatiotemporal non-stationarity. These may be the result of local environmental or political factors. Whether the fitting results of these regions will perform better if explanatory variables such as local unique environmental or political factors are added needs to be further investigated.

Author Contributions

Conceptualization, Z.L., H.Z. and X.H.; methodology, S.S., H.Z. and X.H.; software, S.S.; supervision, Z.L. and H.J.; visualization, Z.L.; writing—original draft, S.S.; writing—review & editing, S.S., Z.L. and H.J. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of Xinjiang Uygur Autonomous Region (XJEDU2017M001, 2018Q011), and the National Natural Science Foundation of China (U1703237, 11661076, 12061070).

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
HIV/AIDSHuman immunodeficiency virus/Acquired immunodeficiency syndrome
PRPoisson regression model
GWPRGeographically weighted Poisson regression model
GWPARGeographically weighted Poisson autoregressive model
GTWPARGeographically and temporally weighted Poisson autoregressive model
GDPGross domestic product

Appendix A. Detailed Processes

Appendix A.1. The Expressions of C, X and B

In model η = ρ C η + B X , the expressions of C, X and B are
C = c 11 ( 11 ) c 11 ( n 1 1 ) c 11 ( 1 T ) c 11 ( n T T ) c ( n 1 1 ) ( 11 ) c ( n 1 1 ) ( n 1 1 ) c ( n 1 1 ) ( 1 T ) c ( n 1 1 ) ( n T T ) c 1 T ( 11 ) c 1 T ( n 1 1 ) c 1 T ( 1 T ) c 1 T ( n T T ) c ( n T T ) ( 11 ) c ( n T T ) ( n 1 1 ) c ( n T T ) ( 1 T ) c ( n T T ) ( n T T ) , X = X 111 X 121 X 1 p 1 X n 1 11 X n 1 21 X n 1 p 1 X 11 T X 12 T X 1 p T X n T 1 T X n T 2 T X n T p T ,
where C I K , X I K are respectively the Ith row and the Kth column of the matrices C and X. Moreover,
B = ( β 11 ( u 11 , v 11 , t 1 ) , , β p 1 ( u 11 , v 11 , t 1 ) , , β 11 ( u n 1 1 , v n 1 1 , t 1 ) , , β p 1 ( u n 1 1 , v n 1 1 , t 1 ) , , β 1 T ( u 1 T , v 1 T , t T ) , , β p T ( u 1 T , v 1 T , t T ) , , β 1 T ( u n T 1 , v n T 1 , t 1 ) , , β p T ( u n T 1 , v n T 1 , t 1 ) ) .

Appendix A.2. Formula and Information Matrix of β(u,v,t)

(1) For the matrix form of model (5), we obtain
( 1 ρ C 11 ) η 1 ρ C 12 η 2 ρ C 1 N η N = j = 1 p β j 0 X 1 j , ρ C 21 η 1 + ( 1 ρ C 22 ) η 2 ρ C 2 N η N = j = 1 p β j 0 X 2 j , ρ C N 1 η 1 ρ C N 2 η 2 + ( 1 ρ C N N ) η N = j = 1 p β j 0 X N j .
Differentiating the above equations with β r 0 yields
( 1 ρ C 11 ) η 1 β r 0 ρ C 12 η 2 β r 0 ρ C 1 N η N β r 0 = X 1 r , ρ C 21 η 1 β r 0 + ( 1 ρ C 22 ) η 2 β r 0 ρ C 2 N η N β r 0 = X 2 r , ρ C N 1 η 1 β r 0 ρ C N 2 η 2 β r 0 + ( 1 ρ C N N ) η N β r 0 = X N r .
Then,
1 ρ C 11 ρ C 12 ρ C 1 N ρ C 21 1 ρ C 22 ρ C 2 N ρ C N 1 ρ C N 2 1 ρ C N N η 1 β r 0 η 2 β r 0 η N β r 0 = X 1 r X 2 r X N r .
Denote η β r 0 = ( η 1 β r 0 , η 2 β r 0 , , η N β r 0 ) and
A = 1 ρ C 11 ρ C 12 ρ C 1 N ρ C 21 1 ρ C 22 ρ C 2 N ρ C N 1 ρ C N 2 1 ρ C N N 1 , X · r = ( X 1 r , X 2 r , , X N r ) , A l · = ( A l 1 , A l 2 , , A l N ) .
Thus, η β r 0 = A X · r , that is
η l β r 0 = h = 1 N A l h X h r = A l · X · r , l = 1 , 2 , , N .
(2) The element I r b of I ( β ) satisfies
I r b ( β ) = 2 L 1 β b 0 β r 0 = β b 0 l = 1 N Y l μ l a l ϕ A l · X · r V ( μ l ) g ( μ l ) W l ( u 00 , v 00 , t 0 ) = l = 1 N Y l μ l a l ϕ β b 0 A l · X · r V ( μ l ) g ( μ l ) W l ( u 00 , v 00 , t 0 ) l = 1 N A l · X · r V ( μ l ) g ( μ l ) β b 0 Y l μ l a l ϕ W l ( u 00 , v 00 , t 0 ) = l = 1 N Y l μ l a l ϕ β b 0 A l · X · r V ( μ l ) g ( μ l ) W l ( u 00 , v 00 , t 0 ) + l = 1 N A l · X · r A l · X · b a l ϕ V ( μ l ) ( g ( μ l ) ) 2 W l ( u 00 , v 00 , t 0 ) .
The Fisher information matrix is
I ( β ) = E ( I ( β ) ) = E ( ( I r b ( β ) ) p × p ) = l = 1 N A l · X · r A l · X · b a l ϕ V ( μ l ) ( g ( μ l ) ) 2 W l ( u 00 , v 00 , t 0 ) = 1 ϕ l = 1 N T l A l · X · r A l · X · b W l ( u 00 , v 00 , t 0 ) .

Appendix A.3. Formula and Information Matrix of ρ

(1) Differentiating (4) with ρ yields
( 1 ρ C 11 ) d η 1 d ρ ρ C 12 d η 2 d ρ ρ C 1 N d η N d ρ = C 11 η 1 + C 12 η 2 + + C 1 N η N , ρ C 21 d η 1 d ρ + ( 1 ρ C 22 ) d η 2 d ρ ρ C 2 N d η N d ρ = C 21 η 1 + C 22 η 2 + + C 2 N η N , ρ C N 1 d η 1 d ρ ρ C N 2 d η 2 d ρ + ( 1 ρ C N N ) d η N d ρ = C N 1 η 1 + C N 2 η 2 + + C N N η N .
That is to say
1 ρ C 11 ρ C 12 ρ C 1 N ρ C 21 1 ρ C 22 ρ C 2 N ρ C N 1 ρ C N 2 1 ρ C N N d η 1 d ρ d η 2 d ρ d η N d ρ = C 11 C 12 C 1 N C 21 C 22 C 2 N C N 1 C N 2 C N N η 1 η 2 η N .
Let d η d ρ = ( d η 1 d ρ , , d η N d ρ ) . Then, d η d ρ = A C η , where
d η l d ρ = d g ( μ l ) d ρ = h = 1 N A l · C · h η h , l = 1 , 2 , , N .
(2) The scalar vector of ρ is
S ( ρ ) = d L 2 d ρ = 1 ϕ l = 1 N T l h = 1 N A l · C · h η h ( Y l μ l ) g ( μ l ) = 1 ϕ ( A C η ) T ( Z η ) .
The information matrix is
I ( ρ ) = 2 L 2 ρ 2 = d d ρ l = 1 N Y l μ l a l ϕ h = 1 N A l · C · h η h V ( μ l ) g ( μ l ) = l = 1 N Y l μ l a l ϕ d d ρ h = 1 N A l · C · h η h V ( μ l ) g ( μ l ) l = 1 N h = 1 N A l · C · h η h V ( μ l ) g ( μ l ) d d ρ Y l μ l a l ϕ ,
where
d μ l d ρ = d μ l d g ( μ l ) d g ( μ l ) d ρ = 1 g ( μ l ) h = 1 N A l · C · h η h .
Thus,
I ( ρ ) = l = 1 N Y l μ l a l ϕ d d ρ h = 1 N A l · C · h η h V ( μ l ) g ( μ l ) + l = 1 N ( h = 1 N A l · C · h η h ) 2 a l ϕ V ( μ l ) ( g ( μ l ) ) 2 .
The Fisher information matrix is
I ( ρ ) = E ( I ( ρ ) ) = l = 1 N ( h = 1 N A l · C · h η h ) 2 a l ϕ V ( μ l ) ( g ( μ l ) ) 2 = 1 ϕ l = 1 N T l h = 1 N A l · C · h η h 2 = 1 ϕ ( A C η ) T ( A C η ) .

Appendix A.4. Algorithms of Coefficient Estimation

We provide four algorithms to estimate the unknown coefficients of PR, GWPR, GWPAR and GTWPAR models in Section 4.
Algorithm I: Estimate the unknown parameters in the PR model. Take the initial values g ( μ i k ( 0 ) ) = η i k ( 0 ) = ln μ i k ( 0 ) , y i k = μ i k ( 0 ) , Z i k ( 0 ) = η i k ( 0 ) + g ( μ i k ( 0 ) ) ( y i k μ i k ( 0 ) ) , and
w i k ( 0 ) = 1 a i k V ( μ i k ( 0 ) ) ( g ( μ i k ( 0 ) ) ) 2 , i = 1 , 2 , , 10 , k = 1 , 2 , , 6 .
The iterative formula of β ^ ( m + 1 ) is
β ^ ( m + 1 ) = X W ( m ) X 1 X W ( m ) Z ( m ) .
Repeat the above step until convergence yields. The estimated value β ^ = β ^ ( m ) can be obtained.
Algorithm II: Estimate the unknown coefficients in the GWPR model. (Note that k is a fixed constant taken from { 1 , 2 , , 6 } and the following steps should be repeated six times independently). Take the initial values η i k ( 0 ) = g ( μ i k ( 0 ) ) , y i k = μ i k ( 0 ) , Z i k ( 0 ) = η i k ( 0 ) + g ( μ i k ( 0 ) ) ( y i k μ i k ( 0 ) ) , and
t i k ( 0 ) = 1 a i k V ( μ i k ( 0 ) ) ( g ( μ i k ( 0 ) ) ) 2 , i = 1 , 2 , , 10 .
Let Z ( 0 ) = ( Z 1 k ( 0 ) , Z 2 k ( 0 ) , , Z 10 k ( 0 ) ) and T ( 0 ) = d i a g ( T 1 k ( 0 ) , T 2 k ( 0 ) , , T 10 k ( 0 ) ) . The iterative formula of β ^ ( m + 1 ) at the location ( u 0 k , v 0 k ) is
β ^ ( m + 1 ) ( u 0 k , v 0 k ) = ( A ( u 0 k , v 0 k ) T ( m ) W ( u 0 k , v 0 k ) A ( u 0 k , v 0 k ) ) 1 A ( u 0 k , v 0 k ) T ( m ) W ( u 0 k , v 0 k ) Z ( m ) ,
where W ( u 0 k , v 0 k ) = diag ( w 1 ( u 0 k , v 0 k ) , w 2 ( u 0 k , v 0 k ) , , w 10 ( u 0 k , v 0 k ) ) and
w i ( u 0 k , v 0 k ) = 1 2 π exp 1 2 d i k ( 0 ) h 2 .
Repeat the above step until convergence. When ( u 0 k , v 0 k ) takes all the locations ( u i k , v i k ) , we will get the estimated value β ^ = β ^ ( m ) in a fixed the kth year.
Algorithm III: Estimate unknown coefficients in the GWPAR model. (Note that k is a fixed constant and the following steps should be repeated six times as in Algorithm II). Take the initial value β 1 ( 0 ) , β 2 ( 0 ) from Algorithm II, and ρ ( 0 ) is the absolute value of spatial Moran’s I = 0.4480 . The initial values η ( 0 ) = ( I ρ ( 0 ) C ) 1 X β ( 0 ) , μ i k ( 0 ) = g 1 ( η i k ( 0 ) ) , and
Z i k ( 0 ) = g ( μ i k ( 0 ) ) + g ( μ i k ( 0 ) ) ( y i k μ i k ( 0 ) ) , t i k ( 0 ) = 1 a i k V ( μ i k ( 0 ) ) ( g ( μ i k ( 0 ) ) ) 2 , i = 1 , 2 , , 10 .
The ( m + 1 ) th iterative estimation β ^ ( m + 1 ) ( u 0 k , v 0 k ) and ρ ^ ( m + 1 ) is
β ^ ( m + 1 ) ( u 0 k , v 0 k ) = ( ( A ( m ) X ) T ( m ) W ( A ( m ) X ) ) 1 ( A ( m ) X ) T ( m ) W Z ( m ) , ρ ^ ( m + 1 ) = ρ ( m ) + ( ( A ( m ) C η ( m ) ) T ( m ) ( A ( m ) C η ( m ) ) ) 1 ( A ( m ) C η ( m ) ) T ( m ) ( Z ( m ) η ( m ) ) .
If ( u 0 k , v 0 k ) takes all the locations ( u i k , v i k ) , the estimate β ^ ( m + 1 ) can be given. When all estimated values arrive to converge, we will get β ^ = β ^ ( m ) ( u i k , v i k ) and ρ ^ = ρ ^ ( m ) in a fixed the kth year.
Algorithm IV: Estimate the unknown coefficients in the GTWPAR model. Take the initial values β 1 k ( 0 ) ( u i k , v i k , t k ) , β 2 k ( 0 ) ( u i k , v i k , t k ) , i = 1 , 2 , , 10 , k = 1 , 2 , , 6 from Algorithm III, and ρ ( 0 ) is the absolute value of spatiotemporal Moran’s I = 0.2143 . The initial value vector η ( 0 ) = ( I ρ ( 0 ) C ) 1 X β ( 0 ) , μ i k ( 0 ) = g 1 ( η i k ( 0 ) ) , Z ( 0 ) = ( Z 1 ( 0 ) , Z 2 ( 0 ) , , Z 60 ( 0 ) ) , T ( 0 ) = diag ( T 1 ( 0 ) , T 2 ( 0 ) , , T 60 ( 0 ) ) and
Z i k ( 0 ) = g ( μ i k ( 0 ) ) + g ( μ i k ( 0 ) ) ( y i k μ i k ( 0 ) ) , T i k ( 0 ) = 1 a i k V ( μ i k ( 0 ) ) ( g ( μ i k ( 0 ) ) ) 2 .
The ( m + 1 ) th iterative estimations β ^ ( u 00 , v 00 , t 0 ) and ρ ^ are
β ^ ( m + 1 ) ( u 00 , v 00 , t 0 ) = ( ( A ( m ) X ) T ( 0 ) W ( A ( m ) X ) ) 1 × ( A ( m ) X ) T ( m ) W Z ( m ) , ρ ^ ( m + 1 ) = ρ ( m ) + ( ( A ( m ) C η ( m ) ) T ( m ) ( A ( m ) C η ( m ) ) ) 1 × ( A ( m ) C η ( m ) ) T ( m ) ( Z ( m ) η ( m ) ) .
where W = { w i k ( u 00 , v 00 , t 0 ) } and
w i k ( u 00 , v 00 , t 0 ) = 1 2 π exp 1 2 ( u 00 u i k ) 2 + ( v 00 v i k ) 2 h S 2 + ( t 0 t k ) 2 τ h S 2 .
A detailed definition is given in Section 3.1. If ( u 00 , v 00 , t 0 ) takes all the locations ( u i k , v i k , t k ) and all estimations converge, we will get β ^ = β ^ ( m ) ( u i k , v i k , t k ) and ρ ^ = ρ ^ ( m ) .
It is worth noting that we use the parameter estimates of the previous model as the initial values of the next model to reduce the number of iterations and improve the operational efficiency. For example, the estimations of the GWPR model are selected as the initial values of the GWPAR model.

Appendix A.5. The Model Comparison Indicators (See Table 3)

(1) The coefficient of determination is defined by
R 2 = 1 ( η η ^ ) 2 ( η η ¯ ) 2 ,
where η is a set of vectors { η i k }, η ^ and η ¯ are the parameter estimate and the mean value of η , respectively.
(2) Deviation can be defined as
D = ( y l n ( μ ^ y ) + ( y μ ^ ) ) ,
where y is a set of response variables, and μ ^ is the estimation of μ =E(y).
(3) The corrected Akaike information criterion is
AICc = D + 2 P + 2 P ( P + 1 ) N P 1 ,
where D, P and N are the deviation, the number of parameters and the number of samples, respectively.
(4) Mean-square error is given by
MSE = 1 N P ( η ^ η ) 2 ,
where the parameter settings are the same as above.

References

  1. Hlongwa, P. Current ethical issues in HIV/AIDS research and HIV/AIDS care. Oral Dis. 2016, 22, 61–65. [Google Scholar] [CrossRef] [PubMed]
  2. Buve, A.; Bishikwabo-Nsarhaza, K.; Mutangadura, G. The spread and effect of HIV-1 infection in sub-Saharan Africa. Lancet 2002, 359, 2011–2017. [Google Scholar] [CrossRef]
  3. Vandormael, A.; Oliveira, T.D.; Tanser, F.; Till, B.; Herbeck, J.T. High percentage of undiagnosed HIV cases within a hyperendemic South African community: A population-based study. J. Epidemiol Community Health 2018, 72, 168–172. [Google Scholar] [CrossRef] [PubMed]
  4. Abbas, U.L.; Robert, G.; Anuj, M.; Gregory, H.; Mellors, J.W. Antiretroviral therapy and pre-exposure prophylaxis: Combined impact on HIV transmission and drug resistance in South Africa. J. Infect. Dis. 2013, 208, 224–234. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Wojcicki, J.M. Socioeconomic status as a risk factor for HIV infection in women in East, Central and Southern Africa: A systematic review. J. Biosoc. Sci. 2005, 37, 1–9. [Google Scholar] [CrossRef] [PubMed]
  6. Hallman, K. Gendered socioeconomic conditions and HIV risk behaviours among young people in South Africa. Afr. J. AIDS Res. 2005, 4, 37–50. [Google Scholar] [CrossRef]
  7. Chris, K.; Sizwe, Z.; Robert, C.; Sipho, D. Why have socioeconomic explanations been favoured over cultural ones in explaining the extensive spread of hiv in south africa? S. Afr. J. HIV Med. 2012, 13, 14–16. [Google Scholar]
  8. Stover, J. Projecting the demographic consequences of adult hiv prevalence trends: The spectrum projection package. Sex. Transm. Infect. 2004, 80, 14–18. [Google Scholar] [CrossRef] [Green Version]
  9. Shiboski, S.C.; Jewell, N.P. Statistical analysis of the time dependence of HIV infectivity based on partner study data. J. Am. Stat. Assoc. 1992, 87, 360–372. [Google Scholar] [CrossRef]
  10. Marsh, K.; Mahy, M.; Salomon, J.A.; Hogan, D.R. Assessing and adjusting for differences between hiv prevalence estimates derived from national population-based surveys and antenatal care surveillance, with applications for spectrum 2013. AIDS 2014, 28, S497–S505. [Google Scholar] [CrossRef] [Green Version]
  11. Palk, L.; Blower, S. Geographic variation in sexual behavior can explain geospatial heterogeneity in the severity of the HIV epidemic in Malawi. BMC Med. 2018, 16, 22–31. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Nelder, J.A.; Wedderburn, R.W.M. Generalized linear models. J. R. Stat. Soc. 1972, 135, 370–384. [Google Scholar]
  13. Myers, R.H.; Montgomery, D.C. A tutorial on generalized linear models. J. Qual. Technol. 1997, 29, 274–275. [Google Scholar] [CrossRef]
  14. Madden, L.V.; Boudreau, M.A. Effect of strawberry density on the spread of anthracnose caused by colletotrichum acutatum. Phytopathology 1997, 87, 828–838. [Google Scholar] [CrossRef]
  15. Guisan, A.; Edwards, T.C.; Hastie, T. Generalized linear and generalized additive models in studies of species distributions: Setting the scene. Ecol. Model. 2002, 157, 89–100. [Google Scholar] [CrossRef] [Green Version]
  16. Marshall, R.J. A review of methods for the statistical analysis of spatial patterns of disease. J. R. Stat. Soc. 1991, 154, 421–441. [Google Scholar] [CrossRef]
  17. Brunsdon, C.; Fotheringham, A.S.; Charlton, M.E. Geographically weighted regression: A method for exploring spatial nonstationarity. Geogr. Anal. 1996, 28, 281–298. [Google Scholar] [CrossRef]
  18. Nakaya, T.; Fotheringham, A.S.; Brunsdon, C.; Charlton, M.E. Geographically weighted Poisson regression for disease association mapping. Stat. Med. 2005, 24, 2695–2717. [Google Scholar] [CrossRef] [Green Version]
  19. Hadayeghi, A.; Shalaby, A.S.; Persaud, B.N. Development of planning level transportation safety tools using geographically weighted Poisson regression. Accid. Anal. Prev. 2010, 42, 676–688. [Google Scholar] [CrossRef]
  20. Li, Y.; Jiao, Y.; Browder, J.A. Modeling spatially-varying ecological relationships using geographically weighted generalized linear model: A simulation study based on longline seabird bycatch. Fish. Res. 2016, 181, 14–24. [Google Scholar] [CrossRef] [Green Version]
  21. Anselin, L. Spatial econometrics: Methods and models. J. Am. Stat. Assoc. 1990, 85, 160–161. [Google Scholar]
  22. Lichstein, J.W.; Simons, T.R.; Franzreb, S.K.E. Spatial Autocorrelation and Autoregressive Models in Ecology. Ecol. Monogr. 2002, 72, 445–463. [Google Scholar] [CrossRef]
  23. Lee, L.F. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models. Econometrica 2004, 72, 1899–1925. [Google Scholar] [CrossRef]
  24. Brunsdon, C.; Fotheringham, A.S.; Charlton, M.E. Some notes on parametric significance tests for geographically weighted regression. J. Reg. Sci. 2010, 39, 497–524. [Google Scholar] [CrossRef]
  25. Zhang, Y.H. Generalized Space Variable Coefficient Autoregressive Model and Its Application in the Impact of Macroscopic-Factors on the HIV/AIDS Incidence. Master’s Dissertation, Xinjiang University, Urumqi, China, 2017. [Google Scholar]
  26. Yan, N.; Mei, C.L. A two-step local smoothing approach for exploring spatio-temporal patterns with application to the analysis of precipitation in the mainland of China during 1986–2005. Environ. Ecol. Stat. 2014, 21, 373–390. [Google Scholar] [CrossRef]
  27. Fotheringham, A.S.; Crespo, R.; Yao, J. Geographical and temporal weighted regression (GTWR). Geogr. Anal. 2015, 47, 431–452. [Google Scholar] [CrossRef] [Green Version]
  28. Huang, B.; Wu, B.; Michael, B. Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices. Int. J. Geogr. Inf. Sci. 2010, 24, 383–401. [Google Scholar] [CrossRef]
  29. Pace, R.K.; Barry, R.; Clapp, J.M.; Rodriquez, M. Spatiotemporal autoregressive models of neighborhood effects. J. R. Estate Financ. Econ. 1998, 17, 15–33. [Google Scholar]
  30. Wu, B.; Li, R.; Huang, B. A geographically and temporally weighted autoregressive model with application to housing prices. Int. J. Geogr. Inf. Sci. 2014, 28, 1186–1204. [Google Scholar] [CrossRef]
  31. Yan, N.; Mei, C.L.; Wang, N. A unified bootstrap test for local patterns of spatio-temporal association. Environ. Plan. A 2015, 47, 227–242. [Google Scholar]
  32. Wu, X.Z. Multivariate analysis. In Complex Data Statistical Methods Application Based on R; Wu, X.Z., Ed.; China Renmin University Press: Beijing, China, 2013; pp. 116–144. [Google Scholar]
Figure 1. Spatiotemporal HIV/AIDS incidence of ten countries, 2011–2016.
Figure 1. Spatiotemporal HIV/AIDS incidence of ten countries, 2011–2016.
Entropy 22 01230 g001
Figure 2. Distributions of HIV/AIDS cases, GDP per capita and population density of ten countries, 2011–2016.
Figure 2. Distributions of HIV/AIDS cases, GDP per capita and population density of ten countries, 2011–2016.
Entropy 22 01230 g002
Figure 3. The spatial distribution of the average coefficient estimations in four models.
Figure 3. The spatial distribution of the average coefficient estimations in four models.
Entropy 22 01230 g003
Figure 4. The average MSE of response variables.
Figure 4. The average MSE of response variables.
Entropy 22 01230 g004
Figure 5. The mean values and 95% confidence intervals of coefficient estimations.
Figure 5. The mean values and 95% confidence intervals of coefficient estimations.
Entropy 22 01230 g005
Table 1. p-values of the spatial and temporal autocorrelation analysis.
Table 1. p-values of the spatial and temporal autocorrelation analysis.
Time201120122013201420152016
p-value0.00240.00280.00210.00190.00170.0022
RegionsAngolaBotswanaLesothoMalawiMozambiqueNamibia
p-value0.90940.88070.00920.53000.12890.8267
South AfricaSwazilandZimbabweZambia
0.00450.14910.56880.2231
Table 2. The quantile and mean values of coefficient estimations and response variables.
Table 2. The quantile and mean values of coefficient estimations and response variables.
ModelCoefficientMin1st QuMedian3rd QuMaxMean
True η 5.2935.6446.4557.2888.8716.559
PR β ^ 1 0.5810.5810.5810.5810.5810.581
β ^ 2 0.3850.3850.3850.3850.3850.385
η ^ 6.3446.5277.1857.4478.1627.078
GWPR β ^ 1 −0.519−0.2490.0090.6411.7740.288
β ^ 2 −0.672−0.1390.0682.0443.8030.747
η ^ 5.6186.4227.0227.2028.8136.932
GWPAR β ^ 1 −6.286−1.258−0.9100.2403.036−0.556
β ^ 2 −2.089−0.6690.0192.2819.1520.912
η ^ 5.1765.6256.4357.2198.9406.539
GTWPAR β ^ 1 −5.642−0.956−0.6770.1862.992−0.443
β ^ 2 −1.865−0.4640.2152.1349.1040.988
η ^ 5.2005.5896.3877.2158.8036.489
Table 3. The comparison of the four models.
Table 3. The comparison of the four models.
ModelR 2 AICcDMSE
PR0.129142,504.4042,624.071.6488
GWPR0.61396495.126613.020.4326
GWPAR0.9940155.46236.020.0067
GTWPAR0.9957115.25190.050.0048
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sun, S.; Li, Z.; Zhang, H.; Jiang, H.; Hu, X. Analysis of HIV/AIDS Epidemic and Socioeconomic Factors in Sub-Saharan Africa. Entropy 2020, 22, 1230. https://doi.org/10.3390/e22111230

AMA Style

Sun S, Li Z, Zhang H, Jiang H, Hu X. Analysis of HIV/AIDS Epidemic and Socioeconomic Factors in Sub-Saharan Africa. Entropy. 2020; 22(11):1230. https://doi.org/10.3390/e22111230

Chicago/Turabian Style

Sun, Shuman, Zhiming Li, Huiguo Zhang, Haijun Jiang, and Xijian Hu. 2020. "Analysis of HIV/AIDS Epidemic and Socioeconomic Factors in Sub-Saharan Africa" Entropy 22, no. 11: 1230. https://doi.org/10.3390/e22111230

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop