Data documenting the potential distribution of Aedes aegypti in the center of Veracruz, Mexico

The data presented in this article are related to the research article entitled “Establishment of Aedes aegypti (L.) in mountainous regions in Mexico: Increasing number of population at risk of mosquito-borne disease and future climate conditions” (M. Equihua, S. Ibáñez-Bernal, G. Benítez, I. Estrada-Contreras, C.A. Sandoval-Ruiz, F.S. Mendoza-Palmero, 2016) [1]. This article provides presence records in shapefile format used to generate maps of potential distribution of Aedes aegypti with different climate change scenarios as well as each of the maps obtained in raster format. In addition, tables with values of potential distribution of the vector as well as the average values of probability of presence including data of the mosquito incidence along the altitudinal range.


Specifications
Type of data Maps, tables and figures How data was acquired A dataset sampling for the state of Veracruz: 100 records of Aedes aegypti from previous surveys, 167 also records provided by the Health Authority for Region V, state of Veracruz and seven records from our sampling data. Potential distribution maps of Aedes aegypti were obtained using the packages "maxlike" ver. 0.1-5, "raster" ver. 2.3-12, "rgdal" ver. 0.9-1, "sp" ver. 1.0-16 and "tcltk2" ver. 1.2-10, in the software R ver. 3.1.2. In addition, a geographic information system was used to analyze the maps obtained. Data format Shapefile (.shp) and Excel (.xlsx) Data source location Veracruz, Mexico Data accessibility Data are available in this article

Value of the data
Presence records over a gradient including current boundary conditions is interesting to assess current Aedes aegypti distribution expansion.
Potential distribution mosquito coverage is useful in planning future strategies to face the human risks produced byAedes aegypti expansion.
The potential distribution ofAedes aegypticould be used to compare the output of other algorithms used in ecological niche modeling.

Data
The dataset of this article provides information about occurrence records used to generate the potential distribution maps of Aedes aegypti, we produced a series of maps about this. This maps are presented and discussed in Equihua et al. [1]. The map included (Map 1) is the spatial distribution of the records used to generate Aedes aegypti potential distribution models (shared in shapefile format). The following five maps are the potential distribution obtained under the different scenarios of climate change we explored. Map 2 is the current potential distribution, Map 3 is the RCP 4.5 to 2030 scenario, Map 4 is the RCP 8.5 to 2030 scenario, Map 5 is the RCP 4. 5 to 2080 scenario and Map 6 is the RCP 8.5 to 2080 scenario. They are shared in raster geo-TIF format. Tables 1-3 show information about the area, probability of occurrence and potential altitudinal presence in different altitudinal ranges where the potential presence of mosquito is projected (they are shared in.xlsx format).

Experimental design, materials and methods
We developed ecological niche models of Ae. aegypti for the state of Veracruz with a total of 274 verified records. Seven records from our sampling data, 100 records from previous surveys and 167 records provided by the Health Authority for Region V, state of Veracruz. We verified all of them for geographic accuracy with on-screen visual inspection using a Geographic Information System image.
To develop potential distribution models of Ae. aegypti we used bioclimate variables for current conditions [2] and projected to future [3]. The bioclimate variables used were Bio5: maximum temperature of the warmest month, Bio6: minimum temperature of the coldest month, Bio13: precipitation of the wettest month and Bio14: precipitation of the driest month.
The results of correlation analysis for 19 bioclimate variables indicate that the four variables selected highly correlate with 2 principal component that account for almost 92% of the variability in the data. For each projection into future conditions, we used two Representative Concentration Pathways (RCP): RCP 4.5 and RCP 8.5, which refer to the possible range of radiative forcing values in the year 2100 relative to pre-industrial values, expressed in W/m2 [4].
We standardized all bioclimate variables (current and future) with their corresponding current layer, i. e. for the projected value of each variable we subtracted the mean and then divided it by the standard deviation of the current data subset. We used the MaxLike software package [5] to generate potential distribution maps and we used the packages "maxlike" ver. 0.1-5, "raster" ver. 2.3-12, "rgdal" ver. 0.9-1, "sp" ver. 1.0-16 and "tcltk2" ver. 1.2-10, in the software R ver. 3.1.2.
Then, we randomly selected 65% of the records for training and the remaining 35% for crossvalidation each of the 1000 times the process was repeated with the current conditions dataset. The resulting models were deemed adequate, according to Estrada-Contreras et al. [6], if they satisfied the following criteria: a) convergence occurred, b) they had no missing data, and c) proportion of errors of omission was less than or equal to 10. The model coefficients were then used to project the species' future niche. The resulting models were ranked by how well they matched the relative occurrence area (ROA) [7] values. We chose 10 models around the statistical median that had an average probability of presence obtained with validation records closest to 1, since theoretically the average of this value should be 1. Then we produced a consensus map averaging these 10 maps (the same models set for current and future conditions). The minimum value of probability of presence was considered indicative of the likely presence of Ae. aegypti, and was obtained by extracting values from the potential distribution map to current conditions with the coordinates of all the records used to generate the models (training and validation). To further evaluate the current presence model we used partial ROC [8] by randomly selecting 35% of the records used to generate the models.
Although ecological niche models were generated for surface analysis of the entire state of Veracruz, elevation increase and changes in the probability of occurrence were conducted only in the rectangle that has its diagonal vertices at points 97°35'55.78''W and 20°28'20.67''N, and 95°49'31.07''W and 18°39'41.6''N, which covers an area of 28,167.58 km 2 . To identify whether the analysis area has combinations of environmental variables similar to those of today, the "Mobility-Oriented Parity"(MOP) tool [9] was used.

Conflict of Interest
There is no conflict of interest.