Data supporting the reconstruction study of missing wind speed logs using wavelet techniques for getting maximum likelihood

The data to construct the missing wind-speed value in the weather station record at “Collado de Yuste”, between the years 2002 to 2012, was calculated using wind speed data recorded in two other nearby weather stations, those in “Solana del Zapatero” and “Calar Alto”. The three mentioned stations are located in the mountain range of the province of Almeria, Autonomous Community of Andalusia, Spain. After calculating the degree of association using the correlation coefficient [1] and Wavelet Transform Scalogram [2], the data was successfully constructed. This paper refers to another study: Wind missing data arrangement using wavelet based techniques for getting maximum likelihood [3].


Subject
Renewable Energy, Sustainability and the Environment Energy Engineering and Power Technology Specific subject area Wind Power, Wavelet Transform to analyze wind speed records Type of data Table  Images How data were acquired The average wind speed data were acquired from three weather stations located at different points in the Sierra de Almeria, Andalusia, Spain.

Data format
Raw Calculated Parameters for data collection Average wind speed data was recorded using the anemometer at three weather stations.

Description of data collection
In three different meteorological stations in the highlands of Almeria, were recorded with its respective anemometers, the wind speed every 600 s (10 min). "Calar Alto" data was stored between January 1st, 2002 and December 31st, 2009. At "Solana del Zapatero" and "Collado de Yuste" stations data was stored between January 1st, 2002 and December 31st, 2012. All these records are in Original_Values.xls. The reconstructed record of "Collado de Yuste" is in the file Reconstructed_Values.xls. Both located in http://data.mendeley.com .

Data source location
Country: Almeria/SpainThe data were obtained from weather stations located in the province of Almeria, called

Value of the Data
• This data can be used to understand how to reconstruct missing data in wind speed records verifying their naturalness with respect to the original values. • This data contains essential detail concerning the non-stationary behavior of wind. Therefore, this research will contribute towards an improvement in our understanding of the wind energy field, which by 2023 will represent 12.4% of renewable energies [4] . • This data is a useful contribution to the prediction of wind behaviour and wind energy potential. • This data can be used for the preliminary assessment of wind farm projects' viability.

Data
The data set is stored on Mendeley's data website ( https://data.mendeley.com ) and is organized in two Excel sheet files: Original_Values.xls (file with average wind speed record in the three locations without applying any recovery technique) and reconstructed_wind.xls (file with reconstructed average wind speed record in the location of "Collado de Yuste"). The information set in Original_Values.xls contains data records of the average wind speed collected by the weather stations of "Calar Alto" at 2151 masl (located in the tab "Calar Alto"), "Solana del Zapatero" at 1116.1 masl (located in the tab "Solana del Zapatero") and "Collado de Yuste" at 1866 masl (located in the tab "Collado de Yuste") in the Sierra de Almeria, Andalusia, Spain. Shown in Fig. 1 .   To fill in missing CY data, we employ interpolation using data from nearby stations. For this purpose, is used the equation 1: The meaning of each element of Eq. (1) is explained in Table 1 .
The initial average hourly values during each of the eleven years in Table 2 and in the 12 months of the 11 years of measurement in Table 3 . After reconstructing the missing values in the CY register, Tables 3 and 4 are modified with the new values shown in Tables 4 and 5 respectively. The scalogram generated with the original CY record and with the reconstructed CY values, the FFT (Fast Fourier Transform) of the original CY information and the FFT including the reconstructed values, giving us a significant graphic contrast, we can see it graphically in [3] .

Experimental design, materials, and methods
The initial record of the "Collado de Yuste" weather station shows 62.108 lost data in wind speed average measurement. Through mathematical calculation based on interpolating data from the other two nearby stations, explained in [3] the record was reconstructed, lowering the loss rate to 447 data. Raw records from the three weather stations show missing data. The method used to reconstruct missing wind speed data from nearby records from other stations could contribute to improve the study for wind generation in Spain and anywhere in the world.