Introduction

Interest in forecasting the energy production of PV power plants has increased in recent years from the concern about climate change [1]. Rough estimation of the average energy produced by the PV power plants can be provided through estimation of global irradiance. Goh and Tan [2] developed statistical forecasting of solar data using time series model. Since then number of research work is available on forecasting solar irradiance [3,4,5,6,7,8,9,10]. Theoretically, prediction of global irradiance can be applied to forecast the energy production of PV power plant. The environmental factors influencing the performance of PV module are analyzed in the literature [11,12,13,14]. The effect of temperature on the performance of PV modules is analyzed in [15,16,17]. But, considerably lesser number of literature is present in predicting the actual output of PV power plant.

Some literature is available for estimating field performance of standalone PV array system [18,19,20,21,22,23,24]. Also some power efficiency models [25,26,27,28,29] can predict the average performance of a PV system under variable climatic conditions.

Ayompe et al. [29] used different models for PV-cell temperature and models for PV-cell efficiency to predict the accuracy of output power from PV module. The four- and five-parameter models were also investigated [30,31,32,33,34] and compared [35] based on the equivalent circuit of a one-diode model. Considering solar radiation and module temperature, Jones and Underwood [28] proposed a power output model of PV module. Neural networks model is more complicated which uses various inputs such as the solar radiation, ambient temperature, and module temperature [30, 31]. The design of neural network model is based on trial and error processes and requires past experience for successful implementation.

Good match between the utility load and the solar resource profile leads to a cost-effective system [29]. Most of the above-mentioned models require either detailed data [26, 30,31,32,33,34,35,36,37] and are complicated to use [35, 36] or restricted in economic performance evaluation [17,18,19,20,21, 41]. These limitations of the mentioned literature are barrier in easy manipulation of the system performance.

Another problem is that different types of PV technologies respond dissimilarly in the same climatic condition [38]. Santana-Rodŕiguez et al. [39] concluded that the performance of modules made of amorphous silicon technology is better than other technologies in Mexico City. Ronak et al. [40] observed that performance of amorphous silicon is well under Malaysia’s tropical hot and humid climate. Adiyabat et al. [41] showed that a PV module with a high-temperature coefficient, such as crystalline silicon, is advantageous for use in the Gobi Desert area.

Numbers of energy prediction models are available to predict the energy generation of different types of PV modules [42]. Dolara et al. [43] investigated the PV power output prediction using three mathematical models and considered both poly-crystalline and mono-crystalline PV module. Dolara et al. [44] and Leva et al. [45] investigated the P V power output prediction using neural network and hybrid model. Some of the renowned models used for prediction, design and economic analysis of PV power plant. Hybrid Optimization Model for Electric Renewables (HOMER), RetScreen, System Advisor Model (SAM), PVSyst, PVSol, PVWATT are the most popular PV production model. The first three models mentioned can be used for any type of renewable energy resources. Comparing with actual PV plant generation the mentioned models are showing less correlation. Reason behind the poor performance of HOMER and RetScreen models are use of average of 22-year satellite-measured meteorological data up to the year 2002 from National Aeronautics and Space Administration (NASA) renewable energy resource website (Surface Meteorology and Solar Energy). Taking average of 22-year data is not representative of the current weather condition. Typical Meteorological Year (TMY) is the most representative of weather condition for an entire year of a particular location. So TMY data should be used instead of average data. Another drawback is ambient temperature used in all the above-mentioned models is the whole day average temperature whereas only daytime temperature is needed for PV plant energy prediction. SAM model is using TMY file for meteorological parameter; however, its power output prediction is not very close for Indian condition. PVSyst and PVSol both have the ambient temperature issue and PVSol is limited for PV plant using up to 5000 numbers of PV panels only. PVSol is good for rooftop PV plant design where as PVSyst is better in Grid interactive plant. But the economic analysis of PVSyst is not good as per the literature. PVWATT is the latest one and easy model used for energy prediction but that is also not showing the expected accuracy in prediction of energy generation for Indian PV power plants. Therefore, a simple model with adequate precision is desirable for prediction of output from PV power plant based on different PV technology in different climatic zones of India.

The objective of this paper is to present a simple but accurate method for prediction of the energy production from PV plants based on different type of PV technology. This method is applicable to any type of climatic location of India to choose the best suitable technology for that specific location. Establishment of energy demand security needed a bridge between energy schemes and location adjuvant PV technology. Application of the finding of the paper through energy supply planning is helpful to make the energy generation economic and environment friendly. ‘Potential map of the location for PV power plants based on different technology and solar resource availability’ is effective for selection of PV technology for a particular location. Considering these aspects, the technology explained in this paper suggests the development of a location adjuvant map for solar PV plants which is adjuvant in energy supply planning. The scope of this study is to relate the location and solar energy to study the potential of the location for different types of solar PV plants. Prospective practical users of this method are people from energy industry, energy planners, engineers, city planners, and climate concern citizens.

The research is presented here in four stages. In the next section material and methods used for the research are described, such as collection of data for 300 grid points of India from NREL, manufactures data for different types of PV technologies, etc., followed by which the classification map of India based on global horizontal insolation (GHI) and daytime temperature is presented (till now all available resource maps of India are only based on insolation though temperature is another determining parameter for solar PV technology). In the subsequent section mathematical method for different PV technologies is described to predict the best suitable PV technology for a particular location. Finally, the validation of the method with the real-time energy output of 15 operating PV power plants of different PV technologies under the scheme of National Thermal Power Corporation (NTPC) Vidyut Vyapar Nigam Ltd. (NVVN) for the one whole year of 2014 is presented.

Methodology developed

Methodology used in this work consists of five different steps. They are concisely discussed as follows:

  1. (a)

    Division of Indian map in 1° × 1° grids.

  2. (b)

    Develop a solar resource map based on global horizontal radiation and daytime average temperature using hourly data of NREL-TMY file. The proposed map can help in selection of location for PV power plant considering two major parameters that affect the output of PV panel.

  3. (c)

    Classify India zone wise on basis of annual average GHI and day-time temperature.

  4. (d)

    Selection of commercially used PV technology.

  5. (e)

    Development of a simple mathematical model with adequate precision for prediction of energy generation from PV power Plant situated at different climatic zones of India.

  6. (f)

    Collection of location detail, secondary data of energy generation and capacity utilization factor (CUF) of operating PV power plants under the NVVN scheme for the whole year of 2014.

  7. (g)

    Validation of the developed mathematic model by comparing the predicted energy output with actual PV power plant generated output data under NVNN scheme.

  8. (h)

    Prediction of the energy generation and CUF of different types of commercially used PV technologies in classified zones of India.

Figure 1a, b depicts the general sketch of the developed methodology and developed energy prediction model, respectively.

Fig. 1
figure 1

a General sketch of the developed methodology. b Developed energy prediction model

Division of Indian map in grids

Mainland area of 3, 287, 263 km2 makes India one of the largest countries in the world. India is equally divided into two halves through the Tropic of Cancer. Pakistan and Burma are situated in the west, China and Nepal in the north to northeastern part and Bhutan in the northeastern part of India. The total length of the land boundary and the shoreline of the country are 15,200 km and 7517 km, respectively. The boundary of India is wide between 8°4′ and 37°6′(N) North Latitude and 68°7′ and 97°25′(E) East Longitudes with a measure of 3214 and 2933 km, respectively. Map of India is divided into 1° × 1° grids. This grid division of India is required for secondary data collection of GHI and temperature from TMY file of NREL. Andaman and Nicobar Islands in the Bay of Bengal and Lakshadweep Islands in Arabian Sea are also part of the Republic of India. But these two islands are not considered in the study as 1° × 1° grids are large enough for their area. So a separate study for these two locations is needed. Required data for 300 grid points are collected for classification of India on the basis of GHI and day-time temperature. Grid map of India used for data collection is shown in Fig. 2.

Fig. 2
figure 2

Grid map of India

Map of India is divided into 1° grids and magnified grids are also shown in Fig. 2.

Secondary data collection for climatic zone division of India

GHI and dry bulb temperature for 300 grid points of India are collected from TMY files of NREL website. It is a 10 × 10 km resolution data. In TMY, the month that is most representative of the location is selected for consideration. The month for which average radiation is most closely equal to the monthly average over the whole measurement period is representative of the TMY data for that month. This process is then repeated for each month in the year. The months are added together to give a full year of hourly samples. From hourly data of each day average GHI and average daytime temperature of each day is calculated for each grid points. Average of 365 days data is considered to calculate the yearly average GHI and average daytime temperature for each grid points. Hourly data of TMY file is analyzed to get the annual average GHI and only day-time annual average temperature of those grid points. Day-time average temperature is considered to reduce the error in temperature correction. Consideration of overall day temperature instead of only day-time temperature causes overestimation of generation from PV module.

A TMY data set provides designers and other users with a reasonably sized annual dataset that holds hourly meteorological values that typify conditions at a specific location over a longer period of time, such as 30 years. TMY data have natural diurnal and seasonal variations and represent a year of typical climatic conditions for a location. The TMY data set is composed of 12 typical meteorological months (January through December) that are concatenated essentially without modification to form a single year with a serially complete data record for primary measurements. This method is an empirical approach that selects individual months from different years occurring in the period of record. For example, in the case of the National Solar Radiation Data Base (NSRDB) that contains 30 years of data, all 30 Januarys are examined, and the one judged most typical by the TMY algorithm is selected to be included in the TMY. The other months of the year are treated in a like manner, and then the 12 selected typical months are concatenated to form a complete year. Adjacent months in the TMY may be selected from different years,

Final selection of a month includes consideration of the monthly mean and median and the persistence of weather patterns. The process may be considered a series of steps.

Step 1:

For each month of the calendar year, five candidate months with cumulative distribution functions (CDFs) for the daily indices that are closest to the long-term (30 years for the NSRDB) CDFs are selected. The CDF gives the proportion of values that are less than or equal to a specified value of an index.

Step 2:

The five candidate months are ranked with respect to closeness of the month to the long-term mean and median.

Step 3:

The persistence of mean dry bulb temperature and daily global horizontal radiation are evaluated by determining the frequency and length of runs of consecutive days with values above and below fixed long-term percentiles.

The persistence criteria excludes the month with the longest run, the month with the most runs, and the month with zero runs. The persistence data are used to select from the five candidate months the month to be used in the TMY. The highest ranked candidate month from Step 2 that meets the persistence criteria is used in the TMY.

Step 4:

The 12 selected months were concatenated to make a complete year and discontinuities at the month interfaces were smoothed for 6 h each side using curve fitting techniques.

Only for 10 grid points the analyzed annual average GHI and day-time temperature are shown in Table 1.

Table 1 Annual average GHI and day-time temperature of some grid points of India

Selection of PV technology

Selection of appropriate PV technology is the key for success of any PV power project. In laboratory scale, many new PV technologies are available such as organic solar cell, dye sensitized solar cells, etc., but they are not used commercially for PV power plant. Hence, mono crystalline silicon (Mono C-Si), polycrystalline silicon (Poly C-Si), amorphous silicon (a-Si), CIGS, CdTe PV technologies from reputed manufacturers are considered which are widely used for commercial PV power plant projects. Specifications of the selected PV technologies are collected from their datasheets. Required specifications of the selected PV technologies are enlisted in Table 2.

Table 2 Technical specifications of selected PV modules

Mathematical model to predict the energy generation

Output of PV module is dependent on the meteorological parameters mainly on insolation and ambient temperature. Energy outputs for fixed lilted modules are evaluated zone wise for different technology PV modules. Mathematical modeling is represented below.

Solar radiation on tilted PV array

Total solar radiation received by PV array which is tilted at a certain angle is known as solar radiation on tilted PV array. In NREL website, the long-term published data of solar radiation and other meteorological parameters are available as hourly average values on horizontal surface for different grid points of India. The solar radiation on a tilted PV array is the main input parameter for the design of the PV system. According to the Liu and Jordan formula using the hourly beam and diffuse radiation on a horizontal surface, the total radiation on a tilted PV array (at angle β) for a given latitude φ can be evaluated:

$$ I_{\text{t}} = I_{\text{b}} r_{\text{b}} + I_{\text{d}} r_{\text{d}} + \rho r_{\text{r}} (I_{\text{b}} + I_{\text{d}} ) $$
(1)

\( I_{\text{b}} \) is the hourly beam radiation on a horizontal surface; \( I_{\text{d}} \) hourly diffuse radiation on a horizontal surface; \( r_{\text{b}} \), \( r_{\text{d}} \) and \( r_{\text{r}} \) are known as conversion factors for beam, diffuse and reflected components, respectively; \( \rho \) is the reflection coefficient of the ground (0.2 and 0.6 for non-snow-covered and snow-covered ground, respectively).

$$ r_{\text{b}} = \frac{{{\text{Cos}}\theta_{i} }}{{{\text{Cos}}\theta_{z} }} $$
(2)
$$ r_{\text{d}} = \frac{{1 + {\text{Cos}}\beta }}{2} $$
(3)
$$ r_{\text{r}} = \frac{{1 - {\text{Cos}}\beta }}{2} $$
(4)
$$ {\text{Cos}}\theta_{\text{i}} = ({\text{Cos}}\phi {\text{Cos}}\beta + {\text{Sin}}\phi {\text{Sin}}\beta {\text{Cos}}\gamma ){\text{Cos}}\delta {\text{Cos}}\omega + {\text{Cos}}\delta {\text{Sin}}\omega {\text{Sin}}\beta {\text{Sin}}\gamma + ({\text{Sin}}\phi {\text{Cos}}\beta - {\text{Cos}}\phi {\text{Sin}}\beta {\text{Cos}}\gamma ){\text{Sin}}\delta $$
(5)
$$ {\text{Cos}}\theta_{\text{z}} = {\text{Cos}}\phi {\text{Cos}}\delta {\text{Cos}}\omega + {\text{Sin}}\phi {\text{Sin}}\delta $$
(6)

\( \theta_{\text{i}} \) is angle of incidence, \( \theta_{\text{z}} \) zenith angle; \( \phi \) the latitude angle; \( \beta \) the tilt angle; \( \gamma \) the azimuth angle; \( \delta \) the solar declination angle; \( \omega \) is the hour angle.

$$ {\text{Insolation}}\,{\text{on}}\,{\text{tilted}}\,{\text{surface}},\quad I_{\text{T}} = \int\limits_{{S_{\text{R}} }}^{{S_{\text{S}} }} {I_{\text{t}} } $$
(7)

\( S_{\text{R}} \) is the sunrise time; \( S_{\text{S}} \) is the sunset time.

Daily PV array output is the summation of the hourly energy output.

Equivalent full sun-shine hours on tilted surface

No. of hours for which the intensity of solar radiation incident at a place is kept constant at its peak value of 1 kW/m2 is known as equivalent hours of full sunshine (\( h_{\text{et}} \)). When insolation on tilted unit area surface is expressed by \( I_{\text{T}} \) kWh/m2/day, then it can be expressed as constant peak value of solar radiation of 1 kW/m2 incident on a receiving surface for \( I_{T} \) h, then \( h_{\text{et}} \) will be equal to \( I_{\text{T}} \) h/day. The expression is given by Eq. (7)

$$ h_{\text{et}} = \frac{{I_{\text{T}} }}{{{{ 1\,{\text{kW}}} \mathord{\left/ {\vphantom {{ 1\,{\text{kW}}} {{\text{m}}^{ 2} }}} \right. \kern-0pt} {{\text{m}}^{ 2} }}}} $$
(8)

\( h_{\text{et}} \) is the equivalent full sun-shine hours on tilted surface; \( I_{\text{T}} \) is Insolation on tilted surface in kWh/m2/day.

PV panel output after temperature correction

Power output of PV panel after consideration of reduction in power output from PV panel due to deviation of its temperature from 25 °C is known as PV panel output after temperature correction. The output of PV module is dependent on solar radiation and the operating temperature of PV module. Rise in the PV operating temperature with respect to STC condition, reduces the energy output from PV module. Operating temperature of PV module can be calculated by means of ambient temperature, incident solar irradiance on PV module at given location and Nominal Operating Cell Temperature (NOCT) of the particular technology [46,47,48,49].

$$ P_{\text{otc}} = P_{\text{m}} \left[ {1 - \left\{ {T_{\text{a}} + \left( {\frac{{({\text{NOCT}} - 20)}}{800} \times I_{\text{t}} } \right) - T_{\text{STC}} } \right\} \times \gamma_{\text{P}} } \right] $$
(9)

\( P_{\text{otc}} \) is the panel output after temperature correction; \( P_{\text{m}} \) the power output at STC; \( T_{\text{a}} \) the ambient temperature in  °C; \( {\text{NOCT}} \) the nominal operating cell temperature; \( I_{\text{t}} \) the irradiance in W/m2; \( T_{\text{STC}} \) the temperature of the PV module at STC in °C; \( \gamma_{\text{P}} \) is the temperature coefficient of power for PV panel in %/°C.

Generated energy from PV plant feed to the grid

Total generated energy from PV array that is fed to the electricity grid is known as generated energy from PV plant feed to the grid.

$$ E_{\text{PVP}} = P_{\text{otc}} \times h_{\text{et}} \times (1 - D_{\text{f}} )(1 - H_{\text{f}} )(1 - W_{\text{l}} )(1 - m_{\text{l}} ) \times \eta_{\text{inv}} $$
(10)

\( E_{\text{PVP}} \) is the energy output from PV plant; \( D_{\text{f}} \) the dust factor in %; \( H_{\text{f}} \) the humidity factor in %; \( W_{\text{l}} \) the wiring losses in %; \( m_{\text{l}} \) the mismatch losses in %, \( \eta_{\text{inv}} \) is the inverter efficiency in %.

Performance ratio

Performance ratio is defined as the ratio of the array yield to the reference yield. Array yield is defined as the total energy generated by the PV array for a defined period divided by the rated output power of the installed PV array. Reference yield is defined as the ratio of total in plane solar insolation to the reference irradiance. Array yield and the reference yield both are time-dependent parameters. The performance ratio (PR) is a system performance index that indicates the overall effect of losses on the array’s rated output due to array temperature, incomplete utilization of the irradiation, and system component inefficiencies or failures (IEC 61724).

$$ {\text{PR}} = \frac{{E_{\text{PVP}} }}{{A_{\text{a}} \times I_{\text{T}} \times \eta_{\text{Panel}} }} \times 100 $$
(11)

\( {\text{PR}} \) is the performance ratio in %; \( A_{\text{a}} \) the active area of panel; \( \eta_{Panel} \) is the solar panel efficiency in %.

Energy yield factor

Energy yield factor (YF) of the PV system defined as the portion of the daily net energy output of the entire PV plant which is supplied by the array per kW of installed PV array.

$$ {\text{YF}} = \frac{{E_{\text{PVP}} }}{{P_{\text{m}} }} $$
(12)

\( {\text{YF}} \) is the yield factor.

Capacity utilization factor

Practical output of a power plant over a period of time divided by its potential output of power plant if had operated at full nameplate capacity for the entire time is known as net capacity utilization factor (CUF) of that power plant. To calculate the CUF, the sum of energy production in a particular period considered and divides by the amount of energy the plant actually capable of produce at its full capacity.

Collected data of solar PV projects in India

Under the scheme of Jawaharlal Nehru National Solar Mission (JNNSM) numbers of Solar PV plants are already installed in India and many other are going to be installed to fulfill the target of JNNSM. For successful execution of JNNSM, NTPC Vidyut Vyapar Nigam Ltd (NVVN) [50] has given the responsibility to prepare the guidelines for selection of developers for commissioning grid connected solar power projects in India. Location and monthly field measured generation data of fifteen existing PV plants of different technologies for one whole year of 2014 in India are collected from the project installer [50]. Some of the PV plants details are tabulated in Table 3.

Table 3 Details of some PV plants in India

Results and discussion

Results and discussion part is sub-divided into three parts. (1) Classification of India in different climatic zones considering on GHI and day-time temperature. (2) Prediction of energy output and CUF from different PV technologies in different climatic zones of India using mathematical method. (3) Validation of the mathematical method with practical field output data of 15 PV plants of different technologies.

Classification of India in different climatic zones

To know the energy output from different photovoltaic technologies in different zones of India, first step is to classify India in different climatic zones on the basis of annual average GHI and day-time temperature, the major two factors affecting the output of PV technology. Indian map is divided into grids by its longitude and latitude. For each degree apart one point is selected as grid point. Thus, 300 grid points have been selected. Secondary data of TMY file from NREL website [30] are collected for those 300 grid points to classify India in 15 climatic zones depending on annual average GHI and day-time average temperature. For 300 grid points daytime and whole day annual average temperature is also calculated to know the difference between these two temperatures. This temperature difference causes error in most of the prediction techniques as whole day average temperature is considered in most of the PV output predicting softwares, such as RetScreen and Homer. Figure 3 shows the daytime and whole day average temperature for the selected grid points.

Fig. 3
figure 3

Day-time and whole day annual average temperature for 300 grid points of India. Maximum 5.16 °C temperature difference is observed between these two temperatures

Classification of India in different climatic zones depending on GHI and day-time average temperature is presented in Fig. 4.

Fig. 4
figure 4

Classification of India on the basis of annual average GHI and day-time temperature

Energy output (kWh) of different PV technologies in different climatic zones of India

Annual average energy output and CUF of 1 MWp power plant of different technologies in different climatic zones is calculated using 2.4.4 and 2.4.7. All the related data like temperature coefficient of power and NOCT value for different PV technologies are used from the data sheets of manufacturers to calculate the energy output.

It is seen that in all climatic zones energy generation and CUF is maximum for 1 MW a-Si PV power plant (Tables 5, 6).

Table 4 Name of the climatic zones
Table 5 Annual average energy output range and CUF at different climatic zones of India for different PV technologies

Validation of the mathematical method

Depending on the availability of actual field data 15 PV power plants are selected. It is found that all the 15 selected PV power plants of different technologies are situated in the same climatic zone, i.e., Zone 14 (very high GHI high temperature). Among the selected 15 PV power plants, plant 1–9 is based on Multi C-Si, 10 is of a-Si, 11–13 is based on CdTe technology and 14–15 is based on CIS technology. For validation of the mathematical method actual field output of PV plants are compared with the predicted value. 15 different PV power plants are of different capacity, so exported energy of the PV power plants are normalized to 1 MW power plant. Validation table is presented in Table 6.

Table 6 Validation table of mathematical model

Table 6 shows that actual generation and CUF of all the selected PV plants are in the range of predicted value got through mathematical model. Percentage errors in prediction of energy generation and CUF for all the fifteen PV plants are also calculated. Results indicate that maximum percentage error in prediction of annual energy generation is 10.29% where as minimum is with a value of 0.08% for plant 2. So, mathematical model is in good correlation with the actual field output and CUF of PV power plants.

Conclusion

The mathematical model used for technical feasibility study of PV power plant is validated collecting the actual field output data of 15 PV power plants of different technologies. Findings of the proposed work are tabulated below:

  • For multi-C-Si predicted energy generation and CUF is 1,602,413–1,943,816 kWh/MW and 18.29–22.19%, whereas actual field generation and CUF are 1,690,376.2–1,955,562 kWh/MW and 19.41–22.19%, respectively. So actual field output is in the range of prediction.

  • For a-Si actual field energy generation and CUF are 1,853,285.2 kWh/MW and 21.54%, respectively. These values are also within the prediction range.

  • Actual energy generation and CUF of CdTe-based PV plants are 1,807,359–2,024,909.2 kWh/MW and 20.82–23.80%, respectively, which is also within the range of prediction.

  • CIS-based PV power plants export annually 1795320–1,864,970 kWh/MW energy to the grid which is within the prediction range.

So, technical feasibility model is in good correlation with the actual field output of PV power plants. This validation would be more trustable if more numbers of a-Si, CdTe and CIGS-based PV power plants data can be compared with predicted data. Due to unavailability of more practical data and as PV plants are not installed in all climatic zones of India till now, further validation in all climatic zones is not possible now. The ministry has future plans for installation of PV power plants all over India; therefore, much better validation of the work will be possible.