1 Introduction

Global economic expansion is increasing the power market demand, causing a negative influence on the environment. The wholesale price of electricity has become an important aspect of the energy sector. Electricity is mainly traded in auctions known as power exchanges or pools, where electricity-generating companies provide energy together with pricing rates that can be bid on by essential consumers. Solar photovoltaic (PV) emerges as an alternative energy capable of meeting a greater percentage of global energy needs. Germany has developed by 20% for electricity generation and Japan is generating more PV power in the world, and 6.5% of global PV generation comes from this country. PV has become more cost-effective and the development of inorganic PV material aids in the efficient production of next-generation solar cells [1, 2]. As a result, operation and maintenance costs (O&M) have a critical impact on electrical and power module profitability margins, because energy market participants must forecast solar power in the medium and long term [3, 4].

The electricity price forecasting has become an important aspect of the energy sector [5], and Machine Learning (ML) Artificial intelligence (AI) algorithms approaches are widely implemented to recognise different patterns [6,7,8]. Despite the significant progress made using AI for PV generation, different challenges must be addressed to be resolved by future research focused on promising techniques based on AI, e.g., Explainable Artificial Intelligence (XAI) and novel hybrid techniques that reduce the weaknesses of current ML techniques to afford new PV operational scenarios required to reach the competitiveness in the energy sector [9, 10]. Different techniques for PV generation must work in a dynamic parameter environment, implying that agents cannot have complete awareness of the parameters and environment [11, 12]. Some research in the literature reviewed the issue of software failures, that may lead to inaccurate approximations [13,14,15]. The use of partial quantitative measurements and inconsistent information on the solar production environment in AI models can allow agents to operate in a variety of situations and restrictions, both at the users and the service provider levels [16]. The reviewed research literature demonstrates that several models have been employed for solar generating, with the benefits and drawbacks explored [1, 17, 18], demonstrating that there is a need to establish models and findings that are generally applicable to a wide variety of scenario. It is also required that the results are accurate, being the lack of modelling details a key problem across a wide part of the reviewed literature [19]. The accuracy of estimating PV system performance is constrained through the use of configuration models [20, 21], e.g., the Multiple Linear Regression (MLR) model is less effective than Artificial Neural Networks (ANN) in detecting hidden layers [22]. The performance of models has been affected by the dynamic parameters used for checking the efficiency of PV generation, showing low performance that affects the validity of this research study [23]. There is growing interest in identifying AI and ML to integrate real-time responses from other power channels interfacing with the power system, and this is considered to play a significant role in the future [24]. The adoption of more multi-agent frameworks, where agents can function in a simulated, stochastic environment while also relaxing assumptions about the preferences and expectations of participating entities, could be a promising approach for studies [25].

The fundamental objective of this study is to develop a new framework that can stimulate human-machine collaboration to estimate daily power prices in the wholesale market [26, 27]. The main novelties of this study are as follows:

  • Analysis of different frameworks: The comparison of the power production using different PV cells is understood with the help of the review of different research papers. This section proposes a novel analysis of dynamic parameters used for checking the efficiency of PV generation that have a relevant influence on the performance of the models. The objective is to demonstrate that poor model performance affects the validity of research studies.

  • Analysis of various models with different parameters in PV generation: Several models are used in the field of electricity forecasting, but these models have their advances and limitations. The objective of this study is to improve the traditional technologies in terms of accuracy and precision of prediction.

  • Analysis of the state of the art: Several researchers have proposed different models for the same purposes. Recent research work done in electricity forecasting and the PV generation domain has been reviewed and analysed to select the best model and find novel approaches [28,29,30]. One of the most relevant novelties of the paper is an analysis of the state of the art performed in three different phases, that has not been found in the current literature:

    • Review-based analysis: The most recent studies in chronological order have been studied to analyze the main areas of the global energy industry under current investigation. The objective is to detect main trends and novel research lines.

    • Method-based analysis: The volume and variety of new studies and methods have grown exponentially in recent years, compared to traditional technologies. This phase analyzes new optimized and hybrid models to detect the most relevant and fastest-growing techniques.

    • Result-based analysis: The research outcomes achieved by the researchers according to the models they used, have been analysed in this phase.

This paper is structured as follows: Sect. 2 develops an analysis of the current state of the art based on different reviews, methodologies and results. Section 3 presents the overall discussion of the research, where all the applied methodologies and algorithms are summarized to extract relevant data about the state of the art. Finally, the main conclusions are provided in Sect. 4.

2 Related Work

2.1 Review Based Analysis

A new deep learning model is used for the enhancement of visual ability in PV generation [31]. The improvement of accuracy is the primary objective of the mainly with real-time data analysis [32, 33]. Figure 1 shows the basic process of the PV generation process. The radiant energy is absorbed by the PV panel or grid of PV panels [34]. This radiant energy is then converted into DC power and then it is passed to the power electronic converters [32, 35]. These converters convert the DC power into the grid frequency power [36].

Fig. 1
figure 1

General diagram of PV generation

Camargo and Schmidt [37] presented a survey to collect data from installations in Chile. The proposed hypothesis of the research focused on four data sets from PV installation. The multi-annual time series was simulated in the research, using forecasting and neighbour methods. It was demonstrated that deseasonalized aggregated data used to have a better correlation. The older MERRA-2 reanalysis dataset is the most critical limitation and the global re-analysis of the data set is one of the main advantages of this research. Guo et al. [38] suggested that PV plan installation is the most reliable and stable plan worldwide, and for this case study, solar power is used to limit the plan for PV generation. A prediction model algorithm is used in this research, analysing a short-term PV model. Boosting and bagging approaches were applied together with meteorological parameters and historical data as the main dataset. It is indicated that the cutting-edge plan of AI has predicted a new PV model. The single model was applied to act as a limitation in this context.

The primary objective of the study presented by Ogawa, and Mori [39] is to identify PV forecasting with an efficient method as a forecasting plan and a Multi-layer Perceptron (MLP) neural network. Two approaches based on ANNs and a statistical approach were implemented. The used dataset was distributed, and the error was normalised as a result of the MLP-based method. Under this PV plan, there is an issue or limitation such as Economic Load Dispatching (ELD) and Unit Commitment (UC). A Deep Neural Network (DNN) based method has been proposed by Dorokhova et al. [40], determining that PV generation has reduced the energy crisis and carbon emission plan positively. The control method, model predictive control and a reinforcement learning control data-driven approach were applied to a 2-year dataset. The electric vehicle charging problem was reduced by using reinforcement learning control, being confirmed that maximisation of PV consumption is highly required.

The primary objective of the research developed by Beltrán et al. [41] is to promote a novel framework to encourage the collaboration of human machines to forecast the price of electricity daily in the wholesale market. Model agnostic, time series analysis and time decomposition, STL, Holt-Winters, Bates and Granger method, Aiolfi and Timmermann method were applied to develop different approaches: loop approach, non-linear approach, and robust approach. Data stories depend on a model-agnostic method. The proposed framework was limited in grasping the bounds of point forecasts by marking the point intervals although it provided satisfactory results by increasing the computational power. Long Short-Term Memory (LSTM) framework was proposed to control the fault of ML algorithms, that are applied only to a huge volume of information [42]. Two-stage hybrid method, wrapper method, ML, deep learning, statistical methods and other forecasting approaches were applied. The robustness of LSTM was comparatively better than normal LSTM, demonstrating that a single framework is unable to predict PVPG accurately due to the bounds of the stand-alone process. Operators can solve time series forecasting issues due to the proper utilization of LSTM. PV generation is an effective solution that enables people to overcome the current energy challenges [36, 43]. Representing a relevant solar irradiation predictor that combines the benefits of ML and the optimization of temporal and spatial parameters is a survey objective of Rodríguez et al. [31]. ML and statistical methods, single model and AI approach were the main models. The accuracy of the results of feed-forward neural networks (FFNN) was better than the persistence model. The major limitation is the high training requirements for FFNN and spatial-temporal model but it is demonstrated that the solar irradiation predictor was able to accurately measure certain changes.

Dynamic power flow, regression, and two core-casting methods were chosen together with the DL approach with time sequence and time-series data. Distribution losses have been found while using secondary and primary LSTM systems. The major drawback of this proposed mechanism is the absence of voltage collapse mitigation and limit violation according to Yin et al. [44]. Visser et al. [45] proposed the performance analysis of 12 different models for forecasting the day-ahead power production with an agreement to the market conditions. The objective of the research was to examine the effect of multiple PV systems with a variation in the inter-system distance on the performance of forecasted models. The method of clean data has been used to analyse 12 different models with their pre and post-processing phases. The direct and indirect approach has been chosen to analyze the PV model, regression and ML-based models. The dataset is formed by PV power output from 152 PV systems located in the Netherlands. The result of the research showed that density and spatial factors presented a positive effect on the performance of the models. The limitation of the research was related to the testing conditions as many cases are of a single site or single aggression of several systems. The advantage of the research was related about the managers of the power grid will be able to minimise the grid imbalances with the application of the models.

The research developed by Gowid and Massoud [46] designed a tool to identify robust and practical PV Maximum Power Point (MPP) with the use of reliable experimental data. They performed a comparative study of input scenarios and developed a novel tool, being necessary to develop a correlation between the current at the maximum power point \( {I}_{MP}\) and the maximum power voltage \( {V}_{MP}\) with PV electrical, thermal, and meteorological variables. The MPP approach was implemented to conduct the research, and the results displayed a decrease of 74.3% in MSE of \( {V}_{MP}\) and a reduction by 95% in the MSE of \( {I}_{MP}\). The parameter combination scenarios were restricted to maximize the accuracy in the research, although this tool will help to maximise the power output. A novel research was developed by Tian et al. [47] to analyse the feasibility and necessity of connection of PV power generation introduction into rail transit power supply. The authors applied the indirect forecasting method for forecasting the load of PV power generation. The LSTM neural network approach has been adopted in the research and the data of the research has been collected from a PV power station every 10 min for 207 days. The LSTM neural network results presented elevated levels of fitting and high prediction accuracy. There is no research limitation in the paper and the benefit of the research is that the capability of the LSTM neural network is well known. The application of ML and data-driven techniques at the level of monitoring, controlling, optimisation and fault detection of power generation systems was applied by Sun and You [48]. The main objective is the analysis of ML and data-driven techniques in terms of flexibility and profitability. ML and data-driven methods were used together with FT, Bayesian nonparametric and Markov chain Monte Carlo (MCMC) approaches. The results showed that big data ML-based methods of regression seemed to be more powerful in characterising nonlinear multivariable systems, although this control method had a limitation of dealing with high uncertainty and nonlinearity. The research will benefit in showing the ML-based data in control, visibility, profitability, and safety during power generation. These factors are essential to ensure a proper data analytics process, and Fig. 2 illustrates how much these factors can be achieved through the ML techniques applied in this section, showing a basic comparison of ML usage in the aspects of visibility, manoeuvrability, flexibility, profitability at intervals of significance of these features.

Fig. 2
figure 2

Comparison of various ML factors in terms of usage and accuracy

PV forecasting is another relevant research topic. The research developed by Behera et al. [49] analysed short-term PV power forecasting by the Empirical Mode Decomposition (EMD) technique. The objective of the research is to construct a 3-stage approach and analyse the PV power forecasting with the constructed approach. Statistical methods were used to analyse the historical solar irradiance data and the 3-stage approach of the EMD technique was implemented. The result of the analysis of the collected data showed that EMD techniques have performed better than the conventional technique with the address to short-term PV power forecasting. High reliability, optimal placement, acceptable power quality and low-cost operation are a few factors that are the limitations of the application of solar power. The background of the research is based on the implementation of the ML network for yielding promising solutions in pattern recognition and low estimation bias. According to Khodayar et al. [50], the objective set for the study is to discriminative deep models to estimate future solar energy as per the historical measurements. The statistical method of generative variation based on autoencoders (AE) was used to estimate the disaggregated BTM signals. The Pecan Street dataset was used to carry out the research, and the numerical results of the real-world Event Detection (ED) dataset showed significant improvements in ED accuracy to sparse coding approaches. There are problems of energy disaggregation in PV penetration in the sparse coding approaches but the application of multiple deep ANN can increase the prediction accuracy.

The development of Floating Solar PV (FSPV) systems is in the nascent stages of long-term performance and its feasibility was not effectively addressed by Goswami et al. [51]. The analytical method used for the mathematical modelling of the PV cell to develop a computational method was based on evolutionary algorithms for the assessment of FSPV. The numerical approach was used in the research for the determination of parameters although there was no dataset applicable. The proposed model applied on 100 FSPV modules showed superiority in the estimation of its parameters and the efficiency of the model decreased under low irradiance. The research findings will help in the sound judgments of the scientists with the implementation of the FSPV systems. The discovery of the inorganic PV material helps in the effective ways of generating new-age solar cells [52]. Predicting the efficiency of inorganic PV materials with relevant ML is the primary objective of the survey. The use of ML method was used along with density functional calculations in material discoveries. The ML approach was issued for the study to recognise the patterns between materials using three datasets as the input, training, and production datasets. The results showed that the ML method was an effective method for fast atomic level prediction of PV materials [52]. The author did not point out any disadvantages related to the study. The application of ML helps in the accuracy of prediction regarding the PV materials of different crystal structures. The use of solar energy is used by countries to decrease ecological hazards. The Root Mean Squared Error (RMSE), mean absolute error and R2 model were used to explore the relationship between numerous input parameters and solar PV using ML models [9]. ML and predictive approaches have been used for the research, using experimental datasets. The results showed that the proposed ML approach was accurate at predicting the power of different solar PV panels. The effect of dust and wind on solar PV is still incomplete in terms of experiments, but the Support Vector Machine (SVM) and the Gaussian Process Regression (GPR) models enhanced the forecasting of solar PV power.

The lack of weather information affects the theoretical power calculations and focuses on the whole power station rather than the arrays, according to Liang et al. [53]. A data-driven method was used along with the Extreme Learning Machine (ELM) model to solve the problem through the status evaluation method of arrays. Six arrays were selected on a random basis from the large-scale power station in China and a statistical approach was implemented for the evaluation of the data. The results demonstrated that the status assessment accuracy was 90%, confirming its effectiveness. The main contribution of this study is the identification of the proposed method to reflect the status of PV arrays, although the evaluation of the degradation of the PV arrays was not possible through the current method. The primary objective of the analysis carried out by Kalogirou and Sencan [54] is to analyse the international PV market and forecast the market demands and developments in 2010. The basic methods used in this research calculated manufacturing costs, identified production efficiencies, and then forecasted the PV world market. A profitable approach and single crystal silicon module were implemented in this research. The dataset for this research is formed by datasets of the PV world market, PV production efficiencies and manufacturing PV costs dataset. Due to the practical limit of advanced technology, forecasted data on the development and market demand of PV may not be accurate. The focus of the research is to examine the possibility of utilising AI in forecasting the production of PV energy using Auto-Regressive Moving Averages and regression methods. Energy production management, moving averages, data-driven modelling, and classical approaches have been implemented by Maycock [55] to forecast the production of electricity. The initial Production Dataset (PD and SI) was inspected in this research. The simulation result showed that MFNN can be effectively used by operators to predict the energy production by the unit of PV production. Lack of better precision determines an enhanced complexity of the ANN model resulting in decreasing the lesser accuracy value in predicted data.

Hossain et al. [56] presented a forecasting algorithm with an LSTM ANN to predict PV power generation. This work applied a statistical analytical research method with three years of (2016–2018) based simulation. The implementation of synthetic irradiance forecast can improve by up to 33% in accuracy compared to using the hourly categorical type of sky forecast. The superiority of the LSTM ANN with the proposed features was tested by exploring other machine learning algorithms, such as the recurrent neural network, the Generalised Regression Neural Network (GRNN) and the ELM. Almomani et al. [57]. presented a method for modelling PV arrays based on AI techniques, specifically Genetic Algorithm (GA) and Cuckoo Optimisation Algorithm (COA). The adopted models using GA and COA were implemented in a simulation platform using MATLAB environment for two-diode and two-diode and single-diode models. The obtained models were tested and validated with experimental data taken from the PV power plant at Mutah University. The results showed that for both optimisation algorithms, the two-diode model was more accurate than the single-diode model. The results also revealed that, at different values of temperature and solar irradiance, the COA handled the optimisation problem better with low iterations and better fitness value compared to the GA.

Solar energy is extremely dependent on climate and geography, and fluctuates irregularly, making the integration of PV into power networks problematic. Ahmed et al. [28] reviewed and evaluated contemporary forecasting techniques. Input correlation analysis revealed that solar irradiance is most correlated with PV production, and for this reason, meteorological classification and cloud movement studies are crucial. Normalisation, wavelet transforms, and augmentation by a generative adversarial network were recommended for network training and forecasting. The authors discussed the established performance evaluation metrics Mean Absolute Error (MAE), RMSE and mean absolute percentage error, with suggestions for including economic utility metrics of economic utility.

Different optimization and control techniques were widely proposed to address four issues including intermittent power supply, low conversion efficiency, the nonlinearity of PV system, and high fabrication costs [10]. Institutive method, ML methods (supervised and unsupervised learning methods), ANN forecasting method, simulated Annealing, SVM, and Harmony Search approach were applied to a large dataset of solar radiation. SVM was the best fit for reducing the RMSE and MAE values. The calculated RMSE value was 12.41% and the MAE value was 6.95% after using the SVM technique. Weather prediction in a geographic region may not be accurate because it shows a minimum error. The main objective of this research developed by Das et al. [58] is to inspect the progress of solar technology development and explain the innovative path. The Wille method and hierarchical clustering method were applied together with silhouette validation, and paragraph vectors doc2vec and word2vec, together with ML approach. The experimental dataset demonstrated the impacts of changing parameters on the system of storage and limitations in installing and purchasing solar power. Solar power technology may reach competitiveness in the global marketplace following this research.

Kaliappan et al. [59] used ANNs to predict the reduction in energy consumption of buildings that would result from installing a new photovoltaic system. This research studies the efficiency of the Elman Neural Network (EN) method, FFNN, and GRNN. The findings of this paper showed that forecasters using ANNs improve accuracy when employing the previous methods (EN-FFNN-GRN). A review of the AI-based Maximum Power Point Tracking (MPPT) in the solar power system was carried out by Yap et al. [60]. Since conventional MPPT techniques are unable to track the global maximum power point under Partial Shading Conditions (PSC), it is necessary to introduce artificial intelligence techniques to enhance this method [34, 61]. A model-based approach has been used to review the specific MPPT open-circuit voltage. Irradiance data has been employed to improve the accuracy of the MPPT prediction. The results demonstrated that all AI-based MPPT techniques showed faster convergence speed, lower steady-state oscillation and higher efficiency compared to conventional MPPT techniques, although the computational cost was also higher. This work provided a detailed comparison of popular highways MPPT techniques for solar power systems. Zhang et al. [62] applied Deep Convolutional Neural Networks (DCNN) with high-resolution weather forecast data to analyse the cloud movement pattern and its effect on solar power generation forecasting for solar farms. In this research work, the error rate is significantly reduced in comparison with other methods, e.g., the persistent model, the Support Vector Regression (SVR) model and convolutional neural networks. Therefore, it is exposed that solar forecasting DL outperforms sophisticated physical models.

Ai-Habahbeh et al. [63] used Auto Regressive Moving Average (ARMA) and Autoregressive Integrated Moving Average (ARIMA) methods to achieve a large improvement in PV performance. The parameters of the CNN participation model also optimised the performance. The study aimed to build a suitable model and test it using a simulation method. The result of the study showed a strong influence of temperature and the amount of sunlight, being the main advantage of this approach the reduction and elimination of waiting time. In addition, energy generation was discussed as a vital outcome that is essential for climate change. Digital twinning is a novel technique that provides a virtual representation of a real-world physical system or process (a physical twin) that functions as an indistinguishable digital counterpart for practical purposes such as simulation. The research work developed by Mazhar et al. [64] provided a systematic review of the integration of Big Data, ML and AI techniques with digital twinning. This paper focused on the role of big data and AI-ML in the creation of Digital Twins (DT) or DT-based systems for various industrial applications. The final section highlights the research potential of the IA-ML for digital twinning, mentioning different challenges and current opportunities.

Different ways for the operation of smart amplification building systems were presented by Hussin et al. [65]. The authors maximised the use of solar energy and improved the transformation efficiency by using two fundamental MPPT methods, e.g., incremental conductance method and disturbance observation method. According to the obtained results, the MPPT algorithm was used to control the voltage level and restrain the current of the photovoltaic cells at the minimum value, and the actual conversion power was limited affecting the output power of the PV cells. It is concluded that solar energy is cost-effective for commercial purposes due to the reduction of the overall costs of electricity generation and consumption. Gligor et al. [66] applied a Multi-Layer Feedforward Neural Network (MLFFNN) to forecast PV production. Different configurations of the MLFFNN were compared to forecast one-day production based on a one-day versus a ten-day regression window. The accurate results determined that this particular type of ANN was suitable for the forecasting of energy production generated by PV modules, obtaining better precision in the prediction with this regression window.

The aim of Kihlström and Elbe [67] was to acquire conclusions about PV technology in the market through market interventions over 40 years. A systematic literature review of peer-reviewed studies on PV technology and market interventions from 1979 to 2019 was provided. Solar PV technology has faced several financial and structural market barriers, including stable governmental market interventions. This article predicts that PV can be an “attractive energy alternative” in the future and a core technology that can develop the specific segment in a solar PV system. The aim of the study performed by Tived [13] is to obtain a review of AI in the solar PV value chain, its current application and future perspectives. Systematic search and statistical combination of quantitative study have been used with a hybrid approach using SVR, Particle Swarm Optimization (PSO) and SVM. The data was presented as a time series, applying different evolutionary algorithms, e.g., PSO and GA. The potential of AI systems and applications promoted the objectives of the European Green Deal, being illustrated through numerous scenarios that direct and indirect adverse have a relevant impact on the environment. The authors also concluded that environmental policies and regulations should be focused on creating a better regulatory vision. Gailhofer et al. [30] discussed specific funding needs for research and development since it is improbable that AI applications can be developed to support government activities related to monitoring, planning or infrastructure without public funding.

The work of Kalaiarasi et al. [68] applied MPPT to track the maximum available power of solar PV systems. A comparative study of various AI-based MPPT techniques, e.g., Fuzzy, ANN and Adaptive Neuro Fuzzy Inference System (ANFIS), was carried out. The output of the PV module interfaces with the resistive load through the Z-source inverter that boosts the input voltage and provides an AC output voltage. This configuration is implemented to find out the experimental remark regarding the PV array. Under the context of the method, modelling PV has been used by Ghannam et al. [69]. A novel approach of AI combined with cell dataset information was implemented. Optimisation algorithms have been developed through the diode model, being the boundary limits of the segment as the main limitations while maximum valid stimulation is the main advantage. Climate perturbations are increasingly unpredictable and powerful, being required revisions of the global horizontal irradiation and direct normal irradiation calculation models. Othman et al. [70] used DL as a tool to forecast the behaviour of the production of any geographical site. Different databases from NASA and the Tunisian National Institute of Meteorology were deployed for the climatic parameters of the study region of El Akarit, Gabes, Tunisia to achieve this objective. The use of DL algorithms validated previously made estimates of the energy potential in the studied region. Different techniques were applied and compared, obtaining an overall accuracy rate of around 75%.

A real grid-connected Seawater Desalination Plant (SWDP) in Egypt is implemented using MATLAB/SIMULINK by Ebrahim et al. [71]. The developed power plant consists of a PV array, a DC/AC converter, the load and the grid. Three MPPT controllers were explored in this study to improve the dynamic performance of the proposed grid-powered SWDP to address the low conversion efficiency of the PV system. Incremental conductance along with three artificial optimisation techniques (PSO, grey wolf optimisation and Harris hawk optimisation) were developed for the dynamic performance evaluation of the presented PV-powered SWDP. The results obtained from the three methods were promising in extracting maximum power with minimum error from the PV system and improving the performance of the SWDP The objective of Khan et al. [72] is the revision of the AI-based nonlinear integral back-stepping control approach for maximum power extraction of stand-alone PV system using buck-boost converter. The results from the simulation showed that the integral back-stepping technique outperforms the conventional Perturb and Observe (P&O) MPPT technique in all scenarios under varying load and environmental conditions with faults and uncertainty in terms of fast reaction time and minimum tracking error. Kahdka et al. [73] proposed an ML-based decision-making model scheme to optimise the cleaning interventions of PV modules. This paper analyses different PV panel cleaning practices for different types of PV cells and different cooling methods, identifying the main parameters on which to base cleaning optimisation. The researchers had to face challenges in data selection and data processing that may generate limitations. This study also provided a brief understanding of the current cleaning system of PV solar panels and a future perspective of the cleaning system.

Understanding the underlying inner workings of an AI-based forecasting model can provide insight into the field of application. This knowledge enables us to improve solar PV forecasting models and highlight relevant parameters. XAI is an emerging field of research in the smart grid domain and it helps to understand why the AI system made a forecasting decision. The work of Kuzlu et al. [74] presented several use cases of solar PV forecasting using XAI tools, such as Local Interpretable Model-Agnostic Explanations (LIME), Shapely Additive exPlanations (SHAP) and ELI5, that can contribute to the adoption of XAI tools for smart grid applications. An understanding of automation detection is provided by Sarp et al. [75] using explainable XAI in calculating the input and output of solar PV power. The data is tested and trained with an explanatory approach, that uses the calculation of the RMSE metric as an evaluation method. The main conclusion is that the XAI model helps understand the current advantages of PV with its use. The research paper developed by Esen et al. [76] analysed the influence of timestamps, forecast horizons, input correlation analysis, data pre- and post-processing, meteorological classification, grid optimisation, uncertainty quantification and performance evaluations in PV production analysis. The study focused on the introduction of solar and radiation simulators for PV research. The prototype solar LED system used in this research was examined using numerical formulas, in addition, emphasis is placed on PV tests based on a statistical approach. The tests were carried out with results below 2%, being considered as Class A. This research helps to understand that the use of solar simulators requiring few LEDs may occur in the future.

The efficiency of solar cells decreases with the increased temperature. The study objective was developed by Benghanem and Almohammedi in one of the hottest places in the world to reduce the temperature of the solar cells and improve their efficiency [77]. In this research, the Thermoelectric Module (TEM) was used for the analysis of performance. The results of the study showed that the efficiency of the solar panels dropped by 0,5% with a per degree rise in Celsius. The best performance of PV/TEM is only applicable in hot areas. The study helped in the suggestion of the hybrid PV/TEM for future use. Organic PV (OPV) cells are based on the concept of polymer-fullerene bulk hetero junction and are interesting due to their low cost and flexibility [78]. The objective of the study was to optimise the tandem structure composed of OPV cells based on the blend materials of P3HT through a transfer matrix simulation approach, various software and simulations. Optical Thickness data of OPV cells have been collected for the software simulation and the two wavelengths showed a good distribution of 500 nm and 727 nm. It is concluded that the model helps in the application of the OPV cells. To make a simulation on strained GaAsxP1-x and tensile train GaNyAsxP1-x-y in the quantum zones that are well active to get it inserted in the solar cells. Chenini et al. [29] used the band anticrossing model to explain the dependencies of lower E- and lower E + sub-band energies of the nitrogen concentration. The donor-acceptor approach has been used in this study for preparing low bandgap polymers concluding that the introduction of arsenic into the host material leads to the reduction of the bandgap energy. The limitation of the study was based on semiconductors although this study will help in increasing the absorption and redshift in the corresponding wavelength.

OPV cells are considered third-generation solar cells featuring new materials, such as organic polymer and tandem solar cells. The aim of the study developed by Benghanem and Almohammedi [77] is to analyse various materials, devices and fabrication techniques for OPV cells. Several roll-to-roll techniques have been used for the design of OPV cells and the slot die coating approach was used for coating the materials. The result of the study showed an increment of 8% efficiency for manufactured PV panels with special conditions and techniques. The limitation of the study is the lack of accuracy to demonstrate this efficiency.

2.2 Summary Review-Based Analysis

The summary review-based analysis is shown in Table 1. The comparison of energy production using different PV cells is understood with the help of the review conducted in this paper. DL has been demonstrated to be very useful for recognising the PV power generation pattern, and the Physic Constrained-LSTM model boosts in superior prediction performance of solar PV cells in temperature prediction accuracy. Conventional techniques e.g., PSO, ARMA or ARIMA, are being outperformed but advanced techniques based on AI due to their suitability and high accuracy in different scenarios and case studies presented by several authors. It is important to highlight the applicability of ELM and k-means due to high accuracies around approximately 90%, being the ML Matern 5/2 GPR method one the most optimal methods in performance for solar PV power prediction. XAI has proven to be one of the most relevant techniques and the number of research papers that have applied these techniques has grown in the last years. XAI has been proposed as a relevant solution for PV forecasting due to high performance, scalability or reduction of errors, among others.

Table 1 Summary of various studies based on review-based analysis

2.3 Method Based Analysis

The method-based analysis compares different studies and methods, their performance, advantages and disadvantages, as it is observed in Table 2. The purpose of this section is to demonstrate the main technologies of AI techniques utilised in power systems to improve traditional methods. In most of the research studies, multi-objective optimisation methods, multiple linear regressions and ANN methods have been used obtaining results with high accuracy in different case studies, e.g., rate of output of hydropower. This analysis verifies the computational efficiency of the proposed methods. The results revealed that among several AI techniques, ANN has emerged as one of the most effective methodologies for PV solar forecasting, surpassing the capabilities of GA or fuzzy logic. These research works reveal the operating ways of AI and other ML methods to forecast and improve the output power in the PV operation. Different techniques have shown limitations, e.g., reduction in estimation capabilities, that decrease the reliability of the overall analysis. It has been demonstrated the influence of the sampling rate in the obtention of reliable results required suitable datasets to overpass 80% accuracy. Moreover, it also shows that the PV system can be used at an inclination of 22° and only for plants at higher plants, as is demonstrated in a specific scenario.

Table 2 Summary of various studies based on method analysis

Figure 3 exhibits the comparison of accuracy achieved by the various authors in the studies of Table 2 based on either AI or ML techniques. It is possible to observe that the results are not ordered over time and the variability of the results depends on the authors and the possible case studies, increasing the complexity of the efficiency of the overall analysis. It is confirmed that all the considered studies present higher accuracies than 75%, even reaching 97%.

Fig. 3
figure 3

Accuracy comparison of different studies between 2017–2022

Figure 4 shows the usage of AI, ML and deep learning model methods used by researchers in various years of the studied research works. This visualization provides a deeper insight into the use of technology based on the involvement of these models in power generation industries for forecasting and analysis purposes.

Fig. 4
figure 4

Technology Usage of DL, ML and AI-based methods

Figure 5 compares MPPT, ELM and ANN models according to different indicators, such as rate of output, RMSE, MSE and precision. Overall results show similarities between the techniques although ANN is superior in all the parameters compared to the other architectures, demonstrating the reliability of this ML technique. Different studies demonstrated that periods for assigning weights for ELM are notably lower than standard ANN training methods, causing variations in the achieved results in precision and RMSE.

Fig. 5
figure 5

Performance models for MPPT, ELM and ANN

2.4 Result Based Analysis

Table 3 summarizes the various research based on methods, their performance, the outcome and the significance of the work. Moreover, the PV system is an electric power system that supplies solar power through the grid and there are several methods have been used to conduct a PV system. Those are the MPPT, ANN model, ELM model, and SVM model, among others. Those models optimise the output of shaded PV arrays or un-shaded PV arrays under “static and dynamic weather conditions” [99]. PV system conducts sustainable energy sources in present history. PV penetration information is based on the long-term efficacy of different algorithms and shows better performance in forecasting accuracy. Moreover, these articles help to understand the future challenges of PV penetration.

Table 3 Summary of research based on result analysis

Various models have been selected due to their applicability and novel trends in the current state of the art, e.g., ANN, SVM, ELM, MPPT, OPV, PSO-ELM3, MLP, MLA, and FCL. For this analysis, hybrid techniques and combinations of different methodologies are discarded. These techniques are shown in Fig. 6 according to the usage and implementation in the research papers considered for this study. ELM and MPPT are most widely used by researchers for forecasting the performance and accuracy of the models, due to easy implementation and reduced training periods, being PSO is the least implemented due to reduced accuracy compared to other architectures. The combination of PSO with other advanced techniques, e.g., SVM and wavelet transform, is widely applied in the current state of the art to increase the reliability of the analysis and increase the overall accuracy.

Fig. 6
figure 6

Models used in studies

Figure 7 shows the accuracy of the models measured based on weather conditions. The considered researchers tested their models on static weather data and dynamic weather data, and based on both conditions, the forecasting accuracy was measured in each case. The MPPT, ELM, ANN, and ELM had higher accuracy but there is a slight difference for the different weather conditions. SVM achieved the best overall accuracy for forecasting and results with dynamic weather caused by its high suitability due to the implementation of different kernel functions, and high reliability in dimensional spaces and differentiated classes.

Fig. 7
figure 7

Model performance on weather conditions

The average accuracy of MPPT, ELM, ANN and SVM is illustrated in Fig. 8. SVM and ANN performed with higher accuracy than MPPT and ELM, even reaching 98%. It is possible to confirm the conclusion obtained from Fig. 6, being that ANN and SVM provide better results due to advanced training periods.

Fig. 8
figure 8

Model accuracy comparison

3 Overall Discussion

As it has been seen throughout this review, different AI techniques have been implemented for PV systems. Specifically, this work distinguishes five main fields: price prediction, operation, forecasting, costs and ML. After comparing the different methods used, it can be seen that the most commonly used are ANN, SVM, ELM, MPPT, OPV, PSO, MLP, MLA and FCL with different performances and accuracies [116]. Among all of them, SVM stands out with the best performance and accuracy across the different applications.

Different approaches for price prediction, e.g., loop, non-linear, and robust approaches, have been widely defined by several methods including time series analysis, time decomposition, STL, Holt-Winters, Bates and Granger, Aiolfi and Timmermann. LSTM is a relevant tool with high reliability for large data volumes. Different techniques have been proposed to combine LSTM with ML, DL or statistical tools, to enhance the performance of forecasting compared to basic LSTM. Another relevant methodology is FNN, due to high accuracy in the results although the main limitation is based on training requirements. This study also demonstrates that deep learning is very useful for recognising the pattern of generating PV energy. This review highlights the need for the use of AI techniques in the field of PV systems, as they improve the accuracy of previous methods by allowing the analysis of significantly larger amounts of data. In addition, ML is a breakthrough in analytical techniques as it can be applied to a range of cases in a generalised way. Operators can potentially overcome time series forecasting challenges through adept utilization of LSTM, offering an effective solution for PV generation amid current energy challenges. Different authors have combined solar irradiation predictors combining ML, statistical methods, single models, and AI approaches. In terms of analysis of costs, MPPT methods, e.g., incremental conductance method and disturbance observation method, were used to control the voltage level and restrain the current of the photovoltaic cells at the minimum value, and the actual conversion power was limited affecting the output power of the PV cells. It is concluded that solar energy is cost-effective for commercial purposes due to the reduction of the overall costs of electricity generation and consumption.

AI and ML are being widely used in electricity forecasting and PV generation domains. The primary objective of analyzed studies is to identify PV forecasting using MLP-based methods containing ANNs and statistical approaches. Global economic development continuously enhances the market demand for electricity resulting in generating a grievous environmental impact. Proposing a Ps-LSTM framework to control the fault of ML algorithms, which are applied only to a huge volume of information solar energy production forecasting by using a deep learning approach. The survey literature has a strong concern over the finding of a data-driven approach in the technological and digitalised era. Under the context of the method, different methods of artificial and ML have been used [64]. Classification, optimizations, regressions and data structure exploration have been used as a method of artificial intelligence to reveal the impact of AI on the development of power electronic systems. XAI has been integrated into various studies over recent years. The analysis proposed in this study has presented different trends in the current state of the art, showing the main implementation conditions, challenges and constraints of different XAI techniques employed. It is demonstrated that XAI proposes potential improvements for the practical use of ML techniques, but obstacles such as standardization, security concerns, and unreliable confidence levels need careful consideration. Additionally, the analyzed papers explore potential applications and future research directions concerning XAI and energy, including smart grid applications, optimal energy management, energy consumer applications, and power system monitoring. The results indicate that SHAP and LIME represent the most widely utilized XAI techniques. Moreover, traditional ML algorithms are prevalent in XAI applications, whereas DL models are seldom integrated. Another relevant technique is the Physic Constrained-LSTM model, which helps in the superior performance of the prediction of the solar PV cells in the accuracy of forecasting the temperature. It is demonstrated that IME, XAI, and SHAP tools are widely used to acquire insight into solar PV power generation forecasting utilising explainable AI tools.

4 Conclusions and Future Research

Solar photovoltaic emerges as an alternative energy capable of meeting a greater percentage of global energy needs due to novel technical advances, reduced costs and high accuracy. The photovoltaic system is an electric power system that supplies solar power through the grid, being requires novel techniques for data analytics, forecasting and control. This paper presented a systematic review of several artificial intelligence and machine learning algorithms to present the main challenges and limitations of the current state of the art. Several researchers are still working in this domain to improve the accuracy and precision of the forecasting models to enhance the competitiveness of photovoltaic solar energy. This study presents a review of recent advancements in the technologies, techniques and methods widely implemented following three different methodologies: review-based, method-based and result-based.

The comparison of the power production using the different PV cells is understood with the help of the review of the research papers in the assignment. The Physic Constrained-Long Short-Term Memory model helps in the superior performance of the prediction of the solar PV cells in the accuracy of forecasting the temperature. It is also seen that extreme learning machines and k-means have shown accuracies of around 90%. Artificial intelligence methods are demonstrating their high strength and reliability compared to conventional modelling models, with reduced computational costs and providing reliable solutions. In most of the research studies, multi-objective optimisation methods, multiple linear regressions and artificial neural network methods have been used to observe the rate of output and verify the computational efficiency of the proposed method. There are several methods have been used to conduct a photovoltaic system, e.g., Maximum Power Point Tracking, Artificial Neural Network model, Extreme Learning Machine, and Support Vector Machine, among others models. Different basic approaches, e.g., Particle Swarm Optimization or several optimization techniques, presented low accuracy compared with other Machine Learning techniques, being less implemented in the current state of the art and being necessary the combination with other advanced techniques. It is important to highlight that the Support Vector Machine is one of the most applied techniques providing high reliability and suitability in different real case studies. It is also concluded that Machine Learning-based methods have recently become relevant in the analyzed literature caused of the increased complexity of operational scenarios for solar plants together with high computational power and data availability. In future research, it is proposed the analysis of novel hybrid methods to be compared with basic Machine Learning techniques, together with novel working scenarios with large solar plants.