Introduction

The six Formosa Satellite Mission 3/Constellation Observing System for Meteorology, Ionosphere, and Climate (FORMOSAT-3/COSMIC) microsatellites carry global positioning system (GPS) radio occultation (RO) receivers to measure the phase and amplitude of GPS signals (Anthes et al. 2000, 2008). Using these data, the vertical profiles of bending angle and atmospheric refractivity are obtained (Kuo et al. 2004), and atmospheric soundings can be retrieved (Kursinski et al. 1997). The present orbital configuration is providing global coverage of approximately 1,400–2,300 soundings per day.

The impact of GPS RO observations on numerical weather prediction has been intensively studied over the last decade. For example, Liu et al. (2001) studied the impact of global positioning system/meteorology (GPS/MET) bending-angle profiles; Zou et al. (2004) examined the impact of RO observations from the Challenging Minisatellite Payload (CHAMP) mission; Cucurull et al. (2006, 2008) assessed the impact of CHAMP and COSMIC GPS RO data; and Kuo et al. (2008) evaluated the impact of COSMIC GPS RO soundings. These studies show the positive and potential impact of assimilating GPS RO observations into numerical models. Several operational centers have also demonstrated the positive impact of COSMIC data on numerical weather prediction (NWP) products. For example, the European Center for Medium-Range Weather Forecasts (ECMWF) NWP system obtains a significant reduction of stratospheric temperature forecast errors, especially in the southern hemisphere, by assimilating COSMIC data (Healy and Thepaut 2006). The National Oceanic and Atmospheric Administration/National Weather Service (NOAA/NWS) has seen a similarly significant improvement in the prediction of 500-hPa pressure surface height, called herein geopotential height, with COSMIC data assimilation (Cucurull et al. 2007, 2008).

During a Mei-yu (plum rain) season, which usually occurs from mid-May to mid-June, quasi-stationary fronts frequently form over southeastern China and move to the Taiwan area, resulting in extended periods of precipitation. The Southwest Monsoon Experiment/Terrain-influenced Monsoon Rainfall Experiment (SoWMEX/TiMREX) is an international collaborative field experiment that collected special observations over the western plains and mountain slope region of southern Taiwan during the 2008 Mei-yu season. The program attempts to examine physical processes leading to the development of heavy orographic precipitation (http://sowmex.cwb.gov.tw). A pilot experiment of this project was conducted during the 2007 Mei-yu season. During a heavy rain event that occurred in central and southern Taiwan from 5 to 10 June 2007 (Fig. 1), dropwindsonde observations were undertaken over the Taiwan Strait and the northern South China Sea. Satellite and radar observations show a deep and widespread low-level, moisture-laden southwesterly flow during this time period (see Fig. 1b, c, for example). As documented in previous studies (Lin et al. 2001; Chen et al. 2006; Chien et al. 2006, 2008), this upstream flow is often associated with organized mesoscale convective systems (MCS) forming over the strait, propagating eastward, and producing heavy rain over the sloping mountainous area. In this study, we perform data assimilation experiments to assess the ability of a mesoscale model to predict these precipitating systems.

Fig. 1
figure 1

Observations from gauges, radar, and satellite. a Twelve-hour accumulated rainfall in Taiwan during May and June 2007. The rainfall amount (mm) shown here is an average of all the 390 rain gauge stations over Taiwan. b, c Radar reflectivity and an infrared image, respectively, at 0000 UTC 8 June 2007

In the last decade, dropwindsonde observations have been used intensively for hurricane studies. Two major programs that collected such soundings took place over the Atlantic and the western North Pacific Oceans, the former led by the National Oceanographic and Atmospheric Administration’s (NOAA) Hurricane Research Division (Aberson and Franklin 1999), and the latter by the Dropwindsonde Observations for Typhoon Surveillance near the Taiwan Region team (DOTSTAR; Wu et al. 2005). After assimilating dropwindsonde data, the National Centers for Environmental Prediction (NCEP) reported an improvement of up to 30% for 12–60-h track forecasts in its operational numerical model (Burpee et al. 1996). Aberson and Franklin (1999) showed that dropwindsonde observations brought significant improvement to the Geophysical Fluid Dynamics Laboratory (GFDL) track and intensity forecasts. Wu et al. (2007) evaluated the impact of dropwindsonde data on tropical cyclone track forecasts in five different models, including the NCEP Global Forecast System (GFS), the Navy Operational Global Atmospheric Prediction System (NOGAPS), the Japanese Meteorological Agency (JMA) Global Spectral Model (GSM), the Weather Research and Forecasting (WRF) model, and the GFDL hurricane model. They concluded that all models, but the GFDL model, produce remarkably better track forecasts after the assimilation of dropwindsonde data.

In this paper, we consider dropwindsonde observations during the pilot field observation of the SoWMEX together with the FORMOSAT-3/COSMIC GPS RO data to examine their impact on regional weather predictions during a 11-day period from 5 to 15 June 2007. The experiment designs are introduced in Sect. “Experiment designs”. The verification results for the different experiments are analyzed in Sects. “Verification of HTRUV” and “Verification of rain”. A summary is given in Sect. “Summary and conclusions”.

Experiment designs

The WRF model and its 3D-VAR component, WRF-Var, are used for the regional forecasting of heavy rainfall events in Taiwan during the 2007 Mei-yu season. The model includes two domains with 45- and 15-km horizontal spacing (see Fig. 2a) and 31 levels in the vertical. We use the WRF Single-Moment 5-class microphysics scheme, the Kain-Fritsch cumulus parameterization scheme, and the Yonsei University (YSU) PBL scheme (Skamarock et al. 2005) in model simulations. Four experiments are conducted for this study (Table 1): the control experiment (CON) uses the conventional observation data in the WRF-Var data assimilation; the GRO and DWS experiments are the same as the CON experiment, except that the data from FORMOSAT-3/COSMIC GPS RO and dropwindsonde observations, respectively, are added to the data assimilation; and the ALL experiment includes all the aforementioned data in the assimilation. Table 2 shows the amount of available data that are located inside domain 1 at a 12-h interval during this 11-day period. A 6-h time window (between −3 and +3 h) was used for the collection of the GPS RO data. Figure 2 presents an example of the locations of observations from each observing platform at 0000 UTC 6 June 2007. While the GPS RO data were distributed over a widespread area, the dropwindsonde data were mostly collected over the ocean to the southwest of Taiwan. The GPS RO data, obtained from the Taiwan Analysis Center for COSMIC (TACC) in Taiwan (http://tacc.cwb.gov.tw), contain vertical profiles of refractivity at 100-m vertical resolution. During the assimilation process, the model refractivity was calculated and then converted to the observation space with horizontal and vertical interpolation. The innovations of refractivity were computed in the observation space. With these innovations, the increments of the model analysis variables were obtained through a three-dimensional variational data assimilation approach.

Fig. 2
figure 2

The locations of observation data within domain 1 from a traditional soundings plus dropwindsonde and b synoptic surface stations plus GPS RO at 0000 UTC 6 June 2007. There were 155 soundings (circles), 13 dropwindsondes (black dots), 877 surface data points (crosses), and 10 GPS RO observations (black dots). The inset in a denotes the location of domain 2

Table 1 The experiment designs
Table 2 The data sets used in the WRF-Var data assimilation for the 22 runs during the 11-day period from 0000 UTC 5 June to 1200 UTC 15 June 2007

Each of the four experiments contains 22 runs of 72-h WRF simulations. These runs were initialized twice daily from 0000 UTC 5 June to 1200 UTC 15 June 2007. The initial data (first guess) for the first run at 0000 UTC 5 June 2007 were obtained from the NCEP GFS, while those for the other 21 runs were from the 12-h forecast of their previous WRF runs (i.e., 12-h update cycling). Because the GPS data were provided continuously in time, the 12-h cycling may not be good enough to take advantage of all available observations. Thus, another GRO experiment, but with 6-h cycling, was performed for comparison. It is found that the forecasts of the 6-h cycling GRO overall are slightly worse than those of the 12-h cycling GRO at the earlier forecast time, but they become slightly better at the longer integration time (not shown).

Verification of HTRUV

The verification was performed by comparing the geopotential height (H), temperature (T), relative humidity (RH), and east–west and north–south wind components (U and V) with those of the NCEP GFS analyses (with mesoscale data assimilation of traditional surface and upper-air observations). These five meteorological variables together are hereafter referred to as HTRUV. The root-mean square error (RMSE), mean error (ME), and skill score (SS) are calculated by using data from all the grid points in domain 2 for all the 22 runs during the 11-day period. These are defined as follows:

$$ {\text{RMSE}} = \sqrt {{\frac{{\sum\nolimits_{n = 1}^{N} {\left( {F_{n} - O_{n} } \right)^{2} } }}{N - 1}}} $$
(1)
$$ {\text{ME}} = {\frac{{\sum\nolimits_{n = 1}^{N} {\left( {F_{n} - O_{n} } \right)} }}{N}} $$
(2)
$$ {\text{SS}} = - {\frac{{{\text{RMSE}}_{\text{GPS}} - {\text{RMSE}}_{\text{CON}} }}{{{\text{RMSE}}_{\text{CON}} }}} \times 100\% $$
(3)

where F and O are the forecast and observation fields, respectively. N is the number of the sample size. Skill score is defined as the percentage of improvement for a particular experiment (e.g., GRO or DWS) over the reference forecasts (CON). The calculation is done on five pressure levels (1,000, 850, 700, 500, and 300 hPa), but for brevity, only results at 850, 500, and 300 hPa are shown in the figures.

RMSE

Figure 3 shows the RMSE of HTRUV by comparing the simulation results from each of the four experiments against the reference analyses. The average was calculated by taking account of simulation data over all the grid points in domain 2 from all the 22 runs. The RMSEs of geopotential height (H; Fig. 3a) increase with time, from about 5 to 15 m at 850 hPa, and up to 20 m at 300 hPa. Comparisons show that GRO performs better (with smaller errors) than CON at 850 hPa after 36 h, and that the RMSE differences between them increase over time. A similar trend is observed at higher levels (500 and 300 hPa). The RMSEs of DWS at 850 hPa are smaller than those of CON at 12 and 24 h, but they become larger after 24 h. At 500 and 300 hPa, most of the RMSEs of DWS are slightly lower than those of CON. Compared with GRO, DWS tends to have smaller RMSEs at the earlier time, but they become larger at the later time. With both the GPS RO and dropwindsonde data assimilated, ALL produces the smallest RMSEs, especially at the longer integration time.

Fig. 3
figure 3

Root-mean square error (RMSE) for domain 2. Row a is H (m), row b is T (°C), row c is RH (%), row d is U (m/s), and row e is V (m/s). Columns from left to right represent verification results at 850, 500, and 300 hPa. The abscissa denotes the time elapsed in the simulation (h). Color bars in red, green, blue, and magenta represent RMSEs for the CON, GRO, DWS, and ALL experiments, respectively

The RMSEs of temperature (T; Fig. 3b) are about 0.7–1.1°C at 850 hPa and become smaller at 500 and 300 hPa. At 850 and 300 hPa, GRO has smaller RMSEs than CON only at the longer integration time (after ~48 h). However, GRO produces smaller RMSEs at 500 hPa than CON at all times. DWS also has lower-temperature RMSEs than CON at 500 hPa, but produces slightly larger RMSEs most of the time at the other two levels. ALL shows the lowest-temperature RMSEs among all experiments after 48 h.

For relative humidity (RH; Fig. 3c), the RMSEs are about 15% at 850 hPa, and increase to about 30% at 300 hPa for all experiments. The RMSEs of GRO are all larger than those of CON at 850 hPa. At 500 hPa, however, GRO has smaller RMSEs than CON at all times. At 300 hPa, GRO has slightly larger RMSEs than CON at all times, except at 72 h. As for DWS, the RMSEs are larger than those of CON at 850 hPa, but they become smaller at 500 and 300 hPa. With both the GPS RO and dropwindsonde data assimilated, ALL has even smaller RMSEs than either GRO or DWS at 500 hPa for many time periods, but it still does not show any obvious improvement at 850 and 300 hPa.

The RMSEs of the east–west wind component (U; Fig. 3d) increase over time from about 3 to 6 m/sec at all levels. Error comparisons show that GRO does not perform well at the earlier time, but it improves after 24 h with smaller RMSEs. The RMSEs of DWS at the low level (850 hPa) are mostly larger than those of CON, but they are in general smaller at the mid and high levels (500 and 300 hPa). For several time periods, they are even the smallest among the four experiments. ALL performs very well in forecasting U at many time periods. The RMSE distribution of the north–south wind component (V; Fig. 3e) among different experiments is quite similar to that of U, except that the V RMSEs are slightly larger.

From the above, it is clear that GPS RO data assimilation provides a positive impact on WRF simulations for many, though not all, meteorological fields; but the impact occurs mainly at the longer integration time. The reason is that in GRO there are only about 10–20 GPS RO observations inside domain 1 for each assimilation period, and it usually takes time for the influence to propagate into domain 2, where the verification is done. The dropwindsonde data also have a positive impact on the model simulations but primarily at the earlier forecast time. Because the observations were all taken in domain 2, their impact can be seen early in the forecast. When both the GPS RO and dropwindsonde data are assimilated in the model, the forecast error is further reduced.

Mean error

The mean errors (MEs) of CON show that the model underpredicts geopotential height, a problem which worsens as the forecast time increases (Fig. 4a). However, with the assimilation of the GPS data, the underestimation problem is reduced, except at the earlier time. The dropwindsonde data help to reduce the errors as well, although the differences are minimal. With both the GPS and dropwindsonde data assimilated, ALL overall predicts even larger geopotential height at 850 and 500 hPa, but it produces about the same H at 300 hPa as GRO.

Fig. 4
figure 4

Mean error (ME). See Fig. 3 for additional explanations

The temperature MEs (Fig. 4b) show a reverse pattern from the low to the high levels. They are positive at 850 hPa near zero at 500 hPa and negative at 300 hPa. This suggests that the model has a warm bias at the low level and a cold bias at the high level. As for cross-experiment comparisons, there is not much difference among the four experiments at 850 and 500 hPa. At 300 hPa, ALL appears to have the largest cold bias.

The RH MEs (Fig. 4c) reveal that the model underpredicts moisture at 850 and 300 hPa. The dry bias is worse at the high level (about −20%) than at the low level (about −10%). At 500 hPa, the RH bias is very small. Compared with the relatively large RMSEs (15–30%) shown in Fig. 3c, these small MEs suggest that the forecasts of RH at 500 hPa are quite diverse. As for the relative performance among different experiments, the distribution is similar to that of RMSE.

The MEs of U (Fig. 4d) are mostly positive, but they are in general small at 850 hPa. However, their RMSEs are not small (3–6 m/s), which implies that the distribution of the errors is quite wide-ranging. At 500 and 300 hPa, the U MEs show a positive bias. Another interesting feature is that the MEs of the four experiments exhibit a different pattern from that of the RMSEs. For example, GRO has the highest positive MEs, but its RMSEs are the lowest for most of the time periods. In other words, although the assimilation of the GPS data can reduce the RMSEs, it results in an increase in positive bias. This suggests that the occurrences of underprediction of U are reduced, and that those of overprediction do not change much through the use of the GPS data. The V-component winds (Fig. 4e) show a similar pattern among different experiments to that of U, but the MEs are mostly negative at 500 and 300 hPa.

Skill score

Figure 5 shows the RMSE-based skill score (SS) of GRO against CON. Basically, the SSs more clearly demonstrate the improvement or worsening of the particular experiment over CON. At 300 hPa (Fig. 5a), GRO shows positive SSs for geopotential height at all time periods, and the SS is the largest (~9%) at 24 h. The other variables do not see significant improvement as for geopotential height, but they still exhibit a trend of increasing SS at the longer integration time. At 72 h, the SSs are positive for all variables.

Fig. 5
figure 5

Skill score (%) for GRO against CON. The pressure levels are at a 300, b 500, and c 850 hPa. The abscissa denotes time elapsed in the simulation (h). Color curves in green, blue, red, magenta, and cyan represent results for H, T, RH, U, and V, respectively

At 500 and 850 hPa (Fig. 5b, c) GRO still shows, for almost all the variables, a tendency to improve forecast accuracy at the longer time. Geopotential height consistently has the largest SSs after 48 h of integration at these two levels. Combining the result at 300 hPa, it is clear that the GPS data provide useful information for height fields such that the forecast of geopotential height can be greatly improved, especially after longer integration. The 500-hPa temperature forecasts of GRO also improve over CON, but the SSs are relatively small (<4%). At 850 hPa, their SSs are close to zero. Relative humidity has positive SSs at 500 hPa, but negative SSs at 850 hPa. There are several possible reasons for this. First, the moisture analysis contains considerable uncertainties, and does not serve as a good source of verification for the prediction. Second, there is a negative refractivity bias in the GPS RO soundings (Rocken et al. 1997), which affects the accuracy of moisture analysis. Third, over regions with significant moisture variability, such as near a Mei-yu front, a more sophisticated nonlocal observation operator should be used (e.g., Sokolovskiy et al. 2006) for the assimilation of GPS RO data.

Compared with GRO, DWS has lower SSs (<5%) and it does not show a trend of improvement over time (Fig. 6). Instead, the SSs are in general higher at the earlier time and they decrease as the integration time increases. The assimilation of dropwindsonde data helps to improve the simulation for most of the variables at 300 and 500 hPa, but it barely shows forecast improvement at 850 hPa.

Fig. 6
figure 6

Skill score (%) for DWS against CON. For additional explanations see Fig. 5

With both the GPS RO and the dropwindsonde data assimilated in the model, ALL (Fig. 7) shows even higher SSs than either GRO or DWS. The most notable is the forecast of geopotential height. Its SSs are relatively high; for example, they are up to 15% at 850 hPa in the 60- and 72-h forecasts. The tendency of SSs for each variable is very similar to that of GRO, except with some minor differences. This is because the dropwindsonde data are available only in six out of the 22 runs. Their overall impact is relatively small compared with that of the GPS data.

Fig. 7
figure 7

Skill score (%) for ALL against CON. For additional explanations see Fig. 5

Statistical analysis

In order to examine statistically the impact of the GPS RO and dropwindsonde data, we have performed a statistical analysis by counting the number of times positive/negative RMSE differences (or SSs) occurred between CON and a particular experiment. The differences were computed for all HTRUV fields at five levels (1,000, 850, 700, 500, and 300 hPa), and at six forecast times (12, 24, 36, 48, 60, and 72 h). A positive (negative) value indicates that a particular experiment performs better (worse) than CON. In addition, we used the resampling technique of Wilks (1995) to test whether the difference exceeded a significance level of 75%. Table 3 shows that the GPS RO data produce the biggest impact on the forecast of geopotential height among the five variables. Although temperature, relative humidity, and U-component wind exhibit many positive differences, they are typically not as significant as geopotential height. The dropwindsonde data (DWS) bring a nearly equal effect on the forecast of each field. With both datasets assimilated (ALL), the number of occurrences of positive impact and that of high significance both increase.

Table 3 RMSE differences of the HTRUV fields between CON and each of the three experiments (GRO, DWS, and ALL)

We further stratified the numbers according to the vertical levels and the forecast time periods. From Table 4, it is evident that the GPS RO data help the forecast the most at 500 hPa, and the least at 850 hPa. The dropwindsonde data also produce a very significant impact at high levels such as 500 and 300 hPa, but worsen the forecast at 850 hPa. As stratified by the six forecast times, the numbers in Table 5 clearly indicate that the positive impact of the GPS RO data increases over time, while that of the dropwindsonde data decreases as the forecast time increases.

Table 4 RMSE differences for the five vertical levels between CON and each of the three experiments (GRO, DWS, and ALL)
Table 5 RMSE differences for the six forecast times between CON and each of the three experiments (GRO, DWS, and ALL)

Verification of rain

In addition to the aforementioned meteorological fields, we also performed verification of 12-h precipitation forecasts against the observations from 390 rain gauges in Taiwan. The equitable threat score (ETS) and bias are computed for verification. They are defined as follows:

$$ {\text{ETS}} = {\frac{H - R}{F + O - H - R}} $$
(4)
$$ {\text{Bias}} = {\frac{F}{O}} $$
(5)

where H is the number of hits, and F and O are the numbers of samples in which the precipitation amounts are greater than the specified rain threshold in forecast and observation, respectively. The random forecast is R = FO/N, where N is the number of points being verified. Eight rain thresholds are used in this study, including 0.3, 2.5, 5, 10, 15 25, 35, and 50 mm (see Chien and Jou 2004 for more details).

Figure 8 presents the ETS and bias of the 12-h rainfall forecasts of the four experiments. For brevity, only the results between 12 and 60 h in the simulation are shown. The ETS and bias of the rainfall forecast at 0–12 and 60–72 h are not discussed here because the former is still in model spin-up stage, and the latter is out of the time window of model peak performance for rainfall prediction from a mesoscale model. At 12–24 h (Fig. 8a), the ETSs of CON (#1 curve) are slightly below 0.3 for rain thresholds smaller than 35 mm, and drop to below 0.2 at the 50-mm threshold. The corresponding biases are very good, around 1 for all thresholds. Compared with CON, there is nearly no improvement in terms of ETS for either GRO (#2 curve) or DWS (#3 curve). DWS even produces much lower ETSs at thresholds between 10 and 35 mm. ALL (#4 curve) is the worst among all the experiments at small thresholds, but it becomes better at large thresholds.

Fig. 8
figure 8

The ETS (left) and bias (right) of 12-h precipitation forecasts from CON (black, #1), GRO (red, #2), DWS (blue, #3), and ALL (green, #4). The abscissa is the rainfall thresholds (mm). Time periods of the 12-h rainfall verified include a 12–24 h, b 24–36 h, c 36–48 h, and d 48–60 h

At 24–36 h (Fig. 8b), the ETSs show that both GRO and DWS predict better rainfall than CON at small thresholds (≤5 mm), but do worse than CON at large thresholds (≥35 mm). In a similar manner, ALL has the highest ETSs at those small thresholds and the lowest ETSs at those large thresholds. Biases of all the experiments are still very good for this forecast time period.

The ETSs of the rainfall forecast at 36–48 h (Fig. 8c) indicate that GRO produces the highest ETS at almost all the rainfall thresholds. This is because the model starts to overpredict rainfall (with large biases) during this time period, while GRO appears to have a less severe overprediction problem. On the contrary, DWS has the highest biases, and its ETSs are the worst among the four experiments at thresholds larger than 10 mm. The ETSs of ALL are nearly the same as CON, except at small thresholds, where the ETSs of ALL are slightly higher.

At 48–60 h (Fig. 8d), GRO still has the highest ETSs, except at large thresholds where the ETSs are low and the biases are high. The biases of DWS are not too bad (close to 1) and its ETSs become better, but ALL overall does not perform well during this time period.

Summary and conclusions

This paper presents an impact study of the GPS radio occultation and dropwindsonde data on WRF forecasts during a 11-day period of the 2007 Mei-yu season. Many mesoscale convective systems developed along the Mei-yu front and produced heavy rainfall over Taiwan during this time period. Data assimilation with 12-h update cycling was performed using the WRF-Var system. The verification was performed for five basic meteorological fields (geopotential height, temperature, relative humidity, and U- and V-component winds) and rain on the 15-km grid.

The verification results show that the forecast of geopotential height has an underprediction problem, with RMSEs increasing over time from about 5 to 20 m. Temperature predictions show a warm bias at the low level and a cold bias at the high level. The error range is about 1°C at 850 hPa and becomes smaller at higher levels. The model overall tends to underpredict relative humidity, with a larger dry bias (~20%) at 300 hPa, a smaller bias (~10%) at 850 hPa, and nearly no bias at 500 hPa. As for winds, the U-component wind has a positive bias and the V-component wind in general has a negative bias. Their RMSEs range from 3 to 6 m/s.

The assimilation of the GPS RO data can help to improve the WRF forecast at the longer forecast time (e.g., >36 h). The forecast of geopotential height, among the five meteorological variables, reflects most notably the impact of the GPS data. All the variables at all levels more or less show a positive impact, save for relative humidity, which has negative SSs at 850 hPa. The impact of assimilating the dropwindsonde data is, in general, positive at high levels like 300 and 500 hPa, and it decreases over time. Because there are usually very few GPS RO observations located in the fine domain, it takes time for the influence to propagate from the coarse domain into the region where the verification is done. In other words, the large-scale simulation is first improved due to the GPS RO observations, and the change can then have a positive impact on the mesoscale at a later time. The dropwindsonde observations, however, are taken inside the fine domain such that their impact can be detected early in the simulation. This reflects the fact that GPS RO is primarily a large-scale observation (both in terms of its measurement characteristics and data distribution), while the dropsonde is primarily a mesoscale observation. With both the GPS RO and dropwindsonde data assimilated, the model improves even further. It is also found that a 6-h update cycling strategy can further improve the forecast by assimilating more GPS RO data into the model, but the difference is not very significant.

The GPS and dropwindsonde data start to improve the rainfall forecast only after 24 h. At 36–60 h, the GPS RO data significantly improve the rainfall prediction because the overprediction problem of the model is reduced. Compared with the verification of the five meteorological fields, this suggests that when the model better simulates basic fields with the GPS RO data the rainfall is better predicted because the dynamics and physics processes are better reproduced. An exception to this, however, is the simulation of low-level (850 hPa) moisture, because with the GPS RO data assimilation, the moisture simulation is not improved. Since low-level moisture is very important in determining the accuracy of precipitation, these inconsistent results lead us toward a possible explanation that the moisture analysis we have used does not serve as a good source of verification for the prediction. This is often the case, as moisture analysis usually contains considerable uncertainties at low levels. Another possibility is that there is a negative refractivity bias in the GPS RO soundings, which affects the accuracy of moisture analysis. In addition, over regions with significant moisture variability, such as near a Mei-yu front, a more sophisticated nonlocal observation operator or direct assimilation of bending angle could be considered. The dropwindsonde data also provide a positive impact on rainfall forecasts, but it is not as significant as that of the GPS RO data. It should be noted, however, that the above results are based on the current data availability of dropwindsonde observations in the field program. If dropwindsonde data were available at more frequent model initial times, their impact would probably be greater. However, since such observations are usually distributed in a limited area, the impact should be minimal compared to that of the GPS RO data when a large-scale weather system is dominant.