Citizen scientists’ engagement in flood risk-related data collection: a case study in Bui River Basin, Vietnam

Tran, Huan N.; Rutten, Martine; Prajapati, Rajaram; Tran, Ha T.; Duwal, Sudeep; Nguyen, Dung T.; Davids, Jeffrey C.; Miegel, Konrad

doi:10.1007/s10661-024-12419-2

Citizen scientists’ engagement in flood risk-related data collection: a case study in Bui River Basin, Vietnam

Research
Open access
Published: 17 February 2024

Volume 196, article number 280, (2024)
Cite this article

Download PDF

You have full access to this open access article

Environmental Monitoring and Assessment Aims and scope Submit manuscript

Citizen scientists’ engagement in flood risk-related data collection: a case study in Bui River Basin, Vietnam

Download PDF

977 Accesses
2 Altmetric
Explore all metrics

Abstract

Time constraints, financial limitations, and inadequate tools restrict the flood data collection in undeveloped countries, especially in the Asian and African regions. Engaging citizens in data collection and contribution has the potential to overcome these challenges. This research demonstrates the applicability of citizen science for gathering flood risk-related data on residential flooding, land use information, and flood damage to paddy fields for the Bui River Basin in Vietnam. Locals living in or around flood-affected areas participated in data collection campaigns as citizen scientists using self-investigation or investigation with a data collection app, a web form, and paper forms. We developed a community-based rainfall monitoring network in the study area using low-cost rain gauges to draw locals’ attention to the citizen science program. Fifty-nine participants contributed 594 completed questionnaires and measurements for four investigated subjects in the first year of implementation. Five citizen scientists were active participants and contributed more than 50 completed questionnaires or measurements, while nearly 50% of citizen scientists participated only one time. We compared the flood risk-related data obtained from citizen scientists with other independent data sources and found that the agreement between the two datasets on flooding points, land use classification, and the flood damage rate to paddy fields was acceptable (overall agreement above 73%). Rainfall monitoring activities encouraged the participants to proactively update data on flood events and land use situations during the data collection campaign. The study’s outcomes demonstrate that citizen science can help to fill the gap in flood data in data-scarce areas.

Disaster preparedness of local governments in Panay Island, Philippines

Article 26 October 2020

Early Warning Systems and Their Role in Disaster Risk Reduction

Road construction and its socio-economic and health impact: a case study of Atonsu lake road

Article Open access 06 July 2023

Introduction

Globally, flooding impacts 100 million people each year and causes great economic losses (Jongman et al., 2015). To mitigate flood-related impacts, flood risk assessment is a pivotal task because it quantifies potential hazards and vulnerabilities associated with flood events to determine appropriate measures (de Moel et al., 2015). This task requires enormous amounts of data on flood hazards, land use information, and flood vulnerability (Apel et al., 2009). Unfortunately, flood data collection is hindered in undeveloped countries, particularly in Asian and African regions, due to time constraints, financial limitations, and inadequate tools (Glas et al., 2020; Huizinga et al., 2017; Sy et al., 2019). Typically, flood risk-related data are obtained from ground observations, hydrological and hydraulic modeling, remote sensing, and field surveys (Sy et al., 2019).

Traditional approaches, such as modeling and remote sensing in flood monitoring and flood risk assessment, have significantly advanced our understanding of these complex phenomena (Trinh & Molkenthin, 2021). These methods commonly document flooding and estimate flood impact for large areas under different scenarios or periods (Ferri et al., 2020). Modeling, which demands detailed data, provides accurate results (Apel et al., 2009). Advancements in information and communication technology (ICT), like cloud computing platforms, enable the rapid execution and analysis of analysis-ready satellite images. This facilitates the real-time or near-real-time display of flooding maps, providing valuable support to flood control operators for enhanced and efficient management (DeVriesa et al., 2020; Liu et al., 2018). However, modeling may be subject to uncertainties due to assumptions and limitations in input data and a lack of hydrological and hydraulic process understanding (Merz et al., 2010a, 2010b). Additionally, remote sensing can encounter challenges related to cloud cover, spatial resolution, and insufficient validation datasets (Schnebele & Cervone, 2013).

To address the mentioned limitations, various initiatives have been undertaken to incorporate citizen science in gathering data from past flood events (Sy et al., 2020), monitoring current flood situations (Ferri et al., 2020), and enhancing flood modeling (Azizi et al., 2023). Citizen science involves public participation in scientific research (Buytaert et al., 2014; Shirk et al., 2012) and is further facilitated by ICT that simplifies the collection of massive amounts of information and data (Buytaert et al., 2014). Data collected from communities through citizen science can be cost-effective (Buytaert et al., 2014), more spatially distributed, and relatively accurate (de Bruijn et al., 2019; Zeng et al., 2020). Therefore, citizen science is a promising approach for providing supplementary data to assess and manage flood risk (Scaini et al., 2021), allowing for on-the-ground observations and local insights to enhance the accuracy and completeness of flood-related data. In addition, citizen science projects may raise locals’ awareness of flood disaster prevention and build community resilience, thereby serving as a nonstructural measure in flood risk management (Ferri et al., 2020; Pandeya et al., 2021).

Public involvement in the collection of flood risk-related data on flood hazard, land use, and flood vulnerability has been discussed for the last two decades (Peters-Guarin, 2008; See, 2019; Sy, 2019). Citizen science has been widely applied in flood hazard assessment to determine the flooding extent (Sy et al., 2020), depth (Fohringer et al., 2015), flow velocity (Le Coz et al., 2016), and duration (Sy, 2019). Moreover, citizen scientists have contributed land use information through field data collection campaigns (Assumpcao et al., 2019) and online crowdsourcing platforms (Sparks et al., 2015). Finally, with regard to flood vulnerability, citizens have shared information about flood damage and their perspective on disaster management through field surveys conducted by researchers (Perera et al., 2015; Peters-Guarin, 2008). However, previous citizen science projects have rarely examined the power of locals’ data collection and contribution for these three mentioned data types in one citizen science program. In addition, some researchers have used citizen scientists to collect data without data validation or comparison with other sources (Assumpcao et al., 2019; Le Coz et al., 2016), which is necessary to understand the variations and limitations of this approach. Furthermore, many citizen science projects have collected data only once (Assumpcao et al., 2019; Perera et al., 2015) and have not utilized citizen scientists to monitor different floods, detect land use change (Tsiakos et al., 2019), and update flood damage (Merz et al., 2010a, 2010b).

The support of low-cost monitoring equipment and ICT paves the way for citizen science-based hydrological monitoring networks (Buytaert et al., 2014; Davids et al., 2019). For example, low-cost rain gauges or water level sensors installed in residential areas or public areas enable citizens to proactively and regularly monitor rainfalls (Davids et al., 2019; Fehri et al., 2020) and water levels (Pandeya et al., 2021; Weeser et al., 2018). These data can be transmitted wirelessly to a server or web-based platform (Pandeya et al., 2021) to provide up-to-date information for authorities and citizens. Data collection apps can gather the date and geolocation of measurements or surveys and take photos to enhance users’ understanding of the investigated objects (Davids et al., 2019). Furthermore, communication technologies can help scientists communicate with participants easily through social networks (Sy, 2019) to motivate them and retain their participation.

Vietnam is one of the Asian-Pacific countries that is most affected by natural disasters, particularly flooding (World Bank Group and Asian Development Bank, 2020). Floods are responsible for 97% of the total disaster loss (World Bank Group and Asian Development Bank, 2020). According to the World Resources Institute’s AQUEDUCT Global Flood Analyzer, as of 2010, river floods with a 10-year return period affected 2.4 million people and caused the gross domestic product damage of 6.0 billion USD (World Resources Institute, 2018). In addition, agricultural activities are mainly located in low-lying deltas and coastal areas, which attract more than 40% of the nation’s workforce (Ministry of Natural Resources and Environment, 2020). They are significantly affected by flooding and climate change (World Bank Group and Asian Development Bank, 2020). Although floods cause great losses, flood risk-related data to estimate potential flood damage remain inadequate in Vietnam (Chinh et al., 2016). Furthermore, the lack of locals’ involvement in flood risk planning and management (Dang et al., 2011; Pham, 2011) coupled with challenges in local-to-central collaboration (Garschagen, 2016) have hindered the effectiveness of flood mitigation measures. Therefore, it is necessary to find an integrated approach to gather missing flood data and enhance communication between locals and authorities to manage floods.

Our research aims to utilize the citizen science approach to collect flood risk-related data for the Bui River Basin in Vietnam, where citizen science-based studies are very limited. We recruited and trained participants living in or around flood-affected areas to self-investigate or investigate flooding in residential areas, land use information, and flood damage to paddy fields for 1 year. We compared the data obtained by citizens with those collected by the research team or the local authority to evaluate the quality of the citizen science data. In addition, we utilized a community-based rainfall monitoring network to engage participants in updating flood data during a data collection campaign.

Study area

The Bui River Basin is located in northern Vietnam and is drained by two main rivers, the Tich and Bui Rivers, which flow through Hoa Binh and Hanoi provinces (

Figure 1A, B). The study area, which spans 266.5 km², is bounded by the upstream Bui River Basin originating from Lam Son hydrologic station area, Luong Son District, Hoa Binh Province, and the Xuan Mai urban area (

Figure 1C). The study area is characterized by semi-mountainous and semi-plain landforms (Doan & Bui, 2016) with elevations ranging from 0 m in the eastern region to 800 m in the northern and southern regions. The annual rainfall is approximately 1700 mm (Kieu et al., 2019), of which nearly 80% occurs in the rainy season from May to October.

The Xuan Mai urban area is known for the “Flooded Villages” of Hanoi located near the Tich-Bui Rivers conjunction area, 30 km from downtown Hanoi. According to Tran et al. (2022), the area has experienced more frequent and intense flooding over the last 15 years. The Tri Thuy station has recorded numerous high floods, including notable events in 2008, 2017, and 2018 (Tran et al., 2022). The 2018 historic flood, which had a 50-year return period (Tran et al., 2022), had an extreme effect on people, property, and agricultural production which is one of the main economic activities in the Xuan Mai urban area (Doan & Bui, 2016). Flooding in this area is caused by several factors, including intense rainfall leading to fluvial floods that overflow onto low-lying areas (Nguyen et al., 2019), mountain floods that flow directly onto low-lying areas (Le et al., 2022), and rapid land use change (Doan & Bui, 2016). Furthermore, the right bank area of the Tich–Bui Rivers serves as a flood retention area to reduce flood damage in downtown Hanoi (Hanoi People's Committee, 2009). This task poses challenges to agricultural production (Tran et al., 2021b). For example, in 2018, paddy-cultivated areas were inundated more than four times (Phan et al., 2019).

The Thuy Xuan Tien commune and Xuan Mai town in Chuong My District, Hanoi City, along the two sides of the Tich and Bui Rivers were chosen as pilot areas to investigate the applicability of citizen science for collecting flood risk-related data (Fig. 1C). The pilot area has Xuan Mai meteorologic station and Tri Thuy hydrologic station that have been measured daily since the 1960s. Furthermore, several academic institutes, such as schools and universities, are based in the area, creating a conducive setting for citizen science initiative implementation. School and university students belong to the most efficient forces in citizen science programs regarding the acquisition of knowledge and the use of data collection applications (Davids et al., 2019; Prajapati et al., 2021).

Materials and methods

To develop a community-based flood data collection approach, we implemented a citizen science program from September 2021 to August 2022. We followed the general approach suggested by Bonney et al. (2009), which consists of three main components: determining collected flood risk-related data, engaging citizen scientists, and comparing citizen science data (Fig. 2). These components are elaborated upon in the subsequent subsections. Our research developed a community-based rainfall monitoring network to promote a citizen science program and encourage participants to update flood risk-related data proactively. The reliability of rainfall collected by citizen scientists is beyond the scope of the current work and will need to be discussed in future research.

Collected flood risk-related data

Flood risk assessment requires the collection of data on flood hazards, exposure, and flood vulnerability (Apel et al., 2009). For flood hazards, information on flood probability and intensity, such as flood extent, depth, and velocity data, is mainly addressed (Trinh & Molkenthin, 2021). For exposure, the land use map, building dataset, and population distribution are often used (de Moel et al., 2015). For flood vulnerability, flood damage functions that indicate the relationship between flood direct and indirect damage to objects (buildings, crops, people, etc.) are often considered (Merz et al., 2010a, 2010b). This study used a citizen science approach to collect data on the flooding depth in residential areas, the direct impact of floods on paddy fields (flooding depth and yield reduction) in the last 10 years, and current land use data in a pilot area. Information on the land use in the field was gathered and categorized into seven different classes: forest, shrubland, agriculture rice, agriculture non-rice, built low, built high, and water body. In addition, rainfall was measured for the whole study area using low-cost rain gauges as proposed by Davids et al. (2019). The low-cost rain gauge is described in Supplementary Material S1.

The questionnaire was designed to collect necessary data and included two parts. The first part covered the biodata questions of the respondents, and the second part covered the collected flood risk-related data. The questions about flood hazard and flood vulnerability data were based on the approach described by Glas et al. (2018). The question of flooding depth in the residential area was given using the reference height level of human body parts and houses to create easy-to-understand questions for citizen scientists (Peters-Guarin, 2008; Sy et al., 2020). The questions about exposure and rainfall data collection were adopted from Davids et al.’s work (2018, 2019), in which taking photographs of investigated subjects was obligatory. The questionnaire was designed on the Open Data Kit (ODK) Collect app and KoBo Toolbox web form for Android-based mobile devices and non-Android-based mobile devices, respectively, and paper forms (only applied for hazard and vulnerability data). The questions used to collect data in this research on the web form can be found in this link (https://bit.ly/3E5oNvX).

Citizen scientists’ engagement

Preliminary site visits

Preliminary site visits were conducted in the Bui River Basin to understand the flood situation, choose the pilot area, and create reference datasets for comparison with citizen science data. During our site visits, ODK Collect was used to document flood marks left in residential areas and land use in the pilot area. The flood depth at 89 flood mark locations of the 2018, the biggest flood in the last 10 years, was measured using a tapeline (refer to Supplementary Material S2, Fig. S2). The measured flood depths depend on the clarity and accuracy of flood marks on buildings and other objects. Therefore, it is not guaranteed that these recorded flooding depths accurately represent the maximum water level of an actual event. Nonetheless, they do provide valuable information for reconstituting flood events. Land use data were collected at 14 sites where we could determine a typical land use class from seven classes within a 20-m radius. The land use class at sites was classified by the first author and controlled by the fourth author to ensure a consistent reference database (Saralioglu & Gungor, 2019). In addition, we gathered a 2018 flood report mentioning flood-affected areas and damage data on the agricultural production of individual farmers from the Chuong My District People’s Committee.

Citizen scientist recruiting and training

Citizens living in or around the pilot area who were over 12 years old, regardless of educational background, were the target group. Citizen scientists were recruited through personal relationships, social media, outreach at educational institutes, and field visits (Davids et al., 2019). Outreach was held for secondary school, high school, and university students, and the outreach program content was modified to match the participants’ backgrounds. The recruitment campaign occurred during the COVID pandemic, so seven outreach events were organized on site (n = 3), virtually (n = 1), or through hybrid meetings (n = 3). Citizen scientists interested in rainfall monitoring were equipped with low-cost rain gauges that were installed in their households. The citizen scientists were trained to conduct surveys or self-report data on their preferred questionnaire forms through in-class, virtual, or on-site training. To consolidate the training process, tutorial videos for the installation of data collection applications and the surveying procedures were published on the YouTube channel (https://bit.ly/44lfjHL; Vietnamese language only), and an annotated and added-picture demonstration was available on digital forms to guide the participants.

Data collection

The citizen scientists were categorized into two groups. The first comprised participants who self-reported or interviewed their family members on flood risk-related data or measured rainfall. These participants were called “self-investigators.” The self-investigators were asked to provide flood risk-related data at a feasible time within 2 weeks after the training session. After 2 weeks, the research team contacted the self-investigators again to thank them for their support, collect the completed paper forms, or invite them to provide data again. In addition, they were asked to monitor rainfall at their houses often during rainy days and less frequently on days without rain.

The second group comprised participants who participated in surveys to gather flood risk-related data from the locals. These participants were called “investigators.” The investigators were asked to conduct surveys using digital forms (ODK Collect, web form) after participating in training sessions. Each investigator was led to a specific area to conduct household surveys on flooding in residential areas and flood damage to paddy fields. Both investigators and self-investigators sampled land use in the field wherever they wanted during their daily life activities or data collection campaigns. To compare the results between citizen scientists and authors in land use classification, eight investigators participated in a field experiment for 1 day in April 2022. They were led to the 14 sites mentioned in the “Preliminary site visits” section to sample and classify land use.

Data quality control and dissemination

To enhance the citizen science data quality, completed questionnaires and measurements of flood risk-related data and rainfall data were reviewed manually in 2-week to 4-week intervals. Common errors included incorrect rainfall units, blurred images, and inconsistent data between answers or information and images. Feedback on errors was promptly provided to the citizen scientists to prevent implausible data. Mislocated coordinates caused GPS signal errors, and non-GPS-generated paper forms were processed based on the address of the survey areas, Google Maps, and survey photos (Beza et al., 2018; Ribeiro et al., 2020). All discrepancies were corrected, and edits and notes were documented for future analysis. Data collected through ODK Collect are publicly available on the S4W data collection platform on the website (https://data.smartphones4water.org/, retrieved on June 26, 2023) with the land use data category under development.

Citizen science data analysis

To evaluate the reliability of citizen science data, we compared these data to the reference datasets created by authors or gathered by the local authority. Based on achievable reference datasets from preliminary site visits, the data of flooded and non-flooded points, and the paddy field flood damage rate in 2018, and land use samples gathered during the field experiment in April 2022 obtained from citizen scientists were compared. The comparison involved overall agreement (OA) and individual agreement levels, which were determined using a confusion matrix (Congalton, 1991). Rainfall comparison was excluded from the research.

For the flood hazard data, we compared flooded and non-flooded points gathered by citizen scientists with the flooding map for the 2018 flood. In addition, flood depth differences between citizen scientists and the flooding map at flooded points were tested. Following Ribeiro et al.’s approach (Ribeiro et al., 2020), a flooding map was created using a 1:2000 topographic map and the 89 flood depth points. The flood elevation of these points was determined by combining the flooding depth and elevation value. A local combination method was used to determine a typical flood elevation surface based on flood level points for each subdomain of 1 km × 1 km for a pilot area (Mason et al., 2021). A digital elevation model and flood elevation surface with 10 m × 10 m resolution for the whole area were created by performing the multilevel B-spline interpolation method in QGIS between elevation points and flood surface levels of distinct subdomains (Ribeiro et al., 2020). Pixels with elevation values lower than the flood surface level were flooded. The flooding map was validated using local authority reports, internet news, and permanent water bodies (Giordan et al., 2018).

For flood vulnerability, we compared the flood damage to paddy fields gathered by citizen scientists with official flood damage data from the local authority for the 2018 flood. The local authority only investigated damage information from households with damaged areas from 30 to 70% and more than 70% because this information is used for compensation claims. The questions about the paddy field damage rate in our research were classified in more detail with 20% damage intervals (i.e., 20–40%, 40–60%). Therefore, the paddy field damage rate obtained from citizen scientists used a median value (e.g., 30%, 50%) to compare with official data. The damage rate was reclassified to < 30%, from 30% to less than 70%, and ≥ 70%, corresponding to low, medium, and high levels, respectively. The flood damage collected by citizen scientists was acceptable when the damage rates matched the damage rate level.

Results

Citizen science data

Participant demographics

The participant demographics in this research are illustrated in Table 1 (for details on the participants, see Supplementary Material S3). There were 59 citizen scientists divided into two common genders. Most participants were 12–34 years old, accounting for 87%. In addition, 57% of participants were educated at the college level or lower, followed by 36% at the bachelor’s level. Forty-five percent of the participants were recruited through personal relationships, whereas only 2% joined the citizen science program through social media. The number of self-investigators was three times higher than the number of investigators.

Table 1 Demographic characteristics of citizen scientists

Full size table

Received data

Fifty-nine participants contributed 594 flood risk-related data and rainfall measurements (hereafter referred to as data) for 1 year (Fig. 3). The allocation of the data number per participant decreased with the expanded data number per participant. Twenty-seven people, approximately 50% of participants, provided data only once. Only five participants, or 8.5% of citizen scientists, contributed more than 50 data per person. The citizen scientists who contributed more than 50 data during project periods were considered “active participants.” This active group contributed nearly 50% of the data over 1 year, increasing the total data number from 307 to 594.

The 594 data classified into four data types are shown in Table 2. The rainfall and exposure data accounted for 59% and 23% of the total collected data, respectively, which was significantly greater than the data for flood hazard and flood vulnerability (10% and 8%, respectively). The lists of measurements and surveys from citizen scientists are provided in Supplementary Material S4.

Table 2 The list of data categorized into data types

Full size table

Data quality assessment

Flood hazard

There were 62 hazard data obtained from citizen scientists (Supplementary Material S5), of which 56 points lay inside the pilot area (Fig. 4). Of the 56 points, 25 had never been flooded and 31 were flooded in the past. Flood event chains such as 2013, 2017, 2018, 2019, and 2021 were frequently mentioned. Although the research focused on collecting flood events in the last 10 years, after 2013, the citizen science survey sites (ID) 47, 30, and 39 in Fig. 4 mentioned flood events in 2003 and 2008. In addition, the citizen science survey sites (ID: 7, 9, 10, 17, 39) included videos, photos, and additional information (failure of drainage systems, surveying locations compared with affected locations, etc.), which were used for the interpretation of citizen science data. The 2018 flood was the biggest in the last 10 years and was mentioned by 20 respondents. Twelve of 20 flooded points provided detailed information about the 2018 floods, such as flooding depth and duration, which were considered sufficient and consistent for this research. Therefore, we used 12 flooded points and 25 non-flooded points obtained from citizen scientists for comparison with a flooding map for 2018, as described in the next paragraph.

The 2018 flooding map was built using a topographic map and 89 flood depth points (Fig. 4). Once again, it is not guaranteed that this map represents the maximum flooding depth because it depends on the accuracy of measurements in residential areas (refer to the “Preliminary site visits” subsection). The map was used to compare with the flooding depth points obtained from citizen scientists. The 2018 flooding map was verified using statistical data. As the statistical data did not provide any information about the spatial distribution, the total estimated flooding area was compared with the statistically determined flooding area. The flooded area of the flooding maps was 447 ha, 2% higher than that of the statistical data, making it authoritative for comparison with citizen science data on flood hazard. The comparison results showed high overall agreement of 86% between citizen scientists’ flood survey points and the 2018 flooding map (Table 3). Non-flooded points gathered from citizen scientists were more reliable than flooded points; the agreement of these two classes was 96% and 67%, respectively. In addition, the flood depth of 8 flooded points gathered by citizen scientists was 0.34 m higher on average than the depth extracted from the flood map (Supplementary Material S5).

Table 3 Confusion matrix of flood hazard data of 2018 collected by citizen scientists and flooding map

Full size table

Exposure data

The land use classification agreement level between citizen scientists and authors was assessed using a confusion matrix (Table 4). Eight citizen scientists were brought to 14 sites prepared by the authors to classify land use samples. One hundred land use samples, approximately 90% of the total expected samples (8 citizen scientists × 14 sites = 132 samples), were submitted by citizen scientists via digital data collection forms during the field experiment in the spring of April 2022 (Supplementary Material S6). The map of land use sample sites is shown in Fig. S3. The overall agreement was 0.82, which showed significant agreement in land use classification between citizen scientists and authors. Citizen scientists correctly classified high built-up, paddy rice, and water body areas without confusion. They also had almost perfect agreement in classifying forest and low built-up lands, with over 81% agreement for both classes. Non-rice and shrubland classes were the most confusing for participants, with only 47% of non-rice areas correctly classified by citizen scientists and 64% agreement for shrubland.

Table 4 Confusion matrix of land use classification by citizen scientists

Full size table

Flood vulnerability

Citizen scientists collected 46 flood vulnerability data with five households outside the pilot area, 11 not affected or having no names of households, and 30 households matching the list of flood-affected households on paddy fields recorded by the local authority after the 2018 flood (Supplementary Material S7). Therefore, the paddy damage data of these 30 households were compared to data from the local authority. A comparison of flood-affected households’ paddy damage rate between citizen scientists and the local authority is shown in Fig. 5 and Table 5. The overall agreement was 73%, demonstrating that citizen scientists have the potential to investigate or self-investigate data on flood vulnerability. The paddy damage rates collected from citizen scientists were lower than those collected from the local authority. For example, although all compared households had a damage rate greater than or equal to 30% according to the local authority, four households identified by citizen scientists had less than 30% damage or no damage (household IDs: 2, 13, 24, and 28, Fig. 5). Some of this disagreement might be affected by the time of data collection or by respondents’ memory or emotions.

Table 5 Confusion matrix of paddy damage rate data collected by citizen scientists

Full size table

Monthly citizen science data collection

The monthly data gathered by participants from September 2021 to August 2022 are shown in Fig. 6 and Supplementary Material (S3, S4, S5, S6, and S7). In the first four months, the citizen science program was affected by the COVID pandemic, so this research obtained little data. After rainfall monitoring activities were implemented in January 2022, monthly data increased significantly in the first four months of 2022. April was the month with the largest quantity of data, with 181 data when a field experiment for land use collection took place. Monthly data gradually decreased in the last 3 months. During the data collection campaign, there was one moderate flood in October 2021 and an abnormally heavy storm in May 2022, which flooded residential areas and damaged paddies in the Bui River Basin, respectively. Citizen scientists living in flood-affected areas updated flood information in 2021 (Fig. 7A) and provided paddy damage after being harvested 1 month after the storm in 2022 (Fig. 7B). One citizen scientist gathered land use information in one area in April and August 2022, where a paddy-cultivated area was abandoned during flood reason in low-lying land (Fig. 7C and D).