RiceAtlas, a spatial database of global rice calendars and production

Knowing where, when, and how much rice is planted and harvested is crucial information for understanding the effects of policy, trade, and global and technological change on food security. We developed RiceAtlas, a spatial database on the seasonal distribution of the world’s rice production. It consists of data on rice planting and harvesting dates by growing season and estimates of monthly production for all rice-producing countries. Sources used for planting and harvesting dates include global and regional databases, national publications, online reports, and expert knowledge. Monthly production data were estimated based on annual or seasonal production statistics, and planting and harvesting dates. RiceAtlas has 2,725 spatial units. Compared with available global crop calendars, RiceAtlas is nearly ten times more spatially detailed and has nearly seven times more spatial units, with at least two seasons of calendar data, making RiceAtlas the most comprehensive and detailed spatial database on rice calendar and production.


Background & Summary
Rice is the world's most important food crop. It is harvested from over 163 million ha in more than 100 countries (http://www.fao.org/faostat/en/#home). It is grown in diverse cropping systems and environments-from single crop systems in temperate and tropical regions in both rainfed and irrigated conditions, to intensive monoculture in irrigated areas in the tropics where rice is grown two or three times per year.
Although information on the distribution of global rice production by region and country can be derived from readily available statistics (e.g., http://www.fao.org/faostat/en/#home, http://apps.fas.usda. gov/psdonline/), information on its distribution within a year is often lacking. Rice area and production statistics are available only at the national level for some countries; if these are available at the subnational level, the statistics are often by year and not by season. Linking information on production and area to the crop calendar can help analyze spatio-temporal variation in rice production. This can contribute to an improved ability to answer questions about food security. For example, this information, together with data on climate shocks and rice stocks, can be used to better assess seasonal and geographic variation in rice supply to mitigate shortfalls in rice availability at certain times of the year. Furthermore, information on where and when rice is planted is needed to quantify the potential risk of abiotic and biotic stresses during the rice-growing seasons, and to model the effects of global climate change and technological change on rice yield and production. In summary, a globally complete and spatially explicit rice crop calendar linked to area and production estimates is a valuable global public good.
Several rice crop calendars exist (http://www.fao.org/agriculture/seed/cropcalendar/welcome.do, refs 1-3). Some are limited to a few countries or have regional coverage, whereas others are global. Regional resources include crop calendars for Latin America and the Caribbean 1 and for Africa (http://www.fao.org/agriculture/seed/cropcalendar/welcome.do), which have information on the planting and harvesting periods of rice and other major crops by agro-ecological zone. The database on rice for Africa includes 26 countries, and that for Latin America includes 24. The calendar for Latin America is outdated, whereas that for Africa does not include many rice-producing countries. Global calendars have been developed recently 2,3 . The calendars by Sacks et al. 2 lack detail at the subnational level, especially in developing countries. The MIRCA2000 (ref. 3) calendar is monthly, gridded, and available for irrigated and rainfed rice, but does not adequately cover rice areas that are cultivated more than once a year. In both global calendars, some areas with rice grown in two seasons have data for only one season. Also, they include only a maximum of two seasons-the main and second season-inadequately covering some of the world's most important rice areas with three distinct cropping seasons, such as in Bangladesh, where aman rice (main rainy season) is harvested in November-December; boro (dry season) in April-May; and aus in July-August. Parts of Vietnam also have three cropping seasons-winter-spring, spring-summer, and summer-autumn-and this is also the case in parts of China (early and late seasons for double-cropped rice areas; and middle for single-cropped rice areas) and India.
Because of the need to develop a spatially explicit global database of rice calendars that includes detailed information on rice areas with more than one rice crop in a year, we compiled the most detailed available datasets of rice planting and harvesting dates by growing season in all rice-producing countries, and linked the database to subnational production data. 'RiceAtlas' provides a spatial and seasonal distribution of the world's rice production. RiceAtlas contributes to the GEOGLAM (Group on Earth Observations Global Agricultural Monitoring) 4 initiative and regional partnerships, such as the Asian Rice Crop Estimation and Monitoring initiative (Asia-RiCE), by providing information for agricultural monitoring requirements, satellite data acquisition plans, and global crop outlook.

Rice calendar
The rice calendar in RiceAtlas is based on various published sources such as global and regional datasets, international and national publications, online sources, and unpublished data sources such as expert knowledge. Collaborators from various countries contributed new datasets, which were used to revise or validate the initial database that was compiled from existing sources ( Table 1, Supplementary  Table 1).
We collected data on the start, peak, and end dates of sowing or transplanting, and the start, peak, and end dates of harvesting of rice for all seasons in all rice-growing countries. In cases where peak planting and harvesting dates were not available, we estimated those to be at the midpoint between the start and end dates. If only the peak planting dates were available, we assumed the start and end dates of planting to be 15 days before and 15 days after the peak date, respectively. The same procedure was used to estimate the start and end dates of harvesting if only peak harvesting dates were available. Planting in a region is not done on a single date but the length of the planting window varies between regions. In the absence of information, we set the planting window to 30 days. This can be revised when better information becomes available. Where available, additional data such as crop establishment method and seedling age for transplanted rice were recorded. To describe RiceAtlas and compare its calendar with existing regional (http://www.fao.org/agriculture/ seed/cropcalendar/welcome.do; ref 1) and global (Sacks et al. 2 and MIRCA2000 3 ) datasets, we used the following metrics: 1. Coverage. This refers to the number of rice growing countries with data. 2. Spatial detail. This refers to the number of spatial units for which data is available. To avoid double counting, we merged adjacent spatial units that have the same calendar. In the case of MIRCA2000, rice calendars are available for irrigated and rainfed rice. If both were available for one spatial unit, only the irrigated calendar was considered in the count of the spatial units. 3. Seasonal detail. This is the number of spatial units with calendars for two or more seasons. This is a metric for the temporal completeness of the crop calendar. 4. Resolution. We used measures similar to those used by Deichmann 5

Rice production
We compiled rice production and area (henceforth referred to as production data or production statistics) from various sources such as national statistics agencies, agriculture departments, and the FAO     Table 1). Because available rice production data from different countries do not refer to the same years, we used the average of the last three years of available data, and adjusted these, such that the national production totals match the 2010-2012 average production from FAO (http://www.fao. org/faostat/en/#home). A three-year average retains the recent production level and accounts for interannual fluctuations in production.

Spatial and seasonal analysis of rice production
Rice production data were linked with the crop calendar data through their locations using administrative boundaries in the GADM database of the Global Administrative Areas (version 2.8; http://www.gadm. org). There were two cases where the compiled data needed adjustment: 1. Different levels of spatial resolution. For example, production statistics were available at one level (e.g., first level subdivisions) and the crop calendar was available at the a more detailed level (e.g., second level subdivisions) (35 countries); 2. Mismatch in seasonal information, for example, the crop calendar reported double cropping, but the production data referred to only a single cropping season (33 countries).
In cases where the calendar data were spatially more detailed than the production data, we disaggregated the latter using expert knowledge where available, or by assuming equal production over the entire area or season. Conversely, if production data were more detailed than the calendar data, we disaggregated the latter and assumed the same calendar for all disaggregated spatial units. If the production statistics explicitly referred to only a single cropping season but the rice calendar had more than one season, rice production was attributed to the main season only. Because of the differences in the level of detail of available data for both rice calendar and production, the spatial detail of RiceAtlas varies across continents and countries (Figures 1 and 2, Table 2).
The production data were distributed proportionally to the corresponding months based on sowing or transplanting, and harvesting dates (start, peak, and end) per season, with the peak planting or harvesting months having greater weight and progressively lesser weights for months away from the peak. For example, if the planting window was three months and the peak planting was the middle month, the first and last months were given equal weights of 0.25 each whereas the peak planting month was given 0.5. If production data for a region was available annually but there were two or more known crop seasons in a year in that area, production data were first disaggregated by season (equally, if seasonal data were not available) then distributed to months based on months of harvest. Area data, on the other hand, were distributed based on the months of both planting and harvesting to estimate the monthly area with a standing rice crop. Global monthly rice-growing areas, as well as regional and within-year distribution of rice production at harvest time, show the production peak and lean periods by region (Figures 3 and 4).

Code availability
R code used to compute for the total planted area or production by month for each country is available from Data Citation 1.

Data Records
RiceAtlas is a spatial database of rice calendars and production in 115 countries, with its attributes given in Table 3. RiceAtlas version 1.0 can be downloaded from the IRRI Dataverse Repository (Data Citation 1).
RiceAtlas will be improved by including more detailed data for selected countries when these become available and accessible. New versions will be periodically uploaded to the Repository.

Technical Validation
Expert knowledge was used to validate and correct the crop calendars. More than 50 persons contributed data and/or validated the crop calendars. Digital and analogue country-level maps and tabulations were provided to rice experts to verify the data for countries they were familiar with. Maps and tables were provided to members of the Temperate Rice Research Consortium during their review and planning meeting at IRRI headquarters on 8 − 9 November 2013. The data from Africa were reviewed by 27 rice scientists during the AfricaRice Science Week on 9 − 13 February 2015 in Cotonou, Benin. Their  comments were addressed and their revisions were included in the current version. All authors validated the crop calendars for their respective regions of expertise. RiceAtlas v1.0 has 2,725 spatial units in total. It has 2,209 unique contiguous spatial units for the rice calendar and is nearly 10 times more spatially detailed than previously published global rice calendars (Table 4). It has 904 spatial units with data for at least two rice growing seasons. This is almost seven times more than those of published global rice calendars and nearly 10 times more for Asia. RiceAtlas, therefore, greatly improves coverage of intensively cultivated rice areas in the region. In addition, based on comparison among rice calendars in terms of overall spatial resolution and rice area resolution, RiceAtlas is the most comprehensive and detailed global rice calendar currently available.
RiceAtlas will be updated as more detailed data become available and accessible. To date, the effort has concentrated on collecting and validating data for Asia and Africa. Arrangements have been made to update data for Latin America through the International Center for Tropical Agriculture (CIAT) and its network of partners in the region.  The spatial detail of RiceAtlas varies across continents and countries, and for some applications a higher spatial resolution could be desirable. To create a more homogenous and higher spatial resolution database, the available data could be used to build predictive models to downscale the rice calendar data to a higher spatial resolution (e.g., 1 km 2 grid cells). Such models could use climate data and satellite images. For example, Moderate Resolution Imaging Spectroradiometer (MODIS) images have been used to detect key phenological stages of the rice crop including start of season 6 . The use of time series of satellite images can also allow for the detection of annual variation, changes in cropping intensity and shifts in planting dates.