The Nipponbare genome and the next-generation of rice genomics research in Japan

The map-based genome sequence of the japonica rice cultivar Nipponbare remains to date as the only monocot genome that has been sequenced to a high-quality level. It has become the reference sequence for understanding the diversity among thousands of rice cultivars and its wild relatives as well as the major cereal crops that comprised the food source for the entire human race. This review focuses on the accomplishments in rice genomics in Japan encompassing the last 10 years which have led into deeper understanding of the genome, characterization of many agronomic traits, comprehensive analysis of the transcriptome, and the map-based cloning of many genes associated with agronomic traits.


Introduction
The elucidation of the rice genome sequence is a major milestone in science as it paves the way for understanding the biology of a major cereal crop that feeds more than half of the world's population (International Rice Genome Sequencing Project 2005). Although roughly a hundred plant genome sequences have already been published to date, the map-based sequence of Oryza sativa ssp. japonica cv. Nipponbare remains as the only monocot genome that has been sequenced to a highquality level. It has therefore become a reference for sequencing of other cereal crops with much larger genome sizes such as maize (Schnable et al. 2009), sorghum (Paterson et al. 2009), soybean (Schmutz et al. 2010), barley (International Barley Genome Sequencing Consortium 2012), and wheat (International Wheat Genome Sequencing Consortium 2014). More importantly, the rice genome sequence has become the most powerful tool in agriculture enhancing the ability of breeders to develop new cultivars with highly desirable traits such as high yield, resistance to biotic/abiotic stress, good eating quality, and cultivars that could adapt to an ever changing cultivation environment brought about by global warming. It is expected that subsequent sequencing of a wide array of rice germplasm throughout the world will be the platform for propelling the next green revolution to increase productivity under more sustainable conditions.
Although 90 % of rice is consumed mainly in Asia, it is also a major food source in many African and South American countries. Rice is a main staple in the Japanese diet with the current average per capita consumption of about 60 kg per year. It has been cultivated both as a staple and economic crop for more than 2000 years across the country and has been integrated in many aspects of the culture as well. Thousands of cultivars have been developed as a result of crossbreeding and selection conducted by farmers and breeders to suit the specific local conditions. Therefore, the complete rice genome sequence based on the cultivar Nipponbare led to the large-scale characterization of other japonica cultivars including the widely cultivated and elite cultivar Koshihikari (Yamamoto et al. 2010) known for good eating quality.
This review will focus on the accomplishments in rice genomics in Japan encompassing the last 10 years since the completion of the rice genome sequence. There is no doubt however that a great deal of accomplishments has been achieved not only by the 10 participating countries in the international sequencing consortium but also by many rice researchers worldwide who have continuously engaged in understanding the rice biology based on the map-based Nipponbare genome sequence. In Japan, succeeding efforts in genome analysis from 2005 onwards have led to fine tuning of the genome assembly, deeper understanding of the structure of specific regions of the genome, characterization of many important traits across various cultivars, comprehensive profiling of the transcriptome, and the isolation and map-based cloning of many genes associated with agronomic traits.

Enhancing the genome assembly and annotation
There have been continuous efforts to refine the genome assembly and enhance the annotation of the genes since the publication of the high-quality map-based sequence of the japonica cultivar Nipponbare. These efforts focused on gap-filling of the 12 chromosomes and characterization of the complex regions of the genome such as the centromeres, telomeres and nucleolarorganizing regions. Among the 12 chromosomes, the complex and highly repetitive centromere-specific DNA sequences were first reported in Cen4 (Zhang et al. 2004), Cen8 (Nagaki et al. 2004;Wu et al. 2004), and subsequently Cen3 (Yan et al. 2006) which also complemented the previous extensive works on rice centromeres (Jiang et al. 1996;Cheng et al. 2002). We have continued to improve the quality of the Nipponbare genome pseudomolecules even after the completion of the IRGSP sequencing initiative. Using BAC sequence analysis, genome annotation, and FISH analysis, we characterized the nearly completed and high-quality genomic sequence of Cen5 in chromosome 5 and revealed some striking differences among the centromeres in terms of the copy number and distribution pattern of the centromere-specific satellite repeat CentO as well as the distribution and expression of transcription units within the pericentromeric and centromeric regions ). In the case of the telomeres, Fibre-FISH analysis revealed the presence of arrays of 730-1500 conserved copies of telomere-specific 5'-TTTAGGG-3' repeat sequence at the end regions of chromosomes 1S, 2S, 2L, 6L, 7S, 7L and 8S of Nipponbare (Mizuno et al. 2006). Gene annotation from the 500 kb subtelomere sequences clearly indicated that the rice chromosomal ends were gene-rich with high transcriptional expression. In addition, the subtelomere regions on these chromosome ends hardly contained TrsA, a subtelomeric repeat sequence of rice. On the other hand, clusters of TrsA have been observed in chromosomes 5L, 6S, 8L, 9L and 12L (Mizuno et al. 2008a). Sequence comparison of these 14 telomere-ends and telomereflanking regions also revealed the occurrence of deletions, insertions, or chromosome-specific substitutions of single nucleotides within the telomere specific repeats at the junction between the telomere and subtelomere, suggesting the telomeric variants in rice have arisen from the rapid expansion of a single mutation rather than from the gradual accumulation of random mutations (Mizuno et al. 2008b). More recently, the 14 telomere-ends from 12 chromosomes were successfully constructed from a fosmid library leading to the identification of telomere sequences and structure in rice (Mizuno et al. 2014). These additional sequenced regions of the genome have been incorporated into the genome assembly as we update the pseudomolecules on a regular basis. The most recent physical map of the genome covers almost 97 % of the entire genome with 62 remaining physical gaps (Fig. 1).
The latest genome assembly was constructed as a joint effort of the Rice Annotation Project Database (RAP-DB) of the National Institute of Agrobiological Sciences (NIAS) and the Michigan State University (MSU) Rice Genome Annotation Project to update and validate the reference IRGSP Nipponbare genome sequence and provide a unified set of pseudomolecules to the rice research community ). The genome assembly was revised using the rice optical map (Zhou et al. 2007) to validate the minimal tiling path. The nextgeneration sequencing (NGS) data obtained by resequencing two individual Nipponbare plants using the Illumina Genome Analyzer II/IIx and Roche 454 GS FLX were used to check sequencing errors in the revised assembly. This resulted in the identification of 4886 sequencing errors and five insertions/deletions in the 321 Mb of the assembled genome corresponding to an error rate of 0.15 per 10,000 nucleotides in the original IRGSP assembly. The revised and unified genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), is now used to provide a common platform for genome annotation in the RAP-DB (http://rapdb.dna.affrc.go.jp, Rice Annotation Project 2008) and the MSU rice annotation database (http://rice.plantbiology.msu.edu/ cgi-bin/gbrowse/rice/).
In line with the revision of the genome assembly, the RAP-DB has been enhanced further with the mapping of 154,579 transcript sequences from the genus Oryza and other monocot species (Sakai et al. 2013). In addition, literature-based manually curated data, transcriptome data, and NGS data of major rice cultivars were also incorporated into the database. The current release of RAP-DB consists of 37,869 loci including 1626 loci that correspond to literature-based manually curated annotation data, commonly used gene names, and gene symbols. Transcription data derived from Illumina RNA-seq analysis of various tissues under normal and stress conditions (Mizuno et al. 2010;Oono et al. 2011;Kawahara et al. 2012Kawahara et al. , 2016 have been added to enhance the utility of the database in understanding transcriptional regulatory networks. Links to gene families in rice, Sorghum bicolor, Zea mays and Arabidopsis thaliana are provided to facilitate analysis of how genes are conserved and evolved among plant species. An additional feature to RAP-DB is the Short-Read Assembly Browser (S-RAB) that provides a viewer for Illumina reads of the japonica cultivar Koshihikari and indica cultivar Guangluai-4 mapped to the Nipponbare genome, showing the alignments, single nucleotide polymorphisms (SNPs), and gene functional annotations. The RAP-DB is updated on a regular basis so that it can provide researchers with the latest information on characterization of rice genome structure and function. Recent advances in DNA sequencing technologies resulted in generation of massive genome sequencing data in a considerable number of rice cultivars and species. To facilitate efficient visualization of rapidly emerging large-scale sequencing data, a novel web-based browser, Tasuke (http://tasuke.dna.affrc.go.jp/), with various functions to show the variation and read depth of multiple genomes, as well as annotations and SNP data of hundreds of cultivars aligned to a reference genome at various scales, has been developed for efficient utilization of emerging NGS data of rice cultivars (Kumagai et al. 2013).

Deeper perspectives on rice genetic resources
The reference Nipponbare genome facilitated the sequencing initiatives of rice germplasm aimed at understanding the genetic diversity which led to the domestication of rice as grown today. Foremost among these initiatives are the Oryza Map Alignment Project (OMAP) to clarify the diversity in the twelve wild rice genomes (Wing et al. 2007), and the international effort of resequencing a core collection of 3000 rice accessions from 89 countries to provide a foundation for large-scale discovery of novel alleles for important rice phenotypes (The 3,000 Rice Genomes Project 2014; Huang et al. 2012). To facilitate comprehensive understanding of the genome diversity in rice, we have also sequenced the African rice O. glaberrima known to be more resilient to water shortage as well as fungal or insect diseases than O. sativa (Sakai et al. 2011). The high-quality assembly and annotation of the O. glaberrima genome have also been reported, providing evidence for its independent domestication (Wang et al. 2014).
In contrast, successive efforts in Japan focus on understanding cultivars important to Japanese agriculture particularly those widely grown throughout Japan. To date,  whole genome sequences from 16 varieties obtained by next-generation sequencers have been submitted in public databases by NIAS and other Japanese research organizations (Additional file 1: Table S2), and projects for sequencing other varieties and landraces are in progress.
The cultivar Koshihikari developed in 1953 is the most widely grown and favored cultivar in Japan occupying almost 80 % of total rice production including its relative cultivars. Many breeding efforts focus on further improvement of quality depending on the region where it is grown. The genome sequence of Koshihikari is therefore indispensable in breeding and designing rice to meet the demands of Japanese consumers. With the reference Nipponbare sequence, the next development was the sequencing of the Koshihikari genome with the Illumina sequencing technology (Yamamoto et al. 2010). A total of 67,051 SNPs between Koshihikari and Nipponbare, some of which derived from originating landraces and distributed through Koshihikari relatives, and 18 pedigree haplotype blocks which were artificially selected during breeding.
The Nipponbare pseudomolecule sequence was used as the template in the construction of a complete BACbased physical map of the O. sativa ssp. indica cv. Kasalath . We also sequenced the centromere region of chromosome 8 in Kasalath ). Comparative analysis with Nipponbare Cen8 revealed both collinearity and diversity in each orthologous centromere. Subsequently, deep sequencing (>154fold coverage) via the Roche GS-FLX Titanium or GS-FLX+ and Illumina GAIIx or HiSeq 2000 platforms and de novo assembly generated the 330.55 Mb Kasalath pseudomolecule sequence representing 91.1 % of the genome with 35,139 expressed loci annotated by RNA-Seq analysis (Sakai et al. 2014). Comparison of the Kasalath pseudomolecule with Nipponbare revealed 2,787,250 SNPs and 7393 large indel sites (>100 bp). On the other hand, comparison with the indica cultivar 93-11 showed 2,216,251 SNPs and 3780 large indels (Sakai et al. 2014). In particular, at least 14.78 Mb of indel sequences and 40.75 Mb of unmapped sequences were identified in the Kasalath genome in comparison with the Nipponbare genome suggesting that~6.3 % of the total transcript loci in rice genome is presumably involved with gain or loss of genes.
Genotyping of the NIAS Genebank (https://www.gene.affrc.go.jp/index_en.php) rice accessions with 179 RFLP markers led to development of a rice diversity research set of germplasms (RDRS) in indica, aus and japonica accessions available for the detailed genetic studies and rice improvement (Kojima et al. 2005). Based on a result from screening 234 accessions of rice collected in Asia, the Americas, Africa, Europe and Oceania with 169 SSR (simple sequence repeats) markers, moreover, current O. sativa cultivars and landraces can be classified in more detail into five genetically differentiated groups: indica, aus, aromatic, temperate japonica, and tropical japonica because of its deep genetic structure evolved during domestication and adaptation and its autogamous breeding system (Garris et al. 2005).
A series of comparative genomic studies among various species in the genus Oryza focused on a number of domestication or adaptation related genes such as the sh4 gene region responsible for the reduction of grain shattering, the semi-dwarf1 (sd-1) gene (Wu et al. 2008;Asano et al. 2011), the major heading-date related genes such as Hd1, Hd3a, Hd6, RFT1 and Ghd7 (Fujino et al. 2010;Yamane et al. 2009;Ebana et al. 2011). Analysis of expression levels revealed clear association of the functional and nonfunctional alleles with early and late flowering, suggesting that Hd1 is a major determinant of variation in flowering time of cultivated rice ). Sequencing of BAC clones covering the chromosomal region of Hd3a and RFT1 genes across the AA~GG genomes revealed that at least 89 % of the amino acid sequences encoded by Hd3a which promotes the transition to flowering under the short-day condition were conserved across the different Oryza species (Komiya et al. 2008). In comparison with Hd1, the Hd3a gene obviously showed much less genetic diversity with 95~100 % sequence identity among the accessions of O. sativa and O. rufipogon (Fig. 2a, b). Similarly, the RFT1 gene which has been associated with late flowering of rice under long-day condition in a functional Hd1 background (Ogiso-Tanaka et al. 2013) also showed low genetic diversity (Fig. 2c). Extremely high gene collinearity was also found in the surrounding region (~300kbp) of Hd3a and RFT1 genes across the Oryza species despite the size differences caused mainly by transposable element insertions (Fig. 2d). Unlike Hd3a, the RFT1 gene has only been found in the Oryza species that have the AA (including O. sativa) or BB genomes (O. punctata). These genomes diverged from a common ancestor onlỹ 2 Mya. This result suggests that the RFT1 gene may have originated from Hd3a by a recent duplication although a possible deletion of RFT1 within the other species could not be ruled out. The Nipponbare reference sequence could contribute insights into the molecular mechanisms underlying genomic evolution and selection in rice which would benefit breeding programs to modify and control flowering time through efficient utilization of different genes or gene alleles.

Elucidating the molecular function of rice genes
The rice genome sequence of Nipponbare has been pivotal in the development of a system for discovering the biological functions of approximately 32,000 genes identified in rice. This has been addressed early on with the development of resources for functional genomics such as the Tos17 insertion mutant panel (Hirochika 2001) and the rice full-length cDNA collection (Kikuchi et al. 2003). The Tos17 mutant collection consisting of approximately 50,000 mutant lines characterized into flanking sequences have been used in full characterization of many genes characterized in Japan and nearly 50 Tos17 mutant lines have been successfully used in tagging specific genes (See reference list at https://tos.nias.affrc.go.jp/ doc/references.html).
Similarly, the information from approximately 28,000 fully-sequenced cDNA clones and the KOME Database (hhttps://dbarchive.biosciencedbc.jp/en/kome/desc.html) have been very useful for functional characterization of many genes. Both these resources are still widely used up to the present by researchers around the world particularly in systematically assigning functions to many predicted genes in the genome, in addition to other resources such as T-DNA insertion lines (Jung and An 2013), Ac/Ds tagging lines (Guiderdoni and Gantet 2012), nDART/aDart lines (Takagi et al. 2007). Analysis of the flanking sequences of these insertional mutants resulted in accumulation of nearly 448,000 gene-tagging sequence resources for characterization of gene functions (Wei et al. 2013).
The rice full-length cDNAs also serve as the main resources for large-scale gene expression profiling. Using these sequences as well as predicted gene information for plausible probes, we have designed 44 K Agilent oligonucleotide microarray to analyze the field transcriptome of field-grown rice (Fig. 3). A wide range of gene expression profiles based on organs and tissues at various developmental stages identified organ/tissue specific genes as well as growth stage-specific genes. Continuous transcriptome profiling of leaf from transplanting until harvesting stage uncovered two major drastic changes in the leaf transcriptional program (Sato et al. 2011). The rice transcriptome is well documented in two databases, namely, RiceXPro (http://ricexpro.dna.affrc.go.jp/) providing an overview of the transcriptional changes throughout the growth of the rice plant in the field, and RiceFREND (http://ricefrend.dna.affrc.go.jp/) for coexpression analysis of these genes. These resources are now widely used for deciphering gene functions and analysis of rice gene networks. Combining the massive field transcriptomic data and meteorological information with statistical model construction, we have also succeeded, to some extent, in predicting fluctuation of gene expression, or transcriptome dynamics (Nagano et al. 2012). These predictions may give insights for improving crop production, disease resistance and resilience to global stress. Information from transcription profiles can also predict the best cultivation conditions for a given variety in a given location.
With current advances in next-generation sequencing, we have embarked on analysis of gene expression using RNA-seq to measure the presence and quantity of RNA transcripts under specific growth conditions. As a result, unannotated salinity stress-inducible transcripts have been identified using the RNA-seq profile of seedlings treated with NaCl (Mizuno et al. 2010). Through RNAseq analysis, previously unannotated salinity response genes, some of which might function in phosphate and cadmium stress tolerance were discovered. Simultaneous measurement of rice and rice blast fungus transcripts in infected plants using RNA-seq provided infectionresponsive expression profiles for both rice and fungal transcripts. These profiles might indicate genes that may interact at different times and different tissues during infection . The same strategy has been used to characterize the transcriptome profile of japonica cultivar Nipponbare under phosphate starvation (Oono et al. 2011, as well as the diversity among the transcriptomes of rice cultivars including Nipponbare with low tolerance to phosphate starvation stress, japonica cultivars IAC 25 and Vary Lava 701 with relatively higher tolerance, and indica cultivar Kasalath known to be highly tolerant to phosphate stress . Recently, we have also found that cadmium stress controls the expression of genes in drought stress signal pathways in rice based on genome wide transcriptome analysis .

Isolation and utilization of agronomically important genes
The genome sequence coupled with genomics tools and resources such as mutant lines, genetic populations and germplasm collection paved the way for marker-assisted selection, mapping of QTLs and map-based cloning of specific genes having agronomic properties. As a result, many QTLs have been detected as Mendelian factors in rice, including those responsible for increased yield, resistance to various insect pests and diseases, resistance to abiotic stress such as drought, salinity and submergence, good eating quality etc.

Heading date
Many QTLs involved in heading date, a key determinant of rice adaptation to different cultivation areas and cropping seasons, have been characterized. These include Hd1, Hd2, Hd3a, Hd3b, Hd4, Hd5, Hd6, Hd8, and Hd9 . Subsequent studies focused on more detailed characterization of these QTLs based on the genome sequence. The Hd1 contains a CCT domain with~60 % identity to Ghd7, a long day dependent negative regulator of heading date. Hd6 encodes casein kinase 2 alpha, and Hd3a is similar to an Arabidopsis FT-like protein Kojima et al. 2002). Another major QTL, Early heading date 1 (Ehd1), encodes a B-type Fig. 3 Characterizing the field transcriptome of rice. Microarray analysis was used to characterize gene expression of rice cells and tissues at various stages of development from transplanting to harvesting in the field response regulator that is suppressed under long-day conditions for which Ghd7 is responsible (Doi et al. 2004). In more recent studies, it has been found that Ehd3, encoding a PHD finger-containing protein, is a critical promoter of rice flowering ). Hd17, a homolog of Arabidopsis EARLY FLOWERING 3 (ELF3), is involved in the photoperiodic flowering pathway by regulating the transcription level of a flowering repressor, Grain number, plant height and heading date 7 (Ghd7) gene (Matsubara et al. 2012). The QTL Hd16 is a gene for casein kinase I which is involved in the control of rice flowering time by modulating the day-length response (Hori et al. 2013).

Disease resistance
Developing cultivars with broad spectrum of resistance to diseases is a priority for rice breeding in Japan. Among the genes widely characterized for disease resistance, the rice WRKY45 gene has been found to play a crucial role in resistance to bacterial and fungal blast induced by benzothiadiazole (BTH), a so-called plant activator that protects plants from diseases by activating plant innate immune system (Shimono et al. 2007). The Pib gene is the first cloned gene for resistance to blast induced by Magnaporthe oryzae (Wang et al. 1999) and it was reported much later that Pi21 encodes a proline-rich protein with a heavy metal-binding domain and putative protein-protein interaction motifs (Fukuoka et al. 2009).

Domestication
The Nipponbare genome sequence has been instrumental in deciphering the evolutionary processes in the domestication of rice (Yang et al. 2012). Several genes that play key roles in selection and domestication have been analyzed. The seed shattering qSH1 gene has been found to encode a BEL1-type homeobox gene and a SNP in the 5' regulatory region caused a loss of seed shattering owing to the absence of abscission layer formation (Konishi et al. 2006). More recently, Kala4, the gene responsible for the black color of rice grains (also referred to as purple rice) has been identified based on an extensive analysis of the genes associated with grain color in about 50 rice cultivars, tracing the origin to tropical japonica (Oikawa et al. 2015).

Abiotic tolerance
Map-based cloning of qLTG3-1 which controls lowtemperature germination in rice provides useful insights on cultivation in temperate as well as high altitude rice growing areas (Fujino et al. 2008). The molecular mechanism of deepwater response has been clarified through the identification of the genes SNORKEL1 and SNORKEL2, which trigger deepwater response by encoding ethylene response factors involved in ethylene signaling (Hattori et al. 2009). With the molecular cloning of Sdr4, a seed dormancy QTL in rice, the role of the gene as an intermediate regulator of dormancy in the seed maturation program has been clarified (Sugimoto et al. 2010). The DEEPER ROOTING 1 (Dro1) has been recently discovered through screening and genetic analysis of the NIAS rice collection which have a great potential for improvement of rice yield under drought conditions by controlling the root system architecture in rice (Uga et al. 2013).

Yield
The major components that determine yield in rice have been widely characterized using the sequence information. The qSW5 gene which corresponds to the QTL for seed width on chromosome 5 has been cloned and a deletion in the gene was found to be associated with larger grain size (Shomura et al. 2008). Furthermore, it has also been shown that this variant was selected during rice domestication for increased yields. Characterization of genes associated with productivity has also made significant progress. A loss-of-function mutation of rice DENSE PANICLE 1 causes semi-dwarfness and slightly increased number of spikelets (Taguchi-Shiobara et al. 2011). A natural variant of NARROW LEAF 1 (NAL1) gene selected in high-yield rice breeding programs increased the photosynthesis rate Fujita et al. 2013). The THOUSAND-GRAIN WEIGHT 6 (TGW6) gene limits endosperm cell number and grain length. Defective alleles lead to increase in grain size and yield . Also researchers at Nagoya U. revealed the basic molecular strategy for construction good plant type for ideal yield performance, most of which related to metabolism or biosynthesis of plant hormones (Ueguchi-Tanaka et al. 2005;Ikeda et al. 2013) A list of genes that have been characterized mainly or in collaboration with Japanese researchers in 2005-2014 is summarized in Additional file 2: Table S1. Significant contributions have been made in elucidating gene functions, identifying QTLs, characterizing molecular mechanisms, and establishing the DNA marker-assisted selection (MAS) as a precise and effective breeding strategy to produce novel varieties. There is no doubt that in the last 10 years since the completion of the rice genome, worldwide rice research has made significant output as evidenced in the number of rice related publications. An overview of the trend in rice research in the last 45 years is shown in Fig. 4 (data provided by Oryzabase, Kurata and Yamazaki 2006). Since 2005, the number of publications doubled in just a matter of 5 years (2010) and by the end of 2014, there are almost 2000 publications on rice alone (including 67 from Japan). In total, Japanese researcher contributed about 70-100 per year in the last 10 years since the completion of the rice genome sequence in 2004.

Conclusion
In Japan, rice genomics has been a part of major research programs of the Ministry of Agriculture, Forestry and Fisheries (MAFF) that address various issues in sustainable food production and agriculture. Foremost among these issues are the rapid aging of farm workers and depopulation of farming communities which would eventually affect agricultural production in the not so distant future. In breeding programs, various efforts are being initiated to integrate rice genomics technology to the development of novel varieties to reinvigorate Japanese agriculture. The new basic plan for food, agriculture and the rural areas which serve as the guideline for advancing the reform of measures and efforts by the entire nation so as to enable Japan's agriculture and rural areas to accurately respond to structural and other changes in the economy and society (http://www.maff. go.jp/e/basic_law/basiclaw_agri/basiclaw_agri.html). Rice being an integral part of Japanese agriculture is very much a part of these programs. Several ongoing MAFFfunded research projects focus on using the genome information of various crops for the development of various technologies to boost next-generation agriculture. As a part of that major project, the development of rice genomics resources and informatics tools are expected to contribute in such areas through attempts of various sectors to develop rice cultivars that address the specific needs of various rice producing regions in Japan considering the environmental changes in rice cultivation due to climate change.
As the rice research community embarks on various efforts in rice genomics, more rice genome sequence and transcriptome profiles will be generated in the very near future. Elucidating the molecular mechanisms controlling many biological processes will be supplemented by information to be obtained from other technologies such as proteomics, metabolomics (Okazaki and Saito 2016), epigenomics (Chen and Zhou 2013), and phenomics . Integration of all these data via advanced bioinformatics will elucidate the gene cascade or network in the whole rice plant that will serve as the platform on how to utilize and improve crop function. And Japanese researchers will continue to be a part of various initiatives for these advancements will most likely revolutionize rice breeding to circumvent future concerns of sustainable agriculture and food security.

Additional files
Additional file 1: Table S2. Rice varieties and landraces with wholegenome short-read sequences submitted by Japanese research organizations in the NCBI database. (XLSX 20 kb) Additional file 2: Table S1. Rice genes reported by Japanese researchers in various scientific journals in 2005-2014. Literatures were searched and obtained from PubMed with 'rice' and 'Oryza' as keywords in either the title or abstract, and further selected by natural language processing and manual curation. The data can be accessed from Oryzabase (http:// shigen.nig.ac.jp/rice/oryzabase/download/reference). (XLSX 215 kb) Abbreviations IRGSP, International Rice Genome Sequencing Project; NGS, next-generation sequencing technology; RAP-DB, Rice Genome Annotation Project Database