Global genomic diversity and conservation priorities for domestic animals are associated with the economies of their regions of origin

Domestic animals play a key role in human survival and the development of civilization. However, the genetic resources of domestic animals are facing an alarming rate of erosion due to socioeconomic changes, economic globalization and financial constraints. In this study, through genome-wide SNP analysis, we estimated the heterozygosity, inbreeding coefficient, effective population size, and runs of homozygosity to identify the breeds facing the risk of extinction for sheep and cattle across the world. In particular, we quantified the contribution of 97 sheep breeds and 53 cattle breeds to genomic diversity (within-breed, between-breed and total) and prioritized the breeds for conservation. Additionally, we compared the average values of genomic diversity between breeds from regions (or countries) in different economic categories (underdeveloped, developing and developed), and found that breeds in developed regions exhibit significantly higher levels of total genomic diversity than those in underdeveloped and developing regions. Altogether, our results suggested that conservation priority should be given to breeds in developed regions to secure the future genomic diversity hotspots of domestic animal resources.

Farm animals contribute in many ways to human survival and economic development, particularly as societies evolve from subsistence agriculture into market-based economies 1 . However, in the past two decades, farm animal genetic resources have faced an alarming rate of extinction and genetic erosion, as approximately 200 uniquely adapted breeds have become extinct and up to 30% of global livestock breeds (approximately 1,200-1,500) are currently endangered 2 . This loss of diversity is attributed to multiple causes, including regional economic forces such as the decline in the economic viability of traditional livestock production systems 3 , destruction of the native habitats (e.g., grazing lands) of livestock breeds 4 , rapid dissemination of uniform and highly productive breeds at the expense of native genetic resources 5 and the extension of markets and economic globalization 6 . In this context, an integrated analysis of the genomic diversity of farm animals and the associated economies can aid an understanding of economic impact on the changes in farm animal genomic diversity in recent years. Additionally, it will contribute to the development of governmental interventions and the implementation of conservation priorities that are necessary to ensure that ongoing agricultural development will be compatible with the conservation and sustainable use of local livestock diversity 7 .
Here, we ranked the conservation priorities of 97 native sheep breeds and 53 native cattle breeds from around the world based on genome-wide single-nucleotide polymorphism (SNP) data for the first time. Firstly, we calculated heterozygosity, inbreeding coefficient, and runs of homozygosity (ROH) for each sheep and cattle breed to detect the small and inbred populations that are facing the risk of extinction. Moreover, we implemented the first association analysis between economic indexes and genomic diversity by studying a total of 2,941 animals from 150 native breeds of two globally distributed livestock species: sheep (1,910 animals, 97 breeds; Supplementary Table 1) and cattle (1,031 animals, 53 breeds; Supplementary Table 2). Specifically, the estimations of genomic diversity contributions for sheep and cattle were based on SNP array data genotyped from the Ovine SNP50 (or Infinium HD) and Bovine SNP50 BeadChips, respectively. Through the molecular coancestry 8 and allelic richness approaches 9 , we calculated the breed contribution to genomic diversity at three levels: within-population diversity (ΔGD WS ), between-population diversity (ΔGD BS ), and total genomic diversity (ΔGD T ) for each breed after its removal (Supplementary Tables 1 and 2). For region-wide breeds, the estimation of genomic diversity contributions was implemented by excluding all of the breeds within the region from the total breeds (Supplementary  Tables 3 and 4). We then compared the ΔGD WS , ΔGD BS and ΔGD T among the regions of origin (or countries) in terms of the annual gross domestic product (GDP) per capita (GDPPC). We also took into account the increase in the GDP per capita (ΔGDPPC) for the regions during the past two decades in a dual-regression analysis to correlate the contribution of the genomic diversity of the breeds and the past economic development in a region. To facilitate comparisons among the regions with different economies, we grouped the regions into three categories (i.e., underdeveloped, developing and developed) according to the average GDPPC (A-GDPPC) during the years 1993-2013. Based on all the above analyses, we were finally able to determine the contribution of each breed to the maximum amount of genomic diversity and establish the core sets of breeds with higher priority for conservation.

Results
Genetic diversity pattern and inbreeding coefficient. Heterozygosity is a measure of genetic variation within a population. We estimated the observed (H O ) and expected (H E ) heterozygosity for each sheep and cattle breed. For sheep breeds, the highest H O value was observed in the RAA (0.394) breed from Spain, and the lowest was in the SPS breed (0.252) from Yunnan province of China. The highest and lowest expected (H E ) heterozygosity were also observed in the RAA breed (0.385) and SPS breeds (0.258), respectively (Supplementary Table 1 Effective population size. The results for effective population size (N e ) for sheep and cattle were calculated based on the linkage disequilibrium method and presented in the Supplementary Tables 1 and 2. In sheep, N e ranged from 3,074 (BYK) down to 3 (WNS), while in cattle the value ranged from 1,429 (ZBO) down to 1 (BALI). The high degree of inbreeding may be the reason for the extremely low N e value of BALI, as evidenced by the high F IS and F ROH value observed in this breed (Supplementary Table 2). To glean further details, we divided the N e values into three categories: N e ≤ 100, 100 < N e < 200, and N e ≥ 200, and the number of breeds within each of the three categories were summarized in Table 1 (also see Figs 1a and 2a, Supplementary Figs 1a and 2a and  Supplementary Tables 1 and 2).

Runs of homozygosity (ROH).
In this study, we calculated the ROH statistics for each breed of sheep and cattle, and subsequently the inbreeding coefficient based on ROH (F ROH ) was estimated. The F ROH values ranged from 0.0004 (WZS) to 0.2016 (WIL) in sheep breeds (Supplementary Table 1) and from 0.0001 (ZEB) to 0.6842 (BALI) in cattle breeds (Supplementary Table 2). Regarding the number of ROH (N ROH ) above 500 kb, the sheep breed of WIL (N ROH = 780) showed the highest N ROH value and WZS (N ROH = 2) had the lowest N ROH value (Supplementary Table 1). For the cattle breeds, ZEB (N ROH = 1) and BALI (N ROH = 1,127) breeds showed the lowest and highest N ROH value, respectively (Supplementary Table 2). Finally, we also calculated the average length of ROH above 500 kb for each breed (L ROH ). For sheep and cattle breeds, L ROH values ranged from 26.9 Mb (HLS) to 9.0 Mb (SOA), and 21.8 Mb (BOR) to 5.1 Mb (ZEB), respectively (Supplementary Tables 1 and 2). To obtain more details, F ROH , N ROH and L ROH values were also divided into three categories: F ROH categories (F ROH ≥ 0.10, 0.05 < F ROH < 0. 10   Genetic structure. The relationship between breeds was investigated using the NJ tree based on the Reynolds distance estimated from allele frequency differences, and branch lengths were highly variable. The 97 sheep breeds were clearly divided into African, Asian (Southwestern Asian, Southeastern Asian, and China), and European sheep breeds ( Supplementary Fig. 3a). However, sheep from the Americas were clustered with European-derived sheep breeds ( Supplementary Fig. 3a). The 53 cattle breeds were clearly divided into European and African cattle breeds, although the American and Southeastern Asian cattle breeds exhibited an admixture with cattle breeds from Europe and Africa ( Supplementary Fig. 3b).  Table 1). In cattle, these values ranged from −0.781 (BALI) to 0.257 (CANC), −0.198 (CANC) to 0.670 (BALI) and −0.176 (LAG) to 0.172 (HFD), respectively (Supplementary Table 2). As representative of the main cattle breeds reared around the world, six cattle breeds (HFD, HO, AN, ANR, NRC, and BEFM; 11.3%) showed a ΔGD T > 0.1 and 11 (20.8%) breeds showed a ΔGD T < −0.1 in this study (Supplementary Table 2 and Supplementary Fig. 4f). However, none of the native sheep breeds presented extreme values of ΔGD T > 0.1 or <−0.1 (Supplementary Table 1 and Supplementary Fig. 4c). This observation implied a more diverse genetic constitution of sheep compared with cattle. Another possible reason is the differences in number of sheep and cattle breeds analyzed, as the contribution of a single breed to genomic diversity is expected to be lower in a larger sample.  Table 3). For cattle, ΔGD WS ranged from 2.271 (France) to −1.421 (Indonesia), ΔGD BS ranged 1.134 (Indonesia) to −1.225 (France), and ΔGD T ranged from 1.047 (France) to −0.293 (Pakistan) (Supplementary Table 4). Further comparisons revealed significant differences in ΔGD WS and ΔGD T between breeds from the three economic categories. For instance, both species showed that the breeds from developed regions had the highest average value of ΔGD T , followed by those from developing regions and underdeveloped regions (Fig. 3). In addition, the breeds in developing and developed regions exhibited significantly (P < 0.05) higher ΔGD WS values than those in underdeveloped regions, whereas the ΔGD BS estimates for breeds from different regions within the three economic categories were not significantly different from each other (Fig. 3a). We also examined the impact of economic growth, which is typically measured as the rate of change in the average annual GDP per capita (ΔGDPPC), on the gain or loss of genomic diversity. Our results showed that ΔGD WS and ΔGD T presented significant associations with ΔGDPPC during the past two decades (sheep: R 2 = 0.087, P = 0.073 (ΔGD WS ), R 2 = 0.278, P < 0.01 (ΔGD T ); cattle: R 2 = 0.235, P < 0.05 (ΔGD WS ); R 2 = 0.318, P < 0.01 (ΔGD T ); Fig. 4a,c,d,f). In both species, ΔGD WS and ΔGD T displayed a rapid increase for ΔGDPPC < 20,000 and a slow increase thereafter (Fig. 4).

Discussion
Heterozygosity is an important factor for estimating the genetic variability in domestic animals. Heterozygosity represents the genetic potential and adaptation abilities to the natural environment 10 . In this study, we found that the livestock in developing and developed regions have higher heterozygosity than those in underdeveloped regions or some island populations. For example, sheep breeds from Southwestern Europe and Northern China obviously harbor higher observed (H O ) and expected (H E ) heterozygosity than those from Africa and southwestern China, which belong to underdeveloped regions (Supplementary Table 1). Similarly, cattle breeds from Europe and America possess higher H O and H E values than those from Africa and Asia (Supplementary Table 2). Besides economic factors, another reasonable explanation is that breeds with high heterozygosity are from regions near the domestication center or trade routes [11][12][13][14][15] . Notably, we found that the island breeds always show low heterozygosity. For example, BALI breed, which is distributed on the Bali island of Indonesia Table 2). These results suggested that these island breeds have suffered population decline and their genetic variation has been reduced.
Effective population size (N e ) has been regarded as a popular parameter to investigate small populations and endangered species. According to the 50/500 rule of thumb 16 , N e values under 50 means that the population is likely to face a serious genetic threat after 5 or more generations. This is because small populations cannot easily avoid inbreeding depression. N e values above 500 means that the population can maintain an evolutionary potential for longer periods of time. However, Frankham et al. 17 have proposed a new criterion that N e values under 100 and above 1,000 are suitable to control inbreeding and maintain high genetic diversity, respectively 17  introduction of breeds and intensive breeding [18][19][20][21][22] . Overall, we can see that most sheep and cattle breeds do not show a long-term stable status, and thus, we need to pay more attention and carry out appropriate conservation strategies on these native breeds with small effective population size.
ROH is a key parameter to detect the level of matings between related individuals in a closed or small population, and the number and length of ROH could indicate the extent of recent inbreeding and artificial selection 23,24 . Because of its utility, ROH has been used to investigate genome-wide autozygosity regions associated with some important traits and diseases in domestic animals and humans 25,26 . Inbreeding is among the major causes that lead to an increase in homozygosity, which in turn, reduce fitness and accumulate more recessive alleles that may lead to expression of unfavorable phenotypes 27,28 . According to the inbreeding coefficient based on runs of homozygosity (F ROH ), we have identified 10 sheep breeds (WIL, BOR, WNS, BCS, ZTS, BGE, DSH, BHM, EFB and YXZ) and three cattle breeds (BALI, SH and JER) with an F ROH value above 0.10. In addition, these breeds also showed a large number of runs of homozygosity (N ROH ≥ 200) (Supplementary Tables 1 and 2). Together these results suggest that these breeds were founded as small populations or the populations recently experienced severe inbreeding. Previous studies have provided evidence that domestic animals have suffered serious inbreeding and the effective population size is declining under the pressure of artificial selection 29,30 . In our study, most of the sheep and cattle breeds had a ROH length above 10 Mb, suggesting they share recent ancestors 3 generations ago 31,32 . Also, there was a negative correlation between F ROH and N e , especially in the sheep breeds. Sheep breeds from North China had a lower F ROH and a higher N e than other regions, while in cattle breeds the lowest F ROH and the highest N e appeared in Africa and Europe. Interestingly, although some sheep and cattle breeds from the developed regions show high F ROH, N ROH and L ROH and low N e values, they make contributions to genetic diversity. For example, 6 sheep breeds (BOR, DSH, ISF, SOA, WIL and GAL) have a N e value under 200, but they exhibit high ΔGD BS and positive ΔGD T values (Supplementary Table 1). Also, three cattle breeds (SH, GNS and JER) present the same pattern as the sheep breeds. Therefore, these breeds should be subjected to conservation measures to avoid the erosion of their genetic diversity. In addition, the breeds from developing and underdeveloped regions with low N e values, such as sheep breeds BGE, GAR, ZTS and SPS, and cattle breeds LAG, ND2, BOR, ZMA and BALI (Supplementary Tables 1 and 2), are strongly differentiated from the continental breeds ( Supplementary Fig. 3), implying that they may have some special economic traits to be used in breeding programs.
Population structure is a useful information for humans to design effective strategies to improve the conservation of farm animal genetic resources. In this study, we performed a comprehensive analysis of population structure of sheep and cattle breeds worldwide. The 97 sheep breeds can be clearly divided into six geographic regions (Europe, Southwestern Asia, Southeastern Asia, China, America and Africa) (Supplementary Fig. 3a). Previous research on 74 diverse sheep breeds collected globally documented a high level of admixture within the European sheep, which showed high co-ancestry with most other breeds 33 . Compared to the previous study, our data set incorporated sheep breeds from China, and identified the breeds with large distinctness among the sheep breeds. Based on the results of the NJ tree, the breeds from Southeastern Asia, Africa, Southwestern China and several breeds (WIL, BRL, DSH, EFB, ISF and GAL) from Europe have longer branches ( Supplementary Fig. 3a), suggesting these breeds are isolated breeds and possess unique genetic characteristics. The investigation of genetic structure of cattle reveals that cattle have two domestication centers: Fertile Crescent and Indus valley 15 . The African taurine has large divergence from European taurine and Asian indicine because of their large portion of wild African auroch ancestry 34 . This is consistent with our results that African cattle show apparent divergence from European cattle (Supplementary Fig. 3b). In addition, 7 African (LAG, BAO, ND2, ZMA, NDAM, SOM and ZEB) and 3 Southeastern Asian breeds (SAHW, GIR and BALI) make a large contribution to the between-breed  Table 2). These results reflect the unique genetic resources carried by these breeds.
From the results of loss or gain of genetic diversity, coupled with the evidence of the above statistics (i.e., heterozygosity, N e , ROH and population structure), our most prominent finding was that breeds from developed regions exhibit higher total genomic diversity than those in underdeveloped and developing regions, and thus, should be given a higher conservation priority (Fig. 3). Since the sample of this study does not cover the global distributions of sheep and cattle, future studies based on more extensive and representative samples are needed to verify this finding. For the potential factors affecting conservation priority, whether the economic situation measured based on the annual GDP per capita plays a direct role in livestock conservation is unclear. One hypothesis is that a better economy induces more effective conservation measures, including higher subsidies for raising local livestock breeds and more financial investment in conservation programs. For example, the European Union has long recognized the importance of conserving animal genetic resources (AnGR) and, in 1992, initiated a policy of economic incentives for farmers keeping native breeds, under EC regulations 2078/92 and 1257/99 35 Table 1 and Supplementary Fig. 4a). This observation could be due to factors such as long-term genetic isolation, genetic drift, or inbreeding, and thus, effective conservation measures preventing genetic drift in small-sized and isolated breeds and promoting reasonable gene flow with outside breeds should be implemented for the native breeds in underdeveloped regions. A second hypothesis is that economic growth is one causative factor for the change in the livestock production system, possibly through effects on breeding programs. Rapid economic growth could have led to an increased need for more productive and synthetic livestock breeds 36 and therefore contributed to the increased ΔGD T and ΔGD WS and decreased ΔGD BS values observed in livestock (Fig. 4). Native breeds in developed regions exhibited high values of both ΔGD WS and ΔGD T (Fig. 3), which could also be due to the improvement of breeding programs by avoiding indiscriminate crossbreeding 37 . However, breeds in developing regions such as Xinjiang of China showed high values of ΔGD WS but negative values of ΔGD T (Supplementary Table 1 and Supplementary Fig. 4a,c), which could be explained by the development scheme of extensive and intensive crossbreeding of sheep with exotic breeds adopted in past decades 38 . Apart from the potential impact of economic situation on livestock conservation, several other factors may also account for the conservation status of domestic animals. For instance, the development of the breed concept, which originated 200-250 years ago in Europe, has probably affected the number of breeds differentiated and selected within the region. Also, transboundary exchanges of breeds may have played a fundamental role in shaping the genetic diversity of breeds, especially in American and Australian countries.
Another important finding regarding genetic diversity was that breeds in developing regions have been facing the erosion of between-breed genomic diversity (ΔGD BS ). Both species are transboundary animals, and many highly productive commercial breeds have been transferred worldwide to upgrade local breeds [39][40][41][42] . With the rapid economic development and increase in the human population in developing regions, the share of trade in genetic material from developed to developing countries increased from 20% in 1995 to 30% in 2005 43,44 . Long-term intensive crossbreeding programs using imported exotic breeds have narrowed the diversity of local genetic materials and led to the loss of between-breed genomic diversity in developing regions 45 . In northern China, the estimates of both ΔGD BS and ΔGD T were negative, suggesting that northern Chinese sheep have suffered severe genetic erosion induced by exotic breeds in the past, which has resulted in the loss of both total and within-breed genomic diversity ( Supplementary Fig. 4b,c). Similar scenarios were also observed in the breeds of Iran, Turkey and Pakistan ( Supplementary Fig. 4a,c). However, livestock in underdeveloped regions (including remote and isolated island areas) are commonly used to service the local community 46,47 , being subjected to little or no influence from exotic breeds, and these populations therefore showed high ΔGD BS values ( Supplementary  Fig. 4b,e). For example, the diverse and isolated topography of the Yunnan-Kweichow Plateau of China has contributed to the genetically isolated populations of local sheep breeds, which show high and positive ΔGD BS values, but negative ΔGD T values ( Supplementary Fig. 4b,c). Similar results were observed in cattle from mainland Africa and Madagascar ( Supplementary Fig. 4e,f). In these scenarios, breeds with high levels of within-breed genomic diversity (ΔGD WS ) should be ranked with high conservation priorities to preserve closed populations that will be capable of coping with future challenging environments or diversified food production. However, breeds with large divergence should also give protection measures to prevent their extinction due to their unique genetic resources and special adaptation to local environment.
As the ovine and bovine SNP BeadChips have generally been developed on European breeds, it is expected that ascertainment bias would impact the estimates of genetic diversity metrics. However, our results should not be significantly affected by ascertainment bias for the following reasons. On the one hand, Herráez et al. 48 have demonstrated that the effect of ascertainment bias could be countered if the SNPs with high LD were removed. Based on this finding, we recalculated the heterozygosity and ΔGD T (loss or gain of genetic diversity) by using the LD-pruned data, and obtained similar results with those not controlling for LD ( Supplementary Fig. 5). On the other hand, Kijas et al. 33 calculated genetic diversity of sheep by using two types of SNP chip data which were genotyped by Roche 454 and Illumina GA sequencing, respectively. The Roche 454 SNPs were developed primarily using animals of European origin, while the Illumina GA SNPs were designed using a larger number of animals selected from multiple regions. The results showed that ascertainment bias was unlikely to heavily influence genetic diversity between breeds and regions. From all the evidence presented above, we believe that ascertainment bias should have little influence on our results.
In conclusion, we show that native sheep and cattle breeds from developed regions in this study make a greater contribution to total genomic diversity than those from developing and underdeveloped regions; thus, a higher conservation priority should be given to these breeds. In contrast, most of the breeds in underdeveloped and SCIEntIfIC RePoRts | (2018) 8:11677 | DOI:10.1038/s41598-018-30061-0 developing regions make a small or no contribution at all to total genomic diversity and therefore receive relatively lower rankings in terms of conservation priority. Nevertheless, we noticed that breeds from underdeveloped countries tend to contribute more to between-breed diversity (Fig. 3). This is also a conservation objective, as many of these small, isolated and peripheral populations may have valuable variation (e.g., private alleles) that should be conserved. Also, it should be noted that the lack of samples in some distribution areas of sheep and cattle limit the generality of the results. The mechanisms underlying these observations are not well understood, but the regional annual GDP per capita and its growth rate, which are two important indicators of economic development, could be the missing link and help to explain the differences in the genomic diversity contributions of livestock breeds from different regions. In addition to genomic diversity, other factors, such as sociocultural value, local adaptation and the special uses of some livestock breeds, should also be considered in the development of conservation programs. With an awareness that the conservation recommendations made from the analysis of the contribution of breeds to diversity depend on the conservation interests and scopes, our results could help local authorities and animal breeders to ensure the effective and sustainable utilization of animal genetic resources in the future.

Methods
The methods were carried out in accordance with the approved guidelines of the Good Experimental Practices adopted by the Institute of Zoology, Chinese Academy of Sciences. All experimental procedures and animal collections were conducted under a permit (No. IOZ13015) approved by the Committee for Animal Experiments of the Institute of Zoology, Chinese Academy of Sciences, China.
Ovine sample collection and DNA extraction. Blood or tissue samples from 783 animals representing 40 native Chinese sheep breeds were collected in this study (Supplementary Table 1). In all cases, particular efforts were made, based on both pedigree information and the knowledge of local herdsmen, to ensure that the animals were unrelated and typical of their breeds. Whole blood samples were collected into evacuated tubes containing EDTA and ear marginal tissues were collected and stored in 2-ml microcentrifuge tubes containing 75% ethanol. Genomic DNA was extracted from the tissue samples using the standard phenol/chloroform protocol 49 or from blood using the General AllGen Kit (Tiangen Biotech, Beijing, China) following the manufacturer's instructions.
Ovine SNP datasets genotyping. Of  sheephapmap.org). Details regarding SNP discovery, the design of the ovine array and genotyping procedures for the BeadChips can be found at the following address: http://www.sheephapmap.org/hapmap.php, and in Kijas et al. 33 for the SNP50 BeadChip and Anderson et al. 50 for the HD SNP BeadChip.
Published Ovine SNP dataset. An Ovine SNP50 BeadChip dataset of 74 worldwide-distributed breeds/ populations (Dataset III) was retrieved from a previous study 33 . A set of 57 native breeds (Supplementary Table 1) that had been sampled and genotyped within the Sheep HapMap project were selected out of the 74 breeds (for information on the breeds and their geographic origins, see Kijas et al. 33 ).
Ovine SNP datasets quality control. All of the ovine SNP BeadChip datasets (Datasets I, II and III) were merged to apply quality control procedures using Plink v.1.9 software 51 . The SNP quality control measures have been detailed elsewhere, in Kijas et al. 52 , Miller et al. 53 , Kijas et al. 33 and Lv et al. 54 . Briefly, quality control was performed according the following criteria: (1) individuals with a call rate >95%, (2) SNPs with a minor allele frequency (MAF) >0.05, (3) SNPs with a >95% genotyping rate, and (4) SNPs with physical locations on autosomes. After removing the SNPs and individuals that failed to meet any of the above criteria, 39,641 SNPs and 1,910 individuals were retained in our working dataset.
Bovine SNP datasets. Bovine SNP50 BeadChip datasets were retrieved from previous publications 10,34 , which included 53 native cattle breeds across its main distributions in the world (Supplementary Table 2). SNPs and individuals that failed to meet any of the following criteria were removed: (1) individuals with a call rate >95%, (2) MAF > 0.05, (3) SNPs with a >95% genotyping rate, and (4) SNPs with physical locations on autosomes (see also Decker et al. 34 ). After filtering, there were 37,827 SNPs and 1,031 animals available for further analyses. Runs of homozygosity (ROH). Runs of homozygosity (ROHs) were detected using the Plink v.1.9 software 51 for each individual. The following settings were generally suitable for ROH identification in the domesticated animals 58 : (1) required minimum density (-homozyg-density 1000) and (2) number of heterozygotes allowed in a window (-homozyg-window-het 1). In this study, we detected ROH with the minimum length of 500 kb (-homozyg-window-kb 500) and a minimum SNP number of 50 SNPs (-homozyg-window-snp 50). The number (N ROH ) and average length (L ROH ) of ROH for each breed were estimated and the N ROH and L ROH were divided into three categories: N ROH ≤ 100, 100 < N ROH < 200, and N ROH ≥ 200; L ROH ≥ 20 Mb, 15 Mb < L ROH < 20 Mb, and L ROH ≤ 15 Mb. In addition, the inbreeding coefficient based on ROH (F ROH ) was calculated for each animal according the following formula 26,59 : where the numerator represents the sum of ROH per animal above 500 kb length and L is the total length of genome. Total lengths of the cattle and sheep genomes were considered as 2,510,611 kb 60  Analysis of conservation priorities based on the molecular coancestry approach. The method of molecular coancestry 8 has proven to be efficient for defining conservation priorities for domestic animals. Unlike prioritization based on Weitzman diversity measures 66,67 , this method considers within-breed genetic variation, which is of great importance for the management of livestock 68 . The method assumes that there is a metapopulation including n populations with N i breeding individuals. In the metapopulation, s i denotes the average self-coancestry of the N i individuals of population i; D ii is the average distance between individuals of the i th population; and D ij is Nei's minimum distance between populations i and j 69  This equation can also be expressed based on genetic diversity 70 : In equation (3),  f and  s are the mean coancestry and self-coancestry of the populations, respectively; ∼ D is Nei's minimum distance between individuals within populations; and D is the average genetic distance over the entire metapopulation. In the derived expression, the total genetic diversity ( = − f GD 1 T ) was divided into three partitions: the genetic diversity within individuals ( = −  s GD 1 WI ), the genetic diversity between individuals ( = −   s f GD BI ) and the genetic diversity between populations ( ). The sum of the first two components (GD WI and GD BI ) gives the genetic diversity within populations ( = −  f GD 1 WS ). As a result, the total genetic diversity GD T is assumed to be the sum of the genetic diversity within populations (GD WS ) and the genetic diversity between populations (GD BS ), i.e., GD T = GD WS + GD BS .
The loss or gain of genetic diversity (ΔGD WS, ΔGD BS and ΔGD T ) was estimated using Metapop v.2.0 software 71 when each of the populations was removed from the dataset. The parameter of λ = 1, which indicates equal weight for within-and between-breed diversity, was applied in the Metapop calculations 71 (Supplementary  Tables 1 and 2). It is noteworthy that the initial version of Metapop used loss (−) vs gain (+) of diversity as indicative of high vs low contribution of diversity, respectively. However, in the latest version of Metapop (used in this paper), the sign was changed to go to a more intuitive view: + means greater contribution and − means lower contribution.
Analysis of conservation priorities based on the allelic richness approach. Allelic richness is an important measure of genetic diversity for setting conservation priorities in livestock. The rarefaction method developed by Hurlbert is advantageous for estimating allelic richness 72,73 . With this method, El Mousadik and Petit denoted allelic richness as a i , representing the number of different alleles in a sample of genes taken at random. The calculation formula is as follows 74 : where P ik is the probability that allele k does not exist in a random sample of g genes; N i represents the total number of genes in the sample of a given subpopulation i and N ik represents the number of copies of the kth allele from that subpopulation. The within-subpopulation component of allelic diversity was calculated according to the following formula 75 : The average distance between all subpopulations is where d A,ij is the average allelic distance between subpopulations i and j, and it is equal to The total allelic diversity is estimated with the following equation: The three estimators (A S , D A and A T ) were calculated using Metapop v.2.0 software as described above (Supplementary Tables 1 and 2).
The results obtained by the molecular coancestry and allelic richness methods have some differences. However, we compared the estimated values based on both methods using paired t-tests, and found that they did not show significant differences for the three estimators (for sheep and cattle, ΔGD WS ; t = 0.01, df = 96, P = 0.999 and t = 0.031, df = 52, P = 0.976; ΔGD BS : t = 0.2, df = 96, P = 0.842 and t = 0.286 df = 52, P = 0.776; and ΔGD T : t = 0.301 df = 96, P = 0.764 and t = 0.429, df = 52, P = 0.670, respectively). In order to obtain more accurate results, the average values of the estimates obtained using the two approaches were employed in the following statistical analysis. The average values of ΔGD WS, ΔGD BS and ΔGD T in sheep and cattle were divided into four fractions: <−0.1, −0.1-0, 0-0.1 and >0.1, and then plotted around the world using the software ArcGIS v.10.1 (ESRI Inc, Redlands, CA, USA) ( Supplementary Fig. 4).
Analysis of the effect of regional economics on the conservation priority of different breeds.
We further investigated the associations between the changes (loss or gain) in genetic diversity for each breed (or group of breeds) and the economies of the countries (or regions) from which the breeds originated. The average GDP per capita of the countries (or regions) during the years 1993 to 2013 was collected from the World Bank database (http://data.worldbank.org.cn/indicator). Because the 40 native Chinese sheep populations originated from 13 provinces, which differ greatly in their economies and employ different breeding and conservation strategies for domestic animals 76 , the GDP per capita for the provinces, rather than the whole country of China, was collected for the period from 1993 to 2013 (from the China Statistics Bureau; available at http://data. stats.gov.cn; Supplementary Table 5) and included in the analysis. The countries (or regions) were classified into three different categories based on their average GDP per capita (A-GDPPC) during these years: underdeveloped countries/regions (A-GDPPC ≤ $1,500); developing countries/regions ($1,500 ≤ A-GDPPC ≤ $15,000); and developed countries/regions (A-GDPPC ≥ $15,000). For sheep, the breeds came from 38 countries or regions: 9 were underdeveloped; 16 were developing; and 13 were developed. For cattle, the breeds came from 21 countries or regions: 10 were underdeveloped; 3 were developing; and 8 were developed (see Supplementary Tables 3 and  4). Subsequently, we calculated the loss or gain of diversity for each country/region when all of the breeds from the country/region were removed using the two approaches described above (Supplementary Tables 3 and 4). Differences in the loss or gain of genetic diversity averaged over sheep and cattle breeds in the three different economic categories were compared using the boxplots statistics in the boxplot package of R software v.3.4.3 77 , and the P value was calculated using the Kolmogorov-Smirnov test 78 .
To evaluate the effect of economic development on the loss or gain of genetic diversity for the breeds, we performed regression analysis between the loss or gain of genetic diversity (within subpopulation, ΔGD WS ; between subpopulations, ΔGD BS ; and total genetic diversity, ΔGD T ) and the increased average GDP per capita (ΔGDPPC) for 1993 to 2013 from each country (or region) using the dual-regression approach. The coefficient of determination (R 2 ) and its significance (p-value) were calculated using SPSS v.22.0 software 79 .
Data availability. The data sets analyzed in the present study are available from the corresponding author on reasonable request.