Introduction

Polyploidy, that is the occurrence of more than two copies of the basic set of chromosomes in a somatic cell, is very common in plants. While it has long been accepted that as many as 50% of the plant taxa are polyploid or derived from ancient polyploidy (Stebbins, 1971), recent studies on sequence conservation in Arabidopsis thaliana indicate that even the simplest genomes exhibit traces of past polyploid history (Blanc et al, 2000; Vision et al, 2000). Rounds of polyploidization, by addition of several genomes, followed by ‘diploidization’ and stabilization of meiotic features leading to pairs of homologous chromosomes have been a major component of genome evolution in plants. Chromosome pairing at meiosis is well determined in diploids, where each chromosome consistently finds its homologous counterpart to give rise to disomic segregation. Polyploids display a wider array of situations. Whether meiosis yields bivalent (by pair) or multivalent (involving more than two) chromosome associations at metaphase I of meiosis affects the pattern of chromosomal segregation to daughter cells, as well as the opportunities for chromosomal interchanges by chiasma formation. When only bivalents are formed, the way the chromosomes pair can vary between full predetermination, producing disomic inheritance and random association, leading to polysomic inheritance. The process that governs chromosome pairing at meiosis is not well understood. It involves specific genetic factors, such as the Ph1 gene in wheat (Riley and Chapman, 1958; Martinez-Perez et al, 2001), as well as structural features (Jenkins and Chatterjee, 1995). Modern sugarcane cultivars constitute a particularly complex case of polyploidy, as described by Grivet and Arruda (2001). They originate from recombination among hybrid derivatives between two highly polyploid species, namely Saccharum officinarum (x=10, 2n=8x=80) and S. spontaneum (x=8, 2n=5–16x=40–128). They usually display between 100 and 130 chromosomes (70–80% appear derived from S. officinarum), while 10–20% appear derived from S. spontaneum and about 10% are recombinants of both origins (D'Hont et al, 1996). The genetic maps available so far suggest that the parental genomes are collinear and probably differ by only a small number of rearrangements (Grivet et al, 1996; Ming et al, 1998).

Despite the high ploidy level, both parental species mainly form bivalents at meiosis (Bremer, 1929; Sreenivasan, 1975). In interspecific cultivars, meiotic metaphase spreads reveal swarms of chromosomes that are difficult to interpret, but show mostly bivalents. The most detailed studies of pollen mother cells in a range of cultivars (Price, 1963; Burner, 1994) report less than one multivalent (tri- or quadrivalent) and three to four univalents per cell on average. New insights into meiosis were brought about by molecular marker segregations, which allow the identification of chromosome-pairing affinities, thanks to linkages in the repulsion phase. This approach, although based on partial genome coverage, suggested polysomic inheritance in studies conducted using progeny from S. spontaneum (Al-Janabi et al, 1993; Da Silva et al, 1993), on the basis of a general lack of linkage in repulsion. Studies that used an S. officinarum parent detected some instances of repulsion, indicative of preferential pairing between a subset of chromosomes (Al Janabi et al, 1994; Ming et al, 1998). The first segregation studies conducted in a cultivar, namely R570, were based on RFLP among 77 individuals derived from self-pollination (Grivet et al, 1996). They led to the identification of 99 unsaturated linkage groups that covered approximately one-third of the whole genetic map. These linkage groups were gathered into 10 tentative homology classes (HC) on the basis of probes in common, enabling the identification of sets of homologous chromosomes. Several instances of repulsion were detected, essentially between S. spontaneum-derived chromosomes. A recent study using 295 individuals of the same self-progeny monitored for close to a thousand segregating amplified fragment length polymorphism (AFLP) bands (Hoarau et al, 2001) ensured half coverage of the genetic map. Out of 120 linkage groups and 52 unlinked markers, the repulsions highlighted 12 couples of chromosomes that are strongly associated at meiosis. At least two of these couples apparently display systematic pairing. The chromosomes involved appeared to be of mixed origin, and several instances of pairing between regions derived from the two ancestral species were detected. These mapping studies provided instructive features, but had several drawbacks for an accurate description of chromosome pairing. They relied on markers scored as dominant with no insight into allele dosage, which limited the accuracy of genotype determination. They also generally failed to cover all homologues in a given HC, for only chromosomes bearing single dose alleles were detected. This allowed only incomplete survey among the interacting chromosomes.

We describe a particular approach based on RFLP analysis using a particular probe chosen, because it enables us to access all chromosomes of a given HC and to characterize their segregation taking account of their dosage. This provides a unique description of chromosome assortment in what is probably the most complex crop genome studied to date.

Materials and methods

The plant materials used were chosen from those used and described by Hoarau et al (2001). They consisted of 282 individuals derived from the selfing of cultivar ‘R570’ (2n=ca. 115), an important commercial variety.

RFLP analysis was performed with standard procedures, as described in Grivet et al (1996). The individual plant DNA was cut separately with three restriction enzymes and was subjected to Southern analysis using BNL12.06 as a probe. Those bands that segregated in a 3:1 (presence:absence) fashion within the progeny were scored; they are single-dose restriction fragments (SDRF) (Wu et al, 1992) in the parental cultivar. The similarity in segregation of two SDRF revealed with two different restriction enzymes is indicative of an allele that can be resolved by both enzymes. The use of three restriction enzymes ensured some redundancy in the detection of several alleles, which was useful for data quality control. A total of 11 alleles, numbered from 1 to 11, could be identified. They correspond to the 11 alleles detected by Grivet et al (1996); their resolution by the various restriction enzymes is given in Table 1. These alleles belong to 11 linkage groups (Figure 1), of which 10 are homologous (or homoeologous when inherited from S. officinarum and S. spontaneum), for they share markers derived from the same series of RFLP probes. The 11th one comprises the BNL12.06 marker, but its other constitutive markers belong to another set of homologous linkage groups. The possibility that additional alleles exist other than the 11 scored could be discarded for two main reasons. Firstly, all the segregating bands that corresponded to multiple-dose restriction fragments could be accounted for by various combinations of the 11 alleles. Secondly, among the eight probes that constitute HC I, none revealed any unlinked marker that would suggest the existence of other, poorly marked, chromosomes of this HC (Grivet et al, 1996). Therefore, it is considered that all the chromosomes of HC I are covered.

Table 1 Resolution of the 11 BNL12.06 alleles by restriction enzymes HindIII, DraI and SstI
Figure 1
figure 1

Distribution of the 11 simplex alleles corresponding to the probe BNL12.06 into eleven linkage groups (Lg) according to the map of Grivet et al (1996). On the basis of the loci (corresponding to probes) in common, 10 linkage groups appear homo(eo)logous and are classified into HC I, whereas the 11th one is part of another HC (VIII) made of eight Lg. The BNL12.06 alleles are indicated in bold characters and the nomenclature of the alleles used in the text is indicated. The RFLP probe designation is abbreviated: B stands for BNL, C for SSCIR, D for CDSR, S for SG and U for UMC. The species origin of the alleles on the Lg is indicated when known: a black dot represents S. officinarum and a white losange S. spontaneum. The correspondence of the Lg to those described by Hoarau et al (2001) on the basis of AFLPs is indicated when known. Both RFLP and AFLP marker specificity enable assigning a tentative species origin to some of the Lg: o for S. officinarum, s for S. spontaneum and i for interspecific.

The analysis of chromosome assortment took account of the massive predominance of bivalents observed in all Saccharum forms, including cultivars. Given that the SDRF are present as a single dose in the parent, the dosages of the SDRF in the gametes can be zero or one, excluding the cases of higher dosages that are possible, although rare, when multivalents are formed. The dosage in progeny individuals derived from self-pollination can thus be either zero (no band), one or two. The parental banding pattern served as a reference to determine these dosages (Figure 2). When the internal distribution of the banding intensities within a progeny individual departed from this parental pattern, the more intense bands could be considered as representing two doses, while the less intense bands could be considered as representing a single dose. Some ambiguity persisted when the intensity distribution among the bands present with a given enzyme was similar to that in the parent; in this case, another restriction enzyme that revealed some of the same alleles together with others could indicate the actual dosage (if the intensity distribution departed from the parental one). Altogether, only three individuals were left with some ambiguity; one had six alleles present in either one or two copies, the other two had seven. This analysis thus enabled the description of most individuals by the number of doses (0, 1 or 2) of all 11 alleles present.

Figure 2
figure 2

Mode of interpretation of allele dosage in the progeny on the basis of banding intensity patterns. RFLP patterns typically display five to 10 bands with variable intensities. Taking the bands two by two (bands x and y in the example shown), the comparison of the relative intensities with the parental cultivar R570 enables determination of the doses when they are null (progeny 3, 4 or 6) or uneven (progeny 1 and 2). Thanks to the large number of two by two comparisons, it is generally possible to determine the dosage for all alleles for all progeny individuals.

Allele specificity, that is, whether the chromosome segments monitored here are derived from S. officinarum or S. spontaneum, was assessed on the basis of the apparent species origin of the RFLP markers, as determined by Grivet et al (1996), or of the AFLP markers, as determined by Hoarau et al (2001), when the latter could be connected to the RFLP map, directly or after bridging allowed by the study of Rossi et al (2003). The occurrence of several markers close to one another on the genetic map, which displayed a similar species origin, was considered conclusive for the origin of the segment. In the six cases where correspondence with the AFLP groups was established, the specificity analysis conducted by Hoarau et al (2001) could be used and the species origin of a large portion, if not the whole chromosome, could be considered. This resulted in the tentative assignment of an S. officinarum origin to five chromosomes (I-4, I-5, I-8, I-10 of group I and VIII-14 of group VIII, using the nomenclature as described by Grivet et al (1996), an S. spontaneum origin to one (I-6 of group I) and a mixed origin to two (I-7 and I-9 of group I, respectively, double and simple interspecific recombinants), the rest being of unclear origin.

The analysis of chromosome affinities first considered only bivalent pairing. The estimation of the frequency of pairing between two chromosomes bearing two alleles was performed using a maximum likelihood method. Assume pij is the frequency of pairing (and 1−pij the frequency of nonpairing) between the two chromosomes that bear alleles Ai and Aj. When these two chromosomes pair together (frequency pij), each gamete receives one dose of either allele Ai or allele Aj; when there is no pairing between these two chromosomes (frequency 1−pij), but each of them pair to another one, each gamete can receive either allele Ai, allele Aj, both of them or neither of them in a similar frequency of 1/4. The frequency distribution of the gametic genotypes can thus be described as a function of pij and can be used to derive the expected frequencies of the various zygotic genotypes in the progeny, assuming random combination between the male and female gametes. This is developed in Table 2. The frequency of the nine possible zygotic combinations in the whole progeny provides nine equations involving pij as the sole variable. Knowing the expected and observed frequencies of the different genotypes, a likelihood equation, L(pij), can be used to determine the most likely value for pij, as well as the standard deviation of this estimation (Fisher and Balmukand, 1928).

Table 2 Expected frequency of the nine possible genotypes for a couple of simplex alleles, Ai and Aj, in a self progeny

For each pair of alleles, the most likely pairing frequency, p, corresponds to the value maximizing the likelihood formula; for practical reasons, it is the logarithm of this formula that is used to determine the most likely pairing frequency, so that (log(L(p)))′=0. The result of this exercise is a full table of pairing frequency estimates between the chromosomes that bear the alleles scored.

The patterns expected in case of simple behaviors are given in Table 3, taking an example with 10 alleles. In case of polysomy (Table 3a), it is expected that all pairing frequencies will be 11.1% (1/9), each chromosome having an equal probability of pairing with any of the other nine. In case of disomy (Table 3b), each chromosome will display pairing frequencies of 100% with a specific counterpart, and 0 with all the other eight. The possible existence of univalents or trivalents (more complex patterns with higher range multivalents were not considered) is considered in Table 3c:

  • a chromosome systematically left as a univalent at meiosis (allele 1 in Table 3c) would yield pij values of 0 in all combinations; the sum of the pij for the corresponding allele would also be 0, in contrast with a value of 100% for a bivalent;

  • a set of three chromosomes systematically involved in a trivalent (alleles 2, 3 and 4 in Table 3c) would yield mutual pij values close to 33.3% (1/3) and values close to 0 in all other combinations; the sum for a given allele would be close to 66.7%, in contrast with a value of 100% for a bivalent.

Table 3 Pairing frequencies (%) among 10 chromosomes expected from various meiotic assortment patterns

In order to analyze the differentiation among the homologous chromosomes, the pair-wise pairing frequencies pij were used as a measure of similarity. The complements to 1 (ie 1−pij) were taken as a distance and used to run a factor analysis (StatSoft, 1997). The distribution of the chromosomes along the three major axes was used to summarize the mutual affinities. It also served as a basis to derive and test quantitative features, such as the number of univalents, bivalents or trivalents.

Results

Marker segregations

The raw data consist of a matrix of scores of 0, 1 or 2 representing the dosages of the 11 alleles for 236 progeny individuals and the dosages of allele 1 to allele 10 for an additional 46 individuals. The respective segregations of the 11 alleles are given in Table 4. The frequency of transmission of the various alleles ranges from 46 to 54%, with the exception of allele 8 that appears with a frequency of 61%. In several cases (alleles 4, 6, 8 and 10), the distribution of the doses departs from that expected in case of identical allele frequencies in the female and male gametes and subsequent random assortment. Such patterns can be explained by differential male and female gametic selection, as is often found in aneuploids (Khush, 1973). The most striking deviation is again found for allele 8. Beyond the individual particularities, the overall pattern of segregation shows an average allele transmission rate equal to the expected 50%.

Table 4 Dosage segregation of the 11 BNL12.06 RFLP alleles in a self progeny of cultivar ‘R570’ constituted of 236 individuals

Pairing frequency estimates

The pair-wise allele comparisons yielded the most likely pairing frequencies as given in Table 5. The precision of these values is relatively low, since the standard deviation for each estimate is close to 6%, while many of these estimates are lower than 10%. However, the total range of values spans more than six standard deviations, meaning that many differences among the pairing frequencies are highly significant. The pij values are comprised between 0 and 39.1%. Null values highlight couples of chromosomes that never pair together. Values higher than the 10% (1/10) expected from random assortment identify couples that tend to pair preferentially. Although this preferential (more frequent than random) pairing is observed often, there is no case of systematic pairing, which would be identified by a value of 100%. The total of the values for a given allele should be 100% if it were involved in a bivalent, irrespective of the specificity of the association. Lower values indicate either pairing with heterologous chromosomes, no pairing (univalent) or pairing in a trivalent. The total ranges from 9.3 to 98.1%, suggesting that some chromosomes are more often involved in pairing with the others. Allele 11 is actually an outlier with its value of 9.3%, the second lowest value being 39.0%. This distinctiveness is congruent with its linkage, with a range of probes involved in a separate HC. On this basis, it is possible to conclude that it is borne by a chromosome that is not part of the HC under analysis, and to exclude it from subsequent data interpretation. This led to the second line of totals in Table 5.

Table 5 Maximum likelihood estimates for the pairing frequencies pij (percent) between chromosomes of HC I as revealed from allele segregation patterns at locus BNL12.06

The totals for one allele range from 39.0 to 95.2%. The mean of the total for the 10 alleles is 75.1% (Table 5); this figure, compared to 100%, characterizes a global deviation from a typical 5 × 2 bivalent meiosis caused by particular cases such as univalents or trivalents.

Specific chromosomal affinities

The differentiation within the population of chromosomes is illustrated by the results of the factor analysis. The first three axes have eigenvalues of 23, 18 and 14%, respectively. The distribution of the chromosomes along these axes is summarized in Figure 3. One chromosome lies apart from the other ones along axis 3. It is I-9, which has a clearly interspecific origin; it is characterized by generally low pairing frequencies with all its counterparts, as illustrated by its total pairing frequency of 39%. Then a group of three chromosomes is isolated. They are chromosomes I-8, I-4 and I-10, which probably have a pure S. officinarum origin. They have a rather low total pairing frequency, with an average of 65%, of which the largest part (42%) represents pairing among the three chromosomes. The other six chromosomes, whose putative origins are S. officinarum (I-5), S. spontaneum (I-6), interspecific (I-7) or unclear (I-1, I-2 and I-3), share a rather high average total pairing frequency (86%) with 74% pairing with each other. Among the six, I-1, I-2 and I-3 display a peculiar pattern: I-1 displays a strong preferential pairing with I-2 (27.4%) and I-3 (39.1%), while there is no pairing (0%) between I-2 and I-3.

Figure 3
figure 3

Distribution of 10 alleles representing 10 interacting chromosomes in planes (1, 2) and (1, 3) of a factor analysis that summarizes the matrix of distances (1−pij), where pij is the frequency of pairing estimated from 282 individuals, as given in Table 5. The species origin of the chromosome segments is indicated.

Chromosome assortment pattern

The simplest typical meiotic picture to represent this distribution is: I-9 tending to form a univalent; I-4, I-8 and I-10 tending to form a trivalent or one bivalent and one univalent; I-1 tending to form a bivalent with I-2 or I-3; and the rest (I-2 or I-3, I-5, I-6 and I-7) forming two bivalents. The translation of various assortment hypotheses into distributions of total allele doses by individual is given in Table 6. The observed distribution is compared to various distributions, corresponding to various meiotic figures that are compatible with the general cytological observations, including: five bivalents (configuration m0); four bivalents and two univalents (m1); three bivalents, one trivalent and one univalent (m2); three bivalents and four univalents (m3). The observed distribution is very close to the m1 and m2 cases (which are equivalent in terms of dosage distribution), although slightly more deviated towards a distribution expected if more univalents occur (m3 case), as illustrated in the last two columns of Table 6.

Table 6 Distribution of the total number of doses found among 282 progeny individuals and comparison with the distribution expected for various meiotic pairing hypotheses (mi)

Discussion

The analysis performed here attempted to describe the mode of chromosome pairing within a given homology class in a sugarcane cultivar. The analytical frame used is complete, since it rests on the survey of (most likely) all alleles at a particular locus, which segregate within a large progeny and whose respective dosages could be determined. The 10 homologues display pairing frequencies ranging from 0 to 40%, and seem differentiated into several subsets:

  • one chromosome (I-9) seems often left as a univalent; it is noteworthy that it has a clear interspecific origin, although there is no proof that this is the cause of its isolation at meiosis;

  • three chromosomes (I-4, I-8, I-10) seem involved in preferential mutual pairing; whether they assemble into a trivalent or a bivalent plus a univalent cannot be determined from the present data; all probably have a pure S. officinarum origin;

  • within a subset of six chromosomes, one chromosome (I-1, of unclear origin) seems to preferentially form a bivalent with one of two particular chromosomes (I-2 and I-3, of unclear origin), while the four chromosomes left form two bivalents; this subset includes chromosomes or chromosomal regions originating from the two ancestral species.

The occurrence of univalents or possibly trivalents does not induce a deficit in transmission. Indeed, the chromosomes involved in these figures exhibit the same transmission rate close to 50% as the others. However, these irregularities induce some variation in the total number of chromosomes transmitted. As a consequence, the distribution of the total dose number per individual is wide but centered around the parental number of 10. There is no trend towards chromosome loss within this particular HC, but it is noteworthy that about 65% of the individuals show either more or fewer chromosomes than the parent.

Besides the variation in pairing frequencies observed here, it is known that R570 contains at least two couples of chromosomes that form systematic pairs (Hoarau et al, 2001). Altogether, this remarkable range of affinities among chromosomes somewhat contradicts the traditional view that polyploids tend to classify into allopolyploids with strict bivalent chromosome formation and thus disomic inheritance vs autopolyploids with all homologous chromosomes having equal opportunities to pair at meiosis, leading to polysomic inheritance (Jackson and Jackson, 1996; Sybenga, 1996; Jenczewski and Alix, 2004). These configurations are two extreme cases of polyploids (Wu et al, 2001). Intermediate polyploid models displaying a combination of both allopolyploid and autopolyploid pairing behavior have been suggested by Stebbins (1950) and reported for several polyploid species (Jackson and Jackson, 1996; Allendorf and Danzmann, 1997; Fjellstrom et al, 2001). The locus under study provides additional evidence towards the existence of intermediate polyploid models. The sugarcane cultivar R570 has an artificial origin, for it is derived from man-made interspecific hybrids a few generations ago. However, similar genotypes of spontaneous origin exist within the ‘Indian and Chinese sugarcanes’, once classified as S. barberi and S. sinense, respectively, which are now known as derived from a limited number of natural hybridization events between S. officinarum and S. spontaneum (D'Hont et al, 2002).

The present study throws light on two other aspects of meiosis in a polyploid genome. First, this system involves locus duplication (if we consider allele 11) and thus a potential bridge between two HC. Such bridges between distinct HC have been found in low frequency in the genome of R570 (Grivet et al, 1996). They may be the result of genome heterogeneity between the two ancestral species, since these have distinct basic chromosome numbers (D'Hont et al, 1998). They may also be representative of genome structural rearrangements that may occur in polyploids, be they derived from translocations or from ectopic movements associated with transposable element activity (Soltis and Soltis, 1999, 2000). Here, the attachment to two distinct HC resulted in separate pairing dynamics at meiosis. Secondly, the pairing pattern among homologous chromosomes clearly shows that homolog juxtaposition is determined by some features of the individual chromosomes. The mutual affinities appear partly, although incompletely determined by the species origin of the chromosomes. With the progress of genetic mapping and allelic resolution along the chromosomes, it will be possible to determine the species origin of the various portions of the chromosomes. When a full picture of these mosaics is available, it will be possible to determine to what extent particular regions of the chromosomes, for example, centromeric or telomeric regions, influence the overall pairing affinities. A generalization of such studies for several HC will give access to a range of situations that will make it possible to test subsequent hypotheses. Therefore, the complex genome of sugarcane will offer material of great interest for basic questions related to the fine mechanisms that govern meiosis.