Introduction

Brassica is the most economically important genus in the Brassicaceae family. Due to substantial progress in breeding and cultivation practice, two major members of the Brassica genus commonly known as rapeseed (Brassica napus L.) and mustards have become one of the worldwide most principal source of vegetable oil (Rani et al. 2013). Especially in several European countries with cool-temperate climates oilseed rape (B. napus) with ‘double-low’ seed quality (Canola) dominates field crop production. Poland and Great Britain have been ranked interchangeably in sixth place in the production of winter oilseed rape (Brassica napus L.) in the world, after China, Canada, India, Germany and France (Ratajczak et al. 2017). In Poland, winter oilseed rape is the most important oil–protein plant. It provides edible vegetable oil for human consumption and proteins for the feed industry, and has recently gained increasing interest as a source of biodiesel (Huang et al. 2016). Traditional breeding, as well as, modern biotechnological tools has led to the improvement of various quantitative and qualitative traits in Brassica. At the same time, modern crop breeding has resulted in the reduction of genetic variability in most of the crop species (Tanksley and Nelson 1996). From our previous work it was found that one of the most effective methods for increasing the variability of traits that impacts the quality of rapeseed products is interspecific hybridization (Niemann et al. 2012). In order to combine desirable genes found in two or more different varieties, planned hybridization between carefully selected parents is practiced within or among species. The plant breeder usually keeps such ideal plants in mind that combines a majority of desirable characteristics. These may be general and/or specific traits that could contribute to the improved adaptation of plants to the environment and could enhance yield with better quality seed. Generally, in Brassica napus breeding, traits related to commercial success are of highest importance (Friedt and Snowdon 2010). For this reason rapeseed breeding strategies are oriented on developnig new cultivars with high oil, protein as well as low erucic acid, glucosinolate and fiber content (Möllers 2004; Abbadi and Leckband 2011; Zou et al. 2016; Körber et al. 2016). For this purpose knowledge of genetic diversity is indispensible in the development of commercial cultivars (Ahmad et al. 2013; Liu et al. 2016).

The content of oil, protein and anti-nutritive substances like glucosinolates and fiber in rapeseeds decide on their utility value (Bell 1984; Vageeshbabu and Chopra 1997; Liu et al. 2012). Brassica napus is planted for oil production and, therefore, an maximization of oil content in the seeds is a major goal in the breeding process (Zhao et al. 2007; Würschum et al. 2012). In general, the oil content in the Brassica seeds varies from 30 to 45% depending on the species, the variety and climatic conditions and is mainly driven by the composition of fatty acid components of erucic acid (C22:1), stearic acid (C18:0), oleic acid (C18:1), linoleic acid (C18:2), and linolenic acid (C18:3) (Velasco and Becker 1998; Möllers 2004). In addition, the oil content of Brassica oilseed meal contains about 40% protein with a well-balanced amino acid composition (Khan et al. 2008), but lower than would be desired. Moreover, plant breeders have strongly reduced the levels of the unhealthy and uneatable glucosinolates in the seeds (< 30 uM/g) to be able to use the protein-rich seed cake as an animal feed supplement (Halkier and Gershenzon 2006). Besides, the presence of high fibre fraction in oilseed rape meal, which is mostly located in seed hull, is the major obstacle for its use in animal feeding. The fibre fraction is poorly digestible and essentially dilutes the available energy and protein. Consequently, oilseed rape meal has less metabolizable energy and reduces the value of the meal relatively to soybean meal. Fibre contents are related to plant cell walls components, namely cellulose, hemicellulose and lignin. The proportion of hemicellulose, cellulose and lignin can be quantified as neutral detergent fibre (NDF), acid detergent fibre (ADF) and acid detergent lignin (ADL). The NDF fraction consists (predominantly) of hemicellulose, cellulose and lignin content, while the ADF fraction comprises cellulose and lignin. ADL represents the non-digestible lignin fraction. To improve the quality of oilseed rape meal, breeding research has been undertaken to select genetic material with lower fibre content.

It is known that all of these seed quality traits are influenced by genetic and environmental factors, as well as agricultural technology (Feng et al. 2012; Ratajczak et al. 2017), so their precise estimation requires phenotyping in replicated multi-environmental field trials. It is important to determine and quantify the extent to which factors like the environment and genotype × environment interaction contribute to variations in each Brassica quality parameter.

The objectives of this research were (1) to determine the relative contributions of G, Y, L and GYL to the variation in seed quality traits of twenty-five Brassica genotypes tested across three locations, (2) to analyze which factor has the most significant effect on tested traits, and (3) to find the best Brassica genotypes based on quality traits.

Materials and methods

Plant material for field trials consisted of 25 winter rapeseed genotypes i.e.: B. napus cv. Californium (open pollinated variety), twenty-three interspecific cross-derived Brassica lines and male sterile line of F8 generation of B. napus (MS8). MS8 line was selected from resynthesized oilseed rape (B. rapa ssp. chinensis × B. oleracea var. gemmifera) using in vitro cultures of isolated embryos in Department of Genetics and Plant Breeding, Poznan University of Life Sciences (PULS) (Table 1). Seeds of B. napus cv. Californium and MS8 line originated from the resources of the Department of Genetics and Plant Breeding of PULS, while seeds of paternal genotypes i.e. Brassica rapa ssp. pekinensis, Brassica rapa ssp. trilocularis, Brassica carinata and Brassica juncea came from the Warwick HRI Genetic Resources Unit. In order to obtain interspecific cross-derived Brassica lines, interspecific hybridizations were carried out in the glasshouse and performed with the application of in vitro culture of isolated embryos according to the method described by Niemann et al. (2012). All interspecific cross-derived lines were sister-pollinated (five plants were enclosed in one paper bag during flowering) for four generations in order to stabilize the fertility. F5–F7 generation of tested lines and control genotypes were selected from the rapeseed breeding program of Department of the Genetics and Plant Breeding of PULS. During seasons 2010/2011, 2011/2012 and 2012/2013 twenty-five Brassica genotypes were grown in three different locations i.e. in the PULS experimental station Dłoń located 100 km south of Poznań and in Złotniki located 15 km west from Poznań as well as in the experimental fields in Poznań-Sołacz. These locations have different types of soil and weather conditions (Tables 2, 3). The field trials at all locations were arranged in a randomized complete block design with three replicates. Each genotype was grown in a 3 row plot of 9.0 m2 with a 0.30 row distance and a sawing density of 60 seeds/m2.

Table 1 Code and origin of the genotypes tested
Table 2 Environments used in the study and their main characteristics
Table 3 Meteorological conditions in Dłoń, Poznań-Sołacz and Złotniki during the vegetation season of winter oilseed rape in 2011, 2012 and 2013

Quality analysis

The seed samples for the analysis of fiber, protein, oil and glucosinolates content were collected from five self-pollinated plants from each tested line. The fiber content in the Brassica lines and control seeds was expressed as the average value of the neutral detergent fiber (NDF) and the acid detergent fiber (ADF) as a subfraction of the NDF. The seed quality traits content was determined by near-infrared reflectance spectroscopy (NIRS) (Velasco et al. 1999). These analyses were performed at the Małyszyn Laboratory that belongs to Plant Breeding Strzelce in triplicate. For NIRS analysis minimum 3 g of intact seeds were used. Every seed probe was scanned on the NIRS Systems monochromator model 6500 (NIR Systems, Inc. Silver Springers, MD, USA).

Statistical analysis

Firstly, the normality of the distributions of the studied traits (neutral detergent fiber—NDF, acid detergent fiber—ADF, as a subfraction of the NDF, protein, oil content and glucosinolates) were tested using Shapiro–Wilk’s normality test (Shapiro and Wilk 1965). Multivariate analysis of variance (MANOVA) was performed on the basis of following model using a procedure MANOVA in GenStat 17th edition: Y = XT + E, where: Y is (n × p)—dimensional matrix of observations, n is number of all observations, p is number of traits (p = 5), X is (n × k)—dimensional matrix of design, k is number of genotypes (k = 25), T is (k × p)—dimensional matrix of unknown effects, E—is (n × p)—dimensional matrix of residuals. Nextly, three-way analysis of variance (ANOVA) was carried out to determine the effects of genotypes, years and locations and the interactions: genotypes × years, genotypes × locations, years × locations and genotypes × years × locations—on the variability of studied traits. The minimal and maximal values of traits as well as arithmetical means and standard deviations were calculated. Moreover, the Fisher’s least significant differences (LSDs) were also estimated at the significance level α = 0.05. The relationships between observed traits were assessed on the basis of Pearson’s correlation coefficients using FCORRELATION procedure in GenStat 17th edition for each environment independent. Results were also analysed using multivariate methods. The canonical variate analysis was applied in order to present multitrait assessment of similarity of tested genotypes in a lower number of dimensions with the least possible loss of information (Rencher 1992). This makes it possible to illustrate variation in genotypes in terms of all observed traits in the graphic form. Mahalanobis’ distance was suggested as a measure of “polytrait” genotypes similarity (Seidler-Łożykowska and Bocianowski 2012), whose significance was verified by means of critical value Dα called “the least significant distance” (Mahalanobis 1936). Mahalanobis’ distances were calculated for genotypes in each environment independent as well as for all environments data jointly. All the analyses were conducted using the GenStat v. 17 statistical software package (Payne et al. 2012).

Results

All studied traits have a normal distribution as well as a multivariate normality. Results of MANOVA indicate that the genotypes (Wilk’s λ = 0.0448; F24,3148 = 113.51; P < 0.0001), years (Wilk’s λ = 0.7618; F2,3148 = 91.63; P < 0.0001) and locations (Wilk’s λ = 0.1305; F2,3148 = 1112; P < 0.0001) were significant different for all five traits. Moreover, interactions: genotype × years (Wilk’s λ = 0.7047; F48,3148 = 4.75; P < 0.0001), genotype × location (Wilk’s λ = 0.1480; F48,3148 = 30.52; P < 0.0001), years × locations (Wilk’s λ = 0.6796; F4,3148 = 64.39; P < 0.0001) and genotype × years × locations (Wilk’s λ = 0.2344; F96,3148 = 11.03; P < 0.0001) were significant. Analysis of variance indicated that the main effects of genotypes, years and locations as well as all interactions were significat for all the traits of study (except year × location interaction for glucosinolates) (Table 4). However, the obtained data showed that the most significant effect on oil, protein and fiber content in the Brassica seeds had environment i.e. location, while genotype had the biggest impact on glucosinolates content.

Table 4 Mean squares from three-way analysis of variance for five observed traits

Minimal, maximal and mean values as well as standard deviations for NDF, ADF, protein, oil and glucosinolates content were presented in Tables 5, 6, 7, 8 and 9, respectively.

Table 5 Neutral detergent fiber content (%) in twenty-five Brassica genotypes tested in nine environments
Table 6 Acid detergent fiber content (%) in twenty-five Brassica genotypes tested in nine environments
Table 7 Protein content (%) in twenty-five Brassica genotypes tested in nine environments
Table 8 Oil content (%) in twenty-five Brassica genotypes tested in nine environments
Table 9 Glucosinolate content (umol/g) in twenty-five Brassica genotypes tested in nine environments

Generally, all tested cross-derived Brassica lines displayed a great increase in the range of variability for the studied quality traits when compared to the control genotypes. Moreover, the highest mean values of fiber, oil and glucosinolates were observed in Dłoń, although protein content in tested lines in total was slightly higher in Złotniki, where mainly the lowest values of all studied traits were noticed (Tables 5, 6, 7, 8, 9). Both, the highest and the lowest NDF mean values were observed in lines harvested in Dłoń (29.89 and 18.61% respectively), however the differences occured between tested genotypes i.e. in MS8 × B. rapa ssp. trilocularis and MS8 × B. rapa ssp. pekinensis adequately (Table 5). Similarly, the highest ADF mean values were observed in the lines harvested in Dłoń from MS8 × B. rapa ssp. pekinensis seeds (26.68%) but the lowest content was noticed in Złotniki from MS8 × B. juncea—17.1% (Table 6). The highest protein mean value was observed in Złotniki (23.80%), while the lowest in Poznań-Sołacz (16.65%) in MS8 × B. rapa ssp. pekinensis lines in 2012 (lines number 3 and 11, respectively) (Table 7). The highest oil mean value was observed in Dłoń in line number 16 (46.96%), ranged from 44.52 to 49.89%, but the minimum value was noticed in Złotniki (35.35%) in line number 17 (Table 8). The minimal glucosinolates content was observed in Złotniki in MS8 × B. rapa lines and ranged from 6.1 to 69.37 umol/g seeds (Table 9).

NDF was positive correlated with ADF in all nine environments, except in Dłoń 2012 (Table 10). Protein content was negative correlated with NDF (except Dłoń 2012 and Złotniki 2012) and ADF (except Dłoń 2012). Oil content was positive correlated with NDF and ADF in six environments; and negative correlated with protein content in eight environment (only in Złotniki 2012 these two traits were not correlated). Glucosinolates content and NDF were negative correlated in all nine environments. Glucosinolates content was positive correlated with protein only in Sołacz (all 3 years) and negative correlated with ADF in Dłoń 2011 and Sołacz—all 3 years of study. Glucosinolates and oil content were negative correlate only in two environments—Sołacz 2013 and Złotniki 2012 (Table 10).

Table 10 Correlation coefficients between observed traits calculated on the basis of genotypes’ means in environments independent

Particular traits have a different degree of significance and a different extent of contribution to the overall multi-trait variability. The multi-trait variability analysis includes identification of the most important traits in the objects’ multi-trait variability. A statistical tool which helps solve this problem is canonical variable analysis. The results of the analysis for analyzed objects are presented in Table 11. The multidimensional analysis of the tested traits compared 25 genotypes in respect of five traits simultaneously in all nine environments data observations (Fig. 1). The first and second canonical varieties elucidated 87.98% and 5.81%, respectively, of multivariate variability of genotypes (Fig. 1). Figure 1 shows the variability of analyzed traits in the configuration of the first two canonical variables. On the graph the coordinates of the point for particular genotypes are values of the first and second canonical variables, respectively. The most significant linear relation with the first canonical variable was observed for: NDF, ADF, oil content (positive correlation), protein and glucosinolates (negative correlations) (Table 11). The largest differences with regard to all five traits (measured in Mahalanobis distances) were recorded for genotypes 6 and 11 (the Mahalanobis distance between them was 7.62). The largest similarities were recorded for 15 and 25 (1.50) (Table 12).

Table 11 Correlation coefficients between the first two principal components and the observed traits
Fig. 1
figure 1

Distribution of 25 Brassica genotypes in the space of two first canonical variables (Genotypes code see in Table 1)

Table 12 Mahalanobis distances between studied genotypes estimated on the basis of data form all environments

Mahalanobis distances between all pairs of genotypes across nine environments were presented in Table 13. The smallest average value of Mahalanobis distance from nine environmental we observed for genotypes 16 and 24 (1.61). However, the largest for lines 6 and 11 (11.71). The most stable pair (the smallest range of Mahalanobis distance) was 16 and 24 (1.96). However, the least stable pair (range equal to 23.93) was 6 and 11 (Table 13).

Table 13 Mahalanobis distances between genotypes across environments

For Mahalanobis distances values in particular environments we observed large range in Sołacz 2011 with the largest mean value (Fig. 2). Generally, in 2011 Mahalanobis distance values were larger than in 2012 and 2013. In 2013 these values were the smallest (Fig. 2).

Fig. 2
figure 2

Box-and-whisker diagram of the values of Mahalanobis distances, classified by the environments

Discussion

Understanding the genetic basis of seed quality traits and environmental factors that influence their expression is important for efficient rapeseed breeding (Jiang et al. 2014). To utilize genetic diversity present in interspecific cross-derived Brassica lines originated from Department of Genetics and Plant Breeding collection of PULS and improve this genotypes itself, it was crucial to evaluate the extent of genetic diversity and to determine which of the tested factors i.e. genotype, year or location has the most significant effect on studied quality traits. Our results provide evidence for large phenotypic variability within the studied Brassica lines for oil, protein, glucosinolates and fiber (ADF and NDF) levels in seeds. The results of the present study are supported by the findings of Marjenovic-Jeromela et al. (2011) and Abideen et al. (2013) who recorded significant variability in oil content and other quality traits in oilseed brassicas. Moreover, our results showed that all tested quality traits i.e. fiber, protein, oil and glucosinolate were significantly affected by environments. For example, a higher oil, fiber and glucosinolates content were observed in Dłoń than in Złotniki and Poznań-Sołacz. On the other hand protein content was in analyzed Brassica lines slightly higher in Złotniki than in other locations. Our results are in agreement with that of Si et al. (2003) who reported that environment had a much larger impact than genotype on protein content in canola seed, and its genetic variance and genotype × year × location interaction were significant. Similarly, Wu et al. (2005) claimed that since genotype × environment (GE) interaction effects can cause genotype variation in seed quality traits across environments, the estimation of genetic main effect and GE interaction effect can demonstrate the degree of environmental influence on the genotypes and be useful for increasing selection efficiency for quality improvement.

In our study analysis of variance indicated that all interactions were also significant for all the traits of study (except year × location interaction for glucosinolates). Because of the fact that quality parameters are affected by genetic and environment, agronomic conditions, such as soil state, can affect seed quality parameters. In our experiment soil types and weather conditions differs among all locations. The results of field trials conducted in three tested locations (Dłoń, Złotniki, Poznań-Sołacz) confirmed the impact of soils and environmental conditions, especially water regime on particular seeds compound levels. Both, soils rich in fine clay particles, called ‘heavy soils’, which was available in Dłoń and luvisols, like those in Złotniki contains minerals and organic fractions sufficient to support cropping. This types of soils are used to be potentially very fertile when treated in the right way. On the other hand, in Poznań-Sołacz there are light, sandy soils. Moreover, the highest cumulative precipitation between three tested locations over a 3 year period were observed similarly like better soils in Dłoń and Złotniki. Thoroughly these conditions could be responsible for a higher oil, fiber and glucosinolates contents, observed in Dłoń than in Złotniki and Poznań-Sołacz.

It has been shown that canola protein content is affected by soil nitrogen level; higher nitrogen levels can lead to higher protein and glucosinolate contents and therefore lower oil contents (Aminpanah 2013). Similarly, high sulfur levels in soil, due to fertilizer, are known to increase the glucosinolate and the protein contents of canola seeds.

Furthermore, in canola, it is known that oil and protein contents follow an inverese relationship—the higher the oil and the lower the protein content. Seed protein content like other seed quality traits of rapeseed is a quantitative trait governed by additive gene action, so while many of the QTL for increased oil concentration are linked in coupling phase with QTL for decreased protein concentration (Gül et al. 2003), there are also many independent QTL for oil and protein concentration (Zhao et al. 2006). Therefore, in most cases, protein concentration will decrease when oil concentration is increased; however, there may also be cases where protein concentration is nearly maintained when oil concentration is increased. This literature data are confirmed by our results, since higher oil contents were observed in Dłoń compared to Złotniki, lower protein contents were expected in Dłoń when compared to Złotniki respectively. Besides, detailed analysis of these quality traits showed that in eight environments oil content was negative correlated with protein content and only in Złotniki 2012 these two traits were not correlated. Our results were consistent with those obtained by Khan et al. (2008) and Marjenovic-Jeromela et al. (2011), who reported significant and negative correlation between this two traits. However, this results were contrary to data obtained by Aytaç and Kinaci (2009), because they found significant and positive correlation between protein and oil content in Brassica oilseed.

Significant and positive association has been reported between protein content and glucosinolates (Ahmad et al. 2013). Our results exhibited that glucosinolates content was positive correlated with protein only in Sołacz (all 3 years). The differences between the results of the present study and those in previous studies could be due to the differences in the material and environmental conditions in which the studies were conducted.

According to Dimov et al. (2012) negative correlations between protein content of the meal and fibre content found in all Brassica DH populations is expected because selection for low fibre content will lead to an increase of protein content in the meal. Our results proved that protein content was exactly negative correlated almost in all analyzed environments with NDF (except Dłoń 2012 and Złotniki 2012) and ADF (except Dłoń 2012).

In the present study, it was found that particular traits have a different degree of significance and a different extent of contribution to overall multi-trait variability. The multidimentional analysis showed that the first and second canonical varieties elucidated 87.98% and 5.81% respectively of multivariate variability of genotypes. NDF, ADF and oil content showed a significant positive correlation with the first canonical variable. Quite opposite observations were made for protein and glucosinolate content, which were negative correlated. Our results are in contrary to Ahmad et al. (2013), who reported that all tested quality traits were positively correlated with oil content and with one another at both genotypic and phenotypic levels.

In our experiment the largest differences with regard to five tested traits occured between lines 6 and 11, which exhibited different origin (B.napus × B. rapa ssp. pekinensis and B.napus × B. rapa ssp. trilocularis cv. Yellow Sarson respectively. However, the largest similarities were recorded for lines 15 and 25 (B.napus MS8 line × B. rapa ssp. pekinensis and B. napus line MS8 respectively).

Moreover, Brassica lines numbered as 14, 16, 21 and 22 showed superiority for oil content, whereas genotype 11 exhibited superiority for minimum glucosinolate content. These genotypes may further be utilized in breeding programmes for evolving rapeseed varieties having high oil content and with superior seeds quality.

Taking into account the effect of genotype and environment on all tested seed quality traits in interspecific cross-derived Brassica lines, it was demonstrated that genotype had significant impact on glucosinolate content while oil, protein and fiber contents were influenced mostly by environment.

As an extension of our research hereafter genome-wide association studies should be used for the investigation of quantitative trait loci (QTL) linked to seed quality traits in winter oilseed rape (Brassica napus L.) genotypes. Genome-wide association analyses would enable identification of highly promising candidate genes and markers for breeding towards improved quality traits in rapeseed.