Abstract

This study presents soil system budgets of N, P and K in six contrasting cropping systems during 10 years of a long-term experiment in southeast Norway. The experiment included systems with arable cash-cropping and with mixed arable-dairy cropping (cash- and fodder crops), with organic and conventional management represented in both groups. All major nutrient inputs and outputs were measured or estimated. State of the art conventional cash-cropping appeared to be balanced in terms of N, whereas conventional mixed cropping had an N surplus. By contrast, less up to date conventional arable cash-cropping and all the organic systems showed indications of soil organic N depletion (negative N budgets). All the organic systems showed that mining of the soil P and K content occurs, whereas the conventional systems all had P and K surpluses. The results corresponded well with measured differences between systems in terms of ignition loss, P-AL, K-AL and K-HNO3 measured in 2009. This study shows that a fertile soil may be exposed to substantial mining of N, P and K over many years before it is detectable by traditional analyses, and that field nutrient budgeting is a feasible, but data-demanding, approach to detect such misbalances at an early stage.

1. Introduction

In 1989, a large cropping system experiment, facilitated for measurements of runoff and leaching, was established at Apelsvoll in southeast Norway. Over the years, this experiment has provided data for many studies covering a range of different topics, including yields and yield quality (e.g., [1]), nutrient leaching and runoff losses (e.g., [2]), economic aspects (e.g., [3]), soil microbiology (e.g., [4]), soil physical and chemical properties (e.g., [5]), and the relation between food production and N losses [6].

Some major adjustments of the experimental design were made in 2000 [6]. In this overview, a synthesis of the results obtained after these changes are given for the major nutrient flows of N, P, and K, with focus on changes in topsoil nutrient pools, as affected by misbalances between nutrient inputs and outputs at field level.

Numerous long-term experiments have shown that crop rotation and management affect soil fertility (e.g., [713]). However, considerable time is needed before identifiable changes in soil fertility emerge [14]. Nutrient budgets have been used widely in a range of farming systems to assess long-term sustainability (e.g., [15]), thus, supplementing soil measurements. In a discussion of uncertainties in nutrient budgets, Oenema et al. [16] distinguished between farm gate soil surface and soil system budgets. The latter accounts for nutrient inputs and outputs, recycling of nutrients within the system, nutrient loss pathways, and changes in soil nutrient pools. Soil system budgets were considered to possess the highest uncertainty of the three budgeting approaches, since nutrient losses via leaching, runoff, volatilization, and denitrification are classified as the most uncertain nutrient flows [17]. De Vries et al. [18], when estimating uncertainties in the soil system N budget of The Netherlands, reported that leaching to ground water and leaching to surface water had the highest relative uncertainty (coefficient of variation). Acquiring quality data on nutrient drainage and runoff may, therefore, considerably reduce the uncertainties of the soil system budget approach.

The present study is aimed at comparing the effects of management (i.e., organic versus nonorganic) and type of production (i.e., arable cash cropping versus mixed dairy farming) on their long-term sustainability in terms of plant nutrition, by a combination of soil system nutrient budgeting and soil measurements. Results on drainage discharge and water-borne nutrient losses will be presented in more detail elsewhere [19].

2. Material and Methods

2.1. Experimental Site and Treatments

In 1989, a 3.2 ha large experiment with pipe-drained plots was established at Apelsvoll Research Centre in central southeast Norway (60°42′ N, 10°51′ E, altitude 250 m). The climate of the region is humid continental with a mean annual precipitation of 600 mm and a mean annual temperature of 3.6°, and 12.0°C in the growing season (May to September). On the experimental area, which slopes 2–8% northeast, deforestation was performed in 1935, whereafter it was used as pasture until 1975. During the following years, up to the establishment of the experiment in 1988, the field was cropped with a rotation including 10% root crops, 40% cereals, and 50% ley, using an average of 10 tonnes cattle slurry ha−1 yr−1 plus regular amounts of inorganic fertilizer. The first year after draining the experimental site (1989), the area was cropped with barley (Hordeum distichum L.). The major soil group of the experimental area is classified as Endostagnic Cambisol [20], with dominantly loam and silty sand textures. More detailed soil characteristics have been presented by Riley and Eltun [21], partly shown in Table 1.

The experimental site comprises 12 blocks (30 × 60 m), separated by 7.5 m grass border zones (Figure 1). In each block, surface runoff is collected at the lower end and led to a sedimentation tank, and the blocks are separately drained with PVC pipes at a depth of 1 m with 7.5 m spacing. Surface runoff from the sedimentation tank and drainage water is transported in sealed plastic pipes to measuring stations equipped for discharge measurements (tipping buckets) and volume proportional sampling.

Using a randomised complete block design, six cropping systems, each with 2 replicates, were established on the twelve blocks in 1989. The first ten years (1989–1999) the experiment comprised three arable systems (conventional and integrated arable cropping without farmyard manure and organic cropping with some farmyard manure) and three mixed dairy systems (conventional, integrated, and organic production of both arable and forage crops, all with farmyard manure). Each block comprised eight 7.5 × 30 m plots, on which all of the arable and/or fodder crops in the rotation were grown each year.

Some major adjustments of the experimental design were made in 2000. The number of rotation plots was reduced from eight to four by merging pairs of neighbouring plots, thus, reducing rotation length from eight to four years, but still with each crop present every year. A new organic mixed dairy system was introduced instead of the integrated mixed dairy system, and some smaller changes were made in the management of the other systems. The six adjusted cropping systems are described briefly below (see Table 2 for details).

CA1
Conventional arable cropping, managed as was common for the region in 1985 (tillage and fertilization as in 1985, but for practical reasons, present-day inputs of seeds and chemical plant protection). The year 1985 was selected, since the North-Sea Agreement (1987) used this year as the base for its planned 50% reduction in nutrient leaching to the North Sea within a 10-year period. Before this date, less attention was paid to nonpoint source losses of nutrients attributed to farming activities, and this cropping system is, thus, used as a reference.

CA2
Conventional arable cropping, using currently available knowledge in order to minimize the ratio of N lost by leaching and runoff to production. This optimisation involves the use of catch crops, split application of fertilizer, and reduced noninversion tillage.

OA
Organic arable cropping without cattle slurry, but with 25% of the area used for green manure (grass clover ley).

CM
Conventional mixed dairy farming, optimised similarly to CA2, but with spring ploughing, 50% of the area as grass clover ley and the use of slurry (amounts calculated from the theoretical number of cows sustained, see Section 2.3 for details).

OM1
Organic mixed dairy farming with 50% of the area as grass clover ley and slurry use (amounts calculated as in CM).

OM2
Organic mixed dairy farming with 75% of the area as grass clover ley and slurry use (amounts calculated as in CM).

The results in 2000 were partly influenced by the previous management. Therefore, this paper deals with the results from the decade May 2001–April 2011.

2.2. Measurements

Within each plot, dry matter (DM) yields of grass, grain, straw (when removed), and potato tubers were measured in quadruplicate (subplot size 1.5 × 6 m). Straw was removed from all the cereal plots of CA1 and from the plots with barley undersown with grass clover ley in OA, CM, OM1, and OM2. Crude protein contents of the cereals were measured by near infrared reflectometry (INFRA 250, Technicon, USA). Potato tuber size distribution and quality parameters were determined according to standard procedures. The proportion of legumes in the grass clover ley was determined visually before harvest.

From 2006 onwards, plant samples of all harvested crops (0.2 g DM) were digested in a mixture of sulfuric acid and hydrogen peroxide and analysed for N and P colorimetrically with an autoanalyser (Skalar 5100, Actlabs, Canada) and for K by flame photometry (Corning 400, Sherwood Scientific Ltd., UK).

Cattle slurry was sampled 1-2 weeks before application and analysed for total-N using the Kjeldahl method. Ammonium-N and nitrate-N were extracted with 2 M KCl and determined colorimetrically with an autoanalyser (Traacs, Bran and Luebbe, Germany).

The water samples (drainage water and surface runoff) were analysed on a monthly basis for total N, ammonium-N, nitrate-N, total P, phosphate P, and total K and determined spectrophotometrically (DR2800 spectrophotometer, Hach Lange, Germany). Potassium was first included in May 2009.

Soil samples have been taken every 3–5 years since 1989. Due to differences in sampling depths, sampling locations, and parameters analysed, only selected samples are comparable. In this study results are shown for topsoil samples (0–25 cm depth) taken in 1996, 1999, 2003, and 2009. The samples taken in 1999 and 2009 were analysed for ignition loss (at 550°C), whereas samples taken in 1996, 2003, and 2009 were used to quantify plant available P and K and acid soluble K. Plant available P (P-AL) and K (K-AL) was extracted by a mixture of acetic acid and ammonium lactate, according to Egnér et al. [22], whereas acid soluble K (K-HNO3) was extracted by boiling in 1 M HNO3. The P and K concentrations in the extracts were analysed by inductively coupled plasma (ICP) techniques (SPECTRO GENESIS, Analytical Instruments GmbH, Germany).

2.3. Calculations and Estimates

Dry and wet atmospheric N deposition was set to 2.7 and 7.2 kg N ha−1 yr−1, respectively, and wet atmospheric K deposition was set to 2.1 kg ha−1 yr−1, based on measurements at the nearest monitoring station, Hurdal [23], about 50 km south of Apelsvoll. Dry depositions of K and wet and dry P depositions were assumed to be negligible.

Symbiotic N fixation was estimated in accordance with Korsaeth and Eltun [2]. Nonsymbiotic N fixation was considered to be negligible. Nutrient inputs with seeds were estimated using measured N, P, and K contents of harvested grain and potatoes from the CON-A system. Literature values were selected for legumes and grasses.

The amounts of N in cattle slurry applied were calculated as the sum of N in harvested forage (grass clover ley) and feed concentrates (purchased and/or produced on the farm), minus the estimated N losses occurring from harvest until slurry application (forage losses, gaseous losses from the cow shed, and during slurry storage) and N exports via milk and livestock, as described in detail by Korsaeth [6]. The number of cows which each farming system could sustain was calculated from the average total available feed in the system during the previous three years (sliding mean) and the feed requirement for milk production, maintenance, activity and replacement. On the conventional farm, it was assumed that purchased cereal-based feed concentrates corresponded to 25% of the total fodder units available (feed concentrates plus forage grass). The organic systems were assumed to be completely self-sufficient, using on-farm produced barley as feed concentrates, for at most 20% of the total feed requirement. Barley not used as concentrates was assumed to be sold.

The yearly average cereal yields of OA were reduced by 25% to correct for the area used for green manure production. In 2007, the mixture of oat and peas in OA was replaced in the rotation by faba bean (Vicia faba L.). These beans were totally damaged by chocolate spot (Ascochyta blight disease) and not harvested. All OA data from 2007 were, thus, excluded from further analyses.

Nutrient concentrations (N, P, and K) in harvested products (crops and straw) for the years 2001–2005 were set equal to the averages (separately for each cropping system) for the years 2006–2010. Removal of N, P, and K was calculated as the measured dry weight of products removed from the field multiplied by the estimated nutrient concentrations. Amounts of N, P and K in harvested grass were reduced by 10% to correct for likely losses under practical harvest.

Gaseous N-emissions (N2O-N, -N, and NH3-N) were estimated from the IPCC framework [24], which comprises estimates for both direct and indirect emissions. Nitrogen sources included in the direct estimates used here were mineral N fertilizer, applied N in slurry, and N in above-ground and below-ground crop residues. Net N mineralization associated with possible loss of SOM resulting from contrasting management was not considered. The volatilization of NH3-N (and -N) is in the IPCC framework [24] related to the input of mineral fertilizer and organic N additions, not including crop residues. This implies that the mulched grass clover in OA would have zero emissions of NH3-N using the IPCC-approach, which is very unrealistic (e.g., [25]). Hence, the volatilization of NH3-N from this crop was calculated by means of a separate method [6].

Nutrient runoff occurring during each agrohydrological year, lasting from 1 May to 30 April, was attributed to the cropping season within that period. Calculations of N, P, and K transported via surface runoff and drainage water were based on measured nutrient concentrations and volumes of surface and drainage water. Organic N was calculated as the difference between total N and the sum of ammonium-N and nitrate-N. Potassium runoff occurring during the agrohydrological years 2001–2008 were set equal to the measured average K runoff for the agrohydrological years 2009/10 and 2010/11.

Nutrient soil system budgets were calculated separately for each system by considering all major flows of N, P, and K, respectively, with the above-ground crops representing the upper boundary and the drain pipes the lower boundary. A positive soil system budget, that is, where the inputs exceeded the outputs, was taken as an indication of nutrient accumulation, whereas a negative budget was taken as an indication of soil mining of the nutrient in question.

2.4. Statistics

Analyses of variance (ANOVA) were performed on yields and nutrient concentrations, using a split-plot model with cropping system as main plot and year as subplot. Grass-clover ley yields were analysed as the sum of two cuts, whereas differences in nutrient concentrations were analysed for each cut separately. Paired comparisons (LSD) were performed [26]. Comparisons of soil chemical properties measured on different occasions were conducted using the paired Student’s t-test. In all tests, significance was assumed at P levels < 0.05. Mean data are presented with their standard errors (s.e.).

3. Results

3.1. Yields

There were significant yield differences between cereal crops within each group of cropping systems (Table 3). The conventional arable systems (CA1 and CA2) gave the largest overall cereal yields; the conventional mixed dairy system (CM) was intermediate, whereas the organic systems gave the lowest yields. The organic arable system had the lowest (area corrected) yields overall, achieving only 40 and 44%, respectively, of the barley and wheat harvested in the arable conventional systems. The mixture of oats and peas in OA compared slightly more favourably, but still with only 47% of the yield level obtained with monocropped oats in CA1 and CA2 (Table 3).

The total fresh weight yield of potatoes was  Mg ha−1, and there was no significant difference in yield (Table 3), size distribution, or selected quality parameters between the two cropping systems with potatoes in the rotation (CA1 and CA2, data not shown). The DM content was kg DM kg fresh weight−1, and about 93% (weight basis) of the potato tubers were saleable.

The conventional mixed dairy system (CM) had significantly larger grass clover ley yields than the organic systems, both for the 1st and the 2nd ley years (Table 4). There was no significant yield difference between the organic systems. Their total production in the two first ley years was 86% of that obtained conventionally.

The annual DM yield of the systems averaged over crops appeared to follow the same pattern between years within each production group (Figure 2).

3.2. Nutrient Concentrations of the Harvested Crops

The only differences in nutrient concentrations among the cash crops were for barley N and K (Table 3). The barley N concentration was highest in CA1, followed by CA2 and OM2 whereas OA had the lowest N concentration. The differences were smaller for K, with highest concentration in barley from OM2, and lowest in barley from CA2 and CM.

The N concentration in herbage (grass clover ley) differed significantly between systems at the 2nd cut in both ley year 1 and 2 (ley year 3 was not comparable between systems), with lower concentration in CM than in the two organic systems, which had similar concentrations (Table 4). For the concentrations of K, the tendency was opposite, at least at the first cut. Organically cropped grass clover ley had significantly lower K concentration than that of CM at the first cut of ley year 2.

3.3. Soil System Nutrient Budgets
3.3.1. Nitrogen

The N input was in the range of 60–112% of the N output (Table 5). The arable system CA1 and all the organic systems had negative soil system N budgets, indicating depletion of the soil organic N content. Over the 10 years, the reductions amounted to 280, 319, 225, and 114 kg N ha−1 for CA1, OA, OM1, and OM2, respectively. By contrast, the budget of CA2 appeared to be balanced, whereas CM had an N surplus amounting to 198 kg N ha−1.

The annual soil system N budgets (and those for P and K) were rather consistent over years for the arable systems, whereas the annual budgets of the mixed dairy systems appeared to be more positive in 2001 and 2002 than during the rest of the decade (Figure 2).

The amount of N in harvested cereals and potatoes of CA1 and CA2 corresponded to 83 and 84% of that applied, respectively (Table 5). The proportion was somewhat lower in the mixed dairy system CM (73%), whereas in the organic systems OM1 and OM2 the N removal exceeded that applied by 54 and 86%, respectively.

When comparing the sum of N lost via drainage and runoff with that applied (in fertilizer and/or cattle slurry), CA1 had the largest quotient within the arable cropping system group, and, correspondingly, OM1 had the largest quotient within the group of mixed dairy systems (Table 5).

The arable systems had the largest losses of N (via drainage and runoff) per unit of harvested N (loss-to-harvest ratio), with OA having the overall largest loss-to-harvest ratio (Table 5). The water based N losses from this system corresponded to 67% of the N in harvested products. By contrast, CA2 lost only amounts corresponding to 27% of the harvested N. The differences were much smaller within the group of mixed dairy systems, with N losses ranging from 13–24% of the harvested N.

3.3.2. Phosphorus

The P input was in the range of 8–156% of the P output (Table 6). The P budgets differed markedly between organic and conventionally managed systems. All three organic systems showed depletions of the soil P content amounting to 82, 100, and 98 kg P ha−1 for OA, OM1, and OM2, respectively. The conventional systems all had a calculated P surplus, particularly CA1 and CA2, which appeared to accumulate P in the same order of magnitude as the P reductions in the organic systems.

The P removal at harvest amounted to 70 and 72% of that applied in CA1 and CA2, respectively (Table 6). This percentage was somewhat larger for CM (82%), and very high for the organic systems OM1 (217%) and OM2 (182%), indicating soil P mining in these systems.

As the system differences in terms of P losses in drainage and runoff were not statistically significant, quotients between these losses and applied or harvested P were not calculated.

3.3.3. Potassium

The K input was in the range of 23–200% of the K output (Table 7). The pattern of the K budgets was similar to that for P; all the organic systems had calculated K deficits, whereas all the conventional systems appeared to accumulate K.

The amount of harvested K corresponded to 53% of that applied in CA1 and CA2 (Table 7). This proportion was somewhat higher in the mixed dairy system CM (66%). In the organic systems OM1 and OM2 the K removal at harvest was only slightly larger than the amount of K applied.

The sum of K lost via drainage and runoff was 7% and 5% of that applied in CA1 and CA2, respectively (Table 7). The corresponding percentages for the mixed dairy systems were less, ranging from 2–4%.

The loss-to-harvest ratios for K followed a similar pattern as for N, but at a lower level (Table 7). The arable systems had the largest ratios, with the highest value calculated for OA. The differences were much smaller within the group of mixed dairy systems, with N losses ranging from 2–4% of the harvested K.

3.4. Changes in Topsoil Nutrient Content
3.4.1. Nitrogen

The ignition loss did not differ significantly between 1999 and 2009, although the measurements appeared to be at a lower level in 2009 (Table 8, Figure 3(a)). The tendency of reduction was particularly strong for CA1 ( ) and OM2 ( ). Although the measured changes in ignition loss over time did not support the calculated N budgets substantially, the differences between systems in terms of ignition loss measured in 2009 reflected the N budgets very well (Figure 4(a)). A ranking of the systems in terms of measured ignition loss levels in 2009 was almost identical with a ranking based on the calculated N budgets.

3.4.2. Phosphorus

P-AL changed significantly in four of the six systems during the period 1996–2009 (Table 8, Figure 3(b)). The measurements in 2009 showed the same pattern of differences between the systems as did those in 2003, but with a greater magnitude. The measured differences in P-AL reflected the calculated P budgets well, although the measured declines in OM1 and OM2 were not significant (Table 8). There was a strong linear relationship between P-AL measured in 2009 and the calculated P budgets of the systems (Figure 4(b)).

3.4.3. Potassium

K-AL followed a pattern similar to that of P-AL over the period 1996–2009, with increasing differences between systems over time (Figure 3(c)). The only significant differences between the measurements in 1996 and 2009 were found in CA2 and CM, which both showed increased levels of K-AL in 2009 (Table 8). The calculated K-deficits in the organic systems, and the K-surplus of CA1, could, thus, not be supported by the measured differences in K-AL over time. The differences between systems in terms of K-AL measured in 2009 corresponded, however, very well with the calculated K budgets (Figure 4(c)).

There was a significant increase in the topsoil content of K-HNO3 from 1996 to 2009 in all systems (Table 8, Figure 3(d)). The rate of change decreased in the order CA2>CM> CA1>OM1>OM2>OA, a ranking which corresponded well with a ranking of the calculated K budgets. The relation between the soil measurements in 2009 and the calculated K budgets was even stronger for K-HNO3 than it was for K-AL (Figure 4(d)).

4. Discussion

4.1. Yields

Both cereal yield levels and relative yield differences between systems followed the same pattern for the whole period 2001–2010 as for the years 2001–2004, which were discussed by Korsaeth [6]. Briefly, the yield differences were larger amongst the arable systems than amongst the mixed dairy systems. The low yields in OA may be explained by P and K deficits, as indicated by the negative P and K budgets and the significant reduction in plant available topsoil P, and due to the lack of chemical plant protection. More foliar diseases were generally observed on cereals in the organic systems than in the other systems (data not shown).

The lack of significant differences in measured potato parameters between the two cropping systems (CA1 and CA2) is partly in accordance with the findings of Riley and Ekeberg [27], who compared spring and autumn ploughing at different depths (10, 20, and 30 cm) with tine harrowing only in spring, on the same soil type at a nearby location. They found the same potato fresh weight yields in all treatments, but the tuber dry matter concentration was significantly lower in potatoes grown without ploughing.

Among the mixed dairy systems, the organically grown cereal yields were also lower than those grown conventionally. However, due to much smaller differences in nutrient regime, the differences between mixed dairy systems were less than those between arable systems. The most likely reasons for inferior yields in the organic mixed dairy systems relative to the conventional mixed dairy system were, as for the arable systems, suboptimal nutrition and a lack of plant protection.

The yield pattern of grass clover ley was also unchanged in 2005–10 compared with the first four years of the decade [6]. The high yields of the organic leys may be explained partly by their N fixation, which was estimated to be much greater in the organic leys than in the conventional leys. This was a result of the significantly higher proportions of clover in the organic leys and less suppression of N fixation by the use of inorganic N fertilizer. The reduced grass clover yields in the 3rd ley year of OM2 may be due to the reduced proportion of clover compared with the first two ley years.

The long period of active nutrient uptake by grass and clover may also partly explain the relatively high organic ley yields. A longer uptake period increases the utilization of less readily available nutrients (e.g., nutrients in organic form), since the mineralization of such nutrients occurs throughout the cropping season. Smaller yield differences between organic and conventional cropping for grass clover ley than for cereals have been reported previously for this experiment [28].

In a review of a number of Swedish field studies, Bergström et al. [29] reported that crop yields in organic rotations were reduced by 20 to 80%, compared with the same crops in conventional rotations. These authors explained this in terms of higher N deficiency, more weed competition, and greater infestation of crop diseases in the organic systems.

4.2. Soil System Nutrient Budgets and Nutrient Concentrations of Topsoil and Crops
4.2.1. Nitrogen

The large calculated deficits found for the arable systems CA1 and OA in the first part of the decade [6] were sustained. The suggested net soil N depletion corresponded to a relative decay rate of the topsoil (0–25 cm) N content of 0.4% yr−1. This corresponds well with Riley and Bakkegard [30], who compared soil samples taken in 1991 and in 2001 from 291 arable fields located throughout southeast Norway. They found that the percentage relative decline rate of SOM was approximately one tenth of the initial percentage of organic matter in soil over the decade. In the present study, there was a strong tendency ( ) towards reduced ignition loss for CA1 in 2009 compared with 1999; but this was not the case for OA.

Comparing conventional and organic cropping systems in a pipe-drained plot experiment in Sweden, Torstensson et al. [31] also reported an N deficit (−18 kg N ha−1 yr−1, not including denitrification, N in seeds and atmospheric depositions) for a conventional arable rotation (CON, barley-oat-spring wheat-barley-oat-potato) comparable with CA1. They tested additionally an arable organic system with green manure as the only N source (OGM, oat-green manure-spring wheat-oat-green manure-potato), comparable with OA and found a positive soil system N budget (13 kg N ha−1 yr−1) in contrast to the present findings. In the experiment of Torstensson et al. [31], the proportion of green manure was, however, larger than in our case (33% versus 25%), which resulted in 37 kg N ha−1 yr−1 more N fixation and 20 kg N ha−1 yr−1 less harvested N, compared with OA.

The only arable system which appeared to have a balanced N budget was CA2. This may mainly be explained by its comparative low leaching and runoff N losses, which represent the main difference between CA1 and CA2 with regard to N flows. The findings indicate that reduced tillage counteracted soil N mining, which is a commonly reported result (e.g., [32]). Another factor which may have contributed to prevent soil N mining in CA2 is that straw was not removed. Straw incorporation has a well-known positive effect on the soil organic N content (e.g., [33]).

The conventional mixed dairy system CM had a calculated N surplus over the decade, which indicates that the system probably increased its soil organic matter content. Conservation of or an increase in soil N has also been reported for other rotations containing pasture or ley receiving organic N on relatively N-rich (>2.0 g kg−1) soils [32, 33]. The opposite was found in the organic system OM1, with the same crop rotation and tillage as CM, but with a calculated N deficit of 23 kg N ha−1 yr−1, indicating that the relatively high production has been maintained at the cost of the soil organic N pool. Similarly, Steinshamn et al. [34] reported an annual N deficit of 16 kg N ha−1, not including N leaching and denitrification, at the field level in an organic crop rotation with 50% grassland (barley, forage rape + Italian ryegrass, oats + peas, 3-year grassland).

The slightly negative N budget of OM2 (−16 kg N ha−1) shows that a high proportion of grass clover ley in the rotation does not guarantee a balanced N budget. By comparison, Syväsalo et al. [35] reported an even larger N deficit (−31 kg N ha−1 yr−1, not including ammonia emissions, deposition, or N in seeds) in an organic grass clover ley receiving 130 kg N ha−1 in cattle slurry. An extra year of grass clover ley instead of wheat in the OM2 rotation resulted in the largest calculated amounts of available cattle slurry for this system (82 kg N ha−1), but the N fixation was apparently much less effective in the additional ley year than in the previous two ley years. The estimated N fixation of the third ley year was only 43 kg N ha−1 yr−1, compared with 82 and 88 kg in the 1st and 2nd ley year, respectively.

The annual nutrient budgets of the mixed dairy systems showed more positive figures for 2001 and 2002 compared with the following eight years. This was a result of too high yield expectations when calculating the initial amount of slurry available for these systems in 2001. The number of cows which each farming system could sustain, and, thus, the amount of slurry available for the crops, was calculated from the average total available feed in the system during the previous three years (sliding mean). The initialization problem was, thus, gradually levelled out.

The calculated changes in soil N, that is, misbalanced soil system N budgets, were in general poorly supported by the measured changes in topsoil ignition loss since 1999, which were all nonsignificant. Considering the large differences in the measured N flows between the systems, it is very unlikely that the SOM level of 1999 would have been sustained in all systems over the following decade. One explanation of the mismatch could be an over- or underestimation of the estimated gaseous N losses, which were the most uncertain N flows in the calculated budgets. If these losses were largely overestimated, the calculated deficits of CA1, OA, OM1, and OM2 would have been reduced, but the calculated surplus of OM would have increased, and vice versa. Another explanation could be that some organic matter has been transported from topsoil to subsoil. Such a translocation of organic matter may have taken place, but it seems unlikely that this process has diverged significantly between systems with more or less the same crop rotation.

The relative differences between the systems in 2009 matched the calculated soil system N budgets much better than the differences found between sampling times, indicating that the 1999 data may include some random variation. The relation between ignition losses in 2009 and the N budgets indicated that ignition loss would equilibrate at 67 g kg−1 with a balanced N budget. This corresponds to an SOM content of 47 g kg−1, calculated with a pedotransfer function developed solely for this site (SOM = 0.81 × ignition loss (%)−0.038 × clay (%)−0.70), [21]).

The significant differences between systems in terms of N concentrations of the harvested crops were few and reflected poorly the differences in fertilization regime and soil system N budgets. Greenwood et al. [36] developed a model linking N concentration in plant DM to growth rate and to plant mass per unit area. They found that subcritical values of N concentration during growth affected the growth rate. In order to define whether any of the crops in the present study were N limited, measurements of N concentration during crop growth and not at harvest would, thus, have been required.

4.2.2. Phosphorus

Conventional arable cropping appeared to give relatively large surpluses of P. In the past, more emphasis, in Norway, was placed on adjusting N fertilizer rates to crop requirements than to adjusting P rates. In 2007–2008 there was a change in the fertilizer recommendations for P in Norway, with a reduction of approx. 25% for cereals and grasses and 30% for potatoes. This prompted the fertilizer market leader in Norway (market share > 90%) to increase the N : P-ratio of their most used compound fertilizers, from 5.3 to 7.3 for cereals and from 2.2 to 3.0 for potatoes, thus reducing the amounts of P for a given amount of N.

The organic arable system did not receive any P, and the production was thus entirely dependent on P supply from the soil. The negative soil system budget of 8 kg P ha−1 yr−1 was the same as that found in a stockless organic farm (red clover-winter wheat-spring beans-spring cereal) in the UK [15]. The results show that in stockless organic farming systems, some form of external P addition becomes unavoidable sooner or later (depending on the size of the initial P pool and the ability of the soil to deliver plant available P). Berry et al. [15] showed that a system comparable with OA was almost in balance in terms of P when it received rock phosphate. From a resource economics point of view, it is questionable; however, whether the use of untreated rock phosphate is a good strategy, considering its low plant availability [37]. An alternative could be to use organic waste, such as biogas residue from household waste, which has been shown to be a valuable and inexpensive source of plant nutrients [38].

The conventional mixed dairy system had a field surplus of almost 4 kg P ha−1 yr−1, suggesting an unnecessary use of a limited resource. This appears to be no exception. In a comparable farming system in northern Sweden, Bengtsson et al. [39] reported a P surplus of 5 kg P ha−1 yr−1. The problem seems to be even worse on conventional dairy farms, that is, those with no or a very low proportion of arable crops. The P surplus on such farms is assumed to vary from 10 to 72 kg ha−1 in Europe (Pfimlin et al. 2006, cited by [40]).

The organic mixed dairy systems produced at the cost of their indigenous soil P pool, with a total deficit of about 10 kg P ha−1 yr−1. This was not surprising, considering that there was no P input to these systems, except for that in seeds. Even when some feed is purchased, P deficits are commonly reported. Berry et al. [15] reported soil surface budgets (i.e., maximum root depth as lower boundary) of −3 kg P ha−1 yr−1 in a mixed dairy system in the UK (ley cropped in 3 out of 5 years, farm number 3). Steinshamn et al. [34] found a deficit between inputs and produce (losses not considered) in an organic dairy farming system in Norway of 6.3 kg P ha−1 yr−1.

All the significant changes in topsoil P-AL from 1996 to 2009 were in the same direction as the corresponding calculated soil system P budgets. The budget calculations were also supported by the very strong relation between the topsoil level of AL-extractable P in 2009 and the calculated soil system budget. The results clearly show the effect of both overfertilization and suboptimal P fertilization on the plant availability of P in soil.

In the present study, P losses to subsoil, that is, below root depth, were considered to be negligible. On a comparable soil in the long-term fertilizer trials at Møystad in S.E. Norway, Riley [41] found no effect of P treatment (no P addition, P in mineral fertilizer, or P in animal manure) below 40 cm depth. In contrast, Verloop et al. [40], studying intensive dairy farming systems, found that some topsoil P was transported to subsoil and that the P accumulation in the deeper layers practically equalled the P depletion in the upper topsoil. Their experiment was, however, run on a light sandy soil, characterized by a 0.3 m anthropogenic topsoil overlaying a layer of yellow sand hardly penetrable by roots.

With an assumed balanced P budget, P-AL appeared to equilibrate at 54 mg P kg−1, a level which is normally considered adequate for optimum growth. In the long-term trials at Møystad, Ekeberg and Riley [8] also found a strong relationship between topsoil P-AL and P balance (P applied via fertilizer and/or farmyard manure minus P removed by harvest) for the period 1922–1983. They reported that P-AL equilibrated at 25–30 mg P kg−1 when the application of P equalled the removal of P by crops. This equilibrium point rose to about 40 mg P kg−1 in the period 1983–2003 (H. Riley, personal communication). Experiences from an intensive dairy farm in The Netherlands, where P-equilibrium fertilization (i.e., balancing P inputs via fertilizer and manure with P in crop products) is performed, has shown that the soil available P-status differs between crop rotations.

The P content of the crops did not differ between the systems, and the herbage concentrations of P were in the range of 0.2–0.3%, a level which is regarded as adequate [42]. In comparison, Mathews et al. [43] considered 0.2–0.34% P to be the critical concentration for cool-season grasses, that is, a concentration level below which a 10% yield drop is expected. It, thus, appears that the grass clover growth was not P limited in the organic mixed dairy systems, in spite of the continuous soil P depletion of these systems.

4.2.3. Potassium

The potassium soil system budgets showed that the conventional arable systems had unnecessarily high levels of K fertilization. As for P, the change in the compound fertilizers (from 2009) also altered the amount of K relative to N, with a N : K-ratio increasing from 2.1 to 2.2 for cereal fertilizer and from 0.65 to 0.67 for potato fertilizer. Although this change has some importance for practical farming, it did not influence the results presented here.

In contrast to the present findings, Torstensson et al. [31] reported a small K deficit (−3 kg K ha−1 yr−1) in a 6 year conventional rotation with five years of spring cereals and one year with potatoes, comparable with CA1. Additionally, they found a large deficit (−28 kg K ha−1 yr−1) in a similar crop rotation but with ryegrass grown as a catch crop after each main crop, comparable with CA2. The contrasting findings of Torstensson et al. [31] may largely be explained by the K leaching, which were 5–7 times larger in their experiment than in the present study. The literature appears inconclusive, however, when it comes to K-budgets for conventional arable cropping systems. This was well illustrated by Heming [44], who studied a large number of fields in southern England, reporting K budgets (K applied in fertilizer minus K in crop) ranging from −40 to +70 kg K ha−1 yr−1.

The organic arable system OA had a calculated K-deficit. Stockless organic systems without any form of K application are bound to result in negative K budgets, as has been commonly reported (e.g., [15, 31, 45]). Interestingly, the calculated deficit in OA was of the same magnitude as the calculated deficits of the two organic mixed dairy systems (OM1 and OM2). The relatively large inputs of K in applied slurry in the mixed dairy systems, thus, appeared to be more than outweighed by large K export via harvested material. Reviewing a range of cropping systems in northern Europe, Öborn et al. [46] summarized that negative farm gate and soil surface K budgets are especially common in organic farming. This was supported by Øgaard and Hansen [47], who found negative K budgets in grassland fields on 23 out of 26 organic farms in Norway. In a study of three long-term field experiments with mixed cropping systems (six year rotations with 2/6 or 3/6 ley) on sandy loam soils over the 18 year period 1997–2004, Andrist-Rangel et al. [48] reported negative K budgets (input minus crop offtake) for organic systems in the range of −22 to −75 kg K ha−1 yr−1. They also found, however, that a conventional mixed cropping system had negative field budgets during the same period, ranging from −21 to −60 K ha−1 yr−1, in contrast to the conventional mixed cropping system of the present study (CM), which had a calculated surplus of 44 kg K ha−1 yr−1. On the other hand, Bengtsson et al. [39] reported a K surplus of 39 kg K ha−1 yr−1 in a comparable conventional system in northern Sweden.

The only significant changes in measured topsoil K-AL were the increase in CA2 and CM, in correspondence with their large calculated field K surpluses. The relative differences between systems in terms of calculated K budgets were well reflected in the relative differences between changes of K-HNO3 and there was a clear relation between K budgets and measured levels of both K-AL and K-HNO3 in 2009. The relatively large, negative soil system K budgets for the organic systems were, however, not reflected by the soil measurements. Could it be that the soil system K budgets were largely underestimated (i.e., underestimated inputs and/or overestimated outputs)?

Errors in K mass budgets are not an uncommon phenomenon (e.g., [46]). It seems unlikely, however, that such errors alone could explain the lack of fit between the calculated K budgets and the soil K measurements. For example, the unfertilized system OA would need a budget correction of about 10 kg K ha−1 yr−1, in order to achieve a balanced soil system K budget, which would reflect the K-AL data (i.e., no significant difference between 1996 and 1999). Such a change would be of the same magnitude as the entire (current) calculated K off-take by harvest, and it would correspond to more than 330% of the total K input or 300% of the K measured in drainage and runoff. Moreover, the calculated depletions of K in the organic systems are well supported by relevant comparisons in the literature, as discussed above.

Another, and more likely, explanation of the poor relation between soil system K budgets and changes in soil K measurements over time, could be that the weathering of primary minerals has released K in amounts compensating for the calculated K depletion. Annual weathering rates for different Norwegian and Swedish soils have been estimated to range from 3 to 82 kg K ha−1 [49]. Øgaard and Hansen [47], looking at K uptake and requirement in organic grassland farming in Norway, found that K uptake from reserve K, that is, K located in the interlayers of the sheet silicates in the soil, was positively related to the acid soluble K content (i.e., K-HNO3 minus K-AL) of the soil. In the present study, acid soluble K was in the range of 20–25 mg K 100 g−1 (1996 values), which corresponds to a potential uptake of reserve K between 30 and 110 kg K ha−1 yr−1, based on the findings of Øgaard and Hansen [47].

The considerations above relate to the topsoil. Additionally, there may have been substantial K uptake from the subsoil, particularly in systems with negative soil system budgets. Experiments using a K/Rb isotope dilution method on loess-parabrown soils in N. Germany have showed that the subsoil (>30 cm depth) supplied between 9 and 70% of the total K uptake in spring wheat [50].

The data give no reason to assume that the availability of K was limiting for yield formation. In the future, separate analyses of grasses and legumes will nevertheless be performed, to enable a better assessment of the critical nutrient levels of the forage crops.

4.3. Implications for Future Cropping Systems

Agronomic practices affect the balance between utilization and removal of plant available nutrients in soil. Where the reserves of potentially available nutrients (e.g., organic N, organic/fixed P, and fixed K) are large, low input farming systems may maintain productivity at the cost of a gradual decline of the soil nutrient pools. It should be emphasised, however, that soil P mining is to be considered as an equally serious depletion of a limited resource as the mining of rock P for producing mineral fertilizer. In future agricultural systems, one main challenge is to close the P cycle. Today, a large share of harvested P ends up in persistent chemical compounds due to the widespread use of coagulants (e.g., aluminium and iron) to remove P from sewage [51]. One step in the right direction is to use biogas residue from household waste, as mentioned above, as a nutrient source. Such a practice has from 2011 been integrated into the management of the organic OA system of the current study, in order to improve its nutrient balance.

Nitrogen may be fixed biologically in sufficient amounts in clover ley dominated systems. Systems which have an N input based on green manure appear, however, to be very area-demanding, and the practice of leaving large amounts of N-rich material unharvested leads to severe risks for N losses to the environment [6]. The present study shows that a good alternative is to use moderate amounts of fertilizer, combined with steps to reduce the risk of losses, such as the use of spring tillage, catch crops, and split application of fertilizer. Such a system (i.e., CA2) has previously been shown to have the lowest ratio of N loss to food production [6].

Potassium is often given relatively little attention when studying agricultural systems, probably due to the absence of direct, negative environmental impacts associated with K losses. Moreover, many soils show a remarkable capacity for replacing removed plant available K through weathering and, thus, counteracting negative effects (i.e., plant K deficiency) when soil K mining occurs, as shown in this study. In the long term, however, all systems require a balanced supply of K, which implies that most organic farmers today need to consider means to enhance the K input to their farms. This study highlights also the need for balancing K, along with N and P, in conventional systems, as there is a risk for an oversupply of K in these systems. Oversupply means that the indirect environmental impacts, that is, those associated with the manufacturing and transport of K fertilizer, are unnecessarily increased.

To perform a thorough evaluation of agricultural systems in terms of environmental impact, such as their global warming potential, all processes governing the manufacturing and transport of input factors (not only those for fertilizer) should be considered. This points toward the need for life cycle assessment (LCA) studies, which use a holistic approach for the evaluation of a system’s environmental impacts (e.g., [52]). An LCA study of the systems included in the present paper is currently in preparation.

5. Conclusions

Differences in cereal yields between organically and conventionally managed systems are larger in arable rotations than those in mixed cropping systems including ley and animal husbandry, mainly due to improved nutrition through N fixation in the ley and the availability of animal manure in the latter group.

The yield differences for grass clover leys are smaller between the two management types (i.e., organic versus conventional) than those of cereals, due to lower disease pressure in grasses and better nutrition, resulting from N fixation and better utilization of mineralised nutrients (i.e., long period of active uptake).

Arable cropping may result in soil N mining, even when fertilized with normal to high amounts of N, due to high potential for losses and poor utilization of mineralised N. The use of 25% of the production area for green manure production is insufficient as an N source alone to balance the N losses and the off-take by harvest, on a fertile soil with high yield potential. Arable cropping, which comprises the use of reduced tillage, catch crops, and moderate amounts of fertilizer appears, however, to balance the N flows at field level. The use of P and K fertilizer in arable crop production may be used to balance the respective nutrient flows but it should be used with care, as over supply is a great risk.

Mixed dairy systems, producing both cereals and fodder crops, risk an undersupply of both N, P, and K, if there is no import of these nutrients to the system in the form of purchased fodder and/or other nutrient sources. The fixation of N by the legumes in the forage crops appears to be insufficient as the only N source. The use of mineral fertilizer may very well be used to balance the flows of N, P, and K in mixed dairy systems, but this leads to a risk of oversupply.

A relatively fertile soil may be exposed to substantial mining of N, P, and K over many years without causing detectible nutrient deficits in plants cropped on this soil. A long-term over- or undersupply will, sooner or later, result in a significant change in the content of plant available nutrients in the soil, but this change may be masked by the release of nutrients from nonavailable compounds.

Field nutrient budgets appear to be a good approach to the evaluation of whether a system is managed in a balanced way or not with regard to important nutrients such as N, P, and K, long before eventual unbalances become detectible by traditional analyses. However, the approach requires comprehensive datasets, which are usually unavailable under practical conditions.

Acknowledgments

H. Riley is gratefully acknowledged for critically reading the paper and T. Gaardløs for his skilled technical assistance. The project was funded jointly by the Norwegian Institute for Agricultural and Environmental Research, the Research Council of Norway, and Yara International ASA.