Brain size does not predict learning strategies in a serial reversal learning test

Reversal learning assays are commonly used across a wide range of taxa to investigate associative learning and behavioural flexibility. In serial reversal learning, the reward contingency in a binary discrimination is reversed multiple times. Performance during serial reversal learning varies greatly at the interspecific level, as some animals adopt a rule-based strategy that enables them to switch quickly between reward contingencies. A larger relative brain size, generating enhanced learning ability and increased behavioural flexibility, has been proposed to be an important factor underlying this variation. Here, we experimentally tested this hypothesis at the intraspecific level. We used guppies (Poecilia reticulata) artificially selected for small and large relative brain size, with matching differences in neuron number, in a serial reversal learning assay. We tested 96 individuals over 10 serial reversals and found that learning performance and memory were predicted by brain size, whereas differences in efficient learning strategies were not. We conclude that variation in brain size and neuron number is important for variation in learning performance and memory, but these differences are not great enough to cause the larger differences in efficient learning strategies observed at higher taxonomic levels.


INTRODUCTION
Cognitive ability varies greatly at all taxonomic levels (Shettleworth, 2010;Thornton and Lukas, 2012). More advanced cognitive abilities may enable an animal to use previous experience to develop efficient learning strategies (Hunter and Kamil, 1971;Mackintosh, 1974Mackintosh, , 1988Rumbaugh et al., 1996;Wilson et al., 1985). Here, we define an efficient learning strategy as the ability to generalize obtained information from earlier successful responses across situations by a learnt rule. By developing and adopting efficient learning strategies, an individual is able to switch faster between contingencies and solve novel problems than if restricted to, for instance, a fixed stimulus-response action pattern (Bitterman, 1965;Gonzalez et al., 1967). Differences in this aspect of cognition are well studied across a wide array of species. For example, macaws (Diopsittaca nobilis) outperform caiques (Pionites melanocephala) in both colour association and reversal learning tasks (van Horik and Emery, 2018), pinyon jays (Gymnorhinus cyanocephalus) solve spatial and visual serial discrimination problems faster than nutcrackers (Nucifraga columbiana) and scrub jays (Aphelocoma californica) (Bond et al., 2007), and bumblebees (Bombus spp.) outperform honeybees (Apis spp.) in odour discrimination problems (Sherry and Strang, 2015). The capacity to generalize information across situations and adopt an efficient learning strategy clearly differs between species. However, the proximate mechanisms causing this divergence remain unclear. Natural selection acts on the individual and the cause of individual variation and its potential consequences in cognitive evolution have been largely overlooked (Thornton and Lukas, 2012). To understand what causes the abovestated divergence in efficient learning strategies, it is important to also examine the proximate predictors of learning performance at the intraspecific level.
Reversal learning assays have been used in taxa ranging from insects to humans (Ashton et al., 2018;Bitterman, 1965;Bond et al., 2007;Buechel et al., 2018;Day et al., 1999;Izquierdo et al., 2016;Liu et al., 2016;Sherry and Strang, 2015). There are several strengths of reversal learning assays. First, reversal learning assays test several aspects of learning ability. Second, the neurological mechanisms underlying performance are comparable across multiple species. Third, the experimental protocol is largely standardized across different species. Initially, animals are trained in a binary discrimination task (e.g. visual, olfactory or spatial cues). After either a fixed number of trials or a pre-decided learning criterion, the reward contingency is reversed. In order to be rewarded, the animal thus has to inhibit the response towards the originally rewarded stimulus (A+) and switch to the previously unrewarded stimulus (B−). In this case A+ B− becomes A− B+. The switch in reward contingency is thought to be more cognitively demanding and involves different cognitive processes and brain regions from those for the initial discrimination task (Frank et al., 1972;Izquierdo et al., 2016;López et al., 2000;Watanabe, 2012;Buechel et al., 2018). The initial discrimination tests for associative learning ability, while the ability to reverse and learn the new reward contingency tests behavioural flexibility (Bitterman, 1965;Bond et al., 2007;Izquierdo et al., 2016;Shettleworth, 2010). In serial reversal learning, the reward contingency is reversed multiple times, which is considered to be even more demanding and specifically tests for differences in the formation of efficient learning strategies (Bitterman, 1965;Bond et al., 2007;Shettleworth, 2010). During serial reversals, some animals progressively improve their performance and make fewer errors as they continuously relearn the new reward contingency (Bitterman, 1965;Bond et al., 2007;Sherry and Strang, 2015). The ability to improve requires that the individual generalizes information based on earlier experience and adopts an efficient learning strategy to maximize the rewarded responses (Bitterman, 1965;Bond et al., 2007;Shettleworth, 2010).
In reversal learning assays, a striking variation in performance, post-reversal recovery rate and degree of efficient learning strategies is found, both across and within species (Ashton et al., 2018;Bitterman, 1965;Bond et al., 2007;Buechel et al., 2018;Day et al., 1999;Elias, 1970;Lucon-Xiccato and Bisazza, 2014;Sherry and Strang, 2015;van Horik and Emery, 2018). In serial reversal assays, some species perform very well in serial reversal learning, whereas others fail to both relearn and progressively improve over serial reversals (Bitterman, 1965;Lucon-Xiccato and Bisazza, 2014;Patrick et al., 1967;Warren, 1960). One key component that has been put forward to predict performance in serial reversal learning is relative brain size (Bitterman, 1965;Buechel et al., 2018;van Horik and Emery, 2018;Elias, 1970). This idea is partially supported at the intraspecific level, where artificially selected large-brained guppies and mice outperformed their small-brained conspecifics Elias, 1970). However, serial reversal learning may be considered cognitively more demanding than reversal learning as, in order to progressively improve over serial reversals, at least one additional cognitive process is required, i.e. the ability to generalize information by a learnt rule. To date, it is still unknown whether intraspecific variation in relative brain size also causes variation in performance and efficient learning strategies during serial reversal learning.
To test the hypothesis that relative brain size predicts the ability to adopt an efficient learning strategy, we used female guppies, Poecilia reticulata W. Peters 1859, artificially selected for small and large relative brain size and with known differences in neuron numbers (Marhounová et al., 2019), in a serial reversal learning assay. Previous experiments with brain size-selected lines have shown that large-brained guppies outperform small-brained guppies in a number of cognitively demanding tasks (Herczeg et al., 2019). These include associative numerical learning (Kotrschal et al., 2013a), spatial cognition (Kotrschal et al., 2015a) and reversal learning . The apparent enhanced cognitive abilities of large-brained guppies have also been shown to be advantageous in ecologically relevant situations such as mate choice assessment (Bloch et al., 2018;Corral-Lopez et al., 2017), predator inspection behaviour (van der Bijl et al., 2015) and survival under predator threat (Kotrschal et al., 2015b). We quantified individual performance in binary colour discrimination over 10 serial reversals. As the ability to generalize previously gained information and adopt an efficient learning strategy is cognitively demanding, we hypothesized that a larger relative brain size will generate cognitive advantages facilitating performance in this test. We thus expected large-brained females to make fewer errors and adopt an efficient learning strategy that would result in a faster reversal learning rate over serial reversals.

MATERIALS AND METHODS Brain size-selected guppies
The experiment was performed in accordance with ethical applications approved by the Stockholm Animal Research Ethical Permit Board (Dnr: N173/13 and 223/15). We used n=96, 7-8 month old female offspring from 7th generation guppies artificially selected for relative brain size, i.e. brain mass relative to body length (Kotrschal et al., 2013a), with associated differences in neuron number (Marhounová et al., 2019). In the 5th generation, the artificial selection had resulted in 15.4% difference in relative brain size and 11.9% difference in neuron number (Marhounová et al., 2019). Briefly, descendants of wild-caught guppies from high-predation areas in the Quare River, Republic of Trinidad and Tobago, were used to set up three independent breeding stocks (replicates). From each of these, one up-and one down-selected line were created, resulting in six brain size selection lines; 75 breeding pairs per replicate formed the parental strains. For details on the artificial selection, see Kotrschal and colleagues (2013a). Only females were used in this experiment as males have been difficult to motivate with a food reward (Fuss and Witte, 2019;Kotrschal et al., 2013b). Fish were kept with constant aeration, in 25±1°C water temperature, on a 12 h:12 h dark:light cycle. Fish were fed 6 days per week with flake food and live Artemia nauplii.

Experimental apparatus
We used the experimental set-up described by Buechel and colleagues (2018) and first used by Lucon-Xiccato and Bisazza (2014). The 7 l experimental tanks were divided into a home compartment and a conditioning chamber (Fig. 1). To avoid unnecessary handling stress, individuals were held in the Fig. 1. Schematic diagram of the serial reversal learning set-up. The tank consisted of a home compartment (A) and a conditioning chamber (B). These were separated by a transparent sliding door (C) and an opaque sliding door (D). All training took place in the conditioning chamber. A white plate (E) with 20 holes (10 mm in diameter, 5 mm deep) was placed at the bottom in the conditioning chamber. Animals were trained to discriminate between a red plastic disc (F) and a yellow plastic disc (G) and find an Artemia underneath the rewarded stimulus. experimental tanks for the duration of the experiment. The fish were physically isolated but visual contact was allowed between adjacent home compartments to avoid negative isolation effects on learning (Bouton, 2007;Petrazzini et al., 2012). However, visual contact between adjacent conditioning chambers was prevented to avoid social learning effects (Brown and Laland, 2002;Laland and Williams, 1997;Reader et al., 2003). All training took place in the conditioning chamber. The home compartment and the conditioning chamber were divided by an opaque and a transparent sliding door. The opaque doors prevented the females from perceiving any visual cues from the arrangement by the experimenter. The transparent doors allowed the females to habituate to and assess the arrangement before entering the conditioning chamber. The trial started with the opening of the opaque door; 10 s later, we opened the transparent door. To prevent observer bias, the experiment was conducted blind and experimental tanks were only identified by running numbers.

Pre-training
First, the females were trained to dislodge a black plastic disc (14 mm in diameter) placed on a white plate with 20 equispaced holes (10 mm in diameter, 5 mm deep) to access one frozen adult Artemia underneath. During the first trials, the disc only partially covered the well. Over successive trials, we increased well coverage until it was completely covered. All females could easily dislodge the disc after 30 trials. This set-up is ecologically relevant as it makes use of guppies' natural behaviour to forage underneath leaves and other plant parts (Houde, 1997).

Colour discrimination learning
We trained 96 females (48 each of small and large brained) to discriminate between one red and one yellow disc and to associate one of the stimuli with the food reward. Half of all small-and largebrained females were trained to associate the red stimulus with a food reward, whereas the other half of all small-and large-brained females were trained to associate the yellow stimulus with the food reward. We chose stimuli colour in consideration of mate and food choice preferences of female guppies. Female guppies are known to prefer orange colours as they signal high quality in both mates and food (Houde, 1997;Rodd et al., 2002). The disc with the rewarded conditioned stimulus (CS+) was moveable, whereas the unrewarded conditioned stimulus (CS−) was unmovable because of a glued-on knob fixed in the well. In order to control for olfactory cues, food was placed underneath both the rewarded and the unrewarded disc. As mentioned above, the CS+ colour was counterbalanced and randomly distributed across the brain size-selected lines and replicates to control for any innate preference and colour bias. The females were given six trials per day over five consecutive days (i.e. 30 trials), with a 2 day pause prior to each reversal. To prevent side bias, the position (left-or right-hand side) of the CS+ was randomly chosen for each trial, with no more than two consecutive trials in the same position. For each trial, we scored the first push on either of the discs as either correct or incorrect. If a female did not push any of the discs within 120 s, that trial was counted as a no-choice trial. The time limit was chosen based on our experience in training guppies in this set-up  and for logistic reasons such that relevant information was collected while permitting a relatively large sample size. For incorrect and no-choice trials, we gave each female 15 min to make a correct choice before we moved the rewarded disc 5 mm to the side to allow easy access to the food. This ensured that all females received the same number of positively reinforced trials throughout the experiment. During training, all females were tested in a randomized order, with trials typically running between 08:15 h and 17:00 h.

Serial reversal learning assay
Following the initial colour discrimination training, the reward contingency was reversed, i.e. CS+ became CS− and vice versa, for 10 serial reversals. The trial procedure and duration were identical to the procedure described in the colour discrimination learning. We established a fixed number of trials (30) per reversal with reference to the previously established fast colour discrimination ability in female guppies Lucon-Xiccato and Bisazza, 2014).

Data analysis
We performed all statistical analyses in R statistical software (version 3.5.1, http://www.R-project.org/), with the glmer function in lme4 packages, version 1.1-18-1 used for mixed modelling (http://lme4.r-forge.r-project.org). In order to determine how relative brain size might explain variation in learning ability, we used generalized linear mixed models (GLMMs) with binomial error distributions and logit link functions (0=incorrect response, 1=correct response). We analysed the initial colour discrimination task and the serial reversal tasks separately, as the initial colour discrimination task tests associative learning ability, whereas serial reversal learning also tests behavioural flexibility and efficient learning strategies. Continuous variables were always centred at midpoint prior to analyses. Brain size was structured as a two-level categorical factor (small and large). Initially, all models included brain size nested in replicate as a random effect, but replicate returned a zero variance that caused singular fit. To control for a potential effect of replicate we thus included it as a fixed effect in all models, but dropped it from the models as it was not significant (P>0.3) and inclusion of replicate did not improve model fit (decreased Akaike information criterion, AIC). Non-significant interactions (P>0.1) were excluded until the lowest AIC was met. Statistical significance was obtained by using the ANOVA function, specifying Type III Wald χ 2 tests, in the car package (Fox and Weisberg, 2019).
The full model testing initial colour discrimination included the fixed effects rewarded colour, the interaction between brain size and trial, as well as random slope for fish ID [glmer syntax final model: success∼brain size+trial+rewarded colour+(trial|fish ID)].
The full model testing the hypothesis that reversal learning rate is predicted by relative brain size initially included trial number as a fixed effect. Trial caused scaling problems and was therefore dropped from the model and replaced by session number (one to five). Each session included six trials for each reversal. The full model included the fixed effects rewarded colour, and the interactions between brain size and session, and brain size and reversal as well as random slope for fish ID [glmer syntax final model: success∼brain size×session+reversal+rewarded colour+(reversal|fish ID)].
The last trial of each reversal was followed by a 2 day pause prior to the next reversal, as described above, which constitutes an opportunity to measure explicit long-term memory (Bailey et al., 1996). By comparing performance on the first trial of each reversal, we could thus test for differences in long-term memory. The full model testing the performance in the first trial per reversal included the fixed effects rewarded colour, the interaction between brain size and reversal, as well as random intercept for fish ID [glmer syntax final model: success∼brain size+reversal+rewarded colour+(1|fish ID)]. To test whether differences in performance in the first trial of each reversal were caused by differences in memory retrieval, we fitted an additional GLMM that compared the performance on the fifth day of each reversal between small-and large-brained females. The full model included the fixed effects rewarded colour, brain size and reversal, as well as random intercept for fish ID [glmer syntax full model: success∼brain size+reversal+rewarded colour+(1|fish ID)].

Colour discrimination learning
In the colour discrimination part of the experiment, animals learnt to associate a colour stimulus with a food reward (trial: χ 2 1 =63.22, P<0.001). However, matching previous results, neither learning rate (slope of the learning curve) nor performance was predicted by brain size. During the last day of colour discrimination training, smalland large-brained females reached similar performance levels, with a raw data mean±s.e. of 93±3.7% correct responses for smallbrained females versus 98±2.0% correct responses for large-brained females. We found no main effect of brain size (χ 2 1 =0.20, P=0.65). Females trained on the red stimulus learnt to associate the stimulus colour with the reward at a faster rate than females trained on the yellow stimulus (χ 2 1 =19.35, P<0.001); note, that rewarded stimulus colour was counterbalanced between brain size and replicates (see Materials and Methods).

Reversal learning rate and performance across serial reversals
The brain size×reversal interaction was not significant (χ 2 1 =0.18, P=0.68; Fig. 2), and was therefore excluded (as mentioned in Materials and Methods). This means that there was no evidence for differences in the ability to progressively improve and thereby adopt an efficient learning strategy between small-and large-brained females. For both brain size-selected lines, learning rate and performance during each reversal was negatively correlated with increasing number of reversals; error rate increased over serial reversals (χ 2 1 =17.91, P<0.001). Performance on the last day of the final reversal was slightly above chance with a raw data mean±s.e. of 62.9±0.08% for small-brained versus 68.3±0.07% for largebrained females. Across reversals, the interaction between brain size and session number was significant (χ 2 1 =6.44, P=0.01), suggesting a steeper average learning curve in large-brained compared with small-brained females within a week-long reversal. It was easier to switch from the rewarded red to the rewarded yellow stimulus than vice versa across reversals (χ 2 1 =301.96, P<0.001). The model also revealed a significant main effect for session, but not for brain size (session: χ 2 1 =791.99, P<0.001; brain size: χ 2 1 =0.83, P=0.36).

Explicit long-term memory
On the first trial in nine out of the 10 reversals, large-brained females made significantly more errors and therefore performed at a lower level than small-brained females (χ 2 1 =10.99, P<0.001; Fig. 3), suggesting an enhanced explicit long-term memory in large-brained females. Importantly, the performance on the last day of training for each reversal did not differ between the brain size-selected lines (χ 2 1 =0.14, P=0.71), which indicates that the memory of what was learnt in the previous reversal was better stored in large-brained compared with small-brained females. The model for the last day of each reversal also revealed significant main effects for reversal and colour (reversal: χ 2 1 =57.64, P<0.001; colour: χ 2 1 =68.40, P<0001). For both brain size-selected lines, performance on the first trial of each reversal increased with increasing reversals (χ 2 1 =42.66, P<0.001). Note that an increase in performance in the first trial of each reversal was an effect of increasingly lower performance levels at the end of the previous reversal. Memory retrieval was affected by rewarded stimulus colour (χ 2 1 =14.73, P<0.001), suggesting that across reversals, the yellow stimulus was better stored in the longterm memory.

DISCUSSION
We investigated the performance of small-and large-brained female guppies in a serial reversal learning assay. We found that largebrained females learnt the new reward contingency at a faster rate and also stored and retrieved information more efficiently in their explicit long-term memory as compared with their small-brained conspecifics. However, none of the brain size-selected lines progressively improved their performance over serial reversals. Rewarded colour was a strong predictor of performance and learning rate for both brain size-selected lines. . We found no evidence for the hypothesis that relative brain size predicts the ability to progressively improve performance over serial reversals and thereby develop an efficient learning strategy (brain size×reversal; χ 2 1 =0.18, P=0.68). Raw mean data (based on 26,763 observations in total) for 48 small-brained and 48 large-brained female guppies across 10 serial reversals. The logistic regression slope estimates for smallbrained (grey line) and large-brained (black line) females and 95% confidence interval (shading) are predictions obtained from a GLMM with binominal error distribution.
Differences in relative brain size, with associated differences in neuron number, affected variation in two distinct aspects of cognition in our experiment, as relative brain size predicted associative learning rate over serial reversals and predicted explicit long-term memory storage. However, learning rate during the initial discrimination did not differ significantly between the brain size-selected lines, something that has been observed in earlier assays on these selection lines . Performance on the last day of training across serial reversals did not differ between the brain size-selected lines, whereas performance on the first trial of each reversal did. Large-brained females made more errors on the first trial across serial reversals, yet they reached the same performance level as small-brained females on the last day of each reversal. From this, we conclude that associative learning rate was faster in large-brained than in small-brained females in the more cognitively demanding reversal learning context; the significant interaction between brain size and session number corroborates this argument. Previous experimental studies have also found positive relationships between learning rate in demanding contexts and relative brain size, both in the brain size-selected guppies  and in mice artificially selected for increased brain mass (Elias, 1970). The lower performance in the first trial of each reversal is most likely explained by differences in memory capacity. What is learnt towards the end of each reversal seems to be better stored in the explicit long-term memory of large-brained females. Phylogenetic comparative studies have revealed a correlation between whole-brain size and different aspects of memory storage, for instance between bird species (Garamszegi and Eens, 2004). Taken together, our results are mainly consistent with previous findings at the interspecific level in that relative brain size is advantageous in particularly demanding cognitive contexts (Benson-Amram et al., 2016;MacLean et al., 2014;Boire et al., 2002;Reader and Laland, 2002), whereas more fundamental aspects of cognition are probably less related to relative brain size . However, we found no support for the proposal that relative brain size predicts the ability to learn from previous experience at the within-species level, as we found no difference in the ability to adopt an efficient learning strategy. The learning rate in both small-brained and large-brained females was instead strongly impaired over an increasing number of reversals, which almost led to extinction in the last reversals. Female guppies of the brain size-selected lines showed neither signs of stability in what was learnt during early reversals nor any flexibility to make new associations when contingencies were reversed repeatedly. We conclude that female guppies only rely on associative learning processes and apparently lack the ability to take advantage of previously successful discrimination in order to adopt an efficient learning strategy.
There are at least two possible explanations for why learning rate was impaired by serial reversals in both of the brain size-selected lines. First, ecological requirements create species-specific challenges that generate divergence in cognitive abilities (Shettleworth, 2010). Continuous changes in feeding opportunities or social complexity are ultimately thought to generate a flexible behaviour repertoire (Bond et al., 2007;Day et al., 1999;Sherry and Strang, 2015). In accordance with the existence of trade-offs between cognitive abilities, as proposed by Tellos-Ramos and colleagues (2019), the ecology of the guppy might favour spatial memory rather than the advanced learning abilities investigated here. Second, cognitive processes involved in the ability to adopt an efficient learning strategy are controlled by a specific region in the telencephalon (Frank et al., 1972;Izquierdo et al., 2016;López et al., 2000;Watanabe, 2012). Primates have among the largest relative telencephalon sizes in the animal kingdom (Finlay and Darlington, 1995), and telencephalon neuron number and density in parrots and many songbirds are equivalent to those of primates (Olkowitcz et al., 2016). Interestingly, species from these taxa are also known to typically perform well in serial reversal assays (Cauchoix et al., 2017;Boogert et al., 2011;Gosette, 1968;Gosette et al., 1966;van Horik and Emery, 2018). Brain region size typically varies between but not within species (Finlay and Darlington, 1995;Gould et al., 2013;Healy and Krebs, 1992;Lucas et al., 2004). Neither neuron density nor relative (to the rest of the brain) telencephalon volume differ between the brain size-selected lines We found that relative brain size predicted long-term memory, as large-brained females made more errors on the first trial of each reversal (brain size; χ 2 1 =10.99, P<0.001). Raw mean data (based on 948 observations in total) and the logistic regression slope estimates for 48 small-brained and 48 large-brained female guppies in a serial reversal learning assay. The dotted horizontal line represents the 50% performance level. The logistic regression slope estimates for small-brained (grey line) and large-brained (black line) females and 95% confidence interval (shading) are predictions obtained from a GLMM with binominal error distribution. Marhounová et al., 2019), which might explain why we found quantitative (i.e. a slightly higher learning rate) but not qualitative (i.e. negative effect of increasing reversals in both brain size-selected lines) differences in learning ability between the brain size-selected lines. We speculate that more advanced learning abilities are explained by the evolution of increased telencephalon size and/or neuron number, rather than by relative brain size and total number of neurons. Alternatively, individual variation in relative brain size and neuron number is too small for the detection of differences in advanced learning abilities in these selection lines.
The positive effect on discrimination learning rate with increasing number of reversals is generally substantially smaller in fish (Engelhardt et al., 1973;Fuss and Witte, 2019;Levin and Vergara, 1987;Lucon-Xiccato and Bisazza, 2014;Mackintosh and Cauty, 1971;Squier, 1969;Wodinsky and Bitterman, 1957) than in other vertebrates (Bitterman, 1965;Bond et al., 2007;Mackintosh and Cauty, 1971). Although there are examples of a positive effect on discrimination learning rate across reversals (see for example Engelhardt et al., 1973;Mackintosh and Cauty, 1971;Wodinsky and Bitterman, 1957), the majority of studies in fish have failed to show any progressive improvement over serial reversals (Behrend et al., 1965;Behrend and Bitterman, 1967;Bitterman et al., 1958;Bitterman, 1965;Engelhardt et al., 1973;Fuss and Witte, 2019;Gonzalez et al., 1967;Levin et al., 1984;Lucon-Xiccato and Bisazza, 2014;Patrick et al., 1967;Portavella and Vargas, 2005;Warren, 1960). Our results are thus consistent with many previous serial reversal learning assays in other species of fish. Another aspect of the results found in fish in this context is that, even when they show improvement over increasing reversals, they rarely improve their performance over serial reversals beyond the level of performance in the initial discrimination. It can thus be questioned whether the fish tested really understand 'the principle of reversal' (sensu Shettleworth, 2010).
A surprising finding during this experiment was the strong effect of the rewarded stimulus. During the initial colour discrimination part, female guppies from both brain size-selected lines made more errors with the yellow compared with the red stimulus. Red is well known to be a signal of strong biological significance for female guppies as, under natural conditions, it predicts feeding opportunities and male quality (Houde, 1997). It is thus not surprising that we found a bias for red, something that has been shown before in a learning context . Interestingly, across serial reversals, more errors were made with the red compared with the yellow stimulus. This suggests a shift in this pre-existing bias, induced by an increased number of reversals. One possible explanation for this shift might be that as red is an important stimulus for female guppies, not being rewarded for pushing the red disc might be perceived as a strong negative experience. Over time, this might decrease the willingness to push the red disc. A shift in the pre-existing bias for important stimuli has been shown to change after both positive and negative experience in butterflies (Westerman et al., 2012) and spiders (Hebets, 2003;Hebets and Vink, 2007;Svensson et al., 2010) in mating contexts. However, whether the shift from red to yellow bias in this experiment is an experimental artefact or whether it has an adaptive value should be investigated further.
Although the relative brain size was not measured directly in the fish used in this experiment, all of the previous assays of relative brain size in both earlier and the current generation of selection have shown substantial differences (typically 12-15% differences in later generations) in relative brain size (Kotrschal et al., 2013a(Kotrschal et al., ,b, 2017), brain volume Marhounová et al., 2019) and neuron numbers (Marhounová et al., 2019) between these selection lines. We therefore assume that these differences remain in the fish used in the current study.
To conclude, it is increasingly clear that in demanding contexts, a relatively larger brain size provides individuals with an enhanced cognitive ability. However, differences in learning ability and behavioural flexibility between the brain size-selected lines were not large enough to enable detection of the differences found at higher taxonomic levels (Bitterman, 1965;Bond et al., 2007). We speculate that the mechanisms causing variation in the ability to adopt an efficient learning strategy require larger variation in brain size, neuron number or region structure size than the differences in relative brain size observed in these selection lines. Additionally, based on both the results of the present study and those of previous experimental studies Elias, 1970), we hypothesize that while a relatively larger brain increases performance in species-specific tasks, it does not provide any additional learning ability. We thus propose that brain size-driven cognitive divergence within species is mostly quantitative in nature.