The Hebb repetition effect in simple and complex memory span

Oberauer, Klaus; Jones, Timothy; Lewandowsky, Stephan

doi:10.3758/s13421-015-0512-8

The Hebb repetition effect in simple and complex memory span

Published: 25 February 2015

Volume 43, pages 852–865, (2015)
Cite this article

Download PDF

Memory & Cognition Aims and scope Submit manuscript

The Hebb repetition effect in simple and complex memory span

Download PDF

Klaus Oberauer¹,
Timothy Jones² &
Stephan Lewandowsky^2,3

4881 Accesses
20 Citations
1 Altmetric
Explore all metrics

Abstract

The Hebb repetition effect refers to the finding that immediate serial recall is improved over trials for memory lists that are surreptitiously repeated across trials, relative to new lists. We show in four experiments that the Hebb repetition effect is also observed with a complex-span task, in which encoding or retrieval of list items alternates with an unrelated processing task. The interruption of encoding or retrieval by the processing task did not reduce the size of the Hebb effect, demonstrating that incidental long-term learning forms integrated representations of lists, excluding the interleaved processing events. Contrary to the assumption that complex-span performance relies more on long-term memory than standard immediate serial recall (simple span), the Hebb effect was not larger in complex-span than in simple-span performance. The Hebb effect in complex span was also not modulated by the opportunity for refreshing list items, questioning a role of refreshing for the acquisition of the long-term memory representations underlying the effect.

The Hebb repetition effect in complex span tasks: Evidence for a shared learning mechanism with simple span tasks

Article Open access 06 December 2021

Claudia Araya, Klaus Oberauer & Satoru Saito

The long-term consequences of retrieval demands during working memory

Article 27 August 2020

Vanessa M. Loaiza, Charlotte Doherty & Paul Howlett

Secondary task engagement drives the McCabe effect in long-term memory

Article 08 August 2023

Kelly Cotton, Joshua Sandry & Timothy J. Ricker

Half a century ago, Donald Hebb (1961) asked participants in an experiment to remember lists of random digits for immediate recall in the order of presentation. Unbeknown to the participants, Hebb presented the same list on every third trial, interspersed with new random lists in the intervening two trials. Across 24 trials, immediate serial recall improved for the repeated but not the random lists. Although participants were not asked to remember the lists beyond the time of immediate test, more long-lasting memory traces accrued with the repetitions, which gradually improved people’s ability to remember lists matching those traces.

Tests of immediate serial recall are routinely used to investigate short-term or working memory, also known as primary memory (from here on we will use the term working memory). Most tests of serial recall involve the simple-span procedure, in which people must recall a list of items immediately upon presentation in forward order. Because of its limited capacity, working memory is commonly assumed to hold only the current list, perhaps with a few traces of the immediately preceding one, but it is not thought to be suited to acquire a representation of the commonalities of lists spanning four trials. Therefore, the pervasive Hebb effect documents the contribution of some longer-lasting form of memory, referred to as long-term memory or secondary memory, to tests of immediate serial recall. The Hebb effect implies that lists maintained for immediate recall leave long-term memory traces, and that these traces are used in immediate recall (Burgess & Hitch, 2005, 2006; Page & Norris, 2009).

Here we investigate whether the Hebb repetition effect is also observed with two variants of the complex-span paradigm. The typical complex-span task differs from the simple-span task by the addition of a distractor task that is to be carried out in between pairs of list items during encoding; here we also investigate a less common variant in which distractors are interspersed between items at retrieval. The distractor task usually requires processing without any explicit memory demand, for instance reading sentences (reading span, Daneman & Carpenter, 1980), solving arithmetic problems (operation span, Turner & Engle, 1989), or carrying out a series of choice response tasks (Barrouillet, Bernardin, Portrat, Vergauwe, & Camos, 2007). Complex-span tasks have become popular in particular because their psychometric properties render them suitable for measuring working-memory capacity (Oberauer, Süß, Schulze, Wilhelm, & Wittmann, 2000; Wilhelm, Hildebrandt, & Oberauer, 2013), and they are good predictors of fluid intelligence (Conway et al., 2005; Conway, Kane, & Engle, 2003; Engle, Tuholski, Laughlin, & Conway, 1999). Although behavioral phenomena from complex span tests bear many similarities with those from simple-span tests, the two paradigms also differ in some regards. For instance, whereas the majority of errors in simple-span tests are order errors, item errors are more prevalent in complex span tests (Oberauer, Lewandowsky, Farrell, Jarrold, & Greaves, 2012; Unsworth & Engle, 2007b). In correlational studies, when multiple simple span and complex-span tasks are used, the two types of tasks load on separate factors (Gathercole, Pickering, Ambridge, & Wearing, 2004; Kane et al., 2004). Those observed differences between the two types of span task render it plausible that simple-span and complex-span performance may also differ with regard to the Hebb effect.

As we discuss next, there are additional reasons to believe that an examination of the Hebb effect in complex span should be theoretically rewarding. Compared to the Hebb effect in simple span, there are equally plausible theoretical reasons to believe that the Hebb effect in complex-span should be greater, or that it might not be present at all. The goal of the present work is to investigate the empirical merits of these competing theoretical expectations.

The Hebb effect could be diminished or even abolished in complex span because the distractor task interrupts encoding of the list, thereby disrupting the formation of an integrated list representation. The Hebb effect in simple span appears to depend on an integrated representation of the list, as demonstrated by an experiment by Cumming, Page, and Norris (2003): After a learning phase with the standard repetition of one list every third trial, they introduced transfer lists matching the previously repeated list in every second list item, whereas the intervening list positions were filled with new items. There was no transfer from the learned list to recall of the repeated items on these transfer lists. Hitch, Fastame, and Flude (2005) investigated Hebb learning with training lists in which only every second item was repeated, rather than the complete list as in the standard Hebb paradigm. There was no evidence of learning in this condition. On the basis of their results, Cumming et al. (2003) as well as Hitch et al. (2005) argued that Hebb learning consists of the formation of a unified (chunked) representation of the list, or at least of segments of the list (for computational models implementing this idea see Burgess & Hitch, 2006; Page & Norris, 2009). The acquisition and use of such chunks is disrupted if repetition is limited to sub-components of learned chunks.

On this hypothesis, one might expect the Hebb effect to be at least diminished – if not absent altogether– in complex span: It is known that people find it difficult to exclude representations used for distractor-task processing from working memory, and those attempts are often not entirely successful (Oberauer, Farrell, Jarrold, Pasiecznik, & Greaves, 2012). If distractor materials are selected at random, as in our experiments reported below, then distractor representations arguably play a similar role to the not-repeated items in the training lists of Hitch et al. (2005), or in the transfer lists of Cumming et al. (2003): When representations of not-repeated distractors are interspersed with representations of repeated list items, formation or application of chunks could be disrupted, thereby diminishing the Hebb effect or abolishing it altogether.

Moreover, Hebb learning is incidental – after all, participants are not asked to remember a list any further after they have finished recalling it immediately after presentation. It is generally assumed that incidental learning does not depend on the person’s intention to learn but on the kind and degree of processing of the material (Craik & Lockhart, 1972; Hyde & Jenkins, 1969). Thus, any material that is attended to and processed to some extent becomes a candidate for incidental learning. It follows that incidental learning of events during a complex-span trial is unlikely to be limited to list items at the exclusion of distractors: People must process the distractors just like list items, and indeed most complex-span experiments enforce an accuracy criterion on the distractor task to prevent people from focusing on the list alone (Conway et al., 2005). One would therefore expect Hebb learning to apply non-selectively to all events that a person attends to and processes during a complex-span trial, in which case distractor-task representations should be as much part of the long-term memory trace as the memoranda. When the distractors have no systematic relationship to the memoranda and differ across repetition trials as well – as was the case in all experiments presented below – such a composite trace would be largely useless for improving recall of a repeated list in the current Hebb paradigm. These reasons justify the prediction that the Hebb effect should be diminished or even abolished in the complex-span paradigm.

On the other side of the argument are considerations that cite the presumed greater involvement of secondary or long-term memory in complex span compared to simple span. Unsworth and Engle (2006) have argued that in simple-span tests, up to four list items can be held in working memory, whereas in complex span the distractor task pushes previous list items out of working memory. In consequence, at the point of recall after a complex-span list only the last one or two items can be recalled from working memory, whereas the remainder must be retrieved from long-term memory. If this assumption is correct, the Hebb effect might be expected to be larger in complex span than in simple span: The Hebb effect reflects a gradually strengthened long-term memory representation of the repeated list, and the impact of that representation on immediate-recall performance should be larger the more that recall depends on long-term memory.

The assumption that long-term memory is more involved in complex span than in simple span received support from an observation first reported by McCabe (2008): When tested with a final free recall test for the words on all memory lists encountered in the experiment, participants were found to recall more words from complex-span than from simple-span lists. This effect has been replicated several times (Loaiza & McCabe, 2012; Loaiza, McCabe, Youngblood, Rose, & Myerson, 2011). According to McCabe, the effect arises because in complex span people cannot hold the entire list in working memory. They are thus forced to temporarily outsource parts of the list to long-term memory, and to bring them back into working memory through “covert retrieval” during the distractor-task phases. Because covert retrieval serves as retrieval practice, stronger long-term memory traces are established in complex-span than in simple-span, which does not require covert retrieval during list presentation. If covert retrieval during complex span contributes to the Hebb effect, then the strength of the Hebb effect in complex span should depend on the opportunity for covert retrieval. This opportunity can be varied through the “cognitive load” imposed by the distractor task. Cognitive load, as defined by Barrouillet et al. (2007), refers to the proportion of time available for the distractor task during which attention is actually occupied by the distractor task. When distractors are presented at a leisurely pace (e.g., one arithmetic step every 2 s), then cognitive load is said to be low because processing only takes up a fraction of the available time. When distractors are presented at a fast pace (e.g., one arithmetic step every 500 ms), cognitive load is high because the entire available time is required for processing. According to Barrouillet and colleagues, any remaining time in between distractors that is not taken up by processing can be used to attend to the representations of the memoranda, thereby refreshing them. The concept of refreshing as used by Barrouillet and colleagues (c.f. Raye, Johnson, Mitchell, Greene, & Johnson, 2007) is very similar to the concept of covert retrieval, as McCabe (2008) recognized. The two processes might not be the same, but for both it is assumed that they can be carried out only when attention is not occupied by a distractor task. It follows that varying cognitive load arguably also varies the opportunity for covert retrieval during a complex-span task. If covert retrieval plays a role in building the long-term memory representations underlying the Hebb effect, then the Hebb effect in complex span should be larger at low than at high cognitive load.

To summarize, theoretical considerations and existing evidence provide equally strong reasons for predicting that the Hebb effect in complex span should be larger than in simple span, or that it should be smaller or even non-existent. Through the following experiments we tested these contrasting predictions. Experiments 1 and 2 established that there is a Hebb effect with complex span, suggesting that distractors did not disrupt the formation of integrated list representations. Experiment 3 generalizes this finding to a version of complex span in which distractor processing is interspersed between recall rather than encoding of memoranda, thereby showing that the effect is resilient to disruptions at test. Finally, Experiment 4 directly compared the Hebb effect for simple and complex span. In addition, Experiment 4 varied the opportunity for covert retrieval (or refreshing) in a complex span paradigm, thereby testing the assumption of McCabe (2008) about the role of covert retrieval in that paradigm. We found that the size of the Hebb effect was unaffected by cognitive load. We conclude that the Hebb effect is a highly robust attribute of list learning that is unaffected by disruptive distractors at encoding or test.

Experiments 1 and 2

Participants performed a complex-span task, which required them to remember lists of consonants and to make size judgments on words displayed after each consonant. The same list was used on every third trial; we refer to those trials as the repetition trials. Each set of three consecutive trials, including one repetition trial and two non-repetition trials, will be called a cycle. We expected a Hebb repetition effect, that is, better immediate recall for repetition trials than new trials, especially at later cycles. Experiments 1 and 2 differed in only two regards: Experiment 1 involved memory lists of seven consonants; for Experiment 2 we increased list length to eight to create more room for improvement through learning. To compensate for the longer duration of trials, in Experiment 2 we reduced the number of trials from 27 to 24.

Method

Participants

Participants were 32 (Experiment 1) and 27 (Experiment 2) members of the University of Western Australia community. They took part in a single 1-hour session in exchange for AUD$10 or course credit.

Materials

For each new trial a memory list was constructed by sampling the required number of consonants without replacement from the set of all consonants except Q and Y. The list for the first repeated trial was constructed in the same way and then held constant for all repetitions. The repeated list was used in every third trial, beginning with trial three.

Materials for the distractor task consisted of 264 English nouns referring to concrete objects. They were selected from a larger set of nouns referring to objects varying across a broad range of size, from “ladybird” to “sun.” The participants’ task was to judge for each word whether the object was smaller or larger than a soccer ball. To make the task unambiguous, we selected only the words referring to the 25 % largest and the 25 % smallest objects in the original set. Each word from the experimental set was used three times throughout the experiment. Words for each size judgment were drawn at random on every trial, including for the repetition trials. Thus, repetition trials had a constant memory list but variable distractor-task stimuli. The random selection of distractors maximizes the chance of distractor representations disrupting the formation of an integrated list representation, thereby creating a condition for which there are good theoretical reasons to expect the Hebb effect to disappear.

Procedure

Each trial started with a fixation cross, followed after 3 s by the first letter displayed centrally in red for 1.5 s. The letter was immediately replaced by the first distractor word, displayed centrally in black. Participants judged whether the word referred to an object larger or smaller than a soccer ball by pressing the “/” (slash) key or the Z key, respectively, on the computer keyboard. Once a response was made, or after the maximum time of 2 s elapsed, the distractor disappeared and was replaced by the next word. Each letter was followed by four size judgments. The fourth size judgment was immediately followed by the next to-be-remembered letter, and so on until presentation of the list was completed. The very last size judgment was followed by a red question mark, prompting participants to commence recall by entering the first letter on the keyboard. The entered letter was displayed for 0.3 s, and was then replaced by the question mark again to prompt recall of the second letter, and so on until participants had given as many responses as there were letters in the list. Omissions were not allowed. The next trial commenced 2.5 s after the last recall response.

Results

We first report memory accuracy to test for the classic Hebb repetition effect. Next we ask whether repetition of the memory list had an impact on speed and accuracy of the distractor task. We analyzed all data with a Bayesian linear regression model, using the BayesFactor package (Morey & Rouder, 2012; Rouder, Morey, Speckman, & Province, 2012) for R (R Development Core Team, 2012). The lmBF function in the BayesFactor package estimates linear models and returns the Bayes factor (BF) of the model relative to a null model that predicts the data by the intercept alone. Two alternative models M₁ and M₂ can be compared to each other by dividing their BFs (relative to the null model). The ratio of the BFs of M₁ vs. null and M₂ vs. null is the BF of M₁ vs. M₂.

For each analysis we investigated two predictors, cycle and repetition. Cycle refers to the ordinal number of the eight sets of three consecutive trials, each including one repeated and two non-repeated lists. Cycle was entered as a continuous variable, centered on zero. For each analysis we estimated four models: M _c, with only a main effect of cycle; M _r, with only a main effect of repetition versus new trials; M _add, with additive effects of cycle and repetition; and M _full, with both additive effects and their interaction. Each of these models included subjects as a random effect, and therefore we also estimated M _b as a baseline model with only the intercept and the random effect of subjects. We assessed the strength of evidence for the main effect of cycle by BF(M _c)/BF(M _b), and the main effect of repetition by BF(M _r)/BF(M _b). Evidence for the interaction was assessed by BF(M _full)/BF(M _add). BFs larger than 1 reflect evidence in favor of the model in the numerator; Bayes factors smaller than 1 reflect evidence in favor of the model in the denominator. The strength of evidence for the model in the denominator can be gauged by the reciprocal of the BF. For instance, if BF(M_full)/BF(M_add) = 0.5, then the BF in favor of the additive model is 2. BFs <3 are usually regarded as evidence “barely worth mentioning;” BF between 3 and 10 as “substantial evidence,” BF between 10 and 100 as “strong evidence,” and BF >100 as “decisive” (Kass & Raftery, 1995).

Memory accuracy

Memory performance was scored as the proportion of letters reported in their correct list positions. Figure 1 shows proportion correct by cycle and repetition (new vs. repeated). Table 1 summarizes the BFs reflecting the strength of evidence for the main effects and the interaction. The evidence for a main effect of cycle was substantial in Experiment 1 but weak in Experiment 2. There was compelling evidence for the main effect of repetition in both experiments. The interaction was supported only weakly in both cases.

Table 1 Bayes Factors for the linear models for Experiments 1–3, and the three span conditions of Experiment 4

Full size table

The Hebb effect was primarily reflected in the main effect of repetition. Its size can be estimated by sampling from the posterior distribution, using the posterior function in the BayesFactor package (Morey & Rouder, 2012). The sample provides information about the mean and the 95 % credible interval of the effect, which are given in Table 2. The 95 % credible interval is the range in which the true effect size lies with a posterior probability of .95. Based on the findings from the first two experiments, we can say that the Hebb effect increases memory performance in complex span by 6–16 percentage points over 8–9 list repetitions.

Table 2 Means and 95 % credible intervals for the Hebb Effect

Full size table

Size-judgment performance

Failures to respond to a size-judgment trial were scored as errors. Response times (RTs) of correct trials only were analyzed. We estimated Bayesian linear models with the same predictors as for memory accuracy. The resulting BFs are reported in Table 1; the data are plotted in Fig. 2. In both experiments accuracy improved and RTs declined over cycles. The BFs for the main effects of repetition show that list repetition had a beneficial effect on RTs in Experiment 1, and on both accuracies and RTs in Experiment 2. Evidence for the interaction was non-existent in Experiment 1 and modest at best in Experiment 2.

Discussion

Experiments 1 and 2 established that there is a Hebb effect with complex span. Memory was better for repeated than for new lists. The beneficial effect of repetition emerged fairly rapidly – by the third cycle it was already strong – and this explains why there was only weak evidence for the interaction of repetition and the linear effect of cycle that would be expected from more gradual learning.

The repetition benefit extended to the distractor task: Size judgments were made faster, and in Experiment 2 also more accurately, in the context of repeated lists. This is a novel finding that we did not predict. Several post-hoc explanations could be offered. From the perspective of a resource theory, it could be argued that encoding and maintaining repeated lists consumes a smaller share of a limited resource, leaving more of that resource for concurrent processing. Other explanations could start from the assumption that people notice the list repetition and find the repeated lists easier to encode and maintain. In previous experiments with the Hebb paradigm, the majority of participants became aware of the list repetitions at some point during the experiment (McKelvie, 1987; Sechler & Watkins, 1991). McCabe (2010) has shown that merely anticipating an easier memory task leads people to respond faster to a concurrent processing task in a complex-span paradigm. When people perceive the memory task to be harder, they apparently devote more of the time in between items to further processing of the memoranda – this could involve consolidation, refreshing, covert retrieval, or elaboration – and therefore delay responding to the distractors.

Experiment 3

Before moving on to a direct comparison of the Hebb effect in simple and complex span we need to examine one possible explanation for the results of the first two experiments. It has been claimed that the Hebb effect arises primarily from learning of the output sequence, as opposed to the presented list (Cunningham, Healy, & Williams, 1984). Cunningham et al. (1984) found a Hebb effect only for lists that were initially recalled, not for lists repeatedly encoded but not recalled, suggesting that Hebb learning occurs only during recall. A later study with better control of learning times observed a robust Hebb effect also for not-recalled lists, but the effect was slightly larger for recalled lists (Oberauer & Meyer, 2009), implying that learning occurred both during encoding and recall. To the extent that the Hebb effect arises from learning during recall, our finding of a Hebb effect in Experiments 1 and 2 would be unsurprising, because in the standard complex-span paradigm that we used in those experiments, the output sequence consisted of uninterrupted recall of all list items, just like in simple span.

To test the possibility that the Hebb effect in Experiments 1 and 2 relied on learning during uninterrupted list recall, in Experiment 3 we used a variant of complex span in which the distractor episodes interrupt the output sequence instead (Lewandowsky, Duncan, & Brown, 2004): Recall of each item was preceded by a brief series of distractor operations. If Hebb learning occurred primarily during output, and if distractors disrupt the formation of associations between list items that support the Hebb effect, then we might expect the Hebb effect to disappear in Experiment 3.