Introduction

Efficiently searching for a target object in a cluttered environment (e.g., looking for your old friend at a train station) is a fundamental skill of the human visual system (Ma et al., 2011; Najemnik & Geisler, 2005; Wolfe, 2003; Zhang et al., 2021). To achieve optimal search performance, it is necessary not to frequently re-visit the locations that have been previously visited (Gilchrist & Harvey, 2000; Najemnik & Geisler, 2005). Posner and Cohen (1984) demonstrated that shortly after attention has moved away from a location, manual responses and eye movements to stimuli appearing at the previously attended loci were delayed compared with those at new locations. They proposed that such an inhibition effect encourages the orienting of attention towards novel items to maximize sampling of the visual environment. Posner et al. (1985) later named this inhibition effect the inhibition of return (IOR).

Klein (1988) proposed the foraging facilitator hypothesis, which suggests that IOR facilitates performance in visual search tasks that require the serial deployment of attention by attaching inhibitory tags to inspected spatial locations and discouraging attention from being oriented back towards the already tagged locations if the search needs to continue. To test this hypothesis, Klein (1988) developed the probe-following-search paradigm, which includes a visual search task (to generate inhibitory tags) and the following probe-detection task (to measure inhibitory tags) in each trial. In visual search involving serial attentional deployment, if inhibitory tagging takes place at the inspected locations in the search display, detection of the probe stimulus appearing at any of these locations should be slower due to attention being discouraged from returning to these locations. Therefore, the hypothesis suggests longer reaction times (RTs) to the probes presented at the tagged distractor locations (i.e., the on-probe condition) than to the probes appeared at blank locations (i.e., the off-probe condition). The RT difference between these two conditions, known as the on-probe cost, provides a measure of IOR in visual search tasks. Furthermore, to control for potential influence from expectancy or forward masking on probe detection, an easy parallel search task (followed by the same probe-detection task) was employed as a control condition. In parallel search (pop-out search), the target is salient and captures attention, and is detected without sequential inspection or attentional shifts, thus not involving any inhibitory tagging. Therefore, the on-probe cost in parallel search would be smaller than that in serial search. Klein (1988) showed the predicted pattern and supported the foraging facilitator hypothesis.

However, the foraging facilitator hypothesis was challenged by two studies (Klein & Taylor, 1994; Wolfe & Pokorny, 1990), which reported failures to replicate the findings of Klein (1988). After the search task was completed in these studies, all search items were removed from the display before the detection probe appeared. Despite the finding of longer RTs to on-probes than to off-probes, there was no difference in the on-probe cost between parallel search and serial search, showing a forward masking effect instead of inhibitory tagging. This dispute remained unresolved until researchers found that an enlarged IOR in serial search could only be observed when the search array was visible at the time of the probe onset to enable marking previously attended locations (Müller & von Mühlenen, 2000; Takeda & Yagi, 2000). This conclusion is consistent with the foraging facilitator hypothesis because landmarks usually remain in a natural foraging scene. The sudden disappearance of some landmarks may indicate highly salient environmental changes requiring attentional focus instead of attentional inhibition.

According to the foraging facilitator hypothesis, IOR should be weaker in efficient search tasks than in inefficient search tasks (Klein, 1988; Wang & Klein, 2010). However, little research was conducted to investigate how IOR changes when search efficiency varies. One way to manipulate search efficiency is to employ serial search tasks of various search difficulties. However, this approach requires using different stimuli across tasks to attain various expected search efficiencies, rendering the results confounded by differences in the physical stimuli. Therefore, a better approach is to adopt a training procedure capable of gradually increasing search efficiency while keeping the stimuli identical. It is also important to note that, in Klein’s (1988) study, the stimuli differed between the serial and parallel search tasks. Therefore, it is desirable to keep the physical stimuli as similar as possible between the serial and parallel search tasks in the training procedure to achieve a better control condition.

The current study combined the probe-following-search paradigm (Klein, 1988) and a training procedure. IOR was quantified as the difference between the on-probe cost in serial search and that in parallel search. In particular, the focus was on the IOR in the target-absent condition because it is a more reliable measure. Inefficient (serial) search is executed with a serial and self-terminating strategy, checking search items one by one until the target is detected (target-present condition) or when a quitting signal reaches a threshold after all the search items with high probabilities of being the search target are exhausted (target-absent condition) (Chun & Wolfe, 1996; Wolfe, 2021). In the target-present trials, the search will be terminated on average after half of the search items have been checked, meaning that there are great chances that the probe is presented at a location previously occupied by a search item yet never searched, leading to an unreliable estimation of IOR. In contrast, IOR measurement in the target-absent trials is more reliable because all probable search items will be attended to in the search. The search-terminating threshold is adaptive according to the search requirements and the observer’s tolerance to search errors (e.g., Wolfe, 2021). Even though the search may not exhaust all items when the search target is absent, most items will be checked when a high search accuracy is reached. Therefore, in the current study, the on-probe cost difference between the serial and parallel search tasks in the target-absent trials was taken as the IOR measure.

According to the foraging facilitator hypothesis, inhibitory tagging is applied to the inspected locations to discourage attention from returning to these locations (until the search target is found), producing the IOR effect and subsequently facilitating the search performance. When visual search gradually becomes more efficient (easier) in a training procedure, visual attention may remain qualitatively serial, but less attentional engagement is involved over time. This will lead to weaker inhibitory tagging, and in turn, reduce the IOR effect. Therefore, the foraging facilitator hypothesis (inhibitory tagging) will be supported if the data show a decrease in IOR over training days as search efficiency increases.

It is important to note that the above prediction is based on the specific manipulation employed. In the current training protocol, the level of inhibitory tagging is determined by search difficulty (measured by search efficiency), so that a more efficient search is predicted to generate weaker inhibitory tagging (thus weaker IOR). This shows a negative relationship between search efficiency and IOR. In the extreme case of pop-out search (super-efficient), no sequential attentional shift is involved, thus the IOR effect should be absent (Klein, 1988). However, given that IOR is a foraging facilitator, an alternative prediction suggests that an increased IOR will facilitate search performance and as a result increase search efficiency. Contrary to the current hypothesis, this would predict a positive relationship between search efficiency and IOR. Note that testing this alternative prediction requires directly manipulating the strength of IOR. Currently, it is unclear how to manipulate IOR (either via volitional control or other variables), and in addition, the current study did not manipulate IOR directly, rendering this alternative prediction inapplicable to the current paradigm. Therefore, it is hypothesized that the IOR and search efficiency would show a negative relationship in the current study.

Experiment 1

Method

Participants

Twenty naïve volunteers (18 females and two males, Mage = 20.50 years, SEage = 0.51, all right-handed) took part in Experiment 1 after providing informed consent. Four participants were excluded from the analysis due to the absence of IOR in the target-absent condition on the first training day. All participants had normal or corrected-to-normal vision and had no previous experience with similar tasks. All procedures were approved by the Academic Committee of Department of Psychology at Soochow University.

The sample size was determined with a power analysis using MorePower (Campbell & Thompson, 2012), with α = .05, β = .2, and an effect size of η2 = 0.14 for the interaction effect in a two-way (search target presence × training days) repeated-measures analysis of variance (RM-ANOVA) on IOR. The result suggested a sample size of 16.

Apparatus

The participants were tested individually in a dimly lit and sound-attenuated room. All stimuli were presented against a gray background on an 18-inch CRT monitor (ViewSonic P220f) with a resolution of 1,024 × 768 and a refresh rate of 100 Hz (Zhang et al., 2018). The participants sat at a viewing distance of 80 cm with their chins resting on a chin rest. Stimulus presentation and response recording were controlled by Psychtoolbox-3 (Kleiner et al., 2007) in MATLAB 2011a, running on a Xeon E5-2630 PC equipped with an NVIDIA GTX750 graphics card. Responses were collected with a standard computer keyboard.

Design

The current study used a within-participants design with four factors, namely training days (days 1 to 7), search tasks (serial vs. parallel), search target presence (present vs. absent), and probe types (on-probe, off-probe, and no probe). A probe was presented at a location previously occupied by a search item in the on-probe condition and at a previously empty location in the off-probe condition. The probe was absent in the no-probe condition.

Procedure

The participants were trained for 7 consecutive days with the probe-following-search task. Each participant completed two visual search sessions on each training day, one parallel search and the other serial search. The completion order of the sessions was counterbalanced across participants but remained identical across training days for each participant. Each session consisted of six 100-trial blocks separated by short breaks, with the two sessions being separated by a 10-min break. At the beginning of each session, participants performed 10–20 practice trials. In sum, there were 1,200 trials in total on each training day. In each session (parallel or serial), there were 300 trials in each target-presence condition (target-present or target-absent). In the following probe task, out of these 300 trials, 120 were on-probe trials, 120 were off-probe trials, and 60 were no-probe trials. These large trial numbers were adopted following previous studies (Li et al., 2018; Li et al., 2017; Lin et al., 2020) because it was proposed that a trial count of 100 or more per condition is desirable for establishing reliable mean values (Vanselst & Jolicoeur, 1994).

Figure 1 illustrates the trial sequence and stimuli. Each trial began with a black fixation cross (0.4° × 0.4°, 0.08° line thickness), which was presented centrally and remained visible throughout the trial. After 1,500 ms, an array of eight search items appeared in two random quadrants out of the four quadrants of the display, with four items in each displayed quadrant. This display layout was adopted to produce substantial IORs because it has been found with a cueing paradigm that, when visual targets appeared after a long delay (800 ms) from cue onsets, a gradient of RTs was found throughout the visual field, showing a gradual change from IOR in the cued quadrant to facilitation in the diagonal quadrant (Bennett & Pratt, 2001). Each item was a black outlined square (0.45° × 0.45°, 0.08° line thickness) with a gap (0.12°) on one side. Each quadrant extended 6.1° horizontally and vertically from the fixation cross. Within each quadrant containing search items, the items were placed randomly in an imaginary grid of 3 × 3, excluding the location next to the fixation cross, with a further ± 0.525° random jitter for presentation locations within the grid. This presentation protocol resulted in a minimum distance of 1.05° between search items and a minimum distance of 2.65° between the items and the fixation (both measured center-to-center).

Fig. 1
figure 1

Stimuli in the current study. A Schematic diagram of stimuli and procedure used in Experiment 1. A visual search task was immediately followed by a probe-detection task in each trial. The dot-probe could be on a search item (on-probe), away from the search items (off-probe), or absent (no probe). B Stimuli used in the target-present and target-absent conditions in the serial and parallel search tasks in Experiment 1. They were displayed in two randomly chosen quadrants out of four (four items per quadrant). C Search layout in Experiment 2. Note that the search items and tasks were the same as those in Experiment 1, except that all the items were displayed in one cluster covering the central area of the display. In both experiments, the background was gray (shown as white in this figure for illustration purposes)

In the serial search session, the target was a black square with a gap on the top or the bottom, whereas the distractors were black squares with a gap on the left or the right. In the parallel search session, all items were squares showing gaps on the left or the right, with the targets in blue and the distractors in black. Search targets were present in half of the trials and absent in the other half. Participants were asked to press one of two assigned keys with the right index and middle fingers quickly and accurately to indicate the presence/absence of targets (key mapping counterbalanced across participants) within a response window of 6,000 ms.

In 80% of the trials, a probe stimulus was superimposed on the search display 60 ms after a manual response was given to the search display or 6,000 ms after the onset of the search display if no response was detected. The proportion of the trials containing the probe was higher than that in Klein's (1988) study (50%) to obtain more data on the probe detection performance. The probe, a red dot of 0.15° in diameter, appeared either at a location occupied by one of the search items (on-probe; 40% of the trials) or at an empty location in a quadrant containing search items (off-probe; 40% of the trials) with equal probabilities. Participants were required to press the space bar with the left index finger as quickly and accurately as they could once detecting the probe. In the on-probe and off-probe trials, the probe stimulus and the search display remained on the screen for 2,000 ms or until a manual response was made (whichever the sooner). In the remaining 20% of the trials in which the probe was absent (no probe), the search display remained in view for 600 ms.

Data analysis

For the search task, trials with incorrect search responses (2.24% of total trials) were excluded from the analysis. This very low error rate was not further analyzed, but suggested that the search-terminating threshold in the target-absent trials allowed most search items to be inspected, supporting the argument that the IOR estimation would be much more accurate in the target-absent trials than in the target-present trials. Trials with correct search responses but RTs shorter than 100 ms or more than 2.5 standard deviations (SDs) away from mean RTs (in each condition, each training day, and each individual) were also rejected (2.25% of total trials).

For the probe-detection task, data from the probe-present trials were analyzed except for those without a detection response (0.06% of total trials) and those preceded by incorrect search responses (2.17% of total trials; Müller & von Mühlenen, 2000). Furthermore, trials in which probes appeared at the exact search target locations (2.41% of total trials) were removed because attention might still be dwelling at the target locations when the probes showed up thus diminishing IOR (Wang et al., 2010). Lastly, trials with detection RTs shorter than 100 ms or more than 2.5 SDs away from mean RTs (in each condition, each training day, and each individual) were excluded as well (2.24% of total trials).

Search efficiency was defined as the number of items scanned per second and quantified by dividing the search display size (8 items) by the search RT difference between the serial and parallel search conditions. IOR was quantified by subtracting the on-probe cost in parallel search from that in serial search (Klein, 1988), i.e., (RT on-serial – RT off-serial) – (RT on-parallel – RT off-parallel). Search efficiency and IOR effects were then submitted to a two-way (search target presence × training days) RM-ANOVA. Greenhouse-Geisser corrected p values (pc) were used in case of sphericity assumption violations.

The current study employed a training protocol to gradually increase search efficiency and predicted a decrease of IOR over the training days. Therefore, it was crucial to ensure that the search task was difficult enough to engage the inhibitory tagging mechanism at the start of the training. As such, the participants who did not show the IOR effect in the target-absent condition on the first training day were excluded from the data analysis. The application of this criterion removed four participants. Nevertheless, the analysis including these four participants showed a similar pattern of results (see Online Supplementary Material (OSM)).

Results

Search performance

Mean RTs in the visual search task are shown in Fig. 2 (Panel A). Figure 3 illustrates the search efficiencies in the target-absent (Panel A) and target-present (Panel B) conditions across the training days. The RM-ANOVA on search efficiencies showed a significant main effect of search target presence, F(1, 15) = 83.44, p < .0001, partial ƞ2 = 0.85, with the search efficiency in the target-present trials (18.55 ± 1.69 items/s for M ± SE) double the search efficiency in the target-absent trials (9.38 ± 0.91 items/s), consistent with previous findings (e.g., Klein, 1988; Müller & von Mühlenen, 2000; Takeda & Yagi, 2000). The main effect of training days was also significant, F(6, 90) = 34.77 , pc < .0001, partial ƞ2 = 0.70, showing a monotonic increase in search efficiency over the training days (from 7.01 ± 0.56 items/s on the first training day to 19.32 ± 2.16 items/s on the last training day), mainly from a significant linear contrast, F(1, 15) = 38.68, p < .0001, partial ƞ2 = 0.72.

Fig. 2
figure 2

Mean search and probe detection reaction times (RTs) in Experiments 1 (top row) and 2 (bottom row). Panel A shows the search RTs as a function of training days. Panels B and C respectively show the probe detection RTs in the search target-absent and search target-present conditions against training days. Error bars indicate ±1 SEM

Fig. 3
figure 3

Inhibition of return (IOR) (ms) and search efficiency (items/s) as a function of training days in the search target-absent (A) and search target-present (B) conditions in Experiments 1 (top row) and 2 (bottom row). Error bars indicate ±1 SEM. Note that the results in the search target-absent condition are more reliable because, in the search target-present condition, there are great chances that the probe was presented at a location that was previously occupied by a search item but was never searched, leading to an unreliable estimation of IOR

The search target presence × training days interaction was also significant, F(6, 90) = 9.66, pc = .004, partial ƞ2 = 0.39, suggesting differential search efficiency improvements over time between the target-present and -absent conditions. To statistically assess this, a linear regression was performed on search efficiency over the training days for each search target presence condition (present and absent) in each participant. The regression slope would represent the speed of search efficiency improvement over time. A paired-samples t-test was then carried out to compare the regression slopes of the target-present condition with those of the target-absent condition. The result indicated that the slopes were significantly steeper in the target-present condition (2.78 ± 0.52 items/s/day) than in the target-absent condition (1.39 ± 0.19 items/s/day), t(15) = 3.36, p = .004, Cohen’s d = 0.84.

Inhibition of return (IOR) performance

Mean RTs in the probe-detection task are shown in Fig. 2 (Panels B and C). The IOR effects in the search target-absent (Panel A) and search target-present (Panel B) conditions are illustrated in Fig. 3 as a function of training days. The IOR performance was analyzed with the same 2 (search target present vs. absent) × 7 (training days) RM-ANOVA. The main effect of training days was significant, F(6, 90) = 3.16, pc = .02, partial ƞ2 = 0.17. The result did not show a significant main effect of search target presence, F(1, 15) = 1.13, p = .30, partial ƞ2 = 0.07. Furthermore, the interaction reached significance, F(6, 90) = 5.50, p < .0001, partial ƞ2 = 0.27. To further assess this interaction, separate one-way RM-ANOVAs were conducted to investigate the training effects in the target-absent and -present conditions respectively. The results did not reveal a significant difference in IOR across the training days in the target-present condition, F(6, 90) = 2.01, p = .07, partial ƞ2 = 0.12. However, a significant training effect was found in the target-absent condition, F(6, 90) = 8.93, p < .0001, partial ƞ2 = 0.37. This result was a combination of a significant linear trend (F(1, 15) = 12.46, p = .003, partial ƞ2 = 0.45) and a significant quadratic trend (F(1, 15) = 16.58, p = .001, partial ƞ2 = 0.53).

The significant quadratic trend suggests the existence of an exponential decline in IOR in the search target-absent trials across the training days. To further explore this, a linear mixed models (LMM) analysis (West et al., 2014) was carried out (using the fitlme function in MATLAB) on IOR with e to the power of training days arranged in descending order (e7, e6, e5, …e1) for the fixed effect. By-participants random intercepts and by-participants random slopes for training days were included as random effects. Two models, respectively with and without the fixed effect of training days, were compared (using the compare function in MATLAB) to determine the contribution of training days. The result was significant, χ2(1) = 14.77, p = .00013, confirming an exponential decrease of IOR across the training days.

Following the finding of the exponential decrease of IOR over training, further tests were run to establish the learning curve in IOR. With 7 days in total, running all pairwise comparisons would greatly inflate the probability of Type I errors. Therefore, following the observation of Fig. 3A, which suggested a turning point on the second day, the trend was further analyzed with two one-way RM-ANOVAs to respectively check the performance change in the first 2 days and the performance from day 2 to day 7. The first RM-ANOVA showed a significant IOR change from day 1 (35.46 ± 6.40 ms) to day 2 (5.39 ± 3.24 ms), F(1, 15) = 31.53, p < .0001, partial ƞ2 = 0.68. The second RM-ANOVA found that IOR stopped changing after day 2, F(5, 75) = 0.23, p = .95, partial ƞ2 = 0.02. This was also confirmed by a Bayesian RM-ANOVA, which showed strong evidence for the absence of any difference after day 2, BF10 = 0.05. Then, a one-sample t-test was carried out to evaluate the existence of IOR from day 2. Given that IOR did not change across days 2–7, this t-test assessed the average IOR across these days against zero. The result showed that the average IOR (7.25 ± 3.28 ms) was significant, t(15) = 2.21, p = .04, Cohen’s d = 0.55, but only anecdotally supported by a Bayesian t-test, BF10 = 1.68.

Discussion

The results show that IOR decreased when visual search efficiency improved during the training in the search target-absent condition. This follows our prediction and agrees with the foraging facilitator hypothesis because less attentional engagement was required when visual search became easier (due to the training) so that inhibitory tagging was less generated. Despite the decreasing trend over the training days, IOR stopped changing and remained stable after the second training day. This suggests that, although IOR plays an important role in foraging behavior, search performance improvement over training does not always relate to the change in IOR. This is probably due to the involvement of other factors such as top-down goals, bottom-up salience, and prior search history (see Wolfe, 2021, for a review).

Another possibility is that the discrepancy in the changes of search efficiency and IOR in fact suggests the absence of any relationship between the two in this experiment. One may argue that the decrease of IOR after the first training day was due to familiarization of the probe task (or some other procedural factors changed by the training), whereas the increase of search efficiency was driven by some other variables. However, this explanation is unlikely because of two reasons. First, as the foraging facilitator hypothesis suggests, IOR is a crucial mechanism in inefficient search. An abundant amount of studies observed the IOR effect in inefficient search (e.g., Müller & von Mühlenen, 2000; Thomas et al., 2006; Thomas & Lleras, 2009), demonstrating that IOR does discourage attention from returning to inspected locations, thereby facilitating search. It is hard to imagine that in the current study the IOR decrease during the first day had nothing to do with the search efficiency improvement. Second, the current research data do not agree with the notion that the IOR decrease in the current data was only driven by procedural training. The target-present data (top-right corner of Fig. 3) showed relatively strong and varied IOR across the training days, against the possibility of a sole explanation of procedural learning in the first training day. This is also consistent with Xu et al. (2016), who investigated the change in IOR across training days utilizing a cue-target paradigm. Despite their task being easier to learn than the current task, and that their paradigm employed more training with 384 trials per condition (240 trials in the current experiment), the IOR effect in Xu et al. (2016; Fig. 3) kept decreasing over the first 4 days. As such, it is more likely that the IOR decrease in the current experiment was due to the reduced attentional engagement that resulted from the improved search efficiency.

It is also worth noting that, despite the training, it is likely that visual search remained serial to the last training day. Firstly, the search RTs in the pre-designated serial search conditions were relatively long given the limited set size of eight in the current study. These RTs were close to or longer than the RTs in previous work on serial search (Takeda & Yagi, 2000; Wang et al., 2010). Secondly, these RTs were much longer than the RTs in the parallel search condition in the current study. Thirdly, different from the parallel search, there was always a significant RT difference between the target-present and target-absent conditions in the serial search trials across all the training days. Despite the possibility, it is unlikely that all these three observations were caused by varying intercepts of the search slope function.

In Experiment 1, the search items were displayed as two clusters in two quadrants. This differed from the stimulus layouts used in previous studies in which the search items were distributed evenly over the display (e.g., Klein, 1988; Müller & von Mühlenen, 2000). It could be argued that the current results were specific to the spatial layout used. Therefore, Experiment 2 used the traditional layout to confirm the findings of Experiment 1.

Experiment 2

Method

Participants and procedure

A new set of 14 volunteers (12 females and two males, aged 20.50 ± 0.55 years, all right-handed) with normal or corrected-to-normal vision took part in Experiment 2 for 6 consecutive days. One participant was removed due to the absence of IOR in the target-absent condition on the first day (see OSM for results including all participants). The procedure was identical to that of Experiment 1 with three exceptions. Firstly, as illustrated in Fig. 1C, the search items were presented randomly at eight of 16 possible locations defined by an imaginary 4 × 4 grid covering the center (7° × 7°) of the screen, with a further ± 0.6° random vertical and horizontal displacement. Secondly, the fixation cross was not presented during the search display and the probe display. Thirdly, a Cedrus RB-X40 response box was used to achieve a better RT resolution.

Data analysis

For the search task, trials with incorrect responses to the search targets (1.38% of total trials) and RT outliers (detected with the same criteria as in Experiment 1; 2.46% of total trials) were removed from the analysis. For the probe-detection task, only the probe-present trials were analyzed. Trials without probe-detection responses (0.04% of total trials) were excluded from the analysis, as were trials preceded by incorrect search responses (1.31% of total trials) and trials in which the probe and the search target occupied the same location (2.36% of total trials). Finally, outliers were determined with the same criteria as in Experiment 1, leading to the removal of 1.81% of the data. The statistical analysis protocol was identical to that in Experiment 1.

Results

Search performance

Figures 2 and 3 (bottom rows) illustrate the search performance. A two-way RM-ANOVA on search efficiencies was carried out with search target presence (present vs. absent) and training days (days 1–6) as factors. The main effect of search target presence was significant, F(1, 12) = 198.98, p < .0001, partial ƞ2 = 0.94, with higher search efficiency found in the target-present trials (28.35 ± 2.56 items/s) than in the target-absent trials (13.33 ± 1.65 items/s). The main effect of training days reached significance as well, F(5, 60) = 42.40 , pc < .0001, partial ƞ2 = 0.78, showing a steady increase in search efficiency over the training days and a significant linear contrast, F(1, 12) = 74.63, p < .0001, partial ƞ2 = 0.86. The search efficiency improved from 10.07 ± 1.02 items/s on the first training day to 30.08 ± 2.92 items/s on the last training day. There was also a significant search target presence × training days interaction, F(5, 60) = 10.44, pc < .0001, partial ƞ2 = 0.47. Linear regression slopes over the training days were obtained for all participants in each target-presence condition. A paired-samples t-test between the target-present and -absent conditions revealed that these two slopes differed significantly, t(12) = 6.11, p < .0001, Cohen’s d = 1.69, with a steeper slope in the target-present (5.05 ± 0.50 items/s/day) condition than in the target-absent (2.64 ± 0.48 items/s/day) condition.

IOR effects

The IOR effect as a function of training days in the search target-absent and search target-present trials are shown in Fig. 3 (bottom rows). A RM-ANOVA on IOR with factors of search target presence (present vs. absent) and training days (days 1–6) found a significant main effect of training days, F(5, 60) = 4.51 , p = .001, partial ƞ2 = 0.27, but no significant main effect of search target presence, F(1, 12) = 2.15, p = .17, partial ƞ2 = 0.15.

The search target presence × training days interaction was significant, F(5, 60) = 3.47, pc =.03, partial ƞ2 = 0.22. To follow up the interaction, two one-way RM-ANOVAs were employed to investigate the training effects in the target-absent and -present conditions respectively. Despite there being no difference across the training days in the target-present condition, F(5, 60) = .31, p = .90, partial ƞ2 = 0.03, the IOR effect significantly decreased across the training days in the target-absent condition, F(5, 60) = 9.24, p < .0001, partial ƞ2 = 0.44. This training effect received significant contributions from both linear (F(1, 12) = 17.92, p = .001, partial ƞ2 = 0.60) and quadratic contrasts (F(1, 12) = 9.43, p = .01, partial ƞ2 = 0.44). The LMM analysis on IOR confirmed a significant exponential change in IOR over days, χ2(1) = 16.42, p < .0001.

Similar to Experiment 1, further analyses were carried out. First, two one-way RM-ANOVAs checked the IOR decrease from day 1 to day 2, and the IOR decreases after day 2. The results showed a significant decrease of IOR from day 1 (16.67 ± 2.94 ms) to day 2 (0.59 ± 3.18 ms), F(1, 12) = 25.70, p < .001, partial ƞ2 = 0.68, but no significant change after that, F(4, 48) = 1.03, p = .40, partial ƞ2 = 0.08. The absence of IOR change after day 2 was also moderately supported by a Bayesian RM-ANOVA, BF10 = 0.25. The one-sample t-test on the average IOR effect (−0.11 ± 1.23 ms) from days 2 to 6 showed the absence of IOR from day 2, t(12) = .09, p = .93, Cohen’s d = 0.03, which was also moderately supported by Bayesian analysis, BF10 = 0.28.

Discussion

Experiment 2 replicated the results of Experiment 1 with a traditional search layout. This suggests that the current findings were not specific to a particular display layout. The two experiments together illustrated that IOR decreased from day 1 to day 2 in the search target-absent condition while search efficiency kept improving over all the training days. This is consistent with the notion that when search efficiency improves, attentional engagement becomes less strong, thus weaker inhibitory tags are generated.

As in Experiment 1, it is likely that visual search remained serial throughout all the training days because the search RTs were always longer in the difficult inefficient search than in the parallel search, and the inefficient search RTs were always longer in the search target-absent condition than in the target-present condition.

A key difference between the findings of the two experiments is that, from the second training day, IOR ceased to exist in Experiment 2 while it persisted at a low level throughout the remaining days in Experiment 1. This is very likely due to the usage of the traditional layout in Experiment 2, while the precise reason remains unclear. Nevertheless, the current findings might suggest that search efficiency improvement sometimes is unrelated to the change in IOR. In certain situations, search efficiency improvement may not even require the existence of IOR after sufficient training. Given that inhibitory tagging is considered important in facilitating foraging behavior, this suggestion is intriguing.

General discussion

The foraging facilitator hypothesis theorizes inhibitory tagging at previously inspected locations as a mechanism to optimize searching performance by prioritizing novel locations during visual search. The more challenging a visual search task is, the more serial attentional deployment is involved, yielding stronger inhibitory tagging (i.e., stronger IOR). Consistent with the foraging facilitator hypothesis, previous studies found that the probe detection RTs for previously attended locations were longer than those for new locations (i.e., the presence of the on-probe cost/IOR) and that IOR was stronger in the serial than parallel search (Klein, 1988; Müller & von Mühlenen, 2000; Takeda & Yagi, 2000; Wang et al., 2010). The current study went beyond the comparison between these two search types and depicted how IOR (the measure of inhibitory tagging) changed when search efficiency (the indicator of search difficulty) gradually improved. Without changing the physical stimuli, task difficulty was gradually reduced with the long-term training protocols. Both experiments observed IOR on the first training day, replicating previous findings (e.g., Klein, 1988; Müller & von Mühlenen, 2000; Takeda & Yagi, 2000). More importantly, it was consistently found that, while search efficiency kept improving across the training days, the IOR magnitude decreased until the second training day but ceased to change on the following days. The absence of IOR changes from the second training day was not expected. This is an intriguing finding showing that search performance improvement does not always lead to the change in IOR. Fully understanding this effect requires further research. Nevertheless, the negative relationshipFootnote 1 between search efficiency and IOR observed in both experiments supports the foraging facilitator hypothesis that IOR facilitates search performance via stronger inhibitory tagging in more difficult visual search.

One may argue that the current training protocol could have affected IOR through procedural training in the probe task, independent from the intended search training, so that the search efficiency data and the IOR data were not related. From the theoretical perspective, although the two tasks could potentially be separately trained, the performance in the probe task (i.e., IOR) is closely related to the performance in the search task (i.e., search efficiency) because the probe task is designed to disclose the attentional deployment during visual search at the time of probe onset. Furthermore, despite the fact that the probe detection RTs decreased over the training days (Fig. 2), it is unlikely that IOR quickly diminished solely due to procedural learning in the probe task, given that the search target-present condition in Experiment 1 (Fig. 3, top-right corner) showed IOR (and variability in IOR) over all the training days. Therefore, we believe that the relationship pattern between search efficiency and IOR found in the current study is intriguing and worth further research. To address the training effect on IOR, future studies could reduce or remove the probe task trials between the starting and ending days of the training.

It is worth noting that the current research did not use the slope of the set-size function in the serial search condition to assess search efficiency, as was usually reported in the visual search literature (e.g., Klein, 1988), because only a single set size (eight items) was used in the current study. Instead, search efficiency was quantified by dividing the difference in RTs between the serial search and the parallel search conditions by the number of search items. In the current approach, parallel search was used as a control condition to approximate the search efficiency measure in serial search. However, this approach might not be as accurate as the slope method. Furthermore, the current approach also raised the possibility that the gradual decrease in the serial search RTs was partially caused by the reduction of the intercept instead of the slope.Footnote 2 Further studies are needed to separate the slope decrease and the intercept decrease, and to corroborate the approach adopted in the current study.

Unexpectedly, a discrepancy between search efficiency and IOR was found. Search efficiency continued improving over the training days in both experiments, whereas IOR only decreased from day 1 to day 2 and remained unchanged since day 2. This discrepancy might suggest that, unlike search efficiency which has more capacity for improvement, IOR changes could be more limited and more easily reach the floor. Indeed, target-distractor similarity in perception is reduced via training due to the heightened perceptual saliency of the target (Qu et al., 2017). With a higher perceptual saliency, the target tends to pop out more along with the training, leading to a shorter dwell time for each item (Becker, 2011), reduced inhibitory tagging (i.e., reduced IOR), and enhanced search efficiency. However, IOR is not the only factor contributing to search efficiency improvement. Visual search is guided by many factors such as top-down goals, bottom-up salience, and prior scene/search history (see Wolfe, 2021, for a review) in a winner-take-all fashion via a priority map where the contribution weights of these factors are flexibly adjusted (Fecteau & Munoz, 2006; Zelinsky & Bisley, 2015). In the current study, apparently IOR was not the main contributor to the performance improvement from the second training day. Visual search involves many other cognitive processes, such as sensory processing, attentional orienting, target discrimination, response preparation, and selection (Nakayama & Martini, 2011), all of which could be enhanced during training (Clark et al., 2015). In the current study, it is likely that some or all of these processes kept improving after IOR reached the floor, and interestingly, maintained the linear trend of search efficiency improvement.

The data also showed a difference in the IOR magnitudes between the two experiments. IOR was stronger in Experiment 1 than Experiment 2 (not statistically assessed because it was not part of the research questions). In particular, while IOR was present even after hitting the floor (day 2) in Experiment 1, IOR was absent since day 2 in Experiment 2. This agrees with our reasoning in the design phase, i.e., that the four-quadrant display layout should be able to generate stronger IOR than the traditional display layout. More importantly, Experiment 2 yielded higher search efficiency and weaker IOR than Experiment 1, again consistent with our hypothesis. This between-experiment difference might be due to the fast decay of the inhibitory tags. The weaker the inhibitory tags, the shorter the dwell time of the inhibitory tags. If this is plausible, in Experiment 2, the inhibitory tags might have diminished when the experiment measured IOR with the probe. This hypothesis requires further research. Nevertheless, the results from Experiment 2 clearly showed that, in contrast to common belief, serial search does not necessarily generate inhibitory tags. It is speculated that the inhibitory tagging process is only active when the serial search task’s difficulty is above a threshold, with some variability across different tasks. When the search is not highly demanding, inhibitory tagging may not be fully engaged or may decay fast.

The absence of IOR from day 2 in Experiment 2 may also be partially attributed to the probability of target presence in the post-search probe-detection task. In the current study, the probe was present in 80% of trials, rather than 50% as in Klein (1988). Given that detecting the probe means to attend to previously searched locations, the high probability of probe appearance in the current study could encourage attention to return to the previously inspected locations, thus counteracting the IOR effect. This suggestion is supported by the finding that IOR was eliminated when the target appeared more frequently at a previously inspected location (Farrell et al., 2010). This counteracting effect might be less of an issue on the first training day because the search task was demanding and yielding a strong IOR. However, after the first training day, the participants became more proficient with the task, therefore could possibly afford to divert more resource from the search task to the probe-detection task and allow more attentional revisiting of the searched locations due to the higher prevalence of probes appearing at the inspected locations.

Notwithstanding the discussion of various possibilities above (which will require further investigation), the current results made an important addition to the literature on the foraging facilitator hypothesis by suggesting that inhibitory tagging (as a result of attentional engagement) is an adaptive mechanism of the visual system. Specifically, the magnitude of IOR was reduced, and sometimes fully diminished, when search efficiency improved (weakening attentional engagement) during the training. Under high search difficulty, when inhibitory tags could significantly improve search efficiency, they were employed; under decreased search difficulty, when there was little advantage in employing them, they did not fully engage or decayed quickly. Overall, we have shown how IOR varied during long-term training that improved search efficiency. The current findings support the foraging facilitator hypothesis that IOR improves search efficiency via stronger inhibitory tagging in more difficult search.