Ensemble perception without attention depends upon attentional control settings

Chen, Zhimin; Zhuang, Ran; Wang, Xiaolin; Ren, Yanju; Abrams, Richard A.

doi:10.3758/s13414-020-02067-2

Ensemble perception without attention depends upon attentional control settings

Published: 27 May 2020

Volume 83, pages 1240–1250, (2021)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Ensemble perception without attention depends upon attentional control settings

Download PDF

Zhimin Chen¹,
Ran Zhuang¹,
Xiaolin Wang¹,
Yanju Ren ORCID: orcid.org/0000-0002-8776-8711¹ &
…
Richard A. Abrams²

1761 Accesses
6 Citations
Explore all metrics

Abstract

People are able to rapidly extract summary statistical information about common patterns, or ensembles, that may exist in a scene, such as repeated textures or colors. Here we examined the extent to which such an ensemble perception can occur in the absence of focal visual attention using a method that has some advantages over methods previously used to study the issue. In particular, we assessed the extent to which ensembles can be processed without attention by measuring the indirect effect of a to-be-ignored ensemble on judgments of an attended ensemble. The results show that ensembles outside the focus of attention do influence judgments of attended ensembles when the to-be-ignored ensemble contains summary statistics that match a sought-for target category. Thus, an attentional control setting for specific summary statistical information permits the processing of ensembles outside of focal attention, facilitating the rapid perception of visual scenes.

Comparing explicit and implicit ensemble perception: 3 stimulus variables and 3 presentation modes

Article Open access 11 October 2023

Emotional judgments of scenes are influenced by unintentional averaging

Article Open access 11 June 2020

Global and local interference effects in ensemble encoding are best explained by interactions between summary representations of the mean and the range

Article Open access 27 January 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

When we first view a scene, the visual system rapidly extracts information about common patterns that may exist in the scene such as repeated colors or textures. The representation of such patterns is referred to as “summary statistical information” or “ensemble perception,” and is thought to play a critical role in the perception of natural scenes (Brady, Shafer-Skelton, & Alvarez, 2017). In particular, the ability to rapidly extract summary statistical information from a scene may reduce the burden of processing that would be otherwise needed by limited-capacity attentional and cognitive mechanisms. Nevertheless, despite evidence of rapid extraction of statistical information (e.g., Ariely, 2001; Chong & Treisman, 2003; Haberman & Whitney, 2007; also see review by Whitney & Yamanashi, 2018), it is still unclear whether ensemble perception can occur in the absence of visual attention. Resolving the issue has important implications for the understanding of perception more generally, and is the focus of the present paper.

Several recent studies have examined the extent to which ensemble perception could be carried out without attention, and they have yielded mixed results. On the one hand are studies that suggest that ensemble information can be processed without attention, or with only minimal attention. For example, Alvarez and Oliva (2008) asked participants to track a set of moving objects while ignoring a set of moving distractors. Although the to-be-ignored distractors were well outside the focus of attention, participants were still able to extract accurate summary statistics specifying their center of mass. Similarly, participants in Alvarez and Oliva (2009) were able to detect changes in an unattended background pattern more effectively when the change produced a different ensemble structure (compared to equivalent local changes that did not alter the summary statistics), suggesting reduced attentional demands for ensemble perception. In another study, Bronfman, Brezis, Jacobson, and Usher (2014) found that participants could report the diversity of colors contained in objects outside of focal attention with no cost to the performance of their primary task, which required focal attention elsewhere in the display, showing that color diversity, even outside focal attention, could be perceived automatically (see also Ward, Bear, & Scholl, 2016). Several other studies have also reached similar conclusions involving summary statistical information of other global visual attributes, such as circle size (Chong & Treisman, 2005) and gabor patch orientation (Alvarez & Oliva, 2009).

On the other hand, some studies have shown that ensemble perception does incur an attentional cost. Jackson-Nielsen, Cohen, and Pitts (2017) found that participants had no information about color diversity, size diversity, or the mean size of elements outside the focus of attention. Huang (2015) had participants make judgments about either individual visual features or summary statistics of stimuli presented sometimes in unexpected locations. He found that judgments of summary statistics benefited just as much from a spatial precue (which permitted focal attention) as did judgments of an individual feature. These findings suggest that ensemble perception is indeed attention-demanding, and cannot be accomplished in the absence of attention.

One reason for the conflicting results may be that many of the studies that have provided evidence for attention-free ensemble perception have employed dual-task paradigms in which participants perform a primary task with high attentional demands and are then probed on a secondary task about summary statistical information for unattended elements in the display (e.g., Alvarez & Oliva, 2008, 2009; Bronfman et al., 2014; Ward et al., 2016). Such paradigms leave open the possibility that participants may have allocated some attentional resources to the secondary (ensemble) task, rendering these experiments imperfect tests of the attentional demands of ensemble processing. However, studies that have revealed attentional costs of ensemble perception have used different methods. For example, Jackson-Nielsen et al. (2017) employed an inattentional blindness paradigm in which participants performed a focal task for several trials and then received an unexpected query regarding unattended elements after one of the trials. In that study, the participants did not have the motivation to allocate any attention beyond what was required of the focal task, and indeed there was no evidence of ensemble perception for unattended parts of the display. In the study by Huang (2015), focal attention was contrasted with divided attention across trials – with no incentive for participants to divide their attention on the focal attention trials. The results also showed that ensemble perception relies on focal attention.

The studies by Jackson-Nielsen et al. (2017) and Huang (2015) suggest that attention may be necessary for ensemble perception; however, those studies also suffer from a potential weakness. In particular, correct responses regarding the ensemble summary statistics in those experiments required the participants to successfully remember details of the ensemble in order to correctly respond. Thus, any failures of memory for the ensemble might be incorrectly assumed to reflect the absence of ensemble perception itself (Chen & Wyble, 2016; Jiang, Shupe, Swallow, & Tan, 2016; but see Ward and Scholl, 2015a, b). As a result, it is still unclear whether ensemble perception does or does not require attention.

In order to address this question, here we adopted a method that permits the assessment of ensemble perception without any memory requirement and without any motivation for the participant to attend to irrelevant portions of the display. The method is based on that used by Gronau and Izoutcheev (2017), who studied a similar question regarding the extent to which attention is required for the perception of scenes. The method also bears some similarity to methods used by others in which the processing of an unattended stimulus is assessed by examining its (indirect) effect on the processing of an attended stimulus (e.g., Du & Abrams, 2008, 2012; Eriksen & Eriksen, 1974; Gaspelin, Ruthruff, & Jung, 2014; Theeuwes, 1992, 1994, 2010). Here we use the method to examine ensemble perception. In the critical experiment reported below, participants were required to attend to one region of a display that contained the relevant stimulus and ignore another region that contained a distractor. Our goal was to determine the extent to which ensemble information about the distractor (outside of attention) was processed – but the distractors were not probed by explicit report. Instead, the processing of the unattended distractor was inferred on the basis of interference or facilitation caused by the distractor on evaluation of the relevant, attended stimulus. Thus, the method indirectly assesses the extent to which ensemble information is processed outside the focus of attention without any motivation for participants to attend to the critical (distractor) stimulus and without any requirement that features of the unattended stimulus be retained in memory.

Overview of experiments

We report two experiments below. In the first experiment, participants were briefly exposed to two ensemble stimuli (consisting of clusters of lines) and were asked to determine the presence of an ensemble that matched a pre-specified target category (vertical, horizontal, or oblique line orientations). Our interest was to determine whether performance would be facilitated when the two stimuli were from the same category. If such facilitation does occur, that would show that shared ensemble category membership can affect ensemble judgments when both ensemble stimuli are attended. To anticipate the results, such facilitation did occur. Then, in Experiment 2 our goal was to seek evidence for the same facilitation when one of the ensembles is outside the focus of attention. Such an effect there would indicate that ensemble perception can take place in the absence of attention.

Experiment 1

In this experiment two ensemble stimuli were briefly presented. The stimuli consisted of clusters of lines that were mostly vertical (the “vertical” category), mostly horizontal (“horizontal”), or mostly oblique (“oblique”; similar to stimuli used by others, e.g., Huang, 2015). Participants were to decide whether either ensemble was from a pre-specified target category. Our interest was to determine whether the decision was influenced by the extent to which both ensembles did or did not share the same category. Such a result would serve as an important prerequisite for the test conducted in Experiment 2, in which some ensembles were presented outside the focus of attention.

Method

Participants

The sample size here and in Experiment 2 was based on the study by Gronau and Izoutcheev (2017), who used a similar paradigm. The sample of 18 participants in their Experiment 1 yielded a medium effect size (partial eta squared = 0.66) when stimuli were fully attended. In order to enhance the power of the present experiment, 24 undergraduate students (13 females, 11 males, age 19–22 years) with normal or corrected-to-normal vision participated. They were paid 15 RMB (equivalent to about $2.14) for their participation.

Apparatus and procedure

Stimuli were presented on a 17-in. CRT with an 85-Hz refresh rate viewed from a distance of 57 cm, on a gray background. The sequence of events on each trial is shown in Fig. 1. At the beginning of each trial, participants attended to a red fixation cross (.8° × .8°) presented at the center of the screen. After 600 ms, two ensemble stimuli were presented – one above and one below fixation – for 47 ms. The ensembles were followed by a 129-ms pseudonoise pattern mask and then a 1,082-ms blank screen. Participants were to press one key on the computer keyboard as quickly as possible if either ensemble was a member of the pre-specified target category, and another key if neither ensemble was a member. Trials without responses by the end of the blank interval were considered errors.

The ensembles were selected from one of three categories: vertical, horizontal, or oblique, with one category designated in advance as the target category. Each stimulus subtended 9.3° by 9.3° and consisted of 16 black line segments (1.2° × .3°) arranged in a 4 × 4 grid in which a randomly selected 12 lines matched the category designation and the other four lines had orientations selected randomly from the other categories. Ensembles were centered approximately 5° above and below fixation. Depending on the particular line orientations, rows within an ensemble were between 1.2° and 2° apart with the space between ensembles at least 2°.

For both the target-present and target-absent trials, the two ensembles were on some trials from the same category, whereas on other trials, the categories differed. Figure 2 shows examples of the different trial types when the target category was horizontal. On same-category target-present trials, both ensembles were from the same (pre-specified target) category (horizontal in the example). On different-category target-present trials, one ensemble was from the target category and the other was from one of the other categories. Finally, on target-absent trials, the two ensembles could be either from the same category or a different category, but never included ensembles from the target category.

Design

The experiment contained 180 target-present trials and 240 target-absent trials. For the target-present trials, one-third (60 trials) contained two ensembles from the target category (e.g., both horizontal when horizontal is the target category), and two-thirds contained one ensemble from the target category and one ensemble from one of the other two categories (60 trials for each of the possible non-target categories; e.g., one horizontal and one vertical, or one horizontal and one oblique). For the target-absent trials, one-half of them (120 trials) contained two ensembles from the same category (60 for each of the two non-target categories; e.g., both vertical or both oblique), while the other half (120 trials) contained one ensemble from each of the two non-target categories (e.g., one vertical and one oblique). Each of the three ensemble categories served as the target category for one-third of the participants. When the target category was present, it was equally likely to appear above or below fixation. When one or two oblique ensembles were in the display, all oblique lines had the same orientation. Trials were presented in a random order. At the beginning of the session, participants completed a practice block of 42 trials (with trial types in the same proportions as in the formal testing) that was not included in the analysis. Two prospective participants were replaced because they were unable to achieve 80% accuracy in the practice block.

Results

Trials with errors and those with reaction times more than three standard deviations above or below each participant’s mean in each experimental condition were excluded from analysis. Mean reaction times are shown in Fig. 3. We conducted a target-presence (present or absent) by category relation (same-category or different-category) ANOVA. Reaction times were faster for target-present than target-absent judgments, F(1, 23) =49.19, p < .001, η_p² = .68. Reaction times were also faster when both ensembles were from the same category, F(1, 23) = 104.74, p < .001, η_p² = .82. The effect of category relation was greater for target-present than for target-absent trials, F(1, 23) =9.25, p = .006, η_p² = .29. Importantly, follow-up contrasts showed that the category relation effect was significant not only for target-present trials, F(1, 23) = 131.27, p < .001, η_p² = .85, but also for target-absent trials, F(1, 23) = 39.05, p < .001, η_p² = .63.

Accuracy rates are shown in Table 1. Participants were more accurate when the two ensembles were from the same category, F(1, 23) = 40.87, p < .001, η_p² = .64, matching the effect in reaction times and ruling out a speed-accuracy tradeoff. There was no overall difference between target-present and target-absent trials, F(1, 23) = .01, p = .90, η_p² = .001, but the effect of category relation was greater for the target-present trials, as revealed by an interaction between the two factors, F(1, 23) = 24.22, p < .001, η_p² = .51. Follow-up contrasts showed that the category relation effect was significant not only for target-present trials, F(1, 23) = 49.53, p < .001, η_p² = .68, but also for target-absent trials, F(1, 23) = 4.76, p = .040, η_p² = .17.

Table 1 Mean accuracy rates (proportion correct) from Experiment 1. Standard errors are shown in parentheses

Full size table

Discussion

In this experiment, participants attended to two ensembles of lines, searching for the presence of a pre-specified category. When the target was present in both ensembles (target-present, same-category trials) participants were faster than when the target was present in only one ensemble (target-present, different-category). This occurred presumably in part because assessment of the orientation of either ensemble would lead to a “present” response. More importantly, there was also a same-category advantage on the target-absent trials despite the fact that target-absent trials always required both stimulus ensembles to be inspected prior to an “absent” response. This result shows that when an entire scene is attended, ensemble perception is influenced by the ensemble category relations present in the scene. While the source of that result could lead to insight into ensemble perception, doing so was not our objective.^{Footnote 1} Most importantly, it serves as an important pre-requisite for Experiment 2, in which we examined the possibility that ensemble category membership can influence ensemble perception outside the focus of attention.

Experiment 2

Experiment 1 revealed a same-category advantage: when both stimuli in the display were from the same ensemble category, the stimuli were processed more quickly. Because that experiment required participants to indicate if a target category was present anywhere in the display, presumably both elements were attended there. Here we repeated the experiment with one important difference: we cued one of the ensemble stimulus locations in advance, and asked subjects to report only whether the stimulus in the cued location matched the pre-specified target category. As a result, the other ensemble stimulus was a distractor – outside the scope of spatial attention. If ensemble statistics are perceived automatically and without attention, then the ensemble category of the unattended distractor stimulus would be expected to influence responses here and reveal a same-category advantage for judgments, as in Experiment 1. Alternatively, if attention is required for ensemble perception then there should be no effect of the category relation between the attended and unattended ensembles on judgments of the attended ensemble. Importantly, the task measures the processing of the distractor ensemble when there is no motivation for participants to split their attention between the two stimuli – the distractor is completely irrelevant to the task. Additionally, there is no requirement that participants remember anything about the distractor in order for us to determine that ensemble information about the distractor was processed.