Studies of visuospatial attention in two-dimensional (2-D) space have shown that when attention is oriented to a peripheral cue, if the stimulus onset asynchrony between the cue and target is less than 300 ms, processing of a target appearing at a precued peripheral location is facilitated (as compared with the response to a target appearing at a novel location). A brief period of facilitation is followed by a long-lasting inhibitory effect in which there is a delayed response to stimuli presented at the previously cued location. This long-lasting inhibitory effect is called “inhibition of return” (IOR; Posner & Cohen, 1984; see Klein, 2000).

Studies of visuospatial attention have primarily focused on 2-D space (Corbetta, Kincade, Lewis, Snyder, & Sapir, 2005; Corbetta, Kincade, Ollinger, McAvoy, & Shulman, 2000; Corbetta, Patel, & Shulman, 2008; Corbetta & Shulman, 2002; Lupiáñez et al., 2004; Mayer, Seidenberg, Dorflinger, & Rao, 2004; Spence, Lloyd, McGlone, Nicholls, & Driver, 2000; Zhang, Zhou, & Zhang, 2012); however, though these 2-D spatial representations show us how attentional orienting responds to the directions of objects, they tell us little about how it is affected by objects’ distances. Humans live in a three-dimensional (3-D) world, and the third dimension, depth, is a neglected topic in cognitive research, despite the fact that it is crucially important for understanding how we guide our actions. Previous studies have shown that an essential component of primate visual function is the ability to extract and represent 3-D shape and depth from information in 2-D retinal images (Hayman, Verriotis, Jovalekic, Fenton, & Jeffery, 2011; Janssen, Vogels, & Orban, 2000; Sereno, Trinath, Augath, & Logothetis, 2002). Therefore, attentional orienting/reorienting in 3-D space is particularly important for guiding our behavior. Although the ability to extract structures and obtain information about the spatial position of objects from 3-D space has been well-documented, the question of how attentional orienting/reorienting among these reconstructed objects is performed remains controversial.

Previous studies have documented evidence of the mechanisms underlying the facilitatory effect of attentional orienting in 3-D space (Andersen & Kramer, 1993; Chen, Weidner, Vossel, Weiss, & Fink, 2012; de Gonzaga Gawryszewski, Riggio, Rizzolatti, & Umiltà, 1987; Downing & Pinker, 1985; Maringelli, McCarthy, Steed, Slater, & Umiltà, 2001). Behaviorally, they showed that fewer benefits exist for shifting attention from near to far locations than for shifting from far to near locations. It was thus proposed that attentional resources are allocated in depth in a viewer-centered fashion, with the maximum processing resources being allocated to stimuli near the observer’s body (de Gonzaga Gawryszewski et al., 1987; Downing & Pinker, 1985; Maringelli et al., 2001). Furthermore, to survive everyday life while navigating through the 3-D world around us, one of the most important behaviors from a biological/ecological perspective involves the rapid redirection of our attention to potentially threatening or rewarding stimuli that unexpectedly invade spaces closer to us. The depth of a potentially threatening/rewarding stimulus relative to the observer is crucial: Stimuli that unexpectedly approach observers instantaneously demand a shift of attention toward them in order to secure survival. In contrast, stimuli far from observers may be less attention-demanding (de Gonzaga Gawryszewski et al., 1987; Downing & Pinker, 1985; Graziano & Cooke, 2006; Previc, 1998). Similarly, in a recent study Chen et al. (2012) constructed a virtual 3-D world by presenting targets either close to or far from participants, in an adapted version of the Posner spatial-cueing paradigm, to investigate the mechanisms underlying visuospatial orienting/reorienting in depth during the earlier, facilitatory phase. They found that attentional reorienting to objects unexpectedly presented closer to observers in the unattended hemispace was faster than reorienting to unexpected objects farther away.

The biphasic hypothesis that attentional orienting/reorienting consists of an early facilitation followed by a later inhibition has been criticized in several studies (Berlucchi, Chelazzi, & Tassinari, 2000; Cheal & Chastain, 2002; Chica & Lupiáñez, 2009; Chica, Lupiáñez, & Bartolomeo, 2006; Kalogeropopulou, Woodruff, & Vivas, 2015; Klein, 2000; Maruff, Yucel, Danckert, Stuart, & Currie, 1999; Wright & Richard, 1996). These researchers suggested that inhibition and facilitation might be two simultaneous, independent processes that overlap in time (Berlucchi et al., 2000; Kalogeropopulou et al., 2015). In addition, when peripheral cues are spatially predictive of target appearance at their same spatial location, an initial facilitation at the cued location is behaviorally measured, whereas no IOR is behaviorally observed (Chica & Lupiáñez, 2009; Chica et al., 2006). Therefore, typically, when peripheral cues are spatially nonpredictive of target appearance within the context of the spatial cueing/IOR tasks, it is widely accepted that the description of a facilitation and an inhibition is done in accordance with the biphasic hypothesis (Chica et al., 2006; Lupiáñez et al., 1997). Currently, in 3-D space, the facilitatory phase has been well established, yet it remains unclear how visuospatial attention orients/reorients in depth during the later inhibitory phase (IOR). To the best of our knowledge, only two relevant studies have investigated the spatial IOR effect in 3-D space. Theeuwes and Pratt (2003) used a variant to the classic Posner (1980) cueing task presented in 3-D environment and found that responses were equally slowed to targets at two different depth locations, so long as the targets shared the same cued location in the horizontal and vertical dimensions. Theeuwes and Pratt suggested that if a location in the horizontal (xy) plane is inhibited, the depth (z) plane in front of and behind the cued location is inhibited, as well. In other words, IOR appears to spread across depth, and thus is not depth-specific. A subsequent study (Bourke, Partridge & Pollux, 2006) largely followed that prior work, in that stimuli were presented at different horizontal locations and two different apparent depths through the use of stereoscopic disparity (±15 min of arc), but Bourke et al. increased the horizontal separation of stimuli that had virtually overlapped in the original study. Bourke et al. suggested that a small spatial IOR effect could be elicited by increasing the horizontal separation of stimuli that had virtually overlapped in the near and far depth planes. However, in that study they could not preclude the interpretation that the smaller IOR sizes were due to the two stimuli that shared the same position but different depths also having different horizontal distances, thus causing attentional orienting/reorienting in the horizontal plane to confound the results.

With regard to attentional reorienting theory, Klein (2000) suggested that IOR effects are observed only when attention disengages from the cued location. In spatial-cueing/IOR tasks, if there is no central cue (second cue), attention would not tend to disengage from the cued location. Therefore, the second cue location is rather important in spatial-cueing/IOR tasks. However, in the two prior studies, the second cue location (the fixation location) made the trajectory over which attention oriented/reoriented in depth nonstraight (e.g., in the uncued conditions, the left location in the near depth plane was cued, then the central fixation was cued to attract attention away from the previously cued location, and the target was presented at the left location, but in the far depth plane). Previous evidence has suggested that attention reorients mostly along a straight trajectory in the 2-D world. In comparison to a straight trajectory, the distance is larger when attention is reoriented along a nonstraight trajectory to the “novelty” stimuli, thus resulting in a slowing down of the speed toward the “novelty” stimuli, and a reduction in the amount of IOR (Bennett & Pratt, 2001). Furthermore, attentional reorienting to fixation results in a significant reduction in the inhibitory effect (Pratt, O’Donnell, & Morgan, 2000). Therefore, the null inhibitory effect in depth might have been caused by the paradigm in which the second cue was always the central fixation.

Whereas prior studies have focused largely on whether an IOR effect occurs when attention is oriented/reoriented in depth (Bourke et al., 2006; Theeuwes & Pratt, 2003), in the present study we not only investigated whether the IOR effect is depth-specific, but also explored whether IOR effects were the same or different when attention was oriented/reoriented along different trajectories in 3-D space. As we know, IOR is considered to prevent attention or eye movement toward a previously examined location in order to improve the efficiency of visual search (Klein, 2000; Tipper et al., 1994). Given the characteristics of IOR in visual searches, together with the fact that we live in a real, 3-D world, it is necessary to investigate the spatial distribution of IOR in different hemispaces and at different depths in 3-D space. Chen et al. (2012) constructed a virtual 3-D world in an adapted version of the Posner spatial-cueing paradigm, investigating the facilitatory mechanisms underlying attention orienting/reorienting at different trajectories in 3-D space. They found that when attention was oriented/reoriented to the hemispace the cue pointed to in the valid depth condition, reaction times (RTs) were longer—similarly to when attention was oriented/reoriented in the uncued depth plane and to the spatial location in the opposed hemispace. However, although the same spatial distance was involved, attentional reorienting to objects unexpectedly appearing closer to the observer in the unattended hemispace was faster than reorienting to unexpected objects that were farther away. Furthermore, when attention was oriented/reoriented in the uncued depth plane but at the position in the same hemispace as the cue, RTs were shorter. These results suggest that the facilitatory effects are different when attention is oriented/reoriented along different trajectories in 3-D space. Therefore, in the present study, it was important not only to characterize the direction-specific mechanisms through which visuospatial attention is directed toward stimuli that require different trajectories in the later, inhibition phase, but also to determine whether the IOR effects are the same or different when attention is oriented/reoriented in different trajectories in 3-D space.

In Experiment 1, the Posner exogenous cueing paradigm was adapted to a virtual 3-D world, presenting targets either near to or far from participants and manipulating the validity of the cues to construct different attentional trajectories. For example, one cue would require attention to orient/reorient in the horizontal plane (x-axis), and another would require orienting/reorienting in the depth plane (z-axis), in order to avoid the problems experienced in the previous two studies; a third cue would require orienting/reorienting in both the horizontal and depth planes (x- and z-axes). If IOR is depth-specific, attention could effectively orient/reorient in the depth plane (z-axis), resulting in RTs to targets at cued locations being longer than the RTs to targets at uncued locations. In contrast, if IOR is depth-blind, attention could not effectively orient/reorient in the depth plane (z-axis), resulting in no difference between the RTs to targets at cued and uncued depth locations. In addition, the ecological importance of the 3-D world would influence the attentional orienting/reorienting, resulting in an interaction between attentional trajectory and target depth. Although the same set of 3-D objects (placeholders and targets) were used for both the near and far depth planes in Experiment 1, the retinal sizes of the objects were different in the near and far depth planes, a situation that could confound our results. To rule out this possibility, we conducted Experiment 2, a 2-D control experiment in which a group of new participants performed the same attentional orienting/reorienting task described in Experiment 1. However, the task was performed under 2-D conditions in which the retinal sizes and retinal positions of the objects were identical to those in Experiment 1, though the depth dimension was removed. Hence, if Experiment 2 were to exhibit the same pattern of results as Experiment 1, the differential retinal images of objects, rather than their reorienting in depth, could have caused the results. In contrast, if the patterns of results differed between the two experiments, the results could most likely be attributed to the depth reorientation per se.

Method

Participants

Twenty-five participants (16 women, nine men; age: 22 ± 1.8 years) participated in Experiment 1, and 25 participants (18 women, seven men; age: 21 ± 2.5 years) participated in Experiment 2. All participants were right-handed and had normal color vision and visual acuity, and none had a history of neurological or psychiatric disorders. All were paid for their participations after the experiment. This study was approved by the Academic Committee of the Department of Psychology, Soochow University, China.

Apparatus, stimuli, and experimental setup

In Experiment 1, all stimuli were presented on a 27-in. ASUS 3-D monitor driven by an Nvidia GeForce FX 5200 graphics card. A pair of Nvidia 3-D shutter glasses synchronized with the monitor provided two separate stereoscopic displays to each of the two eyes, with a resolution of 800 (horizontal) × 600 (vertical) pixels at a refresh rate of 60 Hz. All 3-D objects were presented on a black background that was custom-made using computer scripts (Presentation software package; Neurobehavioral Systems, Albany, CA).

The default virtual display in each trial consisted of three white placeholders in the near depth (sagittal) plane, three white placeholders in the mid depth plane, and three white placeholders in the far depth plane; thus, nine spatial locations in the virtual 3-D space were marked (see Fig. 1). The 3-D objects in the far depth plane appeared behind the monitor screen, whereas the 3-D objects in the near depth plane popped out of the monitor screen. Objects in the mid plane appeared to be on the screen surface, equal distances from the near and far depth planes. The different target distances were simulated by adjusting the binocular disparity. The binocular disparity between the near and far depth planes was ±52.40 min of arc, relative to the fusion plane at which the central location in mid plane was presented (zero disparity). Participants reported that they could clearly perceive both the near and far depth planes when fixating the central location in the mid plane. The horizontal spatial distances (along the x-axis) between the left and right placeholders in the near and far depth planes were matched in visual angle (16.85°); that is, the retinal distances between the spatial positions were matched for the near and far depth planes. The 3-D objects in both depth planes were of the same perceived size, resulting in slightly different retinal sizes of the near and far stimuli: 1.98° for the near and 1.52° for the far objects. To prevent the far objects from being occluded by nearer objects, the stimuli in the near and far depth planes were slightly shifted so that the vertical retinal distance between the near and far positions was 2.36° of visual angle. For half of the participants involved in the present study, the positions in the near depth plane were lower, and for the other half of participants, the near positions were higher. The target was a blue sphere, and the cue and the central cue consisted of transient flashes of one of the peripheral placeholders and one of the central placeholders, respectively.

Fig. 1
figure 1

Top view of exemplary trials of the experimental paradigm. The default visual scene consisted of nine white placeholders (three in the near depth plane, three in the midsagittal depth plane, and three in the far depth plane). The spatial distances among the three placeholders in the near, mid, and far depth planes had equal visual angles, and the same 3-D objects were presented in all three depth planes. All participants reported seeing the same-sized objects in the three depth planes because of the size–distance constancy effect (Boring, 1964). To prevent far objects from being occluded by near objects, the vertical distance between positions in the near and far depth planes was slightly shifted by 2.36°. The black placeholders represented the cues. The paths of overt attentional reorienting in 3-D space were cued depth, different hemispace (CDDH); uncued depth, same hemispace (UDSH); and uncued depth, different hemispace (UDDH)

In Experiment 2, participants sat in front of a monitor screen with no goggles, and 2-D stimuli were presented on the monitor screen with an eye-to-monitor distance of 75 cm. The retinal sizes of the 2-D stimuli matched those of the 3-D stimuli in Experiment 1.

Experimental procedures and design

Experiment 1 in essence was a detection task. At the beginning of each trial, one peripheral location in either the near or the far depth plane was cued for 300 ms, followed by a 200-ms interstimulus interval. Subsequently, one location in the mid plane or the central location in either the near or the far depth plane was cued for 300 ms, to attract attention away from the previously cued location. After another period of 150 or 250 ms (these two randomly arranged intervals were used to prevent temporal orienting), the target was presented for 250 ms at one of the peripheral locations in either the cued or the uncued depth plane, with equal probabilities. The three types of attentional trajectories were as follows: attention oriented/reoriented along the depth plane in which the cue was located, but in the opposite hemispace [i.e., cued depth, different hemispaces (CDDH)]; attention oriented/reoriented along the uncued depth plane, but in the same hemispace as the cue [i.e., uncued depth, same hemispace (UDSH)]; and attention oriented/reoriented not only along the uncued depth plane, but also in the opposite hemispace [i.e., uncued depth, different hemispaces (UDDH)]. The cue validity was crossed with the attentional trajectory, and the cues were valid in 50 % the trials. Participants were instructed to fixate on the central fixation location in the midsagittal plane (zero disparity) throughout the experiment and to respond as quickly and accurately as possible to the target by pressing a key on the response box with their thumb. Prior to the formal test, each participant performed a practice session in which the composition of types of trials was the same as in the formal experiment.

The experimental, therefore, had a 2 (Target Depth: near vs. far) × 3 (Attentional Trajectory: CDDH vs. UDSH vs. UDDH) × 2 (Cue Validity: valid vs. invalid) within-subjects design, resulting in 12 experimental conditions in total. The experiment consisted of 624 trials, including 576 experimental trials (there were 48 trials in each condition) and 48 catch trials (to prevent participants from responding before the target onset, during which only the cues, and not the targets, were presented).

In Experiment 2, the experimental paradigm and task were the same as were described in Experiment 1, except that one independent variable was changed: The targets’ retinal sizes replaced the target depth in Experiment 1. Therefore, this experimental had a 2 (target retinal size: small vs. large) × 3 (attentional trajectory: CDDH vs. UDSH vs. UDDH) × 2 (cue validity: valid vs. invalid) within-subjects design.

Results

Experiment 1

Incorrect responses (miss trials) and RTs beyond three standard deviations were discarded. Errors were below 2 % across all of the conditions and were not analyzed further.

Table 1 provides a summary the mean RTs and standard errors under all conditions in Experiment 1. Correct RTs in the 12 experimental conditions were entered into a 2 (Target Depth: near vs. far) × 3 (Attentional Trajectory: CDDH vs. UDSH vs. UDDH) × 2 (Cue Validity: valid vs. invalid) repeated measures analysis of variance (ANOVA). The results showed that the main effect of attentional trajectory was significant, F(2, 48) = 16.63, p < .001, η 2 = .41. Post-hoc comparisons (Bonferroni-corrected for multiple comparisons) showed that RTs were longer in the CDDH condition (365 ms) than in the UDDH condition (357 ms), MD = 7.81, p < .005; longer in the UDSH condition (372 ms) than in the UDDH condition (357 ms), MD = 14.46, p < .001; and longer in the UDSH condition (372 ms) than in the CDDH condition (365 ms), MD = 6.65, p < .005. The main effect of cue validity was also significant, F(1, 24) = 72.65, p < .001, η 2 = .75, suggesting that the RTs to the targets at the cued location (375 ms) were longer than RTs to the targets at the uncued location (354 ms); however, the main effect of target depth was not significant, F(1, 24) = 2.33, p > .05. The interaction between target depth and cue validity was significant, F(1, 24) = 4.38, p < .05, η 2 = .11, as were the interaction between attentional trajectory and cue validity, F(2, 48) = 8.46, p = .001, η 2 = .26, and the three-way interaction between target depth, attentional trajectory, and cue validity, F(2, 48) = 23.93, p < .001, η 2 = .50. However, the interaction between target depth and attentional trajectory was not significant, F(2, 48) = 2.96, p > .05. To further test the potential differential interaction between cue validity and attentional trajectory in the near and far depth planes, a 2 (Cue Validity: cued vs. uncued) × 3 (Attentional Trajectory: CDDH vs. UDSH vs. UDDH) repeated measures ANOVA was conducted for targets in the near and far depth planes (see Fig. 2).

Table 1 Mean reaction times (RT) and standard errors (SE) of the means in the 3-D experiment and the 2-D control experiment (presented in the format RT ± SE)
Fig. 2
figure 2

Mean RTs, shown as a function of the experimental conditions in Experiment 1. The error bars represent SEMs

For targets in the far depth plane, the main effect of attentional trajectory was significant, F(2, 48) = 17.19, p < .001, η 2 = .42. Post-hoc comparisons (Bonferroni-corrected for multiple comparisons) showed that RTs were longer in the CDDH condition (365 ms) than in the UDDH condition (357 ms), MD = 8.52, p < .001; longer in the UDSH condition (375 ms) than in the UDDH condition (357 ms), MD = 18.58, p < .001; and longer in the UDSH condition (375 ms) than in the CDDH condition (365 ms), MD = 10.06, p = .005. The main effect of cue validity was significant, F(1, 24) = 54.75, p < .001, η 2 = .70, suggesting that RTs to the targets at the cued location (375 ms) were longer than RTs to the targets at the uncued location (357 ms). Furthermore, the interaction between attentional trajectory and cue validity was also significant, F(2, 48) = 22.00, p < .001, η 2 = .48. Additional test involving simple effects suggested that RTs were significantly longer to targets at cued locations than to those at uncued locations in the CDDH condition, t(24) = 9.17, p < .001, and the UDDH condition, t(24) = 7.05, p < .001. However, there was no significant difference between the RTs to targets at cued and uncued locations in the UDSH condition, t(24) = 1.81, p > .05. These results indicated that the typical IOR effects appeared only when attention was oriented/reoriented in the CDDH and UDDH conditions, whereas no IOR effect emerged when attention was oriented/reoriented in the UDSH condition. Comparing the IOR sizes in the CDDH and UDDH conditions, we did not find any significant difference, t(24) = 1.04, p > .05.

For targets in the near depth plane, the main effect of attentional trajectory was significant, F(2, 48) = 6.15, p < .005, η 2 = .20. Post-hoc comparisons (Bonferroni-corrected for multiple comparisons) showed that RTs were longer in the CDDH condition (365 ms) than in the UDDH condition (357 ms), MD = 7.10, p < .05, and longer in the UDSH condition (368 ms) than in the UDDH condition (357 ms), MD = 10.34, p = .005. However, we found no difference between the RTs in the UDSH condition (368 ms) and the CDDH condition (364 ms), MD = 3.24, p > .05. The main effect of cue validity was significant, F(1, 24) = 67.19, p < .001, η 2 = .74, suggesting that RTs to the targets at the cued location (374 ms) were longer than those to targets at the uncued location (352 ms). The interaction between attentional trajectory and cue validity was also significant, F(2, 48) = 8.16, p = .001, η 2 = .25. Additional tests involving simple effects suggested that RTs were significantly longer to targets at cued locations than to those at uncued locations in the CDDH condition, t(24) = 4.25, p < .001; in the UDSH condition, t(24) = 7.38, p < .001; and in the UDDH condition, t(24) = 7.41, p < .001, thus indicating that typical IOR effects appeared regardless of the different trajectories in which attention was oriented/reoriented. According to the IOR effects from different attentional trajectories, however, we found that the IOR size was significantly smaller in the CDDH condition (14 ms) than in the UDSH condition (26 ms), t(24) = 3.20, p < .005, or in the UDDH condition (28 ms), t(24) = 3.40, p < .005. However, there was no significant difference between the IOR sizes in the UDSH and the UDDH condition, ts < 1.

To summarize, these results indicate that attention oriented/reoriented in the UDSH condition differed between targets in the near and the far depth planes. Typical IOR effects were detected only when targets appeared in the near depth plane, whereas there was no IOR effect when targets appeared in the far depth plane. In addition, these results demonstrated that the IOR effects were the same when attention oriented/reoriented along different trajectories when targets were detected in the near depth plane, even though the IOR size was smaller in the CDDH than in the UDSH and UDDH conditions.

Experiment 2

Incorrect responses and RTs beyond three standard deviations were discarded. Also, errors were below 2 % across all conditions and were not analyzed further.

Table 1 presents a summary of the mean RTs and standard errors under all conditions in Experiment 2. Correct RTs were entered into a 2 (Target Retinal Size: small vs. large) × 3 (Attentional Trajectory: CDDH vs. UDSH vs. UDDH) × 2 (Cue Validity: valid vs. invalid) repeated measures ANOVA. The results showed that the main effect of attentional trajectory was significant, F(2, 48) = 17.03, p < .001, η 2 = .42. Post-hoc comparisons (Bonferroni-corrected for multiple comparisons) showed that RTs were longer in the CDDH condition (363 ms) than in the UDDH condition (355 ms), MD = 8.46, p < .001; longer in the UDSH condition (370 ms) than in the UDDH condition (355 ms), MD = 15.46, p < .001; and longer in the UDSH condition (370 ms) than in the CDDH condition (363 ms), MD = 7.00, p < .05. The main effect of cue validity was significant, F(1, 24) = 131.79, p < .001, η 2 = .85, thereby suggesting that RTs to targets at the cued location (375 ms) were longer than RTs to targets at the uncued location (351 ms). However, the main effect of target retinal size was not significant, F(1, 24) = 1.17, p > .05. Moreover, all interactions were not significant, Fs < 1 (see Fig. 3).

Fig. 3
figure 3

Mean RTs, shown as a function of the experimental conditions in Experiment 2. The error bars represent SEMs

According to these results, when we removed the depth dimension but retained the same object retinal sizes as in Experiment 1, all interactive effects disappeared, and typical IOR effects were presented regardless of the different attentional orienting/reorienting trajectories. These results clearly demonstrated that the differences when attention oriented/reoriented along a UDSH trajectory in Experiment 1 were caused by attentional orienting/reorienting in depth between the near and far targets, rather than by the different retinal target sizes.Footnote 1

General discussion

By incorporating the Posner exogenous cueing paradigm into a virtual 3-D world, we aimed to investigate whether spatial IOR effects are depth-specific and whether the spatial IOR effects are the same or different when attention is oriented/reoriented along different trajectories in 3-D space. The results showed typical spatial IOR effects when attention was oriented/reoriented along the CDDH and UDDH trajectories, regardless of whether the targets appeared in the near or the far depth plane. However, the IOR effects differed between targets appearing in the near and far depth planes when attention was oriented/reoriented along the UDSH trajectory; typical spatial IOR effects were detected only when targets appeared in the near depth plane, and there was no spatial IOR effect when targets appeared in the far depth plane.

A prior study had shown that the spatial IOR effect is not depth-specific in 3-D space (Theeuwes & Pratt, 2003), a finding that is in agreement with the viewpoint that attention does not operate in depth, or is “depth-blind” (Ghirardelli & Folk, 1996; Iavecchia & Folk, 1995). However, our results showed that spatial IOR was depth-specific when targets appeared in the near depth plane, whereas spatial IOR was not depth-specific when targets appeared in the far depth plane. Furthermore, the results suggested that no effect was observed when attention was oriented/reoriented along the UDSH trajectory, an outcome that was not due to a faster response to the cued location, but rather to a slower response to the uncued location.Footnote 2 These results also demonstrated that attentional reorienting to objects that unexpectedly appeared near observers in the UDSH condition was faster (355 ms) than reorienting to unexpected objects far away (373 ms), t(24) = 5.23, p < .001. Therefore, we see that the former condition enlarged the IOR size, whereas the latter condition reduced the IOR size or removed the IOR effect completely. This finding was consistent with results indicating that responses to targets in the near plane are faster than responses to targets in the far plane (Theeuwes & Pratt, 2003), and it is thus worth noting that asymmetrical effects have also been reported in previous studies (Andersen & Kramer, 1993; Arnott & Shedden, 2000; Atchley, Kramer, Andersen, & Theeuwes, 1997). Recently, one study (Chen et al., 2012) showed that although the trajectories covered the same spatial distance, attentional reorienting to objects unexpectedly appearing near the observer and in the unattended hemispace occurred more quickly than did reorienting to unexpected far-away objects. Therefore, from an ecological perspective, while navigating the 3-D world, one of the most important behaviors involves the rapid redirection of our attention to potentially threatening or rewarding stimuli that unexpectedly invade the space near us, resulting in a reduction in RT. In contrast, reorienting to stimuli far from us—although it is also important in everyday life—is usually not so urgent, so that the RT is longer (de Gonzaga Gawryszewski et al., 1987; Graziano & Cooke, 2006).

The critical difference between the present study and the prior study (Theeuwes & Pratt, 2003) relies on the question of whether the spatial IOR effect is depth-specific. First, in the present study, we had attention orient/reorient along a straight trajectory in the depth plane (UDSH condition), in which the second cue location was no longer the central fixation location. A previous study had suggested that reorienting attention to a fixated location resulted in a significant reduction in the inhibitory effect (Pratt et al., 2000). Therefore, as compared with a straight trajectory, the distance when attention reoriented along a nonstraight trajectory to the “novelty” stimuli was larger, resulting in a slowing down of the speed toward the “novelty” stimuli and a reduction in the amount of IOR (Bennett & Pratt, 2001). In addition, attentional momentum suggested that every trial included three movements of attention—from fixation to the cue, from the cue back to the fixation point, and from fixation to the target—and that orienting attention to the target is fastest when the third movement of attention has the same direction and amplitude as the second movement (Pratt, Adam, & McAuliffe, 1998; Pratt, Spalek, & Bradshaw, 1999). Therefore, the slowest detection responses occurred at the cued location, and the fastest responses occurred at the uncued location directly opposite the cued location (Bennett & Pratt, 2001). In prior studies (Bourke et al., 2006; Theeuwes & Pratt, 2003), the second cue was the central fixation, resulting in a trajectory of attentional orienting/reorienting in the depth plane (i.e., the UDSH condition in the present study) that was not straight; instead, the third movement of attention exhibited different directions and amplitudes from the second movement when attention was oriented/reoriented along the depth plane. Therefore, the slower responses at the uncued location either reduced the size of IOR or inhibited the IOR effect completely. Second, in the present study, the binocular disparity between the near and far depth planes was ±52.40 min of arc, relative to the fusion plane at which the fixation cross was presented (i.e., zero disparity). These values are near the maximum disparity separation possible without loss of fusion, since a pilot experiment showed that trained observers had no trouble fusing the two images with a binocular disparity of ±52.40 min of arc. In the prior studies (Bourke et al., 2006; Theeuwes & Pratt, 2003), the binocular disparity between the near and far depth planes instead was ±15 min of arc relative to the fusion plane at which the fixation cross was presented. According to the gradient model of visual attention, the closer to the cued location, the stronger inhibition is; therefore, the perceived distances between the near and far depth planes were larger in the present study than in previous studies, resulting in the inhibition tending to spread across locations in same hemispace but different depths in the previous studies.

Although our results indicated that the IOR effects were different when attention was oriented/reoriented along different trajectories in 3-D space, the different spatial-cueing effects especially appeared between targets in the near and far depth planes when attention was oriented/reoriented in UDSH condition. One might argue that the pattern of our results might have arisen from a confound of the effects of depth and hemispace. That is, in the condition in which IOR was not found when targets were in the far depth plane, targets are presented at uncued depth, but also this was the only condition of the three in which the target appeared in the same hemispace (in the other two conditions—CDDH and UDSH—the target appeared in a different hemispace). In order to rule out this explanation, we suggest that, first, if that were true, it would not explain the significant effect in the same hemispace in the near target condition. Second, a previous study (Bennett & Pratt, 2001) has shown that inhibition spreads beyond the cued location to affect the whole hemispace. However, this is likely only appropriate for 2-D space, in which the inhibition spreads along the y-axis. In the present study, the two locations at different depths in the same hemispace were at the same position on the y-axis, but at different positions on the z-axis. Therefore, that objection would not apply to the present study. Third, if inhibition could spread along the z-axis, according to the gradient model of visual attention, the closer to the cued location, the stronger inhibition is. The binocular disparity was ±15 min of arc in previous studies (Bourke et al., 2006; Theeuwes & Pratt, 2003), whereas the binocular disparity was ±52.40 min of arc in the present study. Therefore, the perceived distances were larger in the present study than in previous studies, so inhibition only tended to spread to locations in the same hemispace but different depths in the previous studies.

Apart from the differences in how attention was oriented/reoriented in the UDSH condition, consistent with the study by Theeuwes and Pratt (2003), we identified compatible IOR sizes between the UDDH condition (26 ms) and the CDDH condition (23 ms) when targets appeared in the far depth plane. According to our results, we speculated that although how attention was oriented/reoriented in the UDDH condition involved the depth dimension (z-axis), the IOR size in this condition might mostly be attributed to attention orienting/reorienting in the horizontal plane (x-axis). Bourke et al. (2006) built upon prior work (Theeuwes & Pratt, 2003) by increasing the horizontal separation of stimuli that had virtually overlapped, demonstrating that a small amount of spatial IOR operates within the depth plane (z-axis). In this case, the small amount of spatial IOR was obtained just when the near and far objects inhibited different positions in the horizontal plane (x-axis). Therefore, when the targets appeared in the far depth plane in the present study, although the attentional trajectories varied between the UDDH and CDDH conditions, the IOR sizes were similar.

In addition, attention oriented/reoriented in the CDDH condition underwent some reduction in IOR size relative to the other two conditions (i.e., UDSH and UDDH) when targets appeared in the near depth plane. This reduction effect was perhaps surprising, yet it was not without an interpretation. When targets appeared at the uncued location in the near depth plane, both the UDSH and UDDH conditions involved one critical factor: When stimuli unexpectedly approach to us from a farther location, they instantaneously demand a shift in our attention toward them in order to secure survival (de Gonzaga Gawryszewski et al., 1987; Downing & Pinker, 1985; Graziano & Cooke, 2006; Previc, 1998). However, attentional orienting/reorienting in the CDDH condition did not demand a shift in our attention when stimuli were approaching from far away. Therefore, due to the different ecological importances of these attentional trajectories, the characterization of the direction-specific mechanisms of attentional orienting/reorienting varied.

Our results show that the depth-based spatial IOR is asymmetrical (e.g., in the UDSH condition), and we indicated that the ecological importance of the 3-D world would influence attentional orienting/reorienting. That is, behaviorally, fewer benefits exist for attentional reorienting from near to far locations than for attentional reorienting from far to near locations (Chen et al., 2012; de Gonzaga Gawryszewski et al., 1987; Downing & Pinker, 1985; Maringelli et al., 2001), resulting in the former conditions reducing or removing the IOR effect, but the latter conditions enlarging the IOR effect. Neurally, some evidence has shown that the parietal–occipital junction (POJ) demonstrates a near-space preference in visuospatial processing in 3-D space, with the highest activation for near viewing and the lowest for far viewing (Cardin & Smith, 2011; Quinlan & Culham, 2007). Electrophysiological and tract tracting studies in monkeys have suggested a role of the visual area 6 (V6) complex as a potential candidate region in which information from the ventral and dorsal streams is integrated (Breveglieri, Galletti, Gamberini, Passarelli, & Fattori, 2006; Breveglieri, Kutz, Fattori, Gamberini, & Galletti, 2002; Galletti, Fattori, Kutz, & Battaglini, 1997). In the human brain, the POJ is a putative homologue of the monkey V6 complex (Galletti, Gamberini, Kutz, Baldinotti, & Fattori, 2005; Galletti, Kutz, Gamberini, Breveglieri, & Fattori, 2003; Pitzalis et al., 2013). Therefore, stimuli that unexpectedly approach observers instantaneously demand a shift of attention toward them in order to secure survival. In contrast, stimuli far from observers may be less attention-demanding.

To summarize, by adapting the Posner spatial-cueing paradigm in a constituted virtual 3-D environment, we found that IOR effects were different along different trajectories in 3-D space and that there were asymmetrical IOR effects when attention was oriented/reoriented in the depth plane (i.e., the UDSH condition, in the present study). IOR was depth-specific when targets appeared in the near depth plane, but depth blindness occurred when targets appeared in the far depth plane. We suggest that IOR is not entirely “depth-blind,” and that the ecological importance of the 3-D world influences the effects on IOR of different directional shifts of attention in 3-D space.