Imagine you are skimming this article quickly on your way to journal club. Despite directing your attention to reading, you readily make your way down the hall, perhaps detouring around a recent renovation project, and arrive safely at the seminar room ready to discuss. In addition to revealing your savvy time management skills, you would have demonstrated the fluent spatial memory abilities that underlie many of our daily activities. Humans have the capacity to quickly plan and execute routes through complex environments by accessing spatial memory representations. Given the wealth of navigationally relevant information that must be represented (object locations, environmental structure), successful encoding and subsequent retrieval depend heavily on how the memory representation is organized. As a result, the organizational principles that guide these representations have received considerable attention in the spatial memory literature. In the present article, we consider one of the most consistently observed and supported principles: the establishment of a principal reference orientation (for a review, see McNamara & Valiquette, 2004; Shelton & Yamamoto, 2009).

Principal Reference Theory (PRT) has emerged as a theoretical framework to explain orientation-dependent memory performance. When people are asked to imagine different perspectives of a learned space, they tend to be fastest and/or most accurate for one orientation relative to all others (e.g., Greenauer & Waller, 2008, 2010; Hintzman, O’Dell & Arndt, 1981; Kelly, Avraamides & Loomis, 2007; Kelly & McNamara, 2008; Marchette & Shelton, 2010; May, 2007; Mou & McNamara, 2002; Mou, Liu & McNamara, 2009; Mou, Zhao & McNamara, 2007; Roskos-Ewoldsen, McNamara, Shelton & Carr, 1998; Shelton & Marchette, 2010; Shelton & McNamara 1997, 2001a, 2004a, b; Valiquette & McNamara, 2007; Yamamoto & Shelton 2005, 2007, 2009). According to PRT, representing space involves selecting a principle orientation as the reference for encoding locations (e.g., Shelton & McNamara, 2001a; Werner & Schmidt, 1999). Over the last decade, mounting evidence has suggested that reference orientation selection is based on a hierarchy of cues. Egocentric experience appears to provide a default organization for memory; in the absence of other organizational cues, an egocentrically experienced orientation will serve as the reference (Greenauer & Waller, 2008; Shelton & McNamara, 1997; 2001a, b). Salient organizational cues are a common feature in natural environments, however, and these cues can adjudicate when a space has been experienced from more than one orientation (McNamara, Rump & Werner, 2003; Shelton & McNamara, 2001a; Werner & Schmidt, 1999). For example, the walls of a room provide a strong environmental frame, and an orientation aligned with the walls will be selected over an orientation misaligned with the walls (Shelton & McNamara, 2001a, Exp.3). In addition, strong organizational cues can even drive selection of a reference orientation that was never experienced during learning, including strong columnar organization of locations (e.g. Mou & McNamara, 2002), an axis of bilateral symmetry (e.g., Mou et al., 2007), or aligning all of the object axes with a single orientation in the space (Marchette & Shelton, 2010). Taken together, this body of evidence suggests that the selection of a reference orientation involves the opportunistic use of available reference frames, with interactions between egocentric experience and environmental cues.

Although these studies have provided a strong foundation for thinking about the organizational principles of spatial memory, the conditions of these experimental settings differ dramatically from the conditions involved in our daily interactions with space on many critical features. First, studies have frequently used room-sized (e.g., Greenauer & Waller, 2010; Mou et al., 2007; Shelton & McNamara, 2001a) or table-top (e.g., Roskos-Ewoldsen et al., 1998; Shelton & McNamara, 2001b, 2004b) displays in order to tightly control exposure to specific orientations and limit movement in the space. In such situations, the whole display can be viewed simultaneously with small movements (i.e., vista scale), whereas many spaces in which we navigate daily have discrete locations that are not simultaneously in view (i.e., environmental scale). It is possible that principles governing the organization within a vista-scale space differ from those that govern integration across subsequent local vistas in an environmental-scale space (e.g., Acredolo, 1981; Montello, 1993; Presson, DeLange & Hazelrigg, 1989). A small number of studies have examined memory for large-scale real environments (McNamara et al., 2003) or virtual spaces (e.g., Shelton & McNamara, 2004a) and have shown the characteristic preferred orientation, despite the need to traverse multiple path legs. As such, differences in scale do not appear to be a major factor affecting these broad organizational principles on its own.

More critically, however, most studies differ from familiar spaces at a more fundamental experiential level: Participants learn a layout in a limited number of exposures to the environment—typically, one—and with a limited set of learning goals. Although learning may be “broken up” into mini-sessions between which participants close their eyes, the entire experience with the space takes place in the context of a single experimental session. In addition, the entire encoding experience with the environment lasts around 20–30 min with retrieval following shortly after (e.g., Greenauer & Waller, 2008, 2010; Marchette & Shelton, 2010; McNamara et al., 2003; Mou et al., 2009; Mou & McNamara, 2002; Mou et al., 2007; Shelton & McNamara, 2001a, b; Valiquette & McNamara, 2007; Yamamoto & Shelton 2005, 2007, 2009). Most familiar spaces are learned over extended time periods--days, weeks, months, years—with navigational goals varying with different exposures. In the present article, we consider how this variable exposure might affect organizational strategies/principles.

It is possible that the experimental cases offer insufficient experience to extend the representation to a more generalized representation. Studies showing a single preferred orientation after exposure to multiple views argue against the strongest version of this limitation argument (e.g., Kelly & McNamara, 2010; Shelton & McNamara, 1997; 2001a; Yamamoto & Shelton, 2005); however, these studies still use highly constrained encoding in terms of both time and navigational goals. As such, it is possible that the observed orientation dependence reflects either a temporary strategy that works well for short-term retention or a “starter” representation on which to build. For the latter, it is possible that over many different exposures with variable navigational goals, one can establish a richer, more flexible, and essentially orientation-independent representation.Footnote 1

A small number of studies have examined large-scale familiar spaces learned under more natural conditions (e.g., Anooshian & Young, 1981; Bryant, 1982, 1984; Buttenfield, 1986; Hirtle & Jonides, 1985; Kitchin & Fortheringham, 1997; Montello, 1991; Rovine & Weisman, 1989; Sholl, 1987; Tlauka, 2006; Werner & Schmidt, 1999), but these have provided only limited information with respect to reference frame selection. For example, Sholl demonstrated that participants were faster and more accurate at locating local landmarks when facing a more frequently experienced direction. In a converging line of evidence, Tlauka found that landmark locations in a large-scale environment were more readily recalled from some imagined locations than from others. Montello (1991) showed that pedestrians were more accurate at locating local landmarks when traveling on a street aligned with the grid of a city than when traveling a misaligned street. Similarly, Werner and Schmidt found that alignment with strong environmental structure appears to facilitate retrieval. Overall, these foundational observations have established that the accuracy of spatial recall depends on the participant’s actual/imagined position when they make judgments about the space.

These various forms of retrieval facilitation might reflect preferred orientations, but they are also consistent with other potential organizations for spatial memory, and the methods were not optimized to distinguish between these possibilities. For example, one possible alternative is that accuracy of spatial memory depends primarily on the familiarity of the specific imagined position. This account would make very different predictions about spatial recall than those of a single preferred orientation that was selected on the basis of familiarity; however, the benefits seen in previous research could be consistent with such a story, since positions aligned with local environmental structure are likely to be more familiar by design (e.g., Lynch, 1960). In addition, these theoretical differences stress the importance of distinguishing between benefits for a particular orientation of the space and benefits for particular positions within the space. A reference orientation should have a global influence over retrieval of any location within a space—even in areas in which the cues that led to its selection are not strong or present. If the direction of a major road drives the selection of a preferred orientation for a city, then recall at that orientation should be more accurate even in a nearby park where the road is not visible.

The present study was designed to directly test whether memory representations of familiar spaces learned via normal daily experience reveal the same orientation dependence observed under the more controlled laboratory learning conditions. To address this question, one must have an environment that has clear boundaries, covers a limited enough area to be a coherent space, and includes locations that can be used to interrogate the accuracy of retrieval from many different orientations. In addition, one must have a captive population that uses said environment for daily navigation. Fortunately, college campuses afford these conditions. A campus is a defined set of buildings, quads, and so on, and students re-experience the same campus environment on a daily basis, in a variety of different contexts (classes, meetings, meals, social activities). Finally, their memories for campus have been built up over timescales of months to years. With these constraints in mind, we used undergraduate students on the Homewood campus of Johns Hopkins University, and asked them to retrieve information about commonly known campus locations from memory.

Method

Participants

Fifty-eight students (29 female) at the Johns Hopkins University participated in return for extra credit in psychology courses.

Materials and design

Judgments of relative direction

Memory was tested with judgments of relative direction (JRDs), in which participants point to targets from imagined headings in space (e.g., Roskos-Ewoldsen et al., 1998; Shelton & McNamara, 1997). To form these JRDs, we needed to identify buildings on the Homewood campus that would be familiar to the diverse population enrolled in psychology courses. Extensive pilot work identified 17 buildings that were consistently familiar to students, 13 of which fell into the three major quads that form the academic and residential core of the campus. These 13 buildings were combined into pairs defining imagined headings (Fig. 1), and all 13 also served as targets for JRDs. The remaining four buildings were well known but peripheral to the cluster of core buildings, either embedded in the neighborhood across North Charles Street or at peripheral campus locations. These four buildings were never used as part of an imagined heading, but were included as targets for JRDs. These targets afforded an analysis of pointing within, between, and beyond quads, but no differences were subsequently revealed; this issue of target location is not discussed further.

Fig. 1
figure 1

A map of the Johns Hopkins Homewood campus, using a West-as-up convention common at Hopkins. Buildings used as stimuli in this experiment are outlined in dark pen. The gray arrows indicate the assignment of orientations. Image courtesy of Johns Hopkins University Office of Undergraduate Admissions

Using the veridical orientations of all possible building-pair headings, calculated from Google Earth coordinates, we selected those that were roughly aligned with one of eight orientations around the campus (0°–315° in 45° increments). Unlike laboratory displays, the Homewood campus was not designed to provide exact headings at fixed intervals, so we adopted a +/−11.25° tolerance for assigning orientation (e.g., a 7° heading would be classified as a 0° heading). We also considered the possibility that the Homewood campus might contain spatial hierarchies, which could influence recall of campus locations (e.g., Hirtle & Jonides, 1985; Friedman, 2009; Friedman & Montello, 2006), so we selected building pairs so that half of the pairs at each orientation were buildings located within the same quad (within-quad headings), and half of the pairs were buildings from two different quads (across-quad headings). For each building-pair heading, all remaining buildings were possible targets. To avoid implicit biases based on our own knowledge and use of the Homewood campus, we randomly selected 16 test trials at each heading type (two) by orientation (eight), yielding 256 trials.

Computerized trials were presented with a pointing wheel consisting of a black circle and a pointer line that originated in the center of the circle and tracked the mouse radially. The name of the building at the to-be-imagined location appeared in the center of the circle (e.g., Ames Hall), and the name of the to-be-faced building appeared at the top of the circle (e.g., Krieger Hall). The name of the target building appeared in a box at the top of the screen (e.g., library), and the participant had up to 15 s to move the pointer line to indicate the direction to the target object from that heading. If the participant responded before the deadline, the trial ended and pointing judgment direction and response latency were recorded. If participants exceeded 15 s, the computer emitted an aversive beep and recorded a miss. Missed trials were not repeated. There was a 1.2 s intertrial interval, and participants received a 12 s rest every 16 trials.

Procedures

The use of buildings made imagined locations somewhat ambiguous (e.g., participants could imagine themselves at different points in or around a building). To mitigate this ambiguity, we instructed participants to treat each building as a single point (e.g., the center of the building), and to always use this point when imagining or locating the building. Participants were trained to perform JRDs with 16 practice trials using imagined headings that did not correspond to the eight orientations of interest. Following practice, participants were given an opportunity to ask questions before the test trials began. The 256 test trials were then presented in random order.

Campus experience questionnaire

Once participants completed the JRDs, they filled out a Campus Experience Questionnaire (CEQ). The CEQ had two components: (a) assessing the participants’ familiarity and experience with the Homewood campus, and (b) gauging factors that might relate to their everyday use of the campus (e.g., breadth of exploration in the broader community). In the first section of the CEQ, participants indicated their year of study at Johns Hopkins (e.g., fourth year) and listed all of the residence halls or apartment complexes they had lived in during their study. They then used a 5-point scale to rate their familiarity with the buildings on the Homewood campus, including those used in JRDs (from 1[totally unfamiliar] to 5[familiar]), and listed how many classes they had in each building (for academic buildings), along with how many times a week they visited the building in each of their years at Hopkins. The CEQ was included to allow us to confirm a basic knowledge of the campus and acquire information that might predict reference frame selection.

Results

First, we compared absolute angular error to reaction time at an individual level to determine whether participants were using a speed/accuracy trade-off during performance of judgments of relative direction. In general, we did not observe any tendency for participants to tradeoff speed for accuracy (or vice-versa: mean correlation coefficient,–0.02); however, two individuals did show evidence of a nonlinear speed/accuracy trade-off (correlation coefficients of –.5 and –.52). On closer inspection, it became clear that these participants had a large number of trials completed in less than 1 s, and that these trials tended to be particularly inaccurate. We ran all analyses with and without these two participants and did not observe any differences.

Next, we verified that participants did not time out on trials in a systematic manner that could influence the validity of our results. Average response latency was 7.7 ± 3.4 s, half of the 15 s response deadline, suggesting that the deadline did not serve to limit participants’ capacity to reason accurately. Participants missed 11 of 256 trials on average; these missed trials appeared uniformly distributed across our factors of interest (orientation of the imagined heading and heading type) and were consistent with the natural consequences of completing massed trials of JRDs (occasional distraction or boredom). In all of the subsequent analyses, we focus on angular error rather than response latency data; however, the patterns in response latency were consistent with the overall story but showed greater variability.

The present dataset relied on the opportunistic use of campus buildings familiar to the average undergraduate at Johns Hopkins; as a result, we could not easily counterbalance pointing direction to the target across all orientations. True pointing direction is known to have an impact on absolute angular error, with pointing to the front being more accurate than to the sides or back (e.g., Shelton & McNamara, 2001a). To take pointing direction to target into account, we performed all analyses on mean absolute angular error with pointing direction covaried out at the individual level.(Covarying out pointing direction had an impact on certain orientations, as was expected by the unbalanced pointing directions across headings, but the overall result remained the same regardless of whether we removed the covariate.)

We calculated mean absolute angular error for each individual at each orientation by heading type combination. Absolute angular error was submitted to a mixed-model ANOVA with gender and year of study at Hopkins as a between-participants factor and orientation of the imagined heading (0° to 315° in 45° steps) and heading type (within- or across- quad heading) as a within-participants factor. Within-participants effects and contrasts were Geisser–Greenhouse corrected for non-sphericity; all contrasts were corrected for multiple comparisons using Scheffe’s procedure.

The primary question was whether we could see a preferred orientation in a familiar space learned under more natural conditions. The main effect of orientation was significant, F(7, 350) = 5.383, p = .001, η p 2 = 0.097, indicating orientation dependence. Given the unconstrained learning, however, it was not clear what orientation was likely to be most accurate. Therefore, we asked whether any of the eight orientations was more accurate than the others, and, if so, whether there was uniform increased error for all remaining orientations. As is shown in Fig. 2, 0° was more accurate than all other orientations combined, F(1, 57) = 41.16, p < .05. This contrast was not significant for any other orientation, with the exception of 225°, which was less accurate than all other orientations, F(1, 57) = 20.81, p < .05. To rule out the possibility that particularly inaccurate performance at a single orientation could explain the observed benefit for 0°, we repeated the contrast of 0° versus all other orientations with 225° excluded, F(1, 57) = 35.59, p < .05.

Fig. 2
figure 2

Average absolute angular error in judgments of relative direction as a function of heading type and orientation of the imagined heading. Error bars reflect ± 1 SEM

In addition to the benefit for 0°, there was also a “see-saw” pattern among non-0° orientations: those orientations opposite of and orthogonal to 0° (90°, 180°, and 270°) were more accurately retrieved than were the remaining orientations (45°, 135°, 225°, 315°), F(1, 57) = 36.737, p < .05.

The aforementioned pattern has typically been identified with strong environmental structure benefitting certain non-preferred orientations (e.g., Shelton & McNamara, 2001a). However, it can also arise from different participants showing different preferred orientations. To allow for this possibility, we inspected the data at the individual level. First, we observed that all participants showed a reasonably distinct preferred orientation, with the nearest second best being opposite of or orthogonal to the best. Most participants showed performance consistent with the 0° preferred orientation (31 participants). Critically, individual participants still tended to show the see-saw pattern, suggesting that it was not simply an artifact of different preferences.

With respect to hierarchies, we also observed a significant effect of heading type, F(1, 50) = 9.898, p = .003, η p 2 = 0.165, with participants pointing more accurately from within-quad imagined headings than across-quad headings (38.21° vs. 41.88°), and a significant orientation × heading type interaction, F(7, 350) = 2.370, p = .038, η p 2 = 0.045. One expectation for this interaction was that the presence or scale of the alignment effect (0° vs. else) could be modulated by the type of heading. Inspection of the data did not support this interpretation; the two heading types showed similar benefits (8.11° and 6.65° for within- and across-quad headings, respectively), and the contrast–contrast interaction (0° vs. else × within- vs. across-quad) was not significant, F(1, 57) = 0.467, p > .05. Instead, this interaction appears to be due to differential performance on headings orthogonal to the preferred orientation for within- versus across-quad headings, with a stronger benefit for 90° and 270° for within-quad versus across-quad headings. This was supported by a contrast–contrast interaction comparing these two advantaged headings to the non-orthogonal headings, (90°& 270° vs. 45°, 135°, 225°, and 315°) by across- versus within-quad headings, F(1, 57) = 13.372, p < .05. This could support a difference in the use of the structure available locally (within-quad) versus distally (across-quad), as predicted by hierarchies.

No other effects or interactions were significant. The CEQ provided confirmation of participants’ knowledge; however, likely because of the high consistency in the preferred orientation at 0°, we did not observe any factors that appeared to predict principal orientation selection, and none of the effects were sensitive to the number of years that students had been on campus.

Discussion

We set out to determine whether the orientation dependence of human spatial memory observed in tightly controlled laboratory experiments generalizes to spaces learned under the more natural and highly variable encoding conditions of everyday navigational learning. Our results unambiguously support the prevalence of orientation dependence in spatial memory representations, with our participants organizing campus location along a preferred orientation. This result extends decades of previous research on spatial reference frames from the lab to the real world by showing that similar principles underlie the representations established during a single visit to the lab and those built up through everyday navigation. In addition, this finding has the potential to unite many existing observations about how people recall familiar spaces (e.g., Montello, 1991; Sholl, 1987; Tlauka, 2006; Werner & Schmidt, 1999) by suggesting that the benefits at recall for alignment with familiar orientations or local structures may reflect the presence of a reference orientation. Our result suggests that these patterns of orientation dependence may reflect a relatively general reference used for all locations within the space, because each of our orientations consisted of trials from multiple different imagined headings. If these previously observed benefits also reflected preferred orientations, then the cues that produced those benefits might also be those that drive selection of a reference for real-world spaces.

The selection of a reference orientation has been a pressing issue in spatial cognition. The consistency in our results was remarkable: Most participants showed a preference for the 0°/North-as-up orientation. Looking back at Fig. 1, it is clear that the majority of the buildings on campus are located about rectangles that are roughly aligned with this axis. As such, the organizational properties of the participants’ memory representations appear to be reflecting obvious physical properties of the structure of the campus (i.e., orientation and alignment of buildings). This apparent use of the structural arrangement of campus as a guiding principle for reference frame selection parallels the findings from a variety of paradigms showing how the organization of objects and object properties can drive the principal reference (e.g., Kelly & McNamara, 2008; Marchette & Shelton, 2010; Mou & McNamara, 2002), as well as the benefit for alignment with environmental structure in recalling familiar spaces (Montello, 1991; Werner & Schmidt, 1999). Despite the prevalence of North-as-up, we did observe individuals with preferences for other orientations, most often South (180°) or West (90°). An axis facing South might have been selected, since it shares many of the cues that might support a North-facing reference orientation. Similarly, a West reference is orthogonal to this major axis. Moreover, it is consistent with a particular quirk of Johns Hopkins University: Maps of the Homewood campus are typically oriented with West up (although this convention is thankfully changing). Although we did not find a clear case for other idiosyncratic preferences, this variability in the selection of a preferred orientation did raise some additional questions. For example, a small subset of our participants revealed seemingly random preferred orientations that were misaligned with any obvious structural or functional organization of the campus. Although these may reflect properties that we are as of yet unaware, they may also reflect poor selection. We took a cursory look at this intriguing possibility by comparing the overall performance of participants who selected aligned or misaligned orientations. Even at their best orientation, those selecting the aligned orientations performed more accurately than the subset that selected the apparently misaligned orientations. This introduces an interesting question about the role of individual differences in reference frame selection and how this selection relates to success of learning and navigating.

Selection of the reference orientation reflects one potential use of the spatial structure. In addition, we observed elements of a spatial hierarchy influencing memory access. Although we had insufficient headings to fully explore preferred orientations in different quads, our data suggest that the selection of North-as-up was not specific to any particular subset of trials. However, we did observe differences in facilitation among non-preferred orientations for within- versus between-quad trials. In particular, orientations orthogonal to 0° were facilitated more when headings comprised buildings from the same quad, suggesting that within-quad headings allowed participants to make more use of structure to facilitate retrieval. Previous observations of retrieval facilitation (i.e., the see-saw pattern among non-preferred orientations) have depended on the presence of salient structure. Such a structure may be more readily available when imaging two buildings that are part of the same coherent unit as is the case for buildings that form a quad. (After all, the name “quadrangle” comes from the fact that the buildings tend to form a quadrilateral shape.) Despite the various examples of the see-saw pattern in the literature and now for familiar large-scale space, this retrieval facilitation is an area that is currently underexplored and may represent a rich avenue for future investigations.

Finally, we have attributed the observed preference for North-as-up to the environmental structure because campus buildings are roughly distributed in rectangles with a North–South axis of elongation that is parallel to the major landmark of North Charles Street (Fig. 1). However, there are other features of the environment that might also be driving this selection. First, North-as-up is a map convention with which most, if not all, of our participants are likely familiar. Although it is not the convention for maps at Hopkins, this North-as-up organization may be given priority when learning a new space, because of extensive experience. Second, there is a functional role for the North–South axis on campus: It also serves as the major route of travel for students going from residence halls (as well as much of the off-campus housing) to academic buildings. As such, the reference frame selection could also be driven by salient direction of travel in the space. Data from the CEQ could leverage this aspect, but we had insufficient representation of participants who had extensive experience with primary directions of travel other than the North–South axis: Much of the available housing is on the north end of campus and beyond. Although the coincidence of these various cues on the Homewood campus and our inability to directly manipulate participants’ experiences prevent us from disentangling which cues gave rise to the preferred orientation in the present study, this work highlights the need to consider a much broader range of perceptual, conceptual, and experiential cues that might contribute to the formation of spatial memories.

Taken together, these results argue that representing space requires establishing a reference orientation, a requirement that endures over repeated exposures and extensive experience. These results do not suggest that the choice of reference orientation is inflexible with experience. Indeed, it is clear that the preferred orientation can shift with the experience of new viewpoints (Shelton & McNamara, 2001a) or when updating a representation with new information (Kelly & McNamara, 2010). The key is that these changes/additions do not mitigate the need to represent space with respect to some orientation. Although our results jibe with well-established principles, the implications may be counterintuitive when we reflect on our own experience of recalling a familiar environment. Subjectively, it feels as though familiar locations from our everyday travels can be easily imagined from many different vantage points; however, it appears that this flexibility does not reflect orientation independence but rather stems from deft manipulation of a representation coded with respect to a single reference orientation.