Event Abstract

Uncovering the temporal dynamics of scene understanding using Event-Related Potentials

  • 1 Wright State University, Department of Psychology, United States

Recognizing images of large-scale real-world environments is an essential skill in many professional contexts, ranging from defense and intelligence to driving and urban planning. However, relatively little is known about the temporal dynamics of scene recognition, and particularly its underlying neural temporal dynamics. Uncovering how scene processing unfolds in the brain over time (especially focusing on the earliest time windows in which the brain detects diagnostic scene information) is important not only for providing insights into the nature of scene understanding, but also for applied purposes. Prominent examples of such applications are the identification of specific real-time neural markers for the extraction of diagnostic scene information, as well as providing implicit measures for tracking down and evaluating image analyst training. In the present work, we describe a series of studies aimed at identifying and characterizing the functional significance of early electrophysiological markers of scene understanding. Our starting point for the current research is a recent surge in behavioral and neuroimaging studies that highlight the importance of diagnostic global scene properties (GSPs) for fast scene categorization (for a review, see Groen et al., 2017). Diagnostic GSPs, originally described by Greene and Oliva (2009), are mid-level image representations that convey information about the structure (“is it an open or a closed space?”) and function (“how easily can I navigate in this environment?”) of visual scenes. In our first study, we sought to establish EEG as a useful measure of scene diagnosticity by recording Event-Related Potentials (ERPs) from participants while they viewed naturalistic images of real-world scenes varying along two GSPs: spatial expanse (open/closed) and naturalness (manmade/natural). We found that these two GSPs modulated the amplitude of early sensory-evoked ERP components, particularly the posterior P2 (Harel et al., 2016), suggesting that information about the structure and function of visual scenes is encoded in the brain by 220ms post-stimulus onset. Given that open and closed scenes can be thought as two ends on a navigability continuum, we next reasoned that these ERP markers might contain information about the number of pathways that afford movement in the local environment. To test this idea, we recorded ERPs from participants in a second study while they viewed computer-generated room scenes used in a previous fMRI study of navigability (Bonner & Epstein, 2017). By simply changing the number of doors we were able to systematically control the number of movement paths in the scene, while keeping the overall size and shape of the environment constant. We found that rooms with no doors evoked a higher P2 response than rooms with three doors, analogous to the higher P2 amplitude to closed relative to open scenes. P2 amplitude to rooms with one or two doors was higher than three-door rooms but lower than the response to no-door rooms. This parametric navigability effect persisted following the P2 peak (~220ms), lasting up until 650ms post-stimulus onset. These results suggest that the perceived ease of navigation in a scene is represented in both early and late stages of scene perception, complementing fMRI research showing that the occipital place area automatically encodes the structure of navigable spaces. To further investigate the extent to which GSPs are indeed processed automatically (or rather influenced by their task-relevance), we conducted a third study in which we manipulated the task context under which the GSPs were perceived. Participants saw the same scenes as in the first study but now they were asked to actively categorize them, based either on their naturalness or spatial expense. We replicated previous results, and found that task context had very little impact on the early ERP responses to the naturalness and spatial expanse of the scenes: P2 amplitude could be used to distinguish between open and closed scenes even when participants were not actively attending to that dimension, and that also held for the distinction between manmade and natural scenes. These findings suggest that the extraction of global scene information is very little influenced by task-relevance, and supports the notion these early ERP markers of scene perception reflect automatic extraction of diagnostic scene information. In summary, the series of studies described here suggest that diagnostic global scene properties are processed robustly and automatically within the first 250 milliseconds of processing and can be used to index ecological behaviors such as potential navigability of a given visual environment. Potential implications and applications of these findings will be further discussed.

References

Bonner, M. F., & Epstein, R. A. (2017). Coding of navigational affordances in the human visual system. Proceedings of the National Academy of Sciences, 114(18), 4793-4798.

Greene, M. R., & Oliva, A. (2009). Recognition of natural scenes from global properties: Seeing the forest without representing the trees. Cognitive psychology, 58(2), 137-176.

Groen, I. I., Silson, E. H., & Baker, C. I. (2017). Contributions of low-and high-level properties to neural processing of visual scenes in the human brain. Phil. Trans. R. Soc. B, 372(1714), 20160102.

Harel, A., Groen, I. I., Kravitz, D. J., Deouell, L. Y., & Baker, C. I. (2016). The Temporal Dynamics of Scene Processing: A Multifaceted EEG Investigation. eNeuro, 3(5), ENEURO-0139.

Keywords: scene perception, scene understanding, Scene Recognition, scene categorization, EEG, EEG/ERP, Visual Perception

Conference: 2nd International Neuroergonomics Conference, Philadelphia, PA, United States, 27 Jun - 29 Jun, 2018.

Presentation Type: Oral Presentation

Topic: Neuroergonomics

Citation: Harel A (2019). Uncovering the temporal dynamics of scene understanding using Event-Related Potentials. Conference Abstract: 2nd International Neuroergonomics Conference. doi: 10.3389/conf.fnhum.2018.227.00148

Copyright: The abstracts in this collection have not been subject to any Frontiers peer review or checks, and are not endorsed by Frontiers. They are made available through the Frontiers publishing platform as a service to conference organizers and presenters.

The copyright in the individual abstracts is owned by the author of each abstract or his/her employer unless otherwise stated.

Each abstract, as well as the collection of abstracts, are published under a Creative Commons CC-BY 4.0 (attribution) licence (https://creativecommons.org/licenses/by/4.0/) and may thus be reproduced, translated, adapted and be the subject of derivative works provided the authors and Frontiers are attributed.

For Frontiers’ terms and conditions please see https://www.frontiersin.org/legal/terms-and-conditions.

Received: 02 Apr 2018; Published Online: 27 Sep 2019.

* Correspondence: Dr. Assaf Harel, Wright State University, Department of Psychology, Dayton, United States, assaf.harel@wright.edu