A smoothness constraint on the development of object recognition

doi:10.1016/j.cognition.2016.04.013

Cognition

Volume 153, August 2016, Pages 140-145

https://doi.org/10.1016/j.cognition.2016.04.013 Get rights and content

Highlights

•
Is temporal smoothness important for the development of object recognition?
•
Newborn chicks were used as an animal model in a controlled-rearing study.
•
Chicks developed object recognition abilities in temporally smooth environments.
•
The development of object recognition was impaired in non-smooth environments.
•
Temporal smoothness facilitates the development of object recognition.

Abstract

Understanding how the brain learns to recognize objects is one of the ultimate goals in the cognitive sciences. To date, however, we have not yet characterized the environmental factors that cause object recognition to emerge in the newborn brain. Here, I present the results of a high-throughput controlled-rearing experiment that examined whether the development of object recognition requires experience with temporally smooth visual objects. When newborn chicks (Gallus gallus) were raised with virtual objects that moved smoothly over time, the chicks developed accurate color recognition, shape recognition, and color-shape binding abilities. In contrast, when newborn chicks were raised with virtual objects that moved non-smoothly over time, the chicks’ object recognition abilities were severely impaired. These results provide evidence for a “smoothness constraint” on newborn object recognition. Experience with temporally smooth objects facilitates the development of object recognition.

Introduction

Object recognition is one of the most important functions of the vertebrate visual system. To date, however, the development of object recognition is poorly understood. What environmental factors cause object recognition to emerge in the newborn brain? Does this ability emerge automatically, or do newborns require a specific type of visual input in order to develop accurate object recognition abilities? These types of questions are difficult to address with humans because human infants cannot be raised in strictly controlled environments from birth. In contrast, questions that concern the role of experience in development can be addressed directly with controlled-rearing studies of newborn animals. Here, I describe a high-throughput controlled-rearing experiment that examined whether the development of object recognition requires experience with temporally smooth visual objects.

Researchers have long theorized that biological visual systems leverage the temporal smoothness of natural visual environments to recognize objects (e.g., DiCarlo et al., 2012, Feldman and Tremoulet, 2006, Foldiak, 1991, Gibson, 1979, Stone, 1996, Wallis and Rolls, 1997, Wiskott and Sejnowski, 2002). In particular, when an object moves smoothly across the visual field, the object projects a series of gradually changing images on the retina. The visual system might take advantage of this natural tendency for temporally contiguous retinal images to belong to the same object by associating patterns of neuronal activity produced by successive retinal images of an object. When provided with temporally smooth visual input, this temporal association process should create object representations that are selective for object identity and tolerant to identity-preserving image transformations (e.g., changes in viewpoint).

A wealth of studies provide evidence that mature visual systems use temporal association mechanisms to create object representations. For example, when human adults are presented with sequential views of an object, the views come to be associated with one another in a manner that aids recognition (Cox et al., 2005, Liu, 2007, Stone, 1998, Vuong and Tarr, 2004, Wallis et al., 2009, Wallis and Bülthoff, 2001). Temporal association effects have also been found on the neurophysiological level in adult monkeys (Li and DiCarlo, 2008, Li and DiCarlo, 2010, Meyer and Olson, 2011, Miyashita, 1988). In the present study, I examined whether newborn visual systems create more accurate object representations when presented with temporally smooth objects compared to temporally non-smooth objects—as predicted by temporal association models (Wallis, 1998, Wallis and Bülthoff, 2001). Specifically, I examined the first visual object representation created by newborn subjects, before their visual systems had been shaped by any prior visual object experience.

This experiment required controlling all of the subjects’ visual experiences from the onset of vision and measuring their object recognition abilities across a range of test trials. To meet these requirements, I used a high-throughput controlled-rearing method (Wood, 2013). The method involves raising newborn chicks in strictly controlled environments and recording their behavior in response to pre-programmed animations (Fig. 1A). We use the term “high-throughput” to describe the method because the controlled-rearing chambers record all of the subjects’ behavior (24/7).

I used domestic chicks as an animal model because they are an ideal model system for studying the development of vision (Wood & Wood, 2015a). First, chicks can be raised in strictly controlled environments immediately after hatching, which makes it possible to control all of their visual object experiences. Second, chicks imprint to objects seen in the first days of life. This imprinting behavior can be used to test chicks’ object recognition abilities without training (Bateson, 2000, Horn, 2004). Third, birds and mammals process sensory input using homologous neural circuits with similar connectivity patterns (reviewed by Jarvis et al., 2005, Karten, 2013). Since birds and mammals use homologous neural mechanisms to process visual input, controlled-rearing studies of newborn chicks can inform our understanding of the development of both avian and mammalian vision. Finally, chicks develop visual recognition abilities rapidly (Vallortigara, 2012). For example, newborn chicks can begin recognizing objects (Wood, 2013, Wood, 2015), faces (Wood & Wood, 2015b), and actions (Goldman & Wood, 2015) at the onset of vision. Newborn chicks can also build integrated object representations with bound color-shape units (Wood, 2014).

In the first week of life (input phase), newborn chicks were raised in environments that contained no objects other than a single virtual object (Fig. 1A). For one group of chicks, the virtual object moved smoothly over time (Temporally Smooth Condition), whereas for another group of chicks, the virtual object moved non-smoothly over time (Temporally Non-Smooth Condition). In the second week of life (test phase), I used an automated two-alternative forced-choice procedure to test the chicks’ color recognition, shape recognition, and color-shape binding abilities.

Section snippets

Subjects

Twenty-two domestic chicks of unknown sex were tested. No subjects were excluded from the analyses. The eggs were obtained from a local distributor and incubated in darkness in an OVA-Easy incubator (Brinsea Products Inc., Titusville, FL). After hatching, the chicks were moved from the incubation room to the controlled-rearing chambers in complete darkness. Each chick was raised singly within its own chamber. Ten chicks were raised with a temporally smooth object and 12 chicks were raised with

Recognition performance

The results are shown in Fig. 3. For each test trial type, I computed the percent of time each chick spent with the imprinted object compared to the unfamiliar object. A repeated measures ANOVA with Test Trial Type as a within-subjects factor and Condition (Temporally Smooth vs. Temporally Non-Smooth) as a between-subjects factor revealed a significant main effect of Test Trial Type (F(6, 120) = 17.08, p < 0.001) and Condition (F(1, 20) = 10.99, p = .003). The interaction was not significant (F(6, 120) =

Discussion

I used a high-throughput controlled-rearing method to examine whether newborn chicks need visual experience with temporally smooth objects to develop object recognition abilities. The chicks raised with the temporally smooth objects and the chicks raised with the temporally non-smooth objects were exposed to the same individual images, and the objects were equally predictive in terms of the transitional probabilities between images; nevertheless, there were significant differences in

Acknowledgements

This research was funded by National Science Foundation CAREER Grant BCS-1351892. I thank Brian W. Wood for assistance with the supplementary movies and Samantha M.W. Wood for helpful comments on the manuscript.

References (35)

H. Bulf et al.
Visual statistical learning in the newborn infant
Cognition
(2011)
J.J. DiCarlo et al.
How does the brain solve visual object recognition?
Neuron
(2012)
J. Feldman et al.
Individuation of visual objects over time
Cognition
(2006)
H.J. Karten
Neocortical evolution: Neuronal circuits arise independently of lamination
Current Biology
(2013)
N.Z. Kirkham et al.
Visual statistical learning in infancy: Evidence for a domain general learning mechanism
Cognition
(2002)
N. Li et al.
Unsupervised natural visual experience rapidly reshapes size-invariant object representation in inferior temporal cortex
Neuron
(2010)
J. Stone
Object recognition using spatio-temporal signatures
Vision Research
(1998)
Q.C. Vuong et al.
Rotation direction affects object recognition
Vision Research
(2004)
G. Wallis et al.
Invariant face and object recognition in the visual system
Progress in Neurobiology
(1997)
J. Wattam-Bell
Development of motion-specific cortical responses in infancy
Vision Research
(1991)

J. Wattam-Bell

The development of maximum displacement limits for discrimination of motion direction in infancy

Vision Research

(1992)

D.D. Cox et al.

‘Breaking’ position-invariant object recognition

Nature Neuroscience

(2005)

P. Foldiak

Learning invariance from transformation sequences

Neural Computation

(1991)

J.J. Gibson

The ecological approach to visual perception

(1979)

J.G. Goldman et al.

An automated controlled-rearing method for studying the origins of movement recognition in newly hatched chicks

Animal Cognition

(2015)

G. Horn

Pathways of the past: The imprint of memory

Nature Reviews Neuroscience

(2004)

Cited by (28)

A predictive model of indoor PM<inf>2.5</inf> considering occupancy level in a hospital outpatient hall
2022, Science of the Total Environment
Citation Excerpt :
Deep-learning models can express more complex transformation relationships by combining simple composite functions. Object recognition is an important function of the vertebrate visual system (Wood, 2016). It aims to identify specific objects by extracting contours, textures, and shapes from complex image content.
The hospital outpatient hall is more complex and sensitive than other indoor places because of its high density, flow of patients, and risk of infection. The prediction of indoor pollutants, such as PM_2.5, is a critical health risk factor and an important topic in the study of indoor air quality. Numerous black-box models have been built to predict PM_2.5, which are prone to overfitting and low precision in long sequence time prediction due to their limited weighting calculation and factors considered In this study, subject-object weighting incorporates a long sequence time-series model that considers occupancy (SO-LSTS) to predict PM_2.5 concentrations in a hospital outpatient hall. First, the occupancy level was obtained using image recognition technology. Second, both the subjective (improved AHP) and objective (entropy weight) information were coupled by a distance function and then integrated into the LSTS model. Finally, the model performance was compared to six traditional models and the impact on the output length and hyper-parameter confirmation was assessed. The results demonstrate that the occupancy factor can improve the model performance by 54 %, and the model accuracy is improved by 89 % compared to the traditional Informer method. Our study considers real-time environmental and occupancy levels, which can compensate for the difficulty of interpreting the black-box model and identifying an accurate and resource-efficient proactive control model for hospital environmental management compared to conventional approaches.
Regularizing disentangled representations with anatomical temporal consistency
2022, Biomedical Image Synthesis and Simulation: Methods and Applications
Deep neural networks have shown to be promising approaches for medical image analysis. However, their training is most effective when they learn robust data representations using large-scale annotated datasets, which are tedious to acquire in clinical practice. As medical annotations are often limited, there has been an increasing interest in making data representations robust in case of data lack. In particular, a spate of research focuses on constraining the learned representations to be interpretable and able to separate out, or disentangle, the data explanatory factors. This chapter discusses recent disentanglement frameworks, with a special focus on the image segmentation task. We build on a recent approach for disentanglement of cardiac medical images into disjoint patient anatomy and imaging modality dependent representations. We incorporate into the model a purposely designed architecture (which we term “temporal transformer”) which, from a given image and a time gap, can estimate anatomical representations of an image at a future time-point within the cardiac cycle of cine MRI. The transformer's role is to introduce a self-supervised objective to encourage the emergence of temporally coherent data representations. We show that such a regularization improves the quality of disentangled representations, ultimately increasing semi-supervised segmentation performance when annotations are scarce. Finally, we show that predicting future representations can be potentially used for image synthesis tasks.
One-shot learning of view-invariant object representations in newborn chicks
2020, Cognition
Citation Excerpt :
Unlike humans and commonly-used animal models in psychology (e.g., rats, pigeons, monkeys), newborn chicks are precocial, require no parental care, and can be raised in strictly controlled environments from the onset of vision (Fig. 1A). With chicks it is therefore possible to study how specific visual inputs shape the development of object recognition (Wood, 2016; Wood & Wood, 2016). Newborn chicks can also be observed and tested continuously (24/7) for long periods of time, using automated image-based tracking software.
Can newborn brains perform one-shot learning? To address this question, we reared newborn chicks in strictly controlled environments containing a single view of a single object, then tested their object recognition performance across 24 uniformly-spaced viewpoints. We found that chicks can build view-invariant object representations from a single view of an object: a case of one-shot learning in newborn brains. Chicks can also build the same view-invariant object representation from different views of an object, showing that newborn brains converge on common object representations from different sets of sensory inputs. Finally, by rearing chicks with larger numbers of object views, we found that chicks develop enhanced recognition for familiar views. These results illuminate the earliest stages of object recognition, revealing (1) powerful one-shot learning that builds invariant object representations from the first views of an object and (2) view-based learning that enriches object representations, producing enhanced recognition for familiar views.
Object permanence in newborn chicks is robust against opposing evidence
2024, arXiv
The Development of Object Recognition Requires Experience with the Surface Features of Objects
2024, Animals
Are Vision Transformers More Data Hungry Than Newborn Visual Systems?
2023, arXiv

View all citing articles on Scopus

View full text

Brief articleA smoothness constraint on the development of object recognition

Highlights

Abstract

Introduction

Section snippets

Subjects

Recognition performance

Discussion

Acknowledgements

Cognition

Neuron

Cognition

Current Biology

Cognition

Neuron

Vision Research

Vision Research

Progress in Neurobiology

Vision Research

Vision Research

‘Breaking’ position-invariant object recognition

Nature Neuroscience

Learning invariance from transformation sequences

Neural Computation

The ecological approach to visual perception

An automated controlled-rearing method for studying the origins of movement recognition in newly hatched chicks

Animal Cognition

Pathways of the past: The imprint of memory

Nature Reviews Neuroscience

Brief article
A smoothness constraint on the development of object recognition