A novel approach to investigate recursion and iteration in visual hierarchical processing

Martins, Maurício Dias; Martins, Isabel Pavão; Fitch, W. Tecumseh

doi:10.3758/s13428-015-0657-1

A novel approach to investigate recursion and iteration in visual hierarchical processing

Published: 20 October 2015

Volume 48, pages 1421–1442, (2016)
Cite this article

Download PDF

Behavior Research Methods Aims and scope Submit manuscript

A novel approach to investigate recursion and iteration in visual hierarchical processing

Download PDF

Maurício Dias Martins^1,2,3,
Isabel Pavão Martins⁴ &
W. Tecumseh Fitch¹

3009 Accesses
8 Citations
8 Altmetric
1 Mention
Explore all metrics

Abstract

We describe a new method to explore recursive cognition in the visual domain. We define recursion as the ability to represent multiple hierarchical levels using the same rule, entailing the ability to generate new levels beyond those previously encountered. With this definition recursion can be distinguished from general hierarchical embedding. To investigate this recursion/hierarchy distinction in the visual domain, we developed two novel methods: The Visual Recursion Task (VRT), in which an inferred rule is used to represent new hierarchical levels, and the Embedded Iteration Task (EIT), in which additional elements are added to an existing hierarchical level. We found that adult humans can represent recursion in the visuo-spatial domain, and that this ability is distinct from both general intelligence and the ability to represent iterative processes embedded within hierarchical structures. Compared with embedded iteration, visual recursion correlated positively with other recursive planning tasks (Tower of Hanoi), but not with specific visuo-spatial resources (spatial short-term memory and working memory). We conclude that humans are able to use recursive representations to process complex visuo-spatial hierarchies and that our visual recursion task taps into specific cognitive resources. This method opens exciting opportunities to explore the relationship between visual recursion and language.

Memory benefits when actively, rather than passively, viewing images

Article 27 November 2023

Twenty years of load theory—Where are we now, and where should we go next?

Article 04 January 2016

Deconstructing the effect of self-directed study on episodic memory

Article 19 June 2014

The capacity to understand and generate complex hierarchies is one of the most fascinating features of human cognition. In many domains, including language, music, problem-solving, action-sequencing, and spatial navigation, humans organize basic elements into higher-order groupings and structures (Badre, 2008; Chomsky, 1957; Hauser, Chomsky, & Fitch, 2002; Nardini, Jones, Bedford, & Braddick, 2008; Unterrainer & Owen, 2006; Wohlschlager, Gattis, & Bekkering, 2003). This ability to encode the relationship between basic elements (words, people, etc.) and the broader structures in which these are embedded (sentences, corporations, etc.), affords flexibility to human behavior. For example, in action sequencing, and unlike pure serial associative behavior, hierarchical representations allow the omission or modification of certain steps, without impairing the overall goal.

Here, we define hierarchies as non-cyclical tree-like organizations, where higher levels incorporate multiple lower levels in structural representations (Fitch & Martins, 2014), i.e., in which elements are embedded within other elements. This embedding can refer to the grouping of constituents within a higher order set, such as the grouping of individuals within a family (family = {ind1; ind2; ind3}), or it can refer to the establishment of asymmetrical dominance-subordination relationships between constituents, such as in social hierarchies (ind1 dominant over ind2, ind2 dominant over ind3, etc.).

Within the context of hierarchical processing, recursion is an interesting concept that has fascinated scholars in fields as diverse as mathematics, computer science, linguistics, and visual arts. Recursion is interesting because it allows the generation of structures that are both simple and complex at the same time. Recursive structures are complex because they can contain infinite hierarchical levels, and yet simple because this infinity can be achieved and represented using finite rules.

Recursion is a term that has been used to characterize the process of embedding a constituent inside another constituent of the same kind (Fitch, 2010; Hulst, 2010; Pinker & Jackendoff, 2005). Recursive processes can generate hierarchical structures that display similar properties across different levels of embedding. This feature, called self-similarity, is a signature of recursive structures. An example of a recursive linguistic structure is the compound noun “[[student] committee]”, where we find a noun phrase embedded inside another noun phrase. In contrast, a sentence containing a noun and a verb, such as “[[trees] grow]”, is hierarchical, but not recursive, because a constituent of one type (noun) is nested within a constituent of a different type (verb).

We can also find examples of recursive procedures generating visual hierarchies. For instance, fractals are structures that display self-similarity (Mandelbrot, 1977), that is, they appear similar when viewed at different scales (as in the famous Mandelbrot set). Fractals can be produced by simple rules that generate complex hierarchical structures when applied iteratively to their own output (Fig. 1).

Recently, recursion has become an important topic in cognitive science because the development of the human ability to represent recursion has been considered an important step in the evolution of language (Fitch, Hauser, & Chomsky, 2005; Hauser et al., 2002). In addition, recursion has been proposed to have evolved primarily within the linguistic domain, being accessible to other modalities (e.g., visual domain) only through language (Fitch et al., 2005; Hauser et al., 2002).^{Footnote 1} Other authors have also proposed that recursion might have evolved only in humans, and that recursive thinking is at the core of human cognitive exceptionality (Corballis, 2014). Testing these hypotheses has been difficult due to both theoretical and methodological limitations.

An empirically useful definition of recursion

Despite considerable agreement about the importance of recursion, many different definitions of recursion are in use (Chomsky, 2010; Corballis, 2007; Gentner, Fenn, Margoliash, & Nusbaum, 2006; Hofstadter, 1980; Kilpatrick, 1985; Odifreddi, 1999; Penrose, 1989) which has hindered consistent interpretation of empirical results (Fitch, 2010). On the one hand, it has proven to be particularly difficult to establish clear distinctions between recursion and similar processes such as hierarchical embedding and iteration (Hulst, 2010). On the other hand, it has not been clear which level of analysis (process, structure, or representation) is relevant for empirical research (Lobina, 2011, 2014; Martins, 2012).

Regarding the first theoretical difficulty, here we adopt a framework (Fitch, 2010; Martins, 2012) in which “iteration” refers to the process of repeating an operation a certain number of times. An iterative process may or may not generate hierarchical structures or create dependency relationships between different elements. For example, putting one marble at a time into a bag is an iterative process, but neither hierarchical nor recursive. In contrast, “hierarchical” structures always involve the embedding of elements within other elements. If the hierarchical embedding occurs between constituents of the same category (e.g., such as a noun phrase inside a noun phrase) we classify it as recursive, otherwise as non-recursive. Iteration, hierarchical embedding, and recursion are not mutually exclusive processes: in fact, recursion typically involves both hierarchy and iteration. Nevertheless, it is possible to segregate the cognitive abilities necessary to represent the kind of information that each of these processes encode (Fig. 2).

The second theoretical difficulty is to define the level of analysis useful for empirical enquiries. Recursion can be defined either as a “procedure that calls itself” or as the property of “constituents that contain constituents of the same kind” (Fitch, 2010; Pinker & Jackendoff, 2005). Frequently, we find an isomorphism between procedure and structure, i.e., recursive processes often generate recursive structures. However, this isomorphism does not always occur (Lobina, 2011; Luuk & Luuk, 2010; Martins, 2012). In this manuscript we explicitly focus on a third level of analysis, which is the level of representation. We focus on detecting what kind of information individuals can represent, rather than on how this information is implemented algorithmically.

Encoding iteration requires the ability to represent the repetition of a certain process, for instance the repeated addition of elements to a structure. Encoding hierarchical embedding requires the ability to represent dependency or grouping relationships between constituents at multiple levels. Encoding recursive embedding requires the ability to represent similarities across hierarchical levels (self-similarity). Specifically, that the way contiguous levels relate to each other within a hierarchy is similar across different levels. Recursion enables the generation of new hierarchical levels beyond those previously experienced, maintaining consistency with existing levels at a higher level of abstraction. It is important to retain the notion that a certain hierarchy can be represented both recursively and non-recursively. For instance, in Fig. 3, a certain visual hierarchy can be generated using either process (a) or process (b). The second mode of representation is recursive, and allows the generation of an infinite number of new hierarchical levels, using one simple rule. This capacity to generalize common hierarchical principles across levels and to generate new levels beyond the given is a specific behavioral signature of recursive cognition.

Finally, although there is evidence suggesting humans can represent recursion in language, the question of whether we can represent this concept in other domains (for example, in vision) has been not been addressed empirically. This omission has been caused by a lack of methods to test for the ability to represent recursion in non-linguistic domains. Here we solve this methodological limitation by presenting a novel method that can be used to test recursion in vision. In particular, in this paper we evaluate our novel method in a variety of conditions to ensure that it taps into a specific cognitive construct (recursion) which is not completely explained by other, more general processes (such as intelligence, iterative reasoning, working memory, entropy analysis, and low-frequency spatial heuristics).

Hierarchical processing in the visual domain

The processing of hierarchies in the visual domain has been explored in the context of attention to local versus global information (Fink et al., 1996; Fitch, 2010). In particular, it is interesting that while the proper processing of hierarchies involves the integration of global and local information, there are several conditions in which individuals are biased to focus on the local information only. For instance, while attending to a big square composed of small circles, young children have a tendency to identify the small circles faster and easier than they can identify the big square (Harrison & Stiles, 2009; Poirel, Mellet, Houdé, & Pineau, 2008). This local-oriented strategy to process hierarchical stimuli is similar to that seen in non-human primates (Fagot & Tomonaga, 1999; Spinozzi, De Lillo, & Truppa, 2003). Conversely, in human adults a global bias develops, in which global aspects of hierarchical structures are processed first, and where the contents of global information interfere with the processing of local information (Bouvet, Rousset, Valdois, & Donnadieu, 2011; Hopkins & Washburn, 2002). This global search strategy can be reversed if adults are asked to process novel or unfamiliar structures (Hasselmo & Stern, 2006).

Recently, research within our laboratory suggests that visual fractals might also be processed using different strategies, depending on whether recursive or non-recursive representations are primed (Martins, Fischmeister, et al., 2014; Martins, Laaha, Freiberger, Choi, & Fitch, 2014). Not only are specific neural systems active during recursive representations (Martins, Fischmeister, et al., 2014), but there also seems to be a change in visual processing strategies that correlates with ontogenetic development, and with amount of exposure to examples of fractals (Martins, Laaha, et al., 2014). How these strategies relate with local or global biases is an exciting topic of ongoing research.

Another issue of great interest here concerns the availability of representation modes that allow compression of information. More abstract and global-oriented strategies to represent visuo-spatial information seem to be more efficient because they allow the compression, or reduction, of the information required to be kept online (Alvarez, 2011). In computer science, fractal strategies have also been shown to be efficient in the representation of complex hierarchies, precisely by compressing the amount of information (Koike & Yoshihara, 1993). From this discussion sprouts the prediction that recursive modes of representation are more abstract and lead to better compression of information.

Current study

In the current study, we introduce and explore a new paradigm, focusing specifically on recursion capabilities in the visual domain using fractal images. Because fractals exhibit hierarchical self-similarity, new hierarchical levels can be predicted by generalizing production rules and projecting them to further levels. Our goals are: (1) to create and validate a new task, (2) which allows us to distinguish between iterative, hierarchical, and recursive processes, (3) from which we can learn about the representation of recursion.

We present a series of experiments designed to validate empirically this novel task, forming the basis for further research.

In Experiment 1 we show that humans use recursion in the visual domain; in Experiment 2 we demonstrate that our Visual Recursion Task (VRT) taps into specific cognitive resources when contrasted with general intelligence, spatial working memory, and a control Embedded Iteration Task (EIT); in Experiment 3 we replicate the first two experiments introducing a number of important controls; and in Experiment 4 we compare our new recursive task with another task that invites recursive strategies – the Tower of Hanoi (Goel & Grafman, 1995) – confirming and expanding the evidence that VRT taps into cognitive resources specific for recursion.

Experiment 1: Response paradigm and esthetic biases

In Experiment 1 we tested whether adult humans are able to make inferences about recursive embedding in the visuo-spatial domain. This hypothesis would be supported by above-chance accuracy in our VRT.

In this task, participants are exposed to the first three steps of a process generating a visual fractal, and then asked to discriminate, from two possible alternatives, which is the correct continuation (see details below).

Since we were interested in exploring how participants would approach visual recursion, we gave minimal instructions and did not restrict response time. We assessed the strategies that participants reported after completing the task, and tested whether certain cognitive strategies led to better performance. We also evaluated the effects of the particular response paradigm (binary forced-choice) and subjective esthetic preferences on individuals’ accuracy by (1) adding an additional response task (1-alternative forced-choice – correct/incorrect), and (2) testing whether an esthetic preference for self-similar fractals could account for participants’ choices, regardless of their ability to represent recursion. If participants were using a simple strategy of esthetic preference towards well-formed fractals, this would argue against our assumption that a cognitive strategy was employed rather than simple visual heuristics.

Methods

Participants

We tested 20 volunteers (undergraduates and PhD students; 14 females and six males) aged between 20 and 44 years (M = 28.1, SD = 6) recruited at the University of Vienna. All participants were tested using the same experimental apparatus, and all reported normal or corrected-to-normal visual acuity. All participants gave their prior written consent, and were not paid for taking part. The research conformed to institutional guidelines and Austrian national legislation regarding ethics.

Stimuli and procedure

Stimulus generation

We based the VRT on the well-established properties of fractal geometry (Mandelbrot, 1977). Visual fractals can be generated from single constituents such as lines, squares, or triangles (the initiators) by applying a simple transformation rule (the generator) a given number of times (iterations). The structures generated by iterating this process are hierarchical and self-similar (see Fig. 4 for a schematic overview).

We produced four successive iterations of 60 different types of fractals, generated using Python code running in Nodebox (version 1.9.5, http://nodebox.net), a visual interface. For each of these 60 fractals, we produced (1) a correct fourth continuation of the first three iterative steps, and (2) an incorrect continuation as a Foil. This incorrect fourth iteration was produced by applying a different generator to the third stage, and had the same number and size of constituents as the correct fourth iteration.

The fractals produced for this task can be divided into four broad categories (see Fig. 5 for examples): (1) Polygons (n = 32), (2) trees (n = 9), (3) curves (n = 11), and (4) Koch snowflakes (n = 8). Peano curves and Koch snowflakes were produced using Lindenmayer systems (Lindenmayer, 1968). In these systems, the recursive process substitutes each constituent with a set of new constituents without preserving the initiator across iterations. The other two categories of fractals were produced with custom Nodebox scripts.

Visual Recursion Task (VRT) 2-choice

The three iterations and two test images were arranged on a panel (Fig. 6). Each panel depicted five images, presented simultaneously, arranged in two rows: The first three iterations of each fractal (“sequence” images) were shown in the top row and two alternatives for the fourth iteration (“correct” vs. “incorrect” fourth iteration, henceforth “choice” images) were shown in the bottom row. The position of the choice images (left or right) was randomized. The sequence of panels was presented on a computer screen in a randomized order, which was different for each participant, using custom Python software (version 2.6, www.python.org).

Participants were instructed in English to select the image they considered correct from the two “choice” images in the bottom row and to “try to understand the right strategy and to choose correctly as often as you can.” No further explanation on what “correct” meant was provided.

Participants responded by pressing one of two buttons on a button box (ioLab Systems), corresponding to the position of the correct image (left or right). Auditory and visual feedback was given for all trials. After an incorrect choice, the screen turned red for 1.5 s and a negative feedback sound (frequency 98.0 Hz and duration 1.5 s) was played. After a correct choice, the screen turned white for 1 s and a positive feedback sound (frequency 348.7 Hz, duration 1 s) was played. The sounds were played through Sennheiser HC 520 headphones. There was a 2-s inter-trial interval. There was no time limit per trial (timeout) because we did not want to constrain participants’ strategies, and because we were interested in knowing how they would naturally approach the tasks when given minimal instructions.

Before the VRT began, participants were given a short training session of five trials. The training stimuli were similar to the VRT panels, except that the sequence of images was generated according to a simple non-hierarchical iterative rule (see Fig. 7).

Visual Recursion Task (VRT) 1-choice

In order to evaluate possible performance effects associated with a binary forced choice paradigm, we designed a VRT 1-choice task. This task was identical in all aspects to the basic VRT 2-choice, except that only one image was presented in the center of the second row of each panel, corresponding to either the correct or incorrect fourth iteration (Fig. 8). Participants were instructed to choose whether the image in the lower row was correct (right button) or incorrect (left button). The same number (n = 10) of correct and incorrect fourth iterations was presented.

Before the beginning of the task, the same five training stimuli were presented as in VRT 2-choice, but with only one “choice” image. Feedback and inter-stimuli intervals were the same as in the VRT 2-choice task.

Esthetic preference task

This task was designed to assess the effects of possible preference biases in VRT 2-choice. Here, only the “choice” images (“correct” and “incorrect” fourth iteration) were presented on the screen (Fig. 9) with no previous “sequence” images. Participants were asked to simply select the image they preferred. No auditory or visual feedback was given.

Procedure

All participants began the experiment with the preference task. Participants then performed both recursion tasks in one of two possible orders: ten participants completed VRT 1-choice before VRT 2-choice (“1–2” condition), and ten participants performed VRT 2-choice before VRT 1-choice (“2–1”condition). Participants were randomly assigned to one of the two orders.

The same pool of 60 fractals was used in all tasks, with 20 fractals randomly assigned to each of the three tasks. The distribution of fractal classes was balanced for all tasks and each fractal appeared only once in each experimental session.

Participants’ choices and reaction times (RTs; in milliseconds) were recorded for all stimuli and for all tasks. The performance was calculated as the percentage of correct answers. In the preference task, we recorded as “correct” answers the occurrences where the preferred image corresponded to the well formed fractal, i.e. to the correct fourth iteration. At the end of each task, participants were asked to assess the kind of strategy they had used on a five-point scale. The scale of possible strategies was: 1 – “mostly intuitive”; 2 – “more intuitive than analytic”; 3 – “mixed”; 4 – “more analytic than intuitive”; 5 – “mostly analytic.” Intuitive answers were described to the participant as being based on a gut feeling and analytic answers as being derived by looking carefully at the details and making explicit inferences.

Analysis

The proportion of correct responses and RTs were compared between (1) VRT 2-choice and VRT 1-choice and (2) VRT 2-choice and preference task. We used a semiparametric regression technique called Generalized Estimating Equations (GEE), a technique useful when analyzing binomial data with within-subjects effects (Hanley, 2003). When applied to binary data, this technique is similar to a logistic regression and in comparison with generalized mixed models is more robust to deviations from error distribution assumptions, and model misspecifications (Ghisletta & Spini, 2004; Hubbard et al., 2010). We also used this model to assess accuracy differences between stimuli categories, and RT differences between tasks (using gamma with a log link function). To assess whether performance was above chance at the group level, for each task, we tested whether GEE models’ intercepts were significantly different from zero.

Furthermore, we assessed performance correlations between these tasks. For percentages of correct responses and RTs we tested if the data were normally distributed using the Kolmogorov-Smirnov (K-S) test. If variables were continuous and normally distributed we used Pearson’s bivariate correlations, otherwise we used non-parametric Spearman correlations.

All statistical analyses were performed using SPSS 19 (IBM).

Results

Performance

On average, participants scored 84 % (SD = 12) correct in VRT 2-choice and 70 % (SD = 14) correct in VRT 1-choice (Fig. 10). In the preference task, the “correct” image was preferred in 58 % (SD = 11) of the trials. To assess whether average response was above chance, we ran a GEE model for each task, with “trial” (1–20) as the within-subjects variable. All intercepts differed significantly from zero (all p < .05), meaning accuracy was above chance in all tasks, at the group level. To assess whether there were differences between tasks, while controlling for task order, we ran a binary logistic GEE model. We found a significant effect of task (generalized chi-square = 16.5, p < .001), but no effect of task order (p = .15) and no interaction between the two factors (p = .2). Pairwise comparisons with a Bonferroni p-value adjustment showed that performance was significantly lower in VRT 1-choice than in VRT 2-choice (p < .001, odds ratio = 0.8); and higher in VRT 2-choice than in the preference task (p < .001, odds ratio = 0.7).

Analyzed by participant, the percentage of correct responses in VRT 2-choice was correlated with performance in VRT 1-choice (r = .57, p = .009), but not with the preference task (r = .27, p = .24). This correlation between VRT 1-choice and VRT 2-choice was significant in the group of participants that started the procedure with VRT 2-choice (n = 10; r = .797, p = .006), but not in the group that started with VRT 1-choice (n = 10; r = .260, p = .469).

Reaction time

On average, RT was 12.5 s (SD = 1) in VRT 1-choice, 12.2 s (SD = 7) in VRT 2-choice, and 5.3 s (SD = 3) in the preference task (Fig. 11). To assess whether there were differences between tasks, while controlling for task order, we ran a gamma log link GEE model. There was an effect of task (generalized chi-square score = 11.4, p = .003), but not of task-order (p = .5), and no interaction between the two factors (p = .7). Specifically, we found a difference between VRT 2-choice and preference task (mean difference = 7 s, p < .001) but not between VRT 2-choice and VRT 1-choice (p = .8).

Strategy

At the end of each task, we asked our participants about the strategy they used. In general, participants reported a more intuitive strategy for the preference task (M = 2.45, SD = .9) and a more analytic strategy in both VRT 1-choice (M = 4.2, SD = .8) and VRT 2-choice (M = 4.0, SD =1.2). Interestingly, participants who reported a more analytic strategy in VRT 2-choice also had longer RTs (Spearman’s ρ = .485, p = .03) and a higher percentage of correct answers (Spearman’s ρ = .585, p = .007) than those participants who reported intuitive strategies. This suggests that an analytic rather than an intuitive strategy was optimal for the VRT.

Esthetic preferences

Another important issue was whether the decision between the choice images in the 2-choice condition was influenced by esthetic preferences. Given that the same 120 images were part of the pool of possible choices in VRT 2-choice and preference task, we assessed the frequency with which each image was chosen in both tasks (i.e., for each image, we counted the number of times it was chosen in VRT 2-choice and preference task). We found that these frequencies were not correlated (r = .027; p = .838), meaning that the images chosen more frequently in VRT 2-choice were not the images more frequently chosen in the preference task, suggesting that esthetic preferences could not account for above-chance performance in the recursion task.

Discussion

Our results suggest that human adults can quickly learn how to use recursive information in the visual domain without being explicitly trained or instructed about the concept of recursion. Moreover, a self-reported analytic strategy was associated with higher RTs, and significantly correlated with better performance. Although response feedback was provided during the task, participants were required to respond to a wide variety of stimuli, with different visual and structural features. Structural recursion was the common element among these stimuli and most likely this abstract regularity was transferred across trials. We propose that the ability to represent structural self-similarity in the visual domain was a necessary condition for good performance in this experiment, regardless of how this information was represented.

Given that VRT performance could be influenced by the response paradigm used as well as by esthetic biases in favor of (or against) self-similar fractals, we included three tasks: two recursive tasks (VRT 2-choice and VRT 1-choice) and a preference task. Our findings rule out an effect of esthetic preferences on performance in VRT, suggesting that subjects do not use preferences as decision heuristics, and demonstrate that both versions of the recursive task were similar to each other: (1) Percentages of correct responses in VRT 2-choice and VRT 1-choice were correlated. (2) RTs and self-reported strategy were similar in these tasks but differed significantly from the preference task. (3) Images preferred in the preference task were not the images more frequently chosen as “correct” in the VRT 2-choice condition.

However, there was a significant performance difference between VRT 1-choice and VRT 2-choice, depending on task order: Performance in the two tasks only correlated when VRT 1-choice was performed after VRT 2-choice (reaching a correlation coefficient as high as 0.8). It seems that when VRT 2-choice was performed first in the presence of correct and incorrect information, participants learned to attend more closely to the relevant image details, thereby increasing their accuracy in VRT 1-choice afterwards. This might imply that the ability to process recursion is influenced by the ability to orient attention to the relevant features of the stimuli, and that poor performance in such a task is not necessarily due to an inability to process recursion, but may arise from incorrectly focussed visual attention. This interpretation is consistent with findings in developmental data, in which young children fail in recursion due to inefficient visual strategies (Martins, Laaha, et al., 2014). Crucially, after being primed to attend to the relevant features of the stimuli, participants were well able to perform in VRT 1-choice (mean accuracy 76 %), showing that the comparison between two choice images was not strictly necessary to discriminate between correct and incorrect continuations of the recursive process. This argues against a heuristic response strategy purely based on the comparison between choice images.

Experiment 2: Recursive versus non-recursive iteration

Experiment 1 suggested that human adults are able to represent visual recursion successfully. However, it remains an open question whether the VRT measures something specific to recursion, or instead taps into a more general ability to extract visual regularities. In Experiment 2, we attempted to gain more specific insight into the cognitive processes underlying VRT. We devised an Embedded Iteration Task (EIT) as a control task, which shared the “hierarchicality” and iteration features of VRT, but lacked recursive embedding. We compared participants’ accuracy in both VRT and EIT with a standardized measure of rule-based visual cognition (Matrix Reasoning from WASI®, see below). Here we wanted to test whether visual recursion, as measured by our task, can be dissociated from other visuo-spatial hierarchical tasks (EIT) and general visual intelligence capacity (WASI). For this purpose we used correlation and regression analyses. Although exploratory, we tested the general hypothesis that VRT would not be highly correlated with general intelligence, and that VRT and EIT would correlate with different cognitive abilities. These findings would provide support for the existence of variance within the performance of VRT that is explained by specific resources recruited in the instantiation of recursive representations.

To produce EIT images, an iterative process embedded additional elements within a pre-existing hierarchical structure, without producing new hierarchical levels (Fig. 12). To empirically validate the distinction between recursion and iteration we first assessed the behavioral response profile for both tasks. Furthermore, we tested whether different cognitive abilities (fluid intelligence and working memory) predicted accuracy in solving the two tasks.