Multiplex Networks of Cortical and Hippocampal Neurons Revealed at Different Timescales

Recent studies have emphasized the importance of multiplex networks – interdependent networks with shared nodes and different types of connections – in systems primarily outside of neuroscience. Though the multiplex properties of networks are frequently not considered, most networks are actually multiplex networks and the multiplex specific features of networks can greatly affect network behavior (e.g. fault tolerance). Thus, the study of networks of neurons could potentially be greatly enhanced using a multiplex perspective. Given the wide range of temporally dependent rhythms and phenomena present in neural systems, we chose to examine multiplex networks of individual neurons with time scale dependent connections. To study these networks, we used transfer entropy – an information theoretic quantity that can be used to measure linear and nonlinear interactions – to systematically measure the connectivity between individual neurons at different time scales in cortical and hippocampal slice cultures. We recorded the spiking activity of almost 12,000 neurons across 60 tissue samples using a 512-electrode array with 60 micrometer inter-electrode spacing and 50 microsecond temporal resolution. To the best of our knowledge, this preparation and recording method represents a superior combination of number of recorded neurons and temporal and spatial recording resolutions to any currently available in vivo system. We found that highly connected neurons (“hubs”) were localized to certain time scales, which, we hypothesize, increases the fault tolerance of the network. Conversely, a large proportion of non-hub neurons were not localized to certain time scales. In addition, we found that long and short time scale connectivity was uncorrelated. Finally, we found that long time scale networks were significantly less modular and more disassortative than short time scale networks in both tissue types. As far as we are aware, this analysis represents the first systematic study of temporally dependent multiplex networks among individual neurons.

was uncorrelated. Finally, we found that long time scale networks were significantly less modular and more disassortative than short time scale networks in both tissue types. As far as we are aware, this analysis represents the first systematic study of temporally dependent multiplex networks among individual neurons.

Introduction
Understanding how large groups of neurons process and represent information in neural systems is a fundamental problem of neuroscience. One popular avenue to investigate the behaviors of large populations of neural sources is to analyze their connectivity [1][2][3]. Traditionally, these analyses have focused on individual networks that contain only one type of connection. However, recent work has shown the importance of interdependent networks [4][5][6][7][8][9][10][11][12][13][14][15]. These ''multiplex networks'' consist of multiple interdependent networks that share common nodes and possess different types of connections. In applications outside neuroscience, these previous studies frequently focused on the resilience properties of multiplex networks and on the properties of random multiplex networks.
In neuroscience applications, though rarely studied explicitly (see [13] as an exception), the multiplex properties of networks have often been examined in the context of comparing different types of connectivity. Neural connectivity has traditionally been conceptualized in three ways [16,17]: All three types of connectivity have been widely studied in the literature (see [1][2][3][18][19][20][21][22][23] for reviews). These types of connectivity form multiplex networks because they represent different connection types linking shared nodes. We are aware of only one study that explicitly researched multiplex networks of this type in neural systems [13], and that study was conducted on the level of brain region connectivity. Other studies have implicitly examined these multiplex networks on the level of brain region connectivity [22][23][24][25][26][27] and at the cellular level [28][29][30][31]. Typically, these studies have focused on the ability of one type of connectivity to predict another and what features, if any, of one type of connectivity are not represented in another type of connectivity.
While the investigation of multiplex networks in terms of physical, functional, and effective connectivity is certainly of great interest, we felt it would be productive to examine multiplex networks in the brain from a different point of view. The brain exhibits a large repertoire of neural phenomena over a wide range of time scales (e.g. EEG rhythms, action potentials, local field potentials, hemodynamic response, etc.). It has been argued that isolating phenomena at specific time scales (e.g. oscillations at different frequencies) and understanding their interactions are important to understanding how the brain integrates information [32][33][34][35][36][37][38][39]. Based on the existence of these phenomena, we chose to examine multiplex networks of individual neurons with time scale dependent connections. The neural phenomena listed above are constituted within or associated with the electrical activities of neural sources, and the time scales associated with physical connectivity between neurons are significantly shorter (less than ,20 ms) than the time scales associated with many of these neural phenomena. Thus, to measure the interactions between neurons at the time scales associated with these neural phenomena, we chose to measure the effective connectivity between neurons at different time scales.
Previous studies have discussed the temporal multiplex properties of signals in the brain (e.g. multiple coding modalities) (see [34,36,37] for reviews). In terms of network connectivity, other studies have implicitly examined multiplex networks with time scale dependent functional connections ( [40][41][42][43][44] for example) and effective connections ( [27,45] for example) at the level of brain regions. However, we are aware of only two other studies -both of which were conducted in vitro -that examined networks with time scale dependent connectivity at the cellular level [46][47][48]. Though these works implicitly examined multiplex networks, both studies treated networks at different time scales as distinct with essentially independent nodes and only one type of connection. In other words, these studies did not examine the uniquely multiplex properties of the networks under study, as we have done in this analysis.
In this work, we chose to use Transfer Entropy (TE) [49] to measure the time scale dependent effective connections between neurons. TE has been widely used in neural systems [45,47,[50][51][52][53][54][55][56][57][58][59][60][61][62] and neural models [27,63,64]. TE measures how the state of one unit was changed or affected by the state of another unit. In our analysis, we used TE to determine the effective connectivity between individual neurons over isolated time scales ranging from sub-millisecond to seconds. Next, using graph theoretic methods [3,41,57,58,[65][66][67][68][69][70][71][72], we analyzed the topology of the resulting networks. ''Network topology'' generally refers to the way in which the nodes (neurons in our case) in a network are connected. We used various network topology analysis tools to study the time scale dependent networks. In order to obtain the highest resolution electrophysiological recordings of as many individual neurons as possible, we used a state-of-the-art 512-electrode array to simultaneously record the spontaneous spiking activity of hundreds of neurons in organotypic cultures. To the best of our knowledge, this preparation and recording method represents a superior combination of number of recorded neurons and temporal and spatial recording resolutions to any currently available in vivo system.
We found several interesting results related to the topology of the measured effective connectivity networks. First, we found that long time scale connections are independent of short time scale connections. In other words, we found that the existence of a short time scale connection between two neurons was uncorrelated with the existence of a long time scale connection between those neurons. Second, we found that highly connected neurons (so called ''hubs'') tended to be isolated to specific time scales. Conversely, we found that non-highly connected neurons (''nonhubs'') did not tend to be isolated to a specific time scale. These results represent the first evaluation of truly multiplex network properties at the neuron level and the hub result may have significant implications for fault tolerance in neural networks [4,11]. Third, we found network topology to be time scale dependent. For instance, the physical distance between connected nodes was shown to increase as the time scale lengthened and the networks were shown to become more disassortative and less modular as the time scale lengthened. Fourth, in some respects and at some time scales, hippocampal and cortical networks were shown to be significantly different, while in other respects and at different time scales they were very similar. For instance, cortical networks where shown to be significantly more assortative than hippocampal networks for short time scales, but the two tissue types were similarly disassortative for longer time scales. These results represent the first systematic examination of time scale dependent multiplex networks of individual neurons and they indicate that these time scale dependent networks potentially differ by function and brain region.
The results in this paper were presented in earlier versions at two conferences [73,74].

Electrophysiological Properties
We performed 25 cortical and 35 hippocampal recordings from mouse corticohippocampal organotypic cultures using a 512-electrode array with a sampling rate of 20 kHz. These recordings were pre-processed and spike sorted to yield spike times for each neuron (see Materials and Methods). The spiking activity of the neurons was dominated by bursts of activity ( Fig. 1 A and S1 Fig). The neurons in the cortical recordings had a mean firing rate of 2.10 Hz, while the neurons in the hippocampal recordings had a mean firing rate of 1.21 Hz (Fig. 1  B). On average, we found 309.4 spike sorted neurons in each cortical recording (7735 total neurons) and 120.4 spike sorted neurons in each hippocampal recording (4214 total neurons) ( Fig. 1 C). The cortical recordings yielded significantly more neurons, perhaps because the hippocampal tissues were physically smaller in size.

Transfer Entropy Analysis and Connectivity Properties
After obtaining the spike times for the neurons, we used transfer entropy (TE) to measure the effective connectivity between individual neurons over 10 isolated time scales ranging from sub-millisecond to seconds ( Fig. 1 E, see Materials and Methods). Since this study was relatively exploratory, we chose these 10 time scales to logarithmically span many time scales relevant for neural phenomena. For instance, the analysis was able to capture interactions on the time scale visible to fMRI (,1 second), as well as interactions on the time scale of individual action potentials (,1 millisecond). This allowed us to compare the network structure on the fMRI time scale to the network structure on the individual action potential time scale. We would like to emphasize that other time scales could be chosen for other applications. The number of viable data sets or recordings. Data sets were deemed viable if they produced sub-networks with at least 50 neurons and a given average degree (k). We used sub-networks with k53 throughout the analysis. (E) Time resolutions for the 10 discrete time scales used in this analysis in comparison to the approximate time scales for various neurological phenomena and measurement methods. Note that some measurement methods (e.g. MEG and EEG (forms of electrophysiology), as well as fMRI) are not capable of recording the activity of individual neurons, unlike calcium imaging or cellular electrophysiology (as was used in this study). The analysis time scales were chosen to logarithmically span many neurological time scales and they allowed us to compare network structure on this wide range of time scales. Note that the analysis time scales overlap to ensure that all phenomena are adequately measured.
To evaluate which connections were significant, we compared the TE value found in the data to the TE values obtained from 5000 jittered spike time series. The jittering procedure consisted of randomly altering the time of each spike by a small amount proportional to the time scale being investigated. Thus, the jittering procedure preserved the overall firing rate of each neuron and the long time scale changes in neuron firing rates (i.e. bursts), but removed the precise spike timing between neurons. The percentage of the TE values from jittered data that were larger than the TE value obtained from the data was taken as the p-value. Only connections with p,0.001 were used in the analysis. We then treated the significant connections as binary edges in the networks and the neurons as nodes.
Before proceeding to examine the full effective connectivity networks, we addressed three issues involving individual neurons and connections. We first measured the correlation between the existence of connections at different time scales (Fig. 2 A). We might expect that neurons which communicate would do so at many time scales, which would lead to correlations between connectivity at different time scales. Alternatively, we might expect that connectivity would be segregated based on time scale, which would lead to anti-correlations between connectivity at different time scales. In fact, we found that connectivity was most strongly correlated at adjacent time scales and weakly correlated at distant time scales. High correlation at adjacent time scales is expected because the time scales overlapped ( Fig. 1 E), but the weak correlation at distant time scales implies that short time scale connectivity was independent of long time scale connectivity.
Next, we examined the role chains of short time scale connections played in the existence of longer time scale connections. It might be suspected that long time scale connectivity was simply due to chains of short time scale connections. If this were true, then the network would not contain time scale dependent multiplex connections. Rather, it would contain only one type of connection. To investigate this concern, for each pair of neurons, we measured the correlation between the existence of a chain of significant connections and no direct connection at a short time scale with the existence of a direct significant connection at a longer time scale (Fig. 2 B). We found that chains of short time scale connections were correlated with longer time scale direct connections, but only weakly. Indeed, any effective connectivity analysis will be susceptible to false positive errors of this type (i.e. confusing A R C R B for A R B) [75]. However, using our multiple time scale analysis, we are able to evaluate this phenomenon and determine that if these false positive errors were occurring, they were occurring at a relatively low rate.
For the remaining network analyses, each of the effective connectivity networks was sub-sampled 500 times into smaller networks with 50 neurons and the resulting values for the sub-networks were averaged to obtain the final results for each data set (recording). Only a subset of the strongest connections corresponding to an average degree or number of connections per neuron (k) of 3 was retained in the sub-networks. This procedure was utilized to avoid biases based on network size and number of connections [76] (see Materials and Methods). Also, this method significantly reduced differences between hippocampal and cortical networks in terms of the number of neurons and the number of connections per neuron. Some of the data sets did not produce a sufficient number of significant interactions between neurons at one or more time scales. These data sets were excluded from further analysis (Fig. 1 D).
To better understand the role firing rate played in our analysis, we next examined the correlation between neuron firing rate and degree. We found that firing rate and neuron degree were correlated in the sub-networks, especially for shorter time scales, with hippocampal networks showing higher correlation for middle and longer time scales (Fig. 3 A). Furthermore, we found that the correlation decreased with time scale, with the most significant changes in cortical networks (Fig. 3 B). Generally speaking, we found neurons with a wide range of firing rates possessed connections at all time scales, though the distribution was skewed towards high firing rate neurons having higher degree at short time scales (Fig. 3 C). To insure this correlation was not an artifact of the analysis, we examined the connectivity in a simple null model network. This model possessed 400 independent neurons with firing rates that spanned the firing rates seen in the real data. Each model neuron spiked randomly (Poisson) and was recorded for one hour. The null model produced less connectivity than expected by chance from multiple comparisons for all time scales and showed fewer neurons with 1 or more connections in comparison to the real data ( Fig. 3 C).
From these results, we concluded that the correlation between neuron degree and firing rate is not an artifact of false-positive connections for high firing rate neurons. That said, for any type of analysis, it is true that decreased quantities of data will weaken the statistics and make an effect harder to detect. In our case, this general rule implies that lower firing rate neurons will have weaker statistics and it will be more difficult to detect effective connections involving low firing rate neurons. Therefore, it is possible that weak connections were missed (falsenegatives) due to low firing rates. It is important to note that this issue is closely related to, but distinct from, issues surrounding structural and effective connectivity. If one neuron will influence the activity of another neuron (i.e. they are structurally connected), but the driving neuron never fires, the connection will not be detected (i.e. there will be no effective connection). Since effective connectivity is not simply the capacity to effect change, but also requires actually effecting change, it is correct to find that a neuron that never fires is not effectively connected to other neurons. If one were attempting to assess structural connectivity with effective connectivity, this scenario would be a concern. However, because we were only interested in the effective connectivity itself, this issue was not a concern for our analysis.

Neuronal Hubs
We next analyzed the proportion of the neurons in the network that were hubs (highly connected neurons). Given the fact that these neurons connect to many other neurons, we might expect they play a special role in the processing of information in the networks [72,77]. The threshold for the number of Firing rate and degree were correlated. (A) Neuron firing rate and degree were correlated, especially for short time scales. Hippocampal networks showed higher correlations than cortical networks for middle and long time scales. Box plots: minimum, 25 th percentile, median, 75 th percentile, maximum data set (recording). Differences between hippocampal and cortical networks were assessed with a multiple comparisons corrected Mann-Whitney Test (one dot: p,0.05, two dots: p,0.01, three dots: p,0.001). (B) The correlation between firing rate and degree generally decreased with time scale. Multiple comparison corrected Mann-Whitney Test p-values between different time scales for the same tissue type. (C) Density plots of neuron degrees and firing rates in sub-networks. Note that the real data contain many high degree neurons and the stronger correlation between degree and firing rate for short time scales (top row) in comparison to longer time scales. Also, note that the null model data contained very few non-zero degree neurons. This lack of connectivity in the null model implies that high degrees for high firing rate neurons are not the result of false-positive connections. Vertical line is the approximate degree threshold for hub classification. connections required for a neuron to be classified as a hub was set using the likelihood (10 24 ) to find a highly connected neuron in a random network with the same number of neurons and connections (see Materials and Methods). In a sub-network of 50 neurons with average degree of 3, this threshold corresponded to 11 connections (Fig. 3 C). A small proportion of the neurons were found to be hubs in both types of tissue and at all time scales for most data sets (Fig. 4 A). The proportion of hubs expected by chance in a random network was set to 10 24 , so we found roughly two orders of magnitude more hubs than would be expected by chance in a randomly connected network. Though it is not surprising to find more high degree neurons than would be expected in a random network, interestingly we did find that the proportion of neurons that were classified as hubs generally increased as time scale lengthened. To assess this phenomenon, we compared the distributions of hub percentages between different time scales for the same tissue (Fig. 4 B). Generally, we found that long time scales possessed a significantly larger percentage of hubs.

Shared Hubs
In addition to analyzing the proportion of neurons that were found to be hubs, we also compared the identity of the neurons that were found to be hubs at different time scales in the same data set (Fig. 5). We compared the status of each neuron as a participant with few connections in a network (non-hub) or as a participant with many connections in a network (hub) across different time scales (Fig. 5 A) by calculating the amount of hub and non-hub sharing (see Materials and Methods). In order to understand the importance of the hub and non-hub sharing results, for each data set, we subtracted from the original sharing value the mean sharing from 500 trials of a null model. The null model consisted of the same neurons as the original data, including their status as hubs, non-hubs, or unconnected neurons. However, for each trial in the null model, the identities of the neurons were randomized. Thus, positive sharing values implied more sharing than expected by chance and negative sharing values implied less sharing than expected by chance. This allowed us to evaluate which results were due to time scale and neuron specific behavior in the networks (i.e. uniquely multiplex network properties) and which results were due simply to the fact that the vast majority of connected neurons were non-hubs. Surprisingly, we found that hubs tended to be shared across nearby time scales, while non-hubs tended to be shared widely across all time scales (Fig. 5 B). In other words, the neurons that were found to be hubs at one time scale were not usually found to be hubs at other distant time scales. Conversely, neurons that were found to be participants in networks, but which had few connections (non-hubs), tended to also participate in networks at other time scales. In addition, we calculated the sharing values for each pair of time scales and both tissue types (Fig. 5 C-F). We found that hub sharing was actually below the level expected by the null model for many distant time scales (Fig. 5 C), though the differences between the sharing values from the data and from the null model were not significant for these time scale pairs (Fig. 5 D). This lack of significance may be due to the large number of comparisons for which we had to correct, as well as the small number of hubs. In fact, when we performed the same analysis with a lower degree threshold for hub classification -thereby increasing the number of neurons that were classified as hubs -some of the negative sharing results for hub neurons were found to be significant for cortical networks (S2 and S3 D and E Figs). It should be noted that, because adjacent time scales overlap ( Fig. 1 E), we might expect elevated hub and non-hub sharing across adjacent time scales. The fact that we find this result implies that networks at adjacent time scales are similar and supports the validity of the analysis method. Also, elevated sharing in nearby, though not adjacent, time scales cannot be explained merely by the overlap in adjacent time scales (

Connection Distance
Next, we examined the relationship between physical distance and time scale dependent connectivity (Fig. 6). We compared the average distance between effectively connected neurons at each time scale to the average distance of all possible connections (Fig. 6 A). We found that cortical connections were significantly longer for several time scales. Furthermore, we found that the average connection length was typically shorter than the average length of all possible connections (i.e. mean connection distances less than 1), indicating that the actual network is smaller than the network of all possible connections. When we compared these distances across time scales for the same tissue ( Fig. 6 E), we found that the average connection distances were significantly shorter for the first two time scales in both types of tissue. This result seems reasonable given the assumption that, when given more time, information is more likely to spread to distant neurons. We also compared the average distance between hub neurons at each time scale to the average distance of all possible connections (Fig. 6 B) and we did not find significant differences between tissue types. We found that the distance between hubs generally increased with time scale (Fig. 6 F). Finally, we found that the distance between hubs was significantly smaller than the average connection distance in both tissue types and for all time scales (Fig. 6 C and D). We classified each neuron as a hub, non-hub, or unconnected neuron at each time scale. A neuron was considered to be a shared hub or shared non-hub for two time scales if its status as a hub or non-hub was consistent across those time scales. Hubs were defined using a degree threshold set by the likelihood to have a given number of connections in a random network (0.05 in this illustrative diagram and 10 24 in the full analysis). (B) We calculated the amount of hub and non-hub sharing (see Materials and Methods) for each pair of time scales and grouped the results into neighboring (4 or less) and distant (greater than 4) time scales. We found that hubs were only shared at a significant level for neighboring time scales, while non-hubs were broadly shared across all time scales (multiple comparisons correct Mann-Whitney Test (1, 2, and 3 dots: p,0.05, 0.01, and 0.001 respectively), error bar: standard error of the mean). For each data set, we subtracted the mean sharing values for 500 trials with neuron identities randomized and neuron hub, non-hub, or unconnected status held constant. This null model approximates the amount of sharing expected based only on the number of hubs, non-hubs, and unconnected neurons in the data set, as well as the effect of ignoring the multiplex properties of the networks and considering the time scales to be truly independent networks. We also calculated the mean sharing value of (C) hubs and (E) non-hubs across each pair of time scales for cortical and hippocampal networks. In (  The mean physical connection distance was calculated as the ratio of the mean physical distance between effectively connected neurons in the network to the mean physical distance between all possible pairs of neurons in the sub-network. Cortical networks were found to be significantly larger for several time scales. (B) The mean physical distance between hubs was calculated as the ratio of the mean physical distance between hubs (connected or not connected) to the mean physical distance between all possible pairs of neurons in the sub-network. No significant differences between hippocampal and cortical networks were observed. (C and D) The hubs were significantly more closely spaced than the average connected pair. Box plots: minimum, 25 th percentile, median, 75 th percentile, maximum data set (recording). Differences between hippocampal and cortical networks were assessed with a multiple comparisons corrected Mann-Whitney Test (one dot: p,0.05, two dots: p,0.01, three dots: p,0.001). (E and F) Multiple comparisons corrected Mann-Whitney Test p-values between different time scales for the same tissue type for connection distances (E) and hub distances (F). Note that cortical and hippocampal network connections tend to be significantly longer in time scales 3 to 10 in comparison to time scales 1 and 2. Also, note that hub distances generally increase with time scale. doi:10.1371/journal.pone.0115764.g006

Topology Results
To investigate the time scale dependent topology of the networks, we applied two network topology measures to the cortical and hippocampal effective connectivity networks using programs from the Brain Connectivity Toolbox [78].
First, we measured the modularity of the networks (Fig. 7). Modularity measures the degree to which the network can be separated into non-overlapping groups. Most interestingly, we found that the modularity tended to decrease with time scale (Fig. 7 A) and these changes were generally significant (Fig. 7 B). As we might expect, we also found fewer and larger modules as the time scale increased (Fig. 7 A). For all modularity results, we found no significant differences between tissue types at individual time scales, though the time scale dependent behavior of the cortical networks tended to be more significant across different time scales in comparison to the hippocampal networks. The general result that, at short time scales, the neurons formed networks with many well defined and small modules, while at longer time scales the neurons formed networks with few large and poorly defined modules supports the hypothesis that short time scale interactions are part of distinct modules while longer time scale interactions connect those modules.
Second, we measured the assortativity of the networks (Fig. 8). We used the form of assortativity that measures the correlation between the number of outgoing connections of neurons at the start of connections and the number of incoming connections of neurons at the end of connections. If this correlation is positive, the network is said to be ''assortative,'' but if the correlation is negative, the network is said to be ''disassortative.'' Thus, in assortative networks, neurons with many outgoing connections tend to connect to neurons with many incoming connections. In disassortative networks, neurons with many outgoing connections tend to connect to neurons with few incoming connections. We found that the networks were generally disassortative. Also, we found significant differences between the tissue types only at the shortest time scale, where the cortical networks were found to be more assortative (Fig. 8 A). Both tissues showed significant decreases in assortativity (increases in disassortativity) as the time scale lengthened (Fig. 8 B). This implies that, at long time scales, neurons with many outgoing connections connected to neurons with few incoming connections, while neurons with few outgoing connections connected to neurons with many incoming connections.

Multiplex Networks
Our analysis represents the first systematic examination of time scale dependent multiplex networks of individual neurons and it indicates that these time scale dependent networks potentially differ by function and brain region. Previous in vitro studies have shown changes in connectivity with time scale [46][47][48], but those works considered each time scale as an isolated network. By examining the correlation between connections at different time scales, as well as the hub and non-hub sharing properties of the networks, we examined the connectivity of individual neurons across time scales. Therefore, for the first time, we evaluated uniquely multiplex network properties in networks of individual neurons.
The result that hub neurons tended to be isolated at specific time scales, while non-hubs were not is especially compelling given recent research on fault tolerance in multiplex networks [4,11,14,15]. Our result indicates that if a neuron that operates as a hub at one time scale were removed from the network, the impact would be reduced to networks at other time scales because it would be less likely that the removed neuron operated as a hub at other time scales. Therefore, we hypothesize that this arrangement of connectivity increases fault tolerance in comparison to a system in which the hub functionality is concentrated in a few neurons across all time scales. In a future experiment, the role of hub neurons in network fault tolerance could be studied if it were possible to identify and selectively remove hub (high degree) and non-hub (low degree) neurons. We predict that if a hub neuron which operates as a hub at only one time scale is selectively removed from the network, the effect on the behavior of the network at another time scale would be reduced in comparison to the removal of a neuron that operates as a hub at that time scale.
In addition to possible implications for fault tolerance, the low correlation between connections and the lack of sharing of hubs across distant time scales that we found also has implications for recent studies of correlations between node degree and connectivity in other types of networks (see [79] and the applications

Hubs
In this paper we have presented compelling evidence that hub neurons are time scale specific and non-hub neurons are not time scale specific. Hub neurons are of special interest because they are connected to many other neurons and, therefore, possibly play a unique role in network operation. Neural hubs and their importance have been studied in brain regions [69,77,80,81], but we know of only three previous analyses that examined hubs on the scale of individual neurons [72,82,83]. We feel our results compliment these previous results in that, while we were unable to identify hub neurons by type, we were able to examine network behavior at a wide range of time scales.
Bonifazi et. al. found hubs to be GABAergic interneurons in developing hippocampal acute slices [72,82] (see [84] for a recent discussion of GABAergic hub neurons). In comparison to our analysis, the work by Bonifazi et. al. was different in that it employed cross-correlation to identify functional connections, the recordings were performed and analyzed at only one time scale (,50 ms, calcium imaging, see Fig. 1 E), and acute hippocampal slices from developing rats and mice were studied instead of organotypic slice cultures from mice. Though cross-correlation was used by Bonifazi et. al. to identify hubs, they were also able to show that altering the activity of the hub neurons affected overall network behavior. In the future, it will be interesting to see if the effective hub neurons we found at the ,50 ms time scale in hippocampal networks are also GABAergic interneurons, as well as whether the hubs we found at other time scales and in the cortex are also GABAergic interneurons.
Quilichini et. al. found that hippocamposeptal neurons play a special, hub-like, role in controlling gamma oscillations at the onset of ictal-like events in acute slices [83]. Unlike our analysis or the work by Bonifazi et. al., the researchers in this study used patch-clamp to record from individual neurons and compared their behavior to the behavior of network-wide large-scale gamma-frequency oscillations. Because the researchers in this study defined ''hub'' in terms of initiation and control over gamma oscillations, we were unable to directly determine if the hub neurons we found in the gamma frequency time scales (TS 3-5) were hippocamposeptal neurons.
Our result that hub neurons function as hubs at specific time scales indicates that hub neurons may be intimately related with time scale specific phenomena in the brain. It has been hypothesized that interactions at different time scales (e.g. oscillations at different frequencies) are crucial for information integration in the brain [32][33][34][35][36][37], so the hub neurons identified in our work may play a unique role in neural networks. In future experiments, we would like to investigate the role the hub neurons play in a wide range of network behavior. For instance, do hub neurons at a specific time scale initiate or maintain network bursts? What role do hub neurons play in sensory coding? Do hub neurons at a specific time scale encode certain features of the stimulus? If the hub neurons do encode information about a stimulus, do they do so synergistically or redundantly? Finally, it may be possible to better connect the role of hubs in network behavior if it were possible to identify the type(s) of neuron(s) that operate as hubs in the network.

Time Scales
Importantly, the results of our analysis indicate that individual neurons interact over time scales ranging from sub-millisecond (gap junction, synapse) to seconds (hemodynamic response). Only the short time scale interactions (up to TS 4, see Regarding our specific results, we found that several features of the effective connectivity networks were time scale dependent. For instance, we found changes in physical distance between effectively connected neurons, changes in network topology measures, and time scale dependencies among hub/non-hub neurons and connectivity. Previous studies have examined temporal multiplexing in brain signals [32][33][34][35][36][37][38][39], but we know of only two studies to date that have focused on different time scales in effective connectivity networks at the single neuron level [46][47][48]. In a previous study of the same data sets [46,48], we analyzed time scale dependent functional connectivity instead of time scale dependent effective connectivity, as was analyzed herein. In that work, we applied wavelet transforms to the cross-correlations between the neurons and grouped the resulting significant connections in frequency ranges. That analysis found time scale dependent connectivity properties, but it treated each separate time scale as an independent network. In the present analysis, we examined time scale dependent connectivity properties, but we also examined the hub properties of neurons and connectivity correlation across various time scales. Thus, though the previous study implicitly involved multiplex networks, only the present analysis examined truly multiplex network properties. Also, it should be noted that the definitions of ''time scale'' applied in our present study and this previous work were significantly different (wavelet transform versus delay) and that the functional and effective connectivity of these networks were significantly different.
Matsuda et. al. used TE to examine the effective connectivity in dissociated cultures [47]. Similarly to our analysis, Matsuda et. al. used different bin sizes to probe different time scales. However, their binning structure did not isolate interactions at a certain time scale by including a minimum delay and they only focused on time scales shorter than 100 ms. Also, they used only four data sets and did not attempt to determine which TE results were statistically significant. Our analysis demonstrates that effective connectivity networks among individual neurons vary with time scale in interesting and potentially complex ways. Some of our results seem quite reasonable, such as the increasing physical distance between effectively connected neurons with increased time scale and the decrease in modularity with time scale. In the future, more work must be done to understand how the changes in effective network connectivity relate to different neural phenomena at different time scales.

Network Topology
A great deal of research has focused on whether brain networks are scale-free or small-world (see [41,65] for examples using fMRI, [46,48,70,72,85] for examples involving individual neurons, and [3] for a review). We did not directly assess whether the networks studied in this paper possessed scale-free degree distributions. However, we did observe significantly more high degree neurons than would be expected in a random network. So, while we cannot determine if the networks we examined were scale-free, small-world, hierarchical, etc., we can determine they were most likely not random and that they possessed heavy tailed degree distributions.
By applying several topology measures to the effective connectivity networks, we were able to produce a picture of the networks that change with time scale. For instance, we found that the networks become significantly less modular as time scale increased. In other words, at long time scales the networks contained a few large and poorly defined groups (modules), while at shorter time scales, the networks contained many small and well differentiated groups. This result supports the general hypothesis that the brain consists of hierarchical modules [86,87], though our results also imply that as the module size increases, so too does the time scale at which the module processes information. Furthermore, this result may have significant implications for the time scales involved in population coding [88]. In the future, we hope to investigate the relationship between the time scale dependent modules and sensory coding. Do all the neurons in one module code a certain feature of the stimulus, and, if so, is it time scale specific?
Next, we found that the networks became more disassortative as the time scale increased. This implies that, at short time scales, the connections were more evenly distributed with regards to directionality. However, at long time scales, high outdegree neurons tended to connect to low in-degree neurons, while low out-degree neurons tended to connect to high in-degree neurons. This result may have implications for information flow, the role of hub neurons at long versus short time scales, and the organizational mechanisms in these networks [89].
Beyond the topological qualities of the observed networks mentioned above, it would also be interesting to investigate possible mechanisms that could lead to such topologies through the development of the networks. Numerous mechanisms for network formation have been introduced [20,90,91]. In the future, the topological results presented in this analysis could contribute to the delineation of the mechanisms that are biologically realizable in neural networks.

Transfer Entropy and Other Measures of Connectivity
TE has previously been used in many neuroscience applications [27,45,47,[50][51][52][53][54][55][56][57][58][59][61][62][63][64]. However, no one, to the best of our knowledge, has used TE with different binning and delays to systematically study interactions at multiple isolated time scales ranging from sub-millisecond to seconds. The precise values of bin size, delay, and state structure we chose (Fig. 1 E) are not vital to the analysis. We simply chose those values to span a wide range of time scales in logarithmically nearly-equal delay windows. If so desired, a researcher could change the time scales to better suit their research topic. Overall, the results of this analysis demonstrate that TE can be used successfully as a tool to examine effective interactions between single neurons over many time scales. Importantly, the recently proposed BRAIN Initiative seeks to record all neural activity across complete neural systems [92]. The analysis presented herein represents a novel method to analyze the data produced by this important initiative.
By comparing neuron degree and firing rate (Fig. 3), we found that neuron degree and firing rate were correlated. The lack of connectivity in a null model implied that there were likely few false-positive connections in the analysis based on firing rate, but there may have been false-negative results for low firing rate neurons due to poor statistics. We feel it is also important to highlight the role that the distinction between structural and effective connectivity plays in this issue. In short, even if neurons are structurally connected, if they do not fire or fire very infrequently, then they are not effectively connected. This perceived inability to detect structural connectivity with effective connectivity is actually a misalignment between the definitions of these two types of connectivity. This point is especially important for studies that seek to relate structural connectivity with effective and functional connectivity [22][23][24][25][26][27][28][29][30][31].
In addition to TE, researchers have also used Granger Causality [93][94][95] in similar neural systems (see [96][97][98][99][100][101] for examples and [102] for a discussion of Granger Causality and TE graphs in neuroscience). Also, researchers have used Dynamic Causal Modeling to measure effective connectivity in neural systems (see [103][104][105] as examples). We chose to use TE because, in its basic form, it is model independent and can capture non-linear interactions, unlike Granger Causality and Dynamic Causal Modeling. Furthermore, our analysis utilized binary spike data, which is better suited for TE than Granger Causality or Dynamic Causal Modeling. Finally, the ability of TE to render interactions in terms of bits -a general unit of information -allows for straightforward comparisons between networks and graphs.
Besides TE, Granger Causality, and Dynamic Causal Modeling, researchers have also used cross-correlation (CC) and mutual information to infer functional connectivity [35,50,72,106]. Both CC and mutual information are measures of functional connectivity because they measure statistical dependencies between the activities of neural sources without taking into account the effect or causal influence of one neural source on another.
We chose to analyze TE networks instead of CC networks for five primary reasons. First, unlike CC, TE measures interactions in terms of bits which are a general unit that can be easily compared across different connections and systems. Second, TE is a measure of effective connectivity, whereas CC is a measure of functional connectivity. Therefore, TE is able to capture how the activity of the transmitting neuron affects the activity of the receiving neuron, while CC is only able to measure how their activities are correlated. Third, TE is able to measure non-linear interactions, while CC is not. Fourth, though it was not a direct goal in this analysis, TE has been shown to be superior at inferring physical connectivity [50,63]. Fifth, TE's information theoretic nature means that it will easily allow the analysis to be extended to include multivariate information measures, which is a future goal of our research.

Limitations of this Study
Perhaps the most noticeable potential limitation of this analysis is the fact that it was performed using organotypic cultures [107,108]. Although organotypic cultures have been widely used in research [109,110], these cultures have been shown to possess several differences in comparison to the in vivo system using both mice and rats. Such differences in vitro include additional synaptic connectivity [111,112], decreased ease of LTP induction [113], changes in protein expression [114], increased excitability [112,115], and changes in cellular organization in mice [116].
Despite these issues, the overall structure and electrical activity of corticohippocampal organotypic cultures have been shown to essentially match the in vivo system [111,113,117]. Furthermore, it has been shown that interneurons in organotypic cultures are physiologically and morphologically identical to interneurons in vivo [118], cortical layer structure and cell migration are preserved in postnatal organotypic cultures (as were used in this analysis) in rats [119], and that intracortical connection structure is preserved in organotypic cultures when sub-cortical regions are preserved in culturing (as was done in this analysis) [120][121][122].
Based on these previous studies, we concluded that organotypic cultures represent a useful model system for intact in vivo neural systems. Therefore, we believe our results are highly relevant for the field given the strength of the preparation used, the power of the analysis, and the novelty of the results themselves. Furthermore, at this time, it would not have been technologically possible to achieve the same level of spatial and temporal recording resolution and the same number of recorded neurons in vivo. While some in vivo recording methods are capable of recording from hundreds of neurons, these methods demand tradeoffs in terms of temporal or spatial resolution. For instance, in vivo calcium imaging allows for the simultaneous recording of up to approximately 1000 neurons, but the temporal resolution for these recordings is significantly less (tens of ms) than we achieved in our recordings (50 ms) [123][124][125]. Recording methods with lower temporal resolution would have been unable to capture the short time scale interactions that we observed. Furthermore, in vivo electrophysiological recording methods that employ planar arrays or shank electrodes are capable of recording hundreds of neurons with high temporal resolution, but these recording methods possess limited spatial resolution in comparison to our array (interelectrode spacing of 60 mm) due to larger inter-electrode spacing in arrays (e.g. 400 mm in Utah arrays (Blackrock Microsystems)) and larger spacing between shanks (e.g. 250 mm in [126]). Recording methods with larger inter-electrode spacing would have been less likely to detect short time scale interactions, since those interactions were found to occur primarily between closely spaced neurons ( Fig. 6 A). Therefore, our use of organotypic cultures and a high density, high temporal resolution multi-electrode array permitted a dramatic improvement in the quality of the data, which improved the strength of the analysis.
Our recording method possessed several distinct features that were advantageous especially during the developmental stages of this method. Still, other recording methods could be used with this TE analysis method to investigate other phenomena. For instance, the use of in vivo calcium imagining, while lacking the temporal resolution to capture short time scale connections, would more easily facilitate the gathering of additional information about the neurons involved in the networks (e.g. cell type, cell layer, etc.) and would more easily allow for direct cell stimulation or inhibition via optogenetic techniques [127]. Furthermore, in vivo studies could investigate the relationship between time scale dependent connectivity and phenomena that can only be studied in vivo, such as behavior and sensory coding. We feel these types of analyses could produce novel insights into time scale dependent networks in the brain and we plan to pursue them in the future.

Ethics Statement
All neural tissue samples from animals were prepared according to guidelines from the National Institutes of Health and all animal procedures were approved by the Indiana University Animal Care and Use Committee (Protocol: 12-015) as well as the Animal Care and Use Committee at the University of California, Santa Cruz (Protocol: Litka1105).
Gathering the raw data, pre-processing, and spike sorting The raw spiking data utilized in this analysis are fully described elsewhere [48]. Briefly, cortico-hippocampal organotypic cultures were produced using postnatal day 6 Black 6 mouse pups (wild-type C57BL/6 from Charles River) following the protocol described in [107]. The mice were anesthetized in an ice bath prior to decapitation and brain removal. Each culture was recorded after 2 to 4 weeks. After culturing, spontaneous activity was recorded from each slice using a custommade 512-electrode array system [128]. The array contained 5 mm diameter flat electrodes arranged in a hexagonal lattice with an inter-electrode distance of 60 mm. In this arrangement, the total recording area of the array was approximately a 1 mm by 2 mm rectangle. Each recording was performed such that either the hippocampus or the cortex was centered on the array. Action potentials (spikes) were then detected and spike-sorted using a well-established method [128]. Briefly, points in time where voltage traces exceeded 8 standard deviations calculated over a 5 second window of the voltage trace were marked as potential spikes on a given electrode. A portion of the voltage trace for the given electrode and the 6 adjacent electrodes were then utilized as spike waveforms. These waveforms were then projected into a five dimensional principal component space. Clustering of these points (each of which represents one potential spike) was then performed using a mixture of Gaussian models with maximum likelihood estimation. Duplicate neurons and neurons with many refractory period violations were then excluded from further analysis (see [128] for additional details). After spike sorting, neurons with less than 100 spikes in the 60 minute recording (firing rate ,0.028 Hz) were removed from the analysis. Then, the resulting spike times were used in the remainder of the analysis.
After electrophysiological recording, six example cultures were stained with NeuN to check for differentiation between hippocampal and cortical tissue at the point of recording (Fig. 9) (see [48] for complete staining details). The results of this staining procedure indicated that hippocampal structure was well maintained from DIV1 throughout culturing. In addition to the six example cultures that underwent staining, all live tissue from hippocampal recordings was also imaged pre-and post-recording using light microscopy. These images were aligned and overlaid with cell positions from the electrode array and the neurons were manually sorted as falling within hippocampal or cortical tissue. We did not attempt to identify from which region of the hippocampus or layer of the cortex each neuron originated because the differentiation between these tissue regions was difficult to observe under light microscopy for all cultures (see Discussion -Limitations of this Study). Cortical recordings did not require this procedure as the cortex was large enough to cover the entire array. In both cases, cell positions on the array were measured by fitting a two-dimensional Gaussian distribution to the signal strengths of each neuron on the electrodes on which they were recorded.

Burst Statistics
Network bursts of various types were observed in all of the data sets we analyzed [129,130] (Fig. 1 A and S1 Fig). To calculate burst statistics for our data, we used the general procedure described by Wagenaar et al. [129] to define individual neuron bursts, with one altered parameter. First, we detected neuron bursts by finding groups of at least 4 consecutive spikes with inter-spike intervals (ISIs) less than one-eighth (one-fourth in Wagenaar et. al.) of the average ISI for that neuron. Second, based on observations of the data and the appearance of large bursts that involved many neurons in the network, we felt it was necessary to also measure network bursts. So, we referred to a group of overlapping neuron bursts that contained at least 10% of the neurons in the network as a network burst. Note that we did not require that at least 10% of the neurons be bursting at the same time. Rather, we only required that there was no period during the network burst where no neurons were bursting and that at least 10% of the neurons burst at some point during the network burst. To insure that these bursts did not bias our analysis, we generated model data that contained bursts, but no other interactions between the neurons. In these model data, we found roughly the number of connections expected by chance in random data, while in the actual data we found significantly more connections.

Transfer Entropy Analysis
Transfer Entropy (TE) was introduced by Schreiber to measure the influence of one time series (call it I) on another time series (call it J) [49]. In our case, I and J were binary spike trains for neurons I and J that contained 0 for the time bins when the neuron did not spike and 1 for the time bins when the neuron did spike. In its most basic form, the TE from I to J is given by: By definition, the TE from neuron I to neuron I is zero, so no self connections existed in our analysis. The probabilities in Eqn. (1) are typically measured by counting all the instances of the possible combinations of spikes and no spikes in the j t , j t-1 , and i t-1 bins for all time bins. Note that this process requires the assumption that the time series are stationary and a sufficiently long recording is used to adequately approximate the probability distribution. Given the length of our recordings (1 hour), the fact that spontaneous activity was recorded, and that there are limited combinations of possible states (2 3 ) due to the binary nature of our time series, we feel TE can be used to generate meaningful results from our data.
Generally speaking, TE measures the information gained about the state of the target time series (j t ) when the past state of a transmitting time series (i t-1 ) is known, beyond the information provided by the past state of the target time series (j t-1 ) alone. TE has been used in several neuroscience applications [27, 45, 47, 50-56, 59, 60, 63, 64]. In its most basic form, TE is only able to capture interactions with delays falling within adjacent time bins. For instance, if it were the case that neuron J tended to spike ten time bins after neuron I spiked, the spikes would be so far apart that no interaction would be present in adjacent time bins and the basic TE expressed in Eqn. (1) would not detect an interaction. It would be possible to capture this ten time bin interaction by simply rebinning the data to combine multiple time bins together. However, this procedure could retain some short time scale interactions, yielding a TE result that would be polluted with short time scale interactions.
In order to measure interactions over many isolated time scales, we included a delay between the past states of the neurons (i t-d instead of i t-1 and j t-d instead of j t-interactions allowed by the resolution of the recordings. Also, we combined the i td and j t-d states with their preceding time bin (i t-d-1 and j t-d-1 ) in such a way that a spike in either or both time bins yielded an overall state of 1 and no spikes in either time bins yielded an overall state of 0. We denote these new states as i' t-d and j' t-d . This leads to a slightly altered TE expression: We also calculated the normalized TE by dividing the raw TE by the entropy of the J spike train. Doing so changes the interpretation of the TE from the amount of information being transmitted to the percentage of the target neuron's entropy that can be accounted for by the transmitting neuron. This modified version of the TE is given by: The binary connections throughout the analysis were taken to be 1 whenever the raw TE value was significant (see below for the significance testing method) for a given pair of neurons and 0 when it was not.
We chose to create ten time scales at which to analyze the isolated TE values. We created the time scales in such a way that they logarithmically spanned a large range of neurologically relevant time scales. The precise values of the bin sizes and delays for each time scale are given in Table 1. To more clearly communicate the binning structure and delay windows, the three smallest time scales are overlaid on an example spike train in Fig. 10. The interaction time scales (delay windows) for the ten time scales are presented in Fig. 1 E along with several neural phenomena and common measurement methods for easy comparison. Overall, this method allowed us to systematically examine interactions between individual neurons in vitro over time scales ranging from sub-milliseconds to seconds.
Wibral et al. recently proposed an alternative method for measuring delayed interactions with TE [131]. In their method there is no delay for the history of the J time series (i.e. j' t-d becomes j t-1 in Eqn. (2)). As they elegantly show, this alternative method is well suited to detect the delay which produces the maximum TE. Both the alternative method [63] and methods similar to the one employed in this analysis [52,54,55,132] have been used previously. Despite the advantages of the Wibral et. al. method, we were forced to utilize a method that included a delay in the J time series in order to isolate interactions in separate time scales. In comparison, the analysis by Wibral et. al. sought to find the delay that produced the maximum TE, so it did not allow a delay in the J time series to produce a fair comparison of TE values. Fortunately, our analysis did not involve the comparison of TE values at different delays, rather it sought to find significant TE values at isolated time scales. Therefore, we feel that the concerns correctly expressed by Wibral et. al. regarding the inclusion of a delay in the J time series do not apply to our method.
In order to assess which TE values were statistically significant, we employed a Monte Carlo approach to generate a distribution of TE values from randomized data. To generate the randomized data, we jittered the I spike train using a uniform distribution with a width of seven bins centered on the original location of each spike. The firing rate of the I spike train was preserved in the jittering process, as were the number of i' t-d 51 states. We only jittered the I spike train in order to preserve interactions in the J spike train. We used a uniform distribution for the jittering to prevent spikes from being jittered in front of or behind the j t state of the J spike train. Doing so prevented strong connections from neuron J to I from obscuring weaker connections from neuron I to J. The jittering and TE calculation procedure was performed 5000 times for each pair of neurons. The As the time scale increased, the bin sizes logarithmically increased. The overall state structure with regards to the bins was identical for time scales 2 through 10. Time scale 1 possessed a delay of 0 (d50) in order to capture interactions at the smallest resolution of the recordings (0.05 ms).
doi:10.1371/journal.pone.0115764.t001 p-value for the original TE measurement was then calculated as the proportion of the jittered data sets that produced TE values greater than the value from the original data. Examples of jittered TE distributions can be seen for three pairs of simple model neurons in Fig. 11 B and corresponding examples pairs from the real data can be seen in Fig. 11 C. We chose to set a p-value threshold of p,0.001 for the TE results in this analysis. Pairs of neurons with p-values above this threshold were removed from the remainder of the analysis. We chose this threshold after examining results from randomized data (Fig. 12 A). In order to test the effects of network bursts [129,130] on the analysis, we generated 8 randomized data sets (4 from hippocampal recordings and 4 from cortical recordings) that contained random Poisson firing for each neuron during network bursts (firing rate matched to bursts in real data) and no spikes for all other times. Here, we identified network bursts using a manually set threshold for total network activity in 100 ms bins. Many more connections were found to have low p-values in the real data in comparison to these randomized data, especially for p,0.001. We only compared results for the first 8 time scales because the longer time scales were on the same scale as the network bursts that were retained in the randomized data. In order to ensure that the false-positive connections did not affect the overall analysis, we repeated the entire analysis using p,0.0002 as the p-value threshold. Other than fewer viable data sets (see Topology Analysis -Sub-Networks) and the resulting degradation of the statistics used to evaluate differences between tissue types, we found no appreciable differences in the results.
Though the p-value threshold we imposed was small (p,0.001), many of the networks had many possible connections. So, to insure that the connections we found to be significant were not merely the product of multiple measurements, we calculated the ratio between the number of connections found in the analysis and the expected number of connections found by chance (Fig. 12 B). In a network with N neurons, there are N*(N -1) possible directed connections. So, by chance we would expect approximately p*N*(N -1) spurious connections, where p is the p-value threshold of 0.001. We found that the data sets with the smallest ratio still possessed roughly 4 times more connections than expected by chance. The majority of the data sets possessed between 10 and 100 times more connections than expected by chance. Therefore, we feel it is unlikely that spurious falsepositive connections from multiple measurements biased our analysis to a significant degree.

Topology Analysis -Sub-Networks
Before analyzing the full networks, we applied a sub-network creation routine to avoid biases associated with network size (N) and average degree or number of connections per neuron (k). After the initial network construction, we found that each data set and time scale possessed greatly varied numbers of neurons and connections (Fig. 12 C and D). It has been shown that network topology measures are N and k dependent [76]. To compensate for this effect and to ensure that Time Scale Multiplex Networks in Neuronal Cultures measured differences between networks are not simply due to N and k, we created 500 sub-networks for each data set and time scale with nearly matching N (target: 50) and k (target: 3) values (detailed procedure below). It was not possible to create sub-networks with precisely identical N and k values, but we feel the wide range of N and k values present in the original data, as well as the large differences between hippocampal and cortical networks in terms of N and k values, were significantly reduced using the sub-network routine (Fig. 12 C and D). We feel that this reduction in variability reduced biases associated with N and k to the highest degree possible at this time. Each sub-network was then analyzed and the resulting values were averaged over all sub-networks to obtain an unbiased result for each data set and time scale. The sub-network creation procedure was as follows: First, the neurons were randomly divided into a group of 50 neurons for the sub-network and a pool of unselected neurons. Second, if any of the 50 sub-network neurons were unconnected with the other neurons in the sub-network, an unconnected neuron was replaced with a randomly selected neuron from the pool of other neurons. This process was repeated until all neurons in the group of 50 had at least one connection or until no adequate substitution could be found, in which case, the algorithm would restart. If the algorithm could not succeed 500 times in 10 8 attempts, the data set was deemed unviable for that time scale and it was removed from the analysis. We felt 500 sub-networks of 50 neurons were sufficient to completely sample all of the networks because, even in our largest network of approximately 650 neurons, each neuron would be randomly selected for the subnetwork at least approximately 25 times. If we had been using larger data sets, it would be necessary to use larger sub-networks or more sub-networks. Third, the k value was enforce by retaining only the (k*N)/2 strongest connections, as defined by the normalized TE (Eqn. 3).
We found it was necessary to replace unconnected neurons in the sub-network creation routine to avoid creating sub-networks with many unconnected neurons. If allowed to remain in the sub-network, unconnected neurons would effectively decrease N and increase k because unconnected nodes were ignored in the topology analyses, so replacing unconnected neurons allowed us to more closely match the N and k values between sub-networks. It should be noted that it was not always possible to produce sub-networks with 50 connected neurons after setting the k value using the strongest TE values. Still, we feel this algorithm was the best solution available to the problem of N and k dependent topology measures because it substantially reduced the variance in the N and k values across the networks (Fig. 12 C and D).

Topology Analysis -Hubs
Once the sub-networks were created, hub neurons were identified by their degree. Using a binomial distribution, we estimated the likelihood to observe a neuron with a given degree in a randomly connected network with an identical number of neurons and connections. If the likelihood to observe a given degree was less than 10 24 , we marked neurons with that degree as hubs. We used a binomial distribution to ease processing time and because the number of connections was so small that replacement effects could be ignored. We identified hubs based on total degree (number of incoming and outgoing connections). For example, for sub-networks that contained 50 neurons and had k53 average connections per neuron, the degree threshold for being considered a hub was 11 total connections. We also applied thresholds of 10 22 (8 connections) and 10 23 (9 connections) to the data (S2 and S3 Figs.).
Once the hubs were identified, we calculated the percentage of the neurons that were found to be hubs for each sub-network. We then averaged these results for all 500 sub-networks. This produced a value for the percentage of neurons that were found to be hubs for each data set and time scale. We then plotted the distribution of the average values for all hippocampal and cortical data sets for each time scale (Fig. 4) and performed a multiple comparisons corrected Mann-Whitney Test on the two distributions to determine if they differed significantly.
Other authors have defined hubs using different features of nodes in the network [78,133]. We chose to use degree because it is simple and straightforward, but we also feel that other methods of identifying hubs may provide useful and interesting results.
To compare the hubs across different time scales, we examined each neuron's status in the network of a data set at different time scales (Fig. 5). While the neurons in the network were identical at different time scales, their status as connected or unconnected, as well as their status as hubs or non-hubs could change with time scale. We wished to calculate the degree to which the neurons that were hubs at one time scale were also hubs at another time scale (hub sharing), as well as the degree to which the neurons that were non-hubs at one time scale were also non-hubs at another time scale (non-hub sharing). This task was complicated by the use of sub-networks. To calculate the amount of hub sharing, we utilized the following procedure: First, we calculated the ratio of the number of times each neuron was found to be a hub in a sub-network over the number of times each neuron was found in a sub-network. This ratio was referred to as the hub score for the i th neuron and u th time scale: If a neuron was never found to be in a sub-network, its hub score was set to zero. Second, we calculated the product of the corresponding hub scores between one pair of time scales for one data set. Third, we calculated the average of the hub score products for which either the multiplicand or multiplier was non-zero. This final result was the hub sharing value for one data set and one pair of time scales: Using this method, if, for instance, one data set possessed two neurons that were always found to be hubs at two different time scales, it would yield a hub sharing value of 1. If, for instance, one data set possessed two neurons that were always found to be hubs at one time scale and two different neurons that were always found to be hubs at a different time scale, it would yield a hub sharing value of 0.
Fourth, in order to differentiate the effects of the distribution of hub scores from the influence of time scale and neuron identities, we constructed a null model by calculating the hub sharing values for randomized trials. Each trial retained the original hub scores for each neuron, but the identities of the neurons were randomized: where the sequence of j neurons is a randomized version of the sequence of the i neurons. In this way, the hub sharing values for the randomized trials produced results that estimated the hub sharing values in the case where only the distribution of hub scores is relevant. Fifth, 500 randomized trials were performed, averaged for each data set and time scale, and subtracted from the original hub sharing value. This produced the final hub sharing value: By including the randomized trial comparison, the hub sharing value described the degree of hub sharing above or below what would be expected by chance in the case where the only relevant feature is the distribution of hub scores. Sixth, we averaged the hub sharing values across all data sets for time scales that were at most four steps apart (''neighboring time scales'') and for time scales that were more than four steps apart (''distant time scales'') ( Fig. 5 B). We also averaged the hub sharing values for a given tissue type and pair of time scales (Fig. 5 C and E). Finally, we compared the distribution of hub sharing results to the distribution of hub sharing results for the randomized trials (also corrected by the mean of the sharing results for the randomized trials) to assess when the hub sharing results differed significantly from the null model (Fig. 5 B, D, and F).
The procedure to compare non-hubs across different time scales was identical to the procedure for hubs with the exception that we calculated a non-hub score by calculating the ratio of the number of times each neuron was found to be a non-hub in a sub-network over the number of times each neuron was found in a sub-network: Topology Analysis -Connection Distance Using the physical location of each neuron on the array, we were able to examine the physical distance between effectively connected neurons and hubs. To do so, we calculated the ratio of the mean physical distance between all connected neurons in a sub-network and the mean physics distance between all hub neurons (connected or not) to the mean physical distance between all possible connections in the same sub-network. We then averaged these values over all sub-networks to obtain the average normalized network size and hub distance for a single data set and time scale. Finally, we plotted the distributions of these average results for hippocampal and cortical data at all time scales (Fig. 6). We compared the distributions of normalized distance values for connections and hubs for the two tissue types using a multiple comparisons corrected Mann-Whitney Test. We also compared the hub distances to the connection distances, as well as the hub distances and connection distances across different time scales using a multiple comparisons corrected Mann-Whitney Test.

Topology Analysis -Topology Measures
In addition to analyzing the hubs and physical connection distances, we applied three topology measures to the networks using software from the Brain Connectivity Toolbox [78]. Specifically, we used modularity and assortativity. Modularity measures the degree to which the network could be divided into separate modules [134,135]. A high modularity corresponded to well divided, non-overlapping groups. In addition to the modularity, we also counted the number of modules found and root mean square of the size of the modules. Unlike assortativity, the modularity calculation was stochastic. Therefore, we calculated the modularity, the number of modules, and the root mean square of the module sizes 10 times for each sub-network and averaged the resulting values to obtain mean values for each sub-network we analyzed.
Assortativity measures the correlation between the degrees of nodes at the ends of connections [136]. In our case, we used the correlation between the out-degree of nodes at the starting point of connections and the in-degree of nodes at the end point of connections. A high positive assortativity implies that the high out-degree nodes connect to high in-degree nodes, while low out-degree nodes connect to low in-degree nodes. On the other hand, a large negative assortativity (disassortativity) implies that the high out-degree nodes connect to low in-degree nodes, while low out-degree nodes connect to high in-degree nodes.
After calculating the topology measures for the sub-networks and averaging those results to obtain a single value for each data set and time scale, we compared the distributions of the topology measures for hippocampal and cortical networks using a multiple comparisons corrected Mann-Whitney Test (Figs. 7 and 8). We also compared the distributions of topology values for identical tissue types across different time scales using a multiple comparisons corrected Mann-Whitney Test.

Correction for Multiple Comparisons
Given the large number of comparisons performed in this analysis, it was necessary to correct for the effects of multiple comparisons. We chose to use False Discovery Rate (FDR) control to limit the likelihood of spurious false-positive results. Specifically, we utilized the algorithm introduced by Benjamini and Yekutieli [137] as a modification of the earlier algorithm introduced by Benjamini and Hochberg [138]. We implemented the algorithm using software created by Groppe et al. [139]. This method has been shown to work correctly for dependent and independent measurements. We felt that it would not have been appropriate to use a familywise error rate correction (e.g. Bonferroni) for three primary reasons [139]. First, this study was generally exploratory in nature and we anticipated that noteworthy results would likely be distributed across many comparisons. Second, given the relatively few culture recordings, we anticipated that the individual comparisons would be relatively weak. Third, FDR does apply a conservative correction in the case of few significant comparisons (see Fig. 8 for a noteworthy example).
We applied the correction to the Mann-Whitney test results for each unique type of measurement that was performed across different time scales or different pairs of time scales. For instance, we applied this correction to the Mann-Whitney p-values for the hub sharing results for the cortical networks for all pairs of time scales. We did not use FDR to correct for false-positive results in the TE measurements for individual pairs of neurons due to the extremely large number of comparisons that were performed, as well as the large number of connections found in comparison to the number of connections expected by chance (Fig. 12  B).
Supporting Information S1 Fig. Burst Statistics. Neuron burst were detected using the algorithm described in [123], with one altered parameter. Briefly, a neuron was said to burst if 4 consecutive spikes had inter-spike intervals (ISIs) of less than one-eighth the average ISI for that neuron. A network burst was defined as any grouping of overlapping neuron bursts that contained at least 10% of the neurons in the network. 10% of the neurons in the network were not required to be bursting at the same time.  Fig. 4. For this threshold, we found no significant changes in percentage of neurons found to be hubs across different time scales (B and C). Notice that hub sharing values were significantly below the level in the null model for distant time scales in cortical networks (D and E), though not in hippocampal networks (F and G). This indicates that in cortical networks, hub functionality was not just randomly distributed across distant time scales, but rather, hub functionality was distributed into separate groups. doi:10.1371/journal.pone.0115764.s002 (EPS) S3 Fig. Hub Results with Alternative Degree Threshold (10 23 ). Identical analyses to those presented in Figs. 2 and 3, except with lowered degree threshold. For N550 and k53, the threshold of 10 24 (manuscript) implies a degree threshold of 11, while the threshold of 10 23 implies a degree threshold of 9. (A, D-G) Fig. 5. (B and C) Fig. 4. For this threshold, we found few significant changes in percentage of neurons found to be hubs across different time scales (B and C). Notice that hub sharing values were significantly below the level in the null model for distant time scales in cortical networks (D and E), though not in hippocampal networks (F and G). This indicates that in cortical networks, hub functionality was not just randomly distributed across distant time scales, but rather, hub functionality was distributed into separate groups. Step 3 was repeated 5,000 times using jittered data. 5 (Magenta): The p-value for each normalized TE result was calculated as the percentage of the jittered data sets that produced normalized TE values greater than or equal to the result from the data. Any connections above the p-value threshold were removed from the analysis. In this analysis, the p-value threshold was set at p,0.001. Significant connections were then labeled as binary connections in the networks. 6 (Dark Blue): The relationships between firing rate and degree, as well as the correlation between connectivity at different time scales were analyzed. 7 (Yellow): To compensate for differences in N and k between data sets, sub-networks of 50 neurons were randomly created from the binary networks and the average number of connections per neuron (k) was set by retaining only the strongest connections. Topology measures were applied to the sub-networks. 8 (Dark Green): The sub-network results were averaged and analyzed. doi:10.1371/journal.pone.0115764.s004 (EPS)