Shape variation and modularity of skull and teeth in domesticated horses and wild equids

In horses, the morphological changes induced by the process of domestication are reportedly less pronounced than in other species, such as dogs or pigs – although the horses’ disparity has rarely been empirically tested. We investigated shape differences and modularity of domesticated horses, Przewalski’s horses, donkeys and zebras. Mandibular and tooth shape have been shown to be valuable features for differentiating wild and domesticated forms in some mammals. Both mandible and teeth, show a pattern of shape space occupation analogous to that of the cranium, with domesticated horses occupying a similar extension in shape space to that of wild equids. Only cranial shape data exhibit a tendency to separate domesticated horses and Przewalski’s horses from donkeys and zebras. Maximum likelihood model-based tests confirm the horse cranium is composed of six developmental modules, as reported for placental mammals in general. The magnitude of integration in domesticated horse skull was lower than in wild equids across all six cranial modules, and lower values of integration were associated with higher disparity values across all modules. This is the first study that combines different skeletal features for the description and comparison of shape changes in all living equid groups using geometric morphometrics. We support Darwin’s hypothesis that the shape variation in the skull of domesticated horses is similar to the shape variation of all wild equid species existing today. Lower magnitudes of module integration are recovered in domesticated horses compared to their wild relatives.


Background
After being on the verge of extinction, domestication made horses one of today's most common large animal species [1]. All living species of equids belong to the genus Equus, which is divided into the caballine taxa, including domesticated horses (H) and Przewalski's horses (P), and non-caballine taxa, comprising the different donkey (D) and zebra species (Z). Within the caballine taxa, the Przewalski's horses likely represent the sistertaxon to the extinct wild ancestor of domesticated horses [2,3]. Since the early domestication of horses, reproductive isolation promoted divergence by genetic drift and natural selection. Later on, extensive selective breeding to meet human needs for certain behavioural or physiological traits resulted in a wide range of morphological variation [4][5][6]. Horses, like other domesticated species, have been shaped into diverse morphological types through artificial selection to fit specific functions, such as agricultural work, racing, or leisure. Four traditional body types are recognized: draft horses, medium horses, light horses, and ponies [5,6]. Horse disparity was already acknowledged by Charles Darwin, who noted that its intraspecific disparity is larger than the interspecific disparity of equids in general [7]. Darwin proposed that great differences among horse breeds can be found in the skull. Based on its complexity in form and origin, as well as its relation to important vital functions, the skull is the most commonly used marker of morphological variation [8]. The increase in skull shape variation following domestication has been found in different domesticated species such as dogs [9], cattle [10], and pigeons [11], where it has been measured using geometric morphometric methods, and quantified through comparisons of variance in shape space. In addition, potential shape changes in teeth are commonly used in zooarchaeological studies to determine the time and location of domestication [12][13][14][15][16]. Previous studies on skull and tooth morphology and morphometrics show the existence of intraspecific as well as interspecific shape variation in subsets of the equid clade (Table 1). However, the patterning and magnitude of variation in skull shape or tooth shape across all extant equids has so far not been examined. In order to quantify the shape variation in extant equids and to investigate the impact of domestication, we first compare domesticated horses represented by 38 different breeds and encompassing the whole size range of the species, to the extant zebra and donkey species, as well as to the Przewalski's horse ( Table 2) using twoand three-dimensional geometric morphometrics to explore cranial and mandible (3D), and teeth (2D) morphometrical variation.
As a second part of this study, we examine modularity of the equid skull. The concept of modularity [17,18] has attracted much attention in recent years (e.g. [19][20][21][22][23][24][25][26]), having emerged as a quantitative framework for exploring questions relating to facilitation and constraint in morphological evolution, with the goal of understanding how (and by how much) the direction in which variation is generated is biased [27][28][29]. Many studies have quantified patterns of modularity in the cranium using inter-trait correlations extracted from geometric morphometric data (see [30] and references therein) and, taken together, their results have supported a  Przewalski's Horses (P)  5  2  3  3  3  2   Total  216  224  192  198  144  141 common pattern in therian mammals, with some variability in the magnitude of integration among species (e.g. [20,31]). In contrast, comparatively little is known about the lability of modular patterning and integration magnitudes on relatively short time scales and under selective breeding regimes, although changes in magnitude, rather than patterning, have been implicated as the target for selection [32]. Providing examples of selective breeding for features to suit human activities, the study of domestication events offers an opportunity to empirically examine the role of modularity and integration in the generation of cranial disparity over short evolutionary timescales. A modular structure of the skull is expected to be uncovered for horses, as has been found across a wide spectrum of mammals (e.g. [31]), and we assess the fit of our shape data to functional and developmental hypotheses for modular patterning [33] that have been previously tested in the mammalian cranium. According to Darwin's hypothesis, domesticated horses should show more variation in shape than the wild equid species. If this hypothesis is supported, then we should find differences in integration and disparity measures for cranial modules between wild equids and domesticated horses. To do so, we assess whether cranial modules display a) higher or lower magnitudes of integration and b) high or low disparity for domesticated horses and wild equids, and c) we investigate whether there is a relationship between module integration/disparity and regions of the cranium showing most variability in shape among domesticated horses. Our aim is to characterize and quantify the patterning and magnitude of shape variation in the skulls and teeth of domesticated horses compared to the wild species of Equus. We use geometric morphometric methods to: a) test Darwin's hypothesis that the magnitude of intraspecific disparity in horses is larger than the interspecific disparity in equids, b) examine the extent to which domestication influenced tooth shape in equids, c) investigate whether the patterning of shape variation in horse skulls reflects a modular structure, specifically identifying the model best supported for the landmark data by evaluating four modular hypotheses that reflect developmental and functional trait interactions in the cranium, and d) quantify differences in the magnitude of modularity and integration between domesticated horses and wild equids.
Given the well-documented palaeontological record of horses [34], these animals offer the possibility to compare diversification in macroevolutionary and microevolutionary scale like few others. In fact, previous classic studies by Radinsky [35,36] investigated some of the cranial transformations with morphometric approaches typical of that time. Our study expands the studies of the extant species using newer morphometric approaches and provides the basis for future works comparing also the fossil record of the clade.

Shape variation
In the cranial symmetrised shape data, the first three principal components (PCs) account for 43.1% of the total shape variation in the cranium (Fig. 1a). PC 1 (17.6%) tends to separate the caballine equids (H, P) from the non-caballine equids (D, Z). The shape change along PC 1 from negative to positive is dominated by a narrowing and straightening of the skull in combination with an elongation of the occipital-parietal region, represented by the cranial vault (Fig. 1b). PC 2 accounts for 15.2% of the overall observed variation, and is characterized by a general broadening of the skull in combination with an elongation of the occipital-parietal region (cranial vault module) and a shortening of the nasal region (anterior oral-nasal module). Because of the large number of landmarks compare to the relatively small number of specimens we applied a dimensionality reduction of the datasets by selecting the first PCs for all further analyses following Evin et al. [13] (mevolCVP function) that also takes into account unbalanced sample size between groups. The results of the mevolCVP function suggested the reduction of the dataset to the first three PCs in all further analyses for the cranial data. Significant differences among the four groups (Procrustes ANOVA p < 0.001, F = 35.578, based on the 6 first PCs) allowed us to perform a canonical variance analysis (CVA) with a-priori defined groups (H, D, P, Z) resulting in an overall classification accuracy of 78.4% (Confidence interval CI: 60%-95%) when the four groups are analysed, and 98.2% (CI: 96.8%-100%, based on the 17 first PCs, Procrustes ANOVA p < 0.001, F = 32.723) when the Przewalski horse specimens are excluded. In this later analysis, both domestic horses and donkeys could be correctly assigned to their respective groups in 100% of the cases. Zebras were assigned correctly for 95.7% of cases and the remaining 4.3% were grouped within the donkeys. Predictive discriminant analyses detect cranial shape proximities of the five Przewalski's horses with the domestic horses (100% probabilities of identification). Domesticated and Przewalski's horses are most distinct from donkeys and zebras in Procrustes shape space, as measured by Mahalanobis distance (Table 3). Horses occupy a larger Procrustes shape space (53.12%) as determined by Foote's partial disparity, than zebras (26.22%) and donkeys (18.34%). The Przewalski's horses occupy only 2. 26% of the overall shape space. The overall Procrustes variance for the cranium is 0.0023.
The first four PCs of the mandible shape data account for 64.1% of the total shape variation in the mandible. In contrast to the cranium, none of the PCs shows separation between any of the four groups. Specimens of all groups largely overlap in PC shape space ( Fig. 1c and d), therefore we do not discuss this further. Due to , we computed a CVA with the same a-priori groups used in the cranium. The overall classification accuracy was low when the four groups were analysed (H, P, D, Z) 38.4% (CI: 12.5%-87.5%), while a classification accuracy of 87% (CI: 82.4%-91.2%, 17 first PCs, F = 15.681, p < 0.001) was reached when the two Prezwalski horses were excluded. In this later analysis, 88.2% of the donkeys, 87.9% of the horses and 89.6% of the zebras were correctly classified. The two Przewalski's specimens were both classified as horses with probabilities of 100% and 72%. Donkeys, zebras, and Przewalski's horses are similarly spaced from each other (Mahalanobis distance, Table 3). Mandible Procrustes shape space occupation is very similar to the cranial shape space with horses dominating the shape space (Foote's partial disparity 58.98%). Zebras occupy the second largest shape space with 23.69%, followed by donkeys (16.59%) and Przewalski's horses (0.71%). The Procrustes variance of the mandible (0.0024) is slightly higher than in the cranium.
All analysed teeth (L2P, U2P, L3M, and U3M) have a similar partial disparity as all other analysed features: horses showing the highest partial disparity followed by zebras, donkeys, and Przewalski's horses. However, the overall disparity (Procrustes variance) differs between the teeth, with cranial P2 showing the smallest variance (0.008) while all other teeth exhibit a total variance around 0.013. The Mahalanobis distances among the groups calculated for each tooth separately are similar for cranial and mandibular P2, and cranial and mandibular M2. Przewalski's horses and zebras are in all instances most disparate. The P2 is most similar between horses and Przewalski's horses, the cranial M3 is most similar between zebras and horses and the mandibular M3 is most similar between Przewalski's horses and horses followed by the zebra (Table 3).

Modularity
Results from EMMLi indicated that, of the models tested here, the best supported model for modularity was Goswami's mammalian module hypothesis [20] with separate within-module integration and separate between-module integration (model 2d, Additional file 1: Table S1). This model had the lowest Akaike Information Criterion (AICc) value of − 969.34 and a maximum likelihood of 507.97 (Additional file 1: Table S1). Goswami's mammalian module hypothesis contains six modules, these are anterior oral-nasal (AON), cranial base (CB), cranial vault (CV), orbit (ORB), molar (MR), and zygomaticpterygoid (ZP) (Fig. 2, [20]). Disparity and integration values were calculated for these six modules separately for domesticated horses (H) and wild equids (P/D/Z).  Table S2). In both wild equids and domesticated horses there is a general trend of increasing disparity with decreasing magnitude of integration. Further, the AON and ZP modules stand out from the other cranial modules as they both show higher disparity coupled with lower integration values (Fig. 2).

Discussion
The results of our study on skull shape variation support Darwin's hypothesis that the intraspecific disparity in horses is larger than the interspecific disparity in equids. The shape variation of domesticated horses is not only larger than that of the closest relative (Przewalski's horses) but similar to the shape variation of all the wild equid species existing today. Horses do not only dominate the Procrustes shape space when comparing crania, but also comparing mandibles or teeth -showing higher shape variation in all tested elements. The overall classification among domesticated horses (H), Przewalski's horses (P), donkeys (D), and zebras (Z) had an average accuracy of 44.5% (range: 34.5% -78.4%), which lies significantly above the random chance accuracy of 25%. When we excluded the Przewalski's specimens due to their small sample size the average accuracy increased to 89.2% (range: 63.9% -98.2%). The separation of caballine from non-caballine taxa and the clustering within these sister clades of H & P, and D & Z by the Mahalanobis distances, is in accordance with the phylogenetic relationship of equids. All four groups descend from a common ancestor around 4-4.5 myr BP, with zebras and donkeys splitting around 2.8 myr BP and Przewalski's horses and the wild ancestor of today's domesticated horses splitting 38-72 kyr BP [2,3,37]. The first PC of the cranial shape data tended to separate caballine from non-caballine taxa. The accompanying shape differences are dominated by an elongation of the occipital part of the cranium in zebras and donkeys, which previously have been shown to be a distinct character to separate these two taxa from the domesticated horses (for a more detailed morphological description of the skulls of each group see Table 4, Fig. 3, [38,39]).
In contrast to the results of the cranial analysis, we did not find clear group-distinguishing shape differences in the mandible or in any of the investigated teeth. We found the highest Procrustes variance in the mandible data set, very closely followed by the cranium. All four teeth showed a lower Procrustes variance pointing towards less variability, a result also reflected in the low magnitude of disparity for landmarks belonging to the molar (MR) module, probably related to dietary constraints. Our findings on tooth shape differences are congruent with the "long-fuse" hypothesis on teeth by Seetah et al. [14], stating that "shape changes in equids have been modest […] until the development of modern breeds in recent centuries". Cucchi et al. [16], however, found a strong taxonomic pattern in the shape of the enamel folding, allowing for a more distinct taxonomic separation at the species level. These resulting differences are most likely due to the different choice of teeth ( [14]: UP4; [16]: LP3 -LM2) , as suggested by one of the articles [16], different  wear stages, and/or the different methods used (landmarks vs. semilandmarks). Modular patterning of the cranium is well-supported by empirical evidence, representing shared development and functional associations between parts of the cranium, resulting in their parcellation into semi-autonomous units. Consistent with patterns recovered for other placental mammals [20], our results indicate that trait variation in the equid cranium is best supported by a six-module hypothesis. These six modules reflect functional groups: the anterior oral-nasal (AON) and molar (MR) modules represents the primary masticatory apparatus, the zygomaticpterygoid (ZP) module includes the region of attachment for masticatory muscles, the orbit (ORB) module contains the visual sensory organs, the cranial vault (CV) supports and protects the brain, and the cranial base (CB) supports the braincase and is the point of attachment between the skull and axial skeleton. Previous analyses of module disparity and integration for these six modules in a sample of carnivorans provided some support for highly integrated modules showing low disparity, particularly the basicranium (CB), and weakly integrated modules showing high disparity (ORB and ZP) [20,28]. Our results are broadly consistent with this trend (Fig. 2), which is suggestive of strong integration acting to limit trait variation or the direction of response to selection. The ZP module in horses is also found to be weakly integrated and showing high disparity. Among the carnivoran sample, the molar region showed an unusual pattern of high disparity and high levels of integration [28], in our sample the molar module is also recovered as the most highly integrated module but displays the lowest levels of disparity for both wild and domesticated forms. The discrepancy between these results is likely explained by the diversity of dietary habits represented by the carnivoran sample (e.g. hypercarnivores, insectivores, frugivores) in that earlier study, which resulted in a high disparity among the landmarks captured in the molar module. Selection acting on shared developmental and functional processes can result in an uncoupling of trait associations at different levels [40][41][42][43], providing evidence for complex interactions between modularity and selection. Following, it might therefore be expected that domestication events, as examples of selective breeding regimes, could alter patterns of modularity and integration, and these alterations may differ among breeds, acknowledging that the features targeted for selection (e.g. gait, conformation) are likely to differ for some breeds. There exist few empirical tests of this hypothesis and results on dogs are inconclusive, with reports that patterns of integration have remained stable despite the morphological diversification associated with domestication [9,44], but also that high module disparity is associated with greater cranial shape variation in dogs compared to wolves [45]. In contrast to Parr et al. [45] our results show highly similar magnitudes of module disparity among wild and domesticated forms, and instead lower magnitudes of module integration are recovered in domesticated horses compared to their wild relatives. Variability in integration magnitude, as recovered here, rather than patterning has been proposed to underlie cranial diversity in mammals [31,32], such that general conservatism in patterning across mammals may be explained as a product of stabilizing selection on basic developmental processes whereas directional selection could act by altering magnitudes of integration. A recent study conducted simulations to test the role of integration in generating morphological disparity and noted that integration may not affect disparity in morphospace in the way that it is usually measured (as a volume of occupied morphospace or as a measure of dissimilarity), making the relationship between morphospace occupation and modularity results potentially difficult to interpret [46]. That study did not compare shape variation and its partitioning into modules, however our PCA plots indicate that the main axes of shape variation in the equid sample are spread across landmarks located in at least three modules (CV, AON and ZP). Similarly, Parr et al. [45] found shape variance in wild and domesticated dogs to be spread across modules with different magnitudes of integration. It has been suggested that modularity may repartition variance along new directions in morphospace, thereby exploring a greater volume, however the so-far limited empirical evidence appears to raise the question of the extent to which those new directions may be aligned with the axes recovered by eigen-decomposition of shape variables into mathematically orthogonal axes, as happens in ordination techniques such as PCA.
We examined specimens from the following collections: Museum für Naturkunde Berlin (MfN Berlin, Germany), Institut für Haustierkunde (Christian-Albrechts-Universität of Kiel, Germany), Museum für Haustierkunde "Julius Kühn" (University of Halle, Germany), Naturhistorisches Museum Wien (NHW Vienna, Austria), and Museo de la Plata (MLP La Plata, Argentina). The dataset includes all recent species of the genus Equus [37]. Due to inconsistent species assignment within zebras and donkeys across museums, we analysed all zebra (cranium n = 47; mandible n = 48) and donkey (cranium n = 31; mandible n = 33) species as one group, respectively. We further included five crania and two mandibles of Przewalski's horses. The largest number of specimens in our data set belongs to the domesticated horses (cranium n = 133; mandible n = 141) including the following breeds: Ancient Breed (Roman period),  Table 2).
Analyses of cranial, mandibular and teeth size and shape were performed using landmark-based geometric morphometric (GMM) approaches. The crania and mandibles were measured in three-dimension (3D) using a MicroScribe ® MLX6 (Revware, Inc., Raleigh, North Carolina, USA), while the teeth were measured in two-dimension (2D) using high resolution photographs. A total of 62 type I and type II landmarks [47] were collected on the cranium (Table 5, Additional file 3). The dorsal and ventral sides of the crania (Fig. 4) were measured separately and were subsequently combined using three reference landmarks (numbered 1, 2, and 33, Table 5). For the mandible 24 type II landmark coordinates were measured (Table 5, Fig. 4, Additional file 3).
Phenotypic variation of the four studied teeth was assessed using 9 to 12 two-dimensional landmarks (Type II) ( Table 5, Fig. 4, Additional file 4, 5, 6 and 7) following Seetah et al. [48] for the upper teeth that was adapted for the lower teeth. The landmark coordinates were collected on high resolution photographs using TPSDig2 [49]. The photographs were all taken in a standardized manner using a Canon Eos 600d with a Canon EF 24-105 mm f/4 L S USM lens from lateral and dorsal view with a scale bar for size reference.

Geometric morphometric analyses
To eliminate the effects of size, orientation, and scaling we performed General Procrustes Analysis (GPA, [50]), Table 5 Description of the landmarks, including position and type, collected on each cranium, mandible, and tooth; Type I: discrete juxtapositions of tissue types and Type II: maxima of curvature or other local morphogenetic processes [37] Table 3); Landmarks on the g upper 3rd molar h upper 2nd premolar i lower 3rd molar j lower 2nd premolar of a zebra (specimen MfN 70,299) in occlusal view (for a detailed description of the landmarks see Table 3) which translates all specimens' coordinates so their centroid coincides, scales them to unit centroid size, and rotates them to minimize squared summed distances between matching landmarks. With the cranium and the mandible being symmetric objects, only the symmetric component of shape was analysed in subsequent procedures [51]. The Procrustes scores retained from the GPAs for each skeletal feature were subjected to Principal Component Analyses (PCA). Differences in shape among the four different equid taxa were explored using Procrustes analysis of variance (ANOVA) [52]  Due to the high dimensionality of the datasets, a dimensionality reduction was performed prior to the ANOVA and CVA analyses using the mevolCVP function in R [12]. The mevolCVP function helps to identify the appropriate number of dimensions (first PC scores) which maximize the cross-validated percentage in the subsequent analyses using leave-one-out cross-validated linear discriminant analyses (LDA) (for a more detailed explanation see [53]). We then only used the predetermined number (N) of first PC scores to test for differences in shape among the defined groups using Procrustes ANOVA. If the Procrustes ANOVA showed significant results, we performed a Canonical Variance Analysis (CVA) to identify the shape features which best characterize the different groups. Because sample sizes of Przewalski's horses were relatively small, CVA analyses were performed with and without them. When they were excluded, predictive CVAs were used to assess the proximity of these specimens with the three remaining groups (identification were based on resampled designs [15]). We determined distances among the groups by calculating squared Mahalanobis distances (D 2 ), which represents the distance of one group mean to another group mean in standard deviations.
Further, we analysed the morphological disparity (as Procrustes variance, [52]) which is the occupied space of all specimens together in multidimensional shape space [54]. First, we calculated the grand mean in unit Procrustes variances. Then we inferred and compared Foote's partial disparity (PD) [54,55] to the grand mean. PD was calculated for each group (H, P, D, Z) separately, and for wild equids (D, Z, P) and domesticated horses (H). To do so, the residuals from the regression of shape across all specimens were used and the squared residual lengths were summed over either group mean. The resulting group wise Procrustes variances were multiplied by the number of samples per group divided by total sample size minus one. We then calculated the contribution in percent of each group to the overall disparity.
Analyses were conducted using R [56] in RStudio (v.1. 0.136) and related R packages [52,57,58] (R script is available upon request). The analyses were computed separately for the cranium, the mandible, and each of the four teeth.

Modularity analyses
Cranial landmarks for the total sample (wild equids and domesticated horses) were tested for modular structure using 17 models. Wild equids and domesticated horses were pooled because modular patterning has been demonstrated to be stable across placental mammals [20,31,32]. With the exception of the simplest model (= no modularity), cranial landmarks were subdivided into modules following a priori hypotheses for modular patterning. These were: 1) Tissue origin hypothesis (neural crest vs paraxial mesoderm derived elements [33]), 2) adult module hypothesis [20], 3) Cheverud's functional module hypothesis [59], and 4) horse-specific hypothesis [35,60] (see Additional file 8: Table S3). Hypotheses #1-3 have previously been tested on a macroevolutionary scale in mammals whereas hypothesis #4 tests the face and neurocranium as two separate units based on previously recovered growth pattern differences of the face relative to the neurocranium in horse evolution [35,36,60]. For each of these competing hypotheses (#1-4) we compared the fit of our data to different model structures, allowing for variation in correlation within and between modules. As such, each hypothesis was evaluated for four variants (a-d), these were a) same withinmodule integration and same between-module integration, b) same within-module integration and separate between-module integration, c) separate within-module integration and same between-module integration, and d) separate within-module integration and separate between-module integration (see Additional file 8: Table S3). The fit of the 17 models (4 hypotheses × 4 variants [a-d, above] plus 'no modularity' hypothesis) was evaluated using the EMMLi package version 0.0.3 [22] in R, using a coordinate (Procrustes aligned) correlation matrix based on absolute values of correlations as input. EMMLi is a maximum likelihood approach that allows for the direct comparison of models of mixed complexity, and outputs a corrected Akaike Information Criterion (AICc) value and an AICc difference (dAICc), which can be used to assess the fit of the model to the data [22]. The best supported model of modularity (lowest AICc and smallest dAICc) recovered from the EMMLi analysis was chosen for further calculations of module disparity and integration and comparisons between wild and domesticated forms. The cranial landmark matrix was subdivided into matrices for domesticated horses and wild equids. The matrices for domesticated horses and wild equids were each further subdivided into modulespecific landmark sets (e.g. orbit module domesticated horses, orbit module wild equids) and subject to GPA. For each module, disparity of the landmarks within that module was defined as maximum Procrustes distance following previous studies (e.g. [45]), and was calculated using Procrustes distances between the mean shape landmark configuration and the landmark configuration of each specimen. Disparity calculations were performed using the Evomorph package version 0.9 [61] in R. For each module, integration of the landmarks within that module was calculated using relative eigenvalue standard deviation (i.e. eigenvalue dispersion), following calculations detailed in [62]. This measure assesses the variance of extracted eigenvalues, which would be maximal when all variation in the data is found in a single dimension (i.e. complete integration) and zero when all eigenvalues are equal (i.e. no integration [30]). Therefore, large values for eigenvalue dispersion reflect strong integration between the landmarks in a module. Eigenvalue dispersion has been shown to be independent of trait number and highly correlated with the mean squared correlation coefficient [32].

Conclusion
We described and compared shape changes in various skeletal features among extant equid species using geometric morphometrics. Our results support Darwin`s hypothesis that shape variation in the skull of domesticated horses is similar to the shape variation of all wild extant equid species. Our study further shows that lower magnitudes of integration among six cranial modules are found in domesticated horses compared to their wild relatives. Future research could address the relation between integration and disparity, investigating the relation between the two during the domestication process of diverse species.

Additional files
Additional file 1: Table S1. List of cranial landmarks and their placement within the module configurations tested in this study. Four modularity hypotheses were tested, see text for further details. Modules for each hypothesis are as follows; 1.

Funding
This work was supported by SNF grant 31003A_169395 to MRS-V. Support for the collection visit at the NHW Vienna, Austria was granted by SYNTHESYS funding AT-TAF-5786 "Morphological disparity and ontogenetic allometry in domesticated horses" to LH. LABW is supported by the Discovery Program of the Australian Research Council (DE150100862).

Availability of data and materials
The datasets used and/or analysed during the current study are available in the supplementary information to this article.