Introduction

Timor, an island three times the size of Hawai’i, is one of the Lesser Sunda Islands, whose western half falls within the Nusa Tenggara Timur province of the Republic of Indonesia. Unlike its more famous eastern cousin (the young country of East Timor), West Timor has not been the subject of extensive genetic study. Nevertheless, as one link in the island chain that connects mainland Asia with Australia and the Pacific, Timor acted as a stepping-stone in the proposed Southern route of early human migration from mainland Asia (Sunda Land) to the landmasses of New Guinea and Australia (Sahul Land). Based on archaeological evidence, the earliest modern human colonization in Timor dates to over 42 000 years ago.1 The island has been radically affected by more recent human migration as well: the Neolithic era saw substantial changes in tools, technology, trade and subsistence. One hypothesis suggests that the spread of farmers, ostensibly from Taiwan, carried the Austronesian languages they spoke through Island Southeast Asia and Melanesia, and later out into Polynesia, in a process known as the Austronesian expansion.2 This movement may have reached eastern Indonesia via the Philippines and Sulawesi rather than through western Indonesia.3, 4 Indeed, Neolithic sites in Timor are extremely rich in shell artifacts and decorative pieces that are rare in Island Southeast Asia outside of Taiwan and the Philippines,5 thus suggesting a close affinity between the eastern series of islands that links Taiwan, the Philippines and Timor. Previous studies of maternal mitochondrial DNA (mtDNA) lineages imply similar patterns of sharing in the biological record.6

Linguistically, Timor hosts two very different language families: Austronesian and Trans-New Guinean (‘Papuan’) languages. Two Austronesian languages—Atoni (Dawan or Uab Meto) and Tetun—dominate west Timor, whereas at least 14 distinct languages, both Austronesian and Trans-New Guinean, are spoken in the east. The Austronesian expansion left its mark in other ways as well. Matrilocal residence systems are thought to have dominated ancestral Austronesian societies,7, 8, 9 and despite a widespread regional shift to patrilocal residence, some populations in West Timor still practice matrilocality today.10 Although it is still a matter of contention whether matrilocality in West Timor forms a matrilineal descent system (where clan membership is traced exclusively through female lines to a founding ancestor)7 or is instead simply the practice of matrilocal residence, these communities still clearly exhibit sex-specific dispersal in which populations are regulated by postmarital residence rules.8

The role of Timor as a major Portuguese colonial center simply added to its complex history. Sandalwood, honey, wax and, importantly for their genetic consequences, slaves were among major historical trade commodities.11 The traditional kingdom of Wehali ruled parts of central Timor during the historical period, with Laran as its ritual center. The Wehali kingdom, the religious and political core of the Timor world, was also influential in propagating marriage alliances to outlying regions, thus likely stimulating gene flow to the provinces. Today, people in the Wehali region speak an Austronesian language and still practice matrilocality.

Although an important regional center, few studies have explored the genetic profile of Timor populations. Souto et al.12 studied 15 autosomal short tandem repeats (STRs) in East Timor—once a Portuguese colony and now an independent country. They found high levels of genetic diversity with close affinity to populations along the coasts of New Guinea.13 This result is consistent with recent broad-scale regional studies of Y chromosome and mtDNA diversity across the Indonesian archipelago6, 14 that characterize the islands of Wallacea, including Timor, as having closer links with populations in Papua New Guinea and island Melanesia than mainland Asia.

Yet, Timor is noticeable by its relative absence from these large regional surveys. Its complex history, the presence of multiple language families on a relatively small island and the persistence of ancestral social systems give Timor several unique characteristics. To study the genetic signature of its people, and gain a better understanding of the effects of cultural practices such as language and residence rules on genetic diversity, we present a detailed genetic characterization of the Belu regency of West Timor, focusing on the historical princedom of Wehali.

Materials and methods

Samples and ethics

We studied populations from five districts in the Belu regency of West Timor, the Indonesian territory of the island of Timor. Permission to conduct research in Indonesia was granted by the Indonesian Institute of Sciences. Buccal swabs were collected from 529 consenting, closely unrelated and seemingly healthy individuals from 13 villages by JSL, with the assistance of Indonesian Public Health clinic staff. Sample collection followed protocols for the protection of human subjects established by both the Eijkman Institute and the University of Arizona institutional review boards. Participant interviews confirmed ethnic, linguistic and geographic classifications for at least two generations into the past. JSL gathered information on the languages spoken in each community, as well as video-recording 200-word Swadesh lists for later transcription and linguistic analysis. Table 1 lists the populations studied in West Timor, whereas their geographical locations are illustrated in Figure 1.

Table 1 Timor population samples in the present study
Figure 1
figure 1

Location of populations studied in West Timor: (1) Fatuketi, (2) Umaklaran, (3) Tialai, (4) Raimanawe, (5) Kamanasa, (6) Kateri, (7) Kakaniuk, (8) Laran, (9) Kletek (Kletek Rainan, Kletek Suai and Kletek Wefatuk), (10) Umanen Lawalu and (11) Besikama.

Populations in West Timor mostly speak Austronesian languages as their first language. In the regency of Belu, most speak Tetun (North/Upper Tetun or South/Lower Tetun) (Table 1), although there are small clusters of non-Tetun speakers, including non-Austronesian Bunak-speaking groups.10 These latter communities are found in the area that borders East Timor, where there is more language diversity and a greater number of non-Austronesian-speaking populations (Supplementary Figure 1).

DNA extraction and genetic screening

Full experimental details are provided in Supplementary Text 1. Note that we report three newly discovered Y-chromosome markers that resolve several previously uncharacterized haplogroups in this region.

Mitochondrial DNA hypervariable region I sequences have been deposited in GenBank (accession numbers KJ936094KJ936619). Y-chromosome STR data are provided as Supplementary Data Set 1.

Statistical and population genetic analyses

Molecular diversity, population structure estimates and genetic distances between populations were calculated using Arlequin v. 3.11 (http://cmpg.unibe.ch/software/arlequin3).15 Pairwise genetic distances between populations were computed as the linearized value FST/(1−FST).16, 17 To evaluate the correlation among linguistic, geographic and genetic distances, Mantel tests were performed in Arlequin.

Median-joining networks were built using Network v. 4.5.1.6 (Fluxus Engineering; http://www.fluxus-engineering.com).18 Haplogroups were tentatively dated with the ρ statistic method19 using a rate of one mutation every 19 171 years.20 Dates are intended only as a rough guide for relative haplogroup ages.21

Differences in mtDNA and Y-chromosome diversity between populations were analyzed using an analysis of molecular variance (AMOVA) implemented in Arlequin. A measure of interlocus differentiation ,22 standardized for different mutation rates, was calculated using code implemented in R (available from the authors on request).23

For autosomal and X-chromosome analyses, we used ancestry informative markers, comprising 37 single-nucleotide polymorphisms (SNPs) selected because of their high information content to discriminate Asian-Melanesian ancestry (for marker details, see Cox et al.9). The two parental populations are Han Chinese and Papua New Guinea highlanders, representing the spectrum of ancestry from Asian to Melanesian. The Bayesian clustering algorithm implemented in STRUCTURE (v. 2.3.4)24 was employed to determine differentiation among populations and compare them with putative parental groups (southern Han Chinese and highland Papua New Guineans) from which our ancestry informative markers were initially chosen.9 We implemented a clustering process as described by Hubisz et al.25 by providing prior information about sampling locations to improve the detection of population structure.

Linguistic analyses

Collection of language data and subsequent linguistic classifications were carried out as described in Lansing et al.26 The ALINE algorithm was employed to obtain quantitative distance metrics between all pairs of languages in the study.27

Results

We screened 529 individuals from 13 communities in West Timor for mtDNA, Y chromosome, X chromosome and autosomal diversity. These communities were drawn from five different districts in the regency of Belu (Table 1 and Figure 1). The Y-chromosome and mtDNA data differ in marked ways: genetic diversity is more uniform for the Y chromosome, ranging from 0.95 to 1.00 (Supplementary Table 1), in contrast to mtDNA diversity that exhibits a much wider range from 0.70 to 0.98 (Table 2) (but see values normalized for the very different mutation rates of these loci below). MtDNA diversity was found to be lowest in the vicinity of Wehali in the district of Malaka Tengah. The six lowest values of mtDNA diversity were observed here, specifically in populations from Kletek (Kletek Rainan, Wefatuk and Suai), Kateri, Umanen Lawalu and Kakaniuk (Table 2). Because of its fast mutation rate, the genetic diversity of mtDNA responds quickly to changes in the size of populations (such as growth and contraction). Summary statistics such as Fu’s Fs and Tajima’s D show that Timor has experienced at most only weak population growth (Table 2), with statistically significant signals found for only four villages (Raimanawe, Kamanasa, Laran and Besikama). This is consistent with earlier studies that suggest population sizes have been broadly static, with relatively minor increases and declines across Indonesian prehistory, including in Timor.28

Table 2 Molecular diversity indices and growth summary statistics for populations in West Timor based on mtDNA

In our samples, 24 Y-chromosome haplogroups (shown in Supplementary Figure 2) were observed (Supplementary Table 2). The C-RPS4Y paragroup has been associated with very early population movements into the Indonesian archipelago.14 It has a patchy distribution throughout Southeast Asia and Indonesia, and is absent or present at low frequency further east in Melanesia and Polynesia (Supplementary Table 3).14 In Timor, this lineage is most frequent in Umaklaran (12.2%) and Kletek Rainan (11.8%) (Supplementary Table 2).

Y-chromosome haplogroups with putative Melanesian origins29, 30 —C-M38, M-P34 and S-M254—account for nearly 40% of Y-chromosome lineages in Timor (38.8%). Based on the distribution of haplotypes, Y- STR diversity and coalescent time estimates, it has been proposed that haplogroup C-M38 arose in Melanesia.29 C-M38 alone accounts for almost one-third of Y chromosomes in Timor (26.6%) and reaches highest frequency in eastern Indonesia rather than further east (Supplementary Table 3). Interestingly, C-M38 is the most common haplogroup in Kletek Wefatuk (53.3%, Figure 2), a relatively new village in Wehali. Its inhabitants only moved to this area from East Timor around 100 years ago (JSL, unpublished survey data). C-M208, a subgroup of C-M38, is the ancestor of the P33 lineage found in Polynesians, and was previously thought to be limited to coastal New Guinea, island Melanesia and the Pacific islands.14 However, we detected C-M208 at low frequency in Timor (0.4%).

Figure 2
figure 2

Frequencies of Y-chromosome haplogroup C-M38. Populations: (1) Fatuketi, (2) Umaklaran, (3) Tialai, (4) Raimanawe, (5) Kamanasa, (6) Kateri, (7) Kakaniuk, (8) Laran, (9) Kletek Rainan, (10) Kletek Suai, (11) Kletek Wefatuk, (12) Umanen Lawalu and (13) Besikama.

Surprisingly, the Y-chromosome paragroup O-M122 was not found in Timor, even though it is often associated with the Austronesian migration. However, we did observe several of its derived forms: O-M134, O-JST002611 and O-P201 (Supplementary Table 2). Lineage O-P201 has a wide geographic distribution, with close connections to Indonesia, the Philippines, Taiwan and Oceania.14 Another Asian Y-chromosome lineage, O-M119, dominates indigenous Taiwanese populations29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39 that typically carry the sub-haplogroups O-P203 and O-M110 rather than paragroup O-M119. Although these lineages are infrequent in Timor (O-M119 at 2.4% and O-P203 at 1.2%), O-M110—a haplogroup with putative Taiwanese origins—was found at 8%.

We detected two recently reported K lineage markers in Timor:40 K-P397 and K-P336. Three new haplogroups were also screened: C-P355, C-P343 and S-P377. These markers substantially improve the resolution of the C and K lineages in Island Southeast Asia, and haplogroups K-P397 and C-P355 alone account for almost a quarter of Y chromosomes in West Timor (22.4%). These lineages are shared with other islands in eastern Indonesia, as is P-P295 (10.9% in Timor) that has only been detected in the north–south chain of islands in the east of Island Southeast Asia (Timor, Sumba, Sulawesi and the Philippines; Supplementary Table 3).

A small number of Timorese Y chromosomes belong to haplogroups other than lineages C, K, M, S and O. A total of 0.6% belong to haplogroups D-M116, E-P1 and Q-P36, with the D group possibly reflecting wartime connections with Japan. The Timorese also share haplogroup F-P14 (1.8%) with other Lesser Sunda Island populations and Sulawesi. Haplogroup J-M172, potentially a signal of Arab trader contact, is found at very low frequency (0.4%) in Timor and is also shared with Sulawesi (0.6%).

A total of 31 mtDNA haplogroups were identified in Timor, with all lineages falling into macrohaplogroups M (45.4%) and N (54.6%) (Supplementary Figure 3). The predominant haplogroups are F1a4 (19.3%) and various Q lineages (14.9%; Supplementary Table 4). F1a4 is common in eastern Indonesia, but nearly absent in the west,6 and connects Indonesia with the Philippines.6, 41 Q lineages are found predominantly in New Guinea and Island Melanesia (Supplementary Table 5).30, 42

Interestingly, the high frequency of haplogroup Q in Timor is linked with a correspondingly low frequency of F1a4 and vice versa (Supplementary Figure 4). Haplogroup F1a4 is found at highest frequency in the Wehali area, where we also observed low levels of haplogroup Q.

Regionally, haplogroup P is often found associated with haplogroup Q, but P is relatively infrequent in Timor (0.9%). Two patrilocal populations have the highest Q frequencies in Timor: Umaklaran at 31.7%, followed by Fatuketi at 25.7%. Tialai, a matrilocal population located close to Umaklaran and Fatuketi, also carries a high frequency of haplogroup Q (25.0%). This latter case is perhaps less surprising as Tialai is inhabited by people who speak Bunak, a Trans-New Guinea language, whereas the other two populations primarily speak an Austronesian language as their mother tongue.

The Asian mtDNA lineage known as the Polynesian Motif is also found at moderate frequency in West Timor (7.4%), consistent with its potential origin in eastern Indonesia43 and high frequencies on neighboring islands (but see Cox44).

Finally, despite its geographical location as a stepping-stone to Australia, Timorese show little genetic affinity with Australian aborigines (Supplementary Tables 3 and 5). The possible exception is mtDNA haplogroup Q2 that Hudjashov et al.45 suggest may reflect a secondary expansion into Australia, although one that still occurred 30 000 years ago. We have not identified more recent connections.

To identify patterns of variation among Timorese populations, we performed an analysis of molecular variance. The variance of mtDNA hypervariable region I (92.0%) and mtDNA SNPs (89.8%) are weakly, but consistently, lower than those of Y-chromosome STRs (97.5%) and SNPs (95.9%). This suggests that Timorese men may have dispersed more widely than women, as expected in communities that practice matrilocality (that is, women stay in their natal village, whereas men are given or sent away to surrounding communities). Moreover, when standardized for the 400-fold higher mutation rate of the Y chromosome (in the order of 10−5 mutation events/STR/year)46, 47, 48, 49 relative to mtDNA (in the order of 10−7 mutation events/base pair/year),20 Y-chromosome SNPs show notably lower population structure than mtDNA SNPs (Supplementary Table 6), thus further suggesting that men have moved more widely than women. Consistent with this finding, when the 13 populations were divided into two groups (patrilocal and matrilocal), the variance among groups was higher for mtDNA markers than for the Y chromosome (Table 3). Again, different social behaviors for men and women—in this case, the practice of matrilocality—appear to have affected patterns of genetic variation differently in males and females.

Table 3 Analysis of molecular variance (AMOVA) for subsets of Timor populations

To explore wider regional relationships, multidimensional scaling was performed on seven Timor populations that are predominantly monolingual (Figure 3; note that some populations are strongly bi- or multilingual, thus precluding many of the following language-paired analyses; see Table 1 for details). Maternal lineages consistently show that Umanen Lawalu and Kakaniuk (both from the Wehali region) cluster together, separated from the other five populations: Kamanasa, Fatuketi, Raimanawe, Besikama, and Tialai. Umanen Lawalu and Kakaniuk even vary from their close geographical neighbor in the Wehali area, the village of Kamanasa. However, Umanen Lawalu and Kakaniuk are both older communities in the region, whereas Kamanasa is a new village whose inhabitants arrived from East Timor in 1911 following a period of civil war.

Figure 3
figure 3

Multidimensional scaling (MDS) plot of seven populations in West Timor based on Y-chromosome single-nucleotide polymorphisms (SNPs), Y-chromosome short tandem repeats (STRs), mitochondrial DNA (mtDNA) SNPs, mtDNA hypervariable region I sequences, geography and language distances (a patrilocal village, Fatuketi, is shown as an open circle; the other villages are matrilocal).

Conversely, Y-chromosome plots show that Fatuketi is the only village that clusters far away from other populations (Figure 3). The fact that this village is patrilocal, with men remaining in their home village, might contribute to this outlier pattern. The language data present a different pattern again (Figure 3), with Tialai, the village whose inhabitants speak Bunak, a Trans-New Guinea language, unsurprisingly separated from the remaining Austronesian-speaking communities. Nevertheless, no statistically significant correlations were observed between genetic diversity, language or geography (Supplementary Table 7).

Asian admixture rates across the genome were determined using a suite of ancestry informative markers:9 18 from the autosomes and 19 from the X chromosome. Timor (64% Asian ancestry) fits with regional expectations (Supplementary Table 8), falling both geographically and in terms of Asian ancestry between Flores/Lembata (66%) and Alor (51%).9 Within Timor, Asian ancestry exhibits a surprisingly large range among populations (61–72%) (Supplementary Table 9), but there is no evidence of subdivision by language or social system (Figure 4). Nevertheless, consistent with previous research on Asian-Melanesian ancestry across the Indo-Pacific region,9, 50 we found that Asian admixture is biased toward women (that is, Asian ancestry is higher in the X chromosome (70%) relative to the autosomes (58%); Supplementary Table 9). Interestingly, such a large difference in admixture rates between the X chromosome and autosomes (11%) is only observed elsewhere on Sumba and in Vanuatu (Supplementary Table 8), thus suggesting that differences in male/female dynamics were amplified in Timor during, and likely continuing after, the initial admixture event.51

Figure 4
figure 4

Admixture plot produced by STRUCTURE. Putative parental populations, (1) Southern Han Chinese (green) and (4) Papua New Guinea Highlanders (red), are clearly shown as well differentiated ancestral groups. Populations in West Timor are presented as sets of (2) matrilocal and (3) patrilocal villages. Despite variation in ancestry components among individuals, little difference in admixture proportions is observed between communities.

Tialai, the only population in our study whose inhabitants speak a non-Austronesian language (Bunak), had the lowest Asian admixture rate in the X chromosome (65%). Curiously, the bias in Asian admixture rates between the X chromosome and the autosomes is also lowest in Tialai. Conversely, the highest bias is found in Umanen Lawalu, in the Wehali region (Supplementary Table 9). This finding further suggests that matrilocality, which persists to the present in Wehali, may have been a driving force behind this admixture bias in the X chromosome and autosomes.

Discussion

Comparison of uniparental and biparental genetic markers reveals the sheer complexity of prehistoric Timor, including periods of population isolation, long-distance contact and the effect of social systems. Mitochondrial DNA, Y chromosome, autosomal and X-linked lineages reflect different aspects of this history, but all emphasize a substantial contribution from the first settlers to reach Timor. Mitochondrial lineages P and Q, and Y-chromosome lineages C, M and S, are all associated with the first colonization of the Indonesian archipelago by modern humans 50 000 years ago14 (Supplementary Tables 10 and 11). In the autosomes and X chromosome, a little over a third of the average Timorese genome (34%) traces back to these first settlers too. Traces of the first settlers are also found in the languages spoken in Timor, where hints of ancestral languages (unrelated to either Austronesian or the Trans New Guinean language group) are preserved through loan words borrowed in the modern languages.52

And yet the primary story told by a range of genetic loci is one of more recent contact with settlers ultimately having Asian origins. Mitochondrial lineages B and F, and Y-chromosome lineage O, attest to considerable mixing with more recent Asian immigrants. This contact was neither minor nor insubstantial. Two-thirds of the average Timorese genome today (66%) has an ultimate, and relatively recent, Asian origin. Despite its current perception as an isolated outpost, Timor was once a major contact zone in eastern Island Southeast Asia.

Exactly how and when this contact occurred remains unclear. Certainly, some connection with the spread of Austronesian languages remains a major contender. Analysis of complete mtDNA genomes suggests that the Austronesian expansion is responsible for much of the dominance of Asian maternal ancestry in Oceania, whereas contact with earlier groups is demonstrated by the ongoing presence of more locally ancient lineages.53 A recent analysis of genome-wide SNPs revealed that admixture between Asian and Melanesian sources began in eastern Indonesia 4000 years ago, consistent with a mid-Holocene period of expansion in Neolithic lifeways and the spread of Austronesian languages.50 Yet, the process of transitioning to a farming lifestyle seems to have been a complex and lengthy one. Timor’s neighbor, New Guinea, has a long agricultural tradition based on root crops and, at lower altitudes, bananas.54 Bananas are still one of the main agricultural products, and an important source of income, for people living in the fertile plain of Wehali Wewiku in West Timor,10 and the dispersal of banana cultivars west from New Guinea may well have predated other elements of the Neolithic package that are presumed to have been introduced through later Asian contact.55, 56 Indeed, this westward dispersal of bananas into eastern Indonesia may be associated with the spread of Trans-New Guinea languages into Timor, a Papuan language family that is thought to be a relatively recent translocation from western New Guinea.57 Proposed Y-chromosome markers putatively associated with this expansion (M-P34 and S-M254) occur at moderate frequency in Timor, and have been dated to 6000–10 000 years ago,34, 58 thus likely pre-dating the Austronesian expansion.

Nevertheless, Asian ancestry is a dominant feature of modern Timorese. Although questions have been raised about the provenance of the language family, the spread of Austronesian languages must still have been a defining moment in Island Southeast Asian prehistory. With its deepest branches in Taiwan, Austronesian languages are spoken without exception across most of modern Island Southeast Asia. All but one of our West Timor populations speak Austronesian languages, although Trans-New Guinea languages are more prevalent in East Timor. These linguistic hints are reinforced by genetic signals. The Timorese carry mtDNA lineages (such as B4a, B4b, B4c, B4c1b3, B5a, B5b, B5b1, D and E) that are distributed widely across Mainland and Island Southeast Asia,59, 60 and are thought to reflect multiple population movements from mainland Asia.6 Other maternal lineages connect Timor with the Philippines and Taiwan (F1a4, E1a1a, M7c3c and Y2). Haplogroup F1a4 is particularly noteworthy: it accounts for almost 20% of some populations and shows an almost exclusive connection with the Philippines.6, 41 Curiously, F1a4 has a higher diversity in Indonesian populations compared with the Philippines (Supplementary Figure 4), and may have greater antiquity in the southern part of this range rather than being part of any dispersal from the north (Supplementary Table 10). This north–south connection is also observed in Y-chromosome lineages (for example, P-P295, O-M110 and O-P201), and is consistent with linguistic affinity between Timor and the northern island chain, including Sulawesi and the Maluku Islands, that is perhaps explained by an eastern route of Austronesian language dispersal from the Philippines.52 Furthermore, a network of Y-chromosome haplogroup O-M110 lineages (Supplementary Figure 5) shows that Timor shares an ancestral haplotype with indigenous Taiwanese. Descendant lineages exhibit a star-like pattern indicative of population growth and/or geographical expansion. Although the network is largely uninformative about the direction of migration, a recent admixture study using genome-wide data infers that gene flow from Taiwan to Island Southeast Asia best explains admixture patterns in Austronesian-speaking populations.61

One curious disconnect is the predominance of Asian genetic lineages and languages in Timor, coupled with notable aspects of Papuan cultural traits. The rice cultivation that underpins western Indonesian society is largely absent, perhaps because of climatic conditions that make this region unsuitable for sustained rice agriculture.51, 62 Such apparent inconsistencies may also partly reflect the passage of time. Today, languages and genetics are mostly unlinked across large parts of the Austronesian contact zone. Although associations remain in Sumba,26 they are not apparent in Timor (Supplementary Table 7 and Figure 4). Similarly, no association between language and genetics is observed in the neighboring Maluku islands, where—like in Timor—there is still a clear linguistic contact zone between Austronesian and Papuan language speakers.63 Lansing et al.51 propose a process of extensive Asian admixture, emphasizing ongoing heterogeneous language and culture replacement, in which speakers of the two language families continue to influence each other.64 The history of Timor does not seem to comprise merely an ancestral Melanesian substratum with a single expansion of Asian Austronesian speakers, but is instead compounded by the bidirectional ebb and flow of later populations from western Indonesia and New Guinea and, more recently, from Arab traders and colonists venturing from much further afield. Since Indonesian independence, movements linked to economic and political concerns have also contributed to population dynamics across the Indonesian archipelago,65, 66 although our sampling scheme was designed to avoid such individuals.

The genetic patterns are also affected in other ways. Mitochondrial DNA and Y-chromosome data show that women and men experienced different histories and were subjected to different social pressures. In general, maternal loci are dominated by lineages with immigrant Asian origins, whereas paternal loci are dominated by lineages with local Melanesian origins. This female/male division is not simply a result of genetic drift in the haploid genetic loci, but also stands out strongly in biparental and sex-linked nuclear markers. The female dominance of Asian lineages is clear in the X chromosome compared with the autosomes (Supplementary Table 9 and Figure 4), suggesting that female migrants played a leading role during the period of Asian immigration into eastern Indonesia, including Timor. This finding provides support for an Austronesian ‘house society’ model51 in which the Austronesian expansion led to the dispersal of matrilocal societies with small numbers of neighboring non-Austronesian males marrying into Austronesian matrilocal, matrilineal houses.51 These admixture rates seem to capture not only the initial contact event during the mid-Holocene, but also ongoing interactions over the subsequent 4000 years.

Sex-specific differences in social practices continued to affect the genetic diversity of Timorese. A recent broad-scale regional study of both mtDNA and Y chromosomes in Indonesian populations showed that women have moved widely between communities, whereas men historically remained at home.6, 14 This pattern is typical of patrilocal societies, but ancestral Austronesian societies are thought to have been characterized by matrilocal systems7, 8, 9 that still dominate many communities in West Timor today (JSL, unpublished survey data).10 Consistent with this social relict, the genetic profile of Timorese shows that men moved widely between communities, whereas women stayed local (Table 3). Similarly, Lansing et al.51 showed that the effective population size of Timor women is reduced compared with the island’s men. This lower female than male effective population size is also consistent with male-biased migration in a matrilocal society.

The persistence of these Austronesian social behaviors has shaped recent population structure across the island. This is seen most clearly in the Malaka Tengah district, the site of the historical Wehali kingdom and ritual center, and still a stronghold of matrilineal, matrilocal communities.67 Wehali is considered ‘female land’, a matrilineal area where all the land, houses and property belong to women. Wehali’s politico-ideological structures extended beyond the limited area of the princedom, eventually comprising 17 domains in both the west and east of the island. The spread of Wehali’s power was primarily achieved through marriage alliances.10 Although surrounding Tetun and Atoni populations transferred wealth to Wehali, indigenous concepts of power gave Wehali a notable role as the ‘husband-giver’ to more peripheral areas in Timor. Consistent with this tradition, putative Austronesian mtDNA lineages like B4a1a1a (that is, the Polynesian motif) reach their highest frequencies in the inner Wehali villages of Kateri (36%) and Laran (18%). This process perhaps also explains the higher mobility of male lineages across West Timor compared with female lineages.

This study thus illustrates the influential roles of isolation, contact and social behavior in producing the genetic profile of modern West Timorese. Isolation led to the persistence of genetic lineages from the very first settlers in Timor; contact created genomes characterized by rampant Asian and Melanesian admixture; and social behavior, particularly a tenacious holding to the Austronesian practice of matrilocality while surrounding populations transitioned to patrilocality, created patterns of male and female dispersal that differ from neighboring regions. Far from being an island outlier, Timor stands firmly at the heart of the Austronesian world.