White-handed gibbons discriminate context-specific song compositions

Julie Andrieu; Samuel G. Penny; Hélène Bouchet; Suchinda Malaivijitnond; Ulrich H. Reichard; Klaus Zuberbühler

doi:10.7717/peerj.9477

White-handed gibbons discriminate context-specific song compositions

Julie Andrieu ¹, Samuel G. Penny^1,2, Hélène Bouchet³, Suchinda Malaivijitnond^4,5, Ulrich H. Reichard⁶, Klaus Zuberbühler^1,3

1Department of Comparative Cognition, University of Neuchâtel, Neuchâtel, NE, Switzerland

2School of Pharmacy and Biomolecular Sciences, University of Brighton, Brighton, UK

3School of Psychology and Neuroscience, University of St. Andrews, St. Andrews, UK

4National Primate Research Center of Thailand, Chulalongkorn University, Saraburi, Thailand

5Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok, Thailand

6Department of Anthropology and Centre for Ecology, Southern Illinois University at Carbondale, Carbondale, IL, USA

DOI: 10.7717/peerj.9477

Published: 2020-08-03
Accepted: 2020-06-12
Received: 2019-12-05

Academic Editor: Lydia Hopper

Subject Areas: Animal Behavior, Evolutionary Studies, Zoology
Keywords: Duet song, Predator song, Playback experiment, Syntax, White-handed gibbons

Copyright: © 2020 Andrieu et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Andrieu J, Penny SG, Bouchet H, Malaivijitnond S, Reichard UH, Zuberbühler K. 2020. White-handed gibbons discriminate context-specific song compositions. PeerJ 8:e9477 https://doi.org/10.7717/peerj.9477

The authors have chosen to make the review history of this article public.

Abstract

White-handed gibbons produce loud and acoustically complex songs when interacting with their neighbours or when encountering predators. In both contexts, songs are assembled from a small number of units although their composition differs in context-specific ways. Here, we investigated whether wild gibbons could infer the ‘meaning’ when hearing exemplars recorded in both contexts (i.e. ‘duet songs’ vs. ‘predator songs’). We carried out a playback experiment by which we simulated the presence of a neighbouring group producing either its duet or a predator song in order to compare subjects’ vocal and locomotor responses. When hearing a recording of a duet song, subjects reliably responded with their own duet song, which sometimes elicited further duet songs in adjacent groups. When hearing a recording of a predator song, however, subjects typically remained silent, apart from one of six groups which replied with its own predator song. Moreover, in two of six trials, playbacks of predator songs elicited predator song replies in non-adjacent groups. Finally, all groups showed strong anti-predator behaviour to predator songs but never to duet songs. We concluded that white-handed gibbons discriminated between the two song types and were able to infer meaning from them. We discuss the implications of these findings in light of the current debate on the evolutionary origins of syntax.

Introduction

Primate vocal communication is characterised by species-specific repertoires of acoustically distinct vocalisations, some of which are given in response to specific events. The classic example is the vervet monkey (Chlorocebus pygerythrus) alarm call system, with acoustically distinct call types given to different predator types (Seyfarth, Cheney & Marler, 1980a, 1980b). However, beyond the fact that primate calls can convey relatively distinct meanings, additional complexities have recently come to light, with corresponding implications for evolutionary theories of communication.

First, it is often difficult to characterise a particular call type as an acoustically discrete structural entity. Instead, following in-depth investigation seemingly ‘discrete’ calls often display considerable amounts of acoustic variation, which may be meaningful to recipients (Keenan, Lemasson & Zuberbühler, 2013). For example, the acoustic structure of chimpanzee (Pan troglodytes) rough grunts varies depending on the perceived quality of the food resource (Slocombe & Zuberbühler, 2005), whereas Barbary macaque (Macaca sylvanus) barks differ in call duration and mean frequency range according to specific external disturbances (Fischer, Hammerschmidt & Todt, 1995, 1998; Fischer & Hammerschmidt, 2006).

Second, context can play an important role in how animals interpret each other’s calls. Evidence is in terms of how ongoing context modifies how animals react to a specific call type (Zuberbühler, 2000a, 2000b; Arnold & Zuberbühler, 2013; Seyfarth & Cheney, 2018), a mechanism already described by Smith (1977). Empirically, the way intention and external factors affect how primates infer meaning from signals is relatively poorly explored (Grice, 1969; Carnap, 1988; Scott-Phillips, 2010).

Third, call sequences can serve as powerful semantic vehicles beyond the contribution of individual calls (Zuberbühler, 2019a). For instance, the number of roaring units per sequence in guereza colobus monkey (Colobus guereza) alarm roars depends on the nature of the danger (Schel, Tranquilli & Zuberbühler, 2009). Another example is Campbell’s monkeys (Cercopithecus campbelli) alarm calling, with variation in call rates (Lemasson et al., 2010), call combinations (Ouattara et al., 2009) and call permutations (Ouattara, Lemasson & Zuberbühler, 2009a, 2009b) depending on external events. Similar phenomena have been observed in putty-nosed monkeys (Cercopithecus nictitans martini) (Arnold & Zuberbühler, 2006a, 2006b). Although these findings show remarkable similarities to some aspects of human syntax in terms of combinatorial and permutational properties, the implications for evolutionary theories of language are far from clear, suggesting that more empirical work is needed (Bolhuis et al., 2018; Townsend et al., 2018; Zuberbühler, 2019b).

A relevant primate example of complex combinatorial structure is gibbon song. In most species, mated pairs produce morning duets that appear to serve territorial and mate defence functions (Haimoff, 1984; Raemaekers & Raemaekers, 1985a; Geissmann, 2002; Terleph, Malaivijitnond & Reichard, 2015, 2016; J. Andrieu, 2012–2014, unpublished data a). Social learning seems to play some role in the acquisition of song (Koda et al., 2013) and production is subject to social influence (e.g. changes in mating partners usually result in audible differences in song coordination) (Geissmann, 1999; Terleph, Malaivijitnond & Reichard, 2017). Like most other primate calls, gibbon song contains information about caller identity (Oyakawa, Koda & Sugiura, 2007; Terleph, Malaivijitnond & Reichard, 2015; Clink et al., 2017) and the caller’s physical condition (Barelli et al., 2013; Terleph, Malaivijitnond & Reichard, 2016). Gibbon songs are audible over long distances, up to 1 km, much beyond an average gibbon home range (Mitani, 1985), suggesting that they have evolved to communicate to outgroup individuals (Raemaekers & Raemaekers, 1985a, 1985b; Mitani, 1985; Terleph, Malaivijitnond & Reichard, 2015, 2016).

Interestingly, in white-handed gibbons (Hylobates lar), there is also evidence for context-specific song types: duet songs are produced by the mated pair as part of their daily routine while predator songs are given when facing a predator, such as a clouded leopard or python (Clarke, Reichard & Zuberbühler, 2006; J. Andrieu, 2012–2014, unpublished data b). Both song types are identical in terms of their note repertoires, although there are consistent differences in the prevalence of certain notes and in how notes are combined into songs (Clarke, Reichard & Zuberbühler, 2006, 2015). Predator songs are sung for longer than duet songs and produced by most group members. They function to deter predators, recruit group members, and alert relatives in adjacent territories (Zuberbühler, Jenny & Bshary, 1999; Clarke, Reichard & Zuberbühler, 2006; Matsudaira et al., 2018). In contrast, duet songs function in mate and territorial defence (Marshall & Marshall, 1976; Raemaekers & Raemaekers, 1985a). Duet songs may also function as indicators of the strength of the social bond of the mated pair, a kind of relationship marker, evidenced by the fact that newly formed pairs appear to go through a lengthy phase of adjusting their relative vocal contributions towards a well-adjusted duet song (Haimoff, 1984; Geissmann & Orgeldinger, 2000).

Here, we investigated whether gibbons could discriminate the two functionally and structurally distinct song types (i.e. duet song and predator song), by broadcasting natural singing events of a neighbouring group simulated from a concealed speaker. We predicted that if gibbons discriminated between predator and duet songs then they should respond with the matching song types and with behaviour adequate to the situation. Specifically, in response to predator songs we predicted increased vigilance, increased defaecation rates and any other type of anti-predator behaviour already reported in the literature (Boissy, 1995; Clarke, Reichard & Zuberbühler, 2012). In response to duet songs, we predicted no changes to antipredator behaviour but duet song responses (Raemaekers & Raemaekers, 1985a, 1985b).

Materials and Methods

Study site and subjects

This study was conducted in the Mo Singto-Klong E-Tau area of Khao Yai National Park, Thailand (101°22′E, 14°26′N), 130 km North-East of Bangkok. Data were collected from December 2012 to August 2014. Thirteen fully habituated groups of white-handed gibbons were monitored, each comprising a primary male, his mated female with her offspring and (in 5 cases) a secondary male, totalling N = 53 individuals at the time of the study. Due to a number of constraints, it was only possible to conduct playback experiments with six of the 13 groups (Table 1).

Table 1:

Composition of study groups at the Mo Singto-Klong E-Tau research area (August 2014).

Group	N individuals	Group composition	Tested	Song provider for
A*	3	2AM, 1AF	–	group H
B*	5	2AM, 1AF, 1JF, 1I?	yes	–
BD	3	1AM, 1AF, 1I?	–	–
C	3	1AM, 1AF, 1I?	–	–
E	3	1AM, 1AF, 1JM	–	–
H^†	4	1AM, 1AF, 1JF, 1I?	yes	–
M	5	1AM, 1AF, 1SAF, 1JM, 1I?	yes	group R
N*	6	2AM, 1AF, 1SAM, 1JF, 1I?	yes	group M
NOS*	5	2AM, 1AF, 1J?, 1I?	–	–
R	4	1AM, 1AF, 1AF, 1I?	yes	–
S	3	1AM, 1AF, 1JM	–	group W
T	5	1AM, 1AF, 1SAM, 1JM, 1I?	–	group B
W*	4	2AM, 1AF, 1I?	yes	group N

DOI: 10.7717/peerj.9477/table-1

Notes:

* Multi-male group; M, male; F, female, ?, sex unknown; A, Adult (age > 8 years); SA, sub-adult (5–8 years); J, juvenile (2–5 years); I, infant (<2 years). yes, tested group; -, group not tested.

† No data on latency and duration of first look to speaker due to technical problems (duet playback: female filmed erroneously; predator playback: male moved out of sight).

Terminology

Following Raemaekers, Raemaekers & Haimoff’s (1984) terminology we distinguished three sequence types within each song: the introductory sequence (series of soft ‘hoo’ notes, followed by combinations of other note types, such as ‘oo’, ‘wa’, ‘leaning wa’ and ‘wa-oo’); the great call sequence (idiosyncratic female call sequence, usually followed by her male’s ‘coda’ response). In duet songs, the first great call sequence usually appears within the first 2 min. Great call sequences can be repeated multiple times (about once every 1–2 min) (Raemaekers, Raemaekers & Haimoff, 1984; Clarke, Reichard & Zuberbühler, 2006; Terleph, Malaivijitnond & Reichard, 2016), in which case they are separated by an interlude sequence (any notes given after a great call sequence, including the final one) (Ellefson, 1968; Raemaekers, Raemaekers & Haimoff, 1984) (Figs. 1 and 2A).

Figure 1: Song note repertoire of white-handed gibbons (Raemaekers, Raemaekers & Haimoff, 1984; Clarke, Reichard & Zuberbühler, 2006).
Note types (A) ‘hoo’; (B) ‘oo’; (C) ‘wa’; (D) ‘leaning wa’; (E) ‘wa-oo’; (F) ‘sharp wow’; (G) ‘other’; (H) ‘other’. Songs were digitised using Cool Edit Pro 2.1; spectrograms were drawn using 21.6 Hz filter bandwidth, 2.69 Hz frequency resolution, 33.3 ms time grid resolution and a Hanning window function.

Download full-size image

DOI: 10.7717/peerj.9477/fig-1

Figure 2: Schematic representation of the structural differences between (A) duet and (B) predator songs (Clarke, Reichard & Zuberbühler, 2006).

Download full-size image

DOI: 10.7717/peerj.9477/fig-2

The same three sequence types can also be found in predator songs although, overall, they differ in length and are produced with the contribution of most group members. When comparing predator songs with duet songs, for the introductory sequence the initial ‘hoo’ notes series are longer and contain more ‘hoo’ notes, followed by fewer ‘leaning wa’ notes and more ‘hoo’ notes (Clarke, Reichard & Zuberbühler, 2006). The great call sequence is also different, mainly because males respond more rapidly with their answering coda (Clarke, Reichard & Zuberbühler, 2006). Regarding the interlude sequence, predator songs contain more ‘sharp wow’ notes, especially towards the end of the song, compared with duet songs (Clarke, Reichard & Zuberbühler, 2006) (Figs. 1 and 2B).

Stimulus collection

Duet songs were recorded on an all-occurrence basis during all-day follows of study groups (Table 1) until at least one song suitable as playback stimulus was recorded, that is, a high-quality song with minimum background noise, singing individuals at a maximum distance of 30 m from the recording device. Predator songs were induced by presenting a realistic, life-size clouded leopard (Neofelis nebulosa) model to each group following an established protocol (Fig. 3, Clarke, Reichard & Zuberbühler, 2006). Once a group was located and before positioning the model, we ensured that on the same day the group had (a) already produced at least one duet song more than one hour earlier (to verify a basic motivation to sing), (b) not yet produced a predator song (nor its direct neighbours), (c) not had a natural predator encounter since the beginning of the day-follow, nor heard other species’ alarm calls within the last hour and (d) not had an intergroup encounter with a neighbouring group. If these conditions were met, we positioned the predator model on the group’s anticipated travel direction outside their visual range. We then continuously recorded their vocal behaviour and scored the presence of any non-vocal anti-predator behaviour on an all-occurrence basis (branch dropping, defaecation, vigilance). Duet and predator songs were recorded using directional microphones (Sennheiser MKH 815T & Sennheiser ME66) with windshields connected to a digital stereo recorder (Marantz PMD660; settings 44.1 kHz, 16 bits) from December 2012 to August 2014.

(A) Clouded leopard model used to elicit predator songs (Photo credit: Julie Andrieu); (B) real clouded leopard, Neofelis nebulosa (Image credit: goodfreephotos.com at https://www.goodfreephotos.com/animals/mammals/clouded-leopard.jpg.php). — Figure 3: (A) Clouded leopard model used to elicit predator songs (Photo credit: Julie Andrieu); (B) real clouded leopard, *Neofelis nebulosa* (Image credit: goodfreephotos.com at https://www.goodfreephotos.com/animals/mammals/clouded-leopard.jpg.php).

Download full-size image

DOI: 10.7717/peerj.9477/fig-3

Experimental protocol

Each group was tested once with each stimulus type, which resulted in a total of 12 trials (N = 6 duet songs; N = 6 predator songs, Table S1; minimum interval between trials: 1 week), all broadcasted before 12:00 local time (to match timing of natural duet song production). Prior to playback experiments, we measured the peak intensity of female great call climaxes in spontaneous duet songs (i.e. loudest notes, Terleph, Malaivijitnond & Reichard, 2016) at an estimated recording distance of 10–20 m using a REED ST-805 (REEDinstruments, Wilmington, NC, USA) sound pressure metre (frequency range 31.5 Hz–8 kHz, measuring level range 30–130 dB, 0.1 dB resolution, accuracy ± 1.5 dB). We measured three great call climaxes per female from the six song-providing groups (Table 1), which resulted in a mean sound pressure level of 78.2 ± 8.0 dB (n = 18; dB SPL, A-weighting sound pressure levels for general sound level measurements, and 125 ms fast time weighting). We then broadcasted songs such that subjects always heard recordings from one of their direct neighbours (Table S2), with comparable natural audibility (tested at each playback location with a decibel metre, matching climaxes SPL measurements, with real time adjustments in coordination with both experimenters depending on weather conditions on the testing day) and from spatially realistic locations 15–20 m within the canopy from where the song providing group had been seen before within the respective territories.

We standardised the distance between the speaker and subjects to about 150 m (mean ± SD: 149 ± 17 m), with playback conditions randomly counterbalanced (Table S2). Stimuli were broadcasted when the same conditions as for predator model presentation had been met, using a Climate CL60-T2 speaker connected to a Kenwood KAC-5203 amplifier, in conjunction with a Roland R-05 digital player.

Playback trials were carried out from spatially realistic locations, that is, from the home range of the song-providing group towards the home range of the target group. In doing so, we took a number of precautions such that the song-providing group could not overhear its own song. Before each trial, we ensured that the song-providing group was not in the vicinity of the speaker (>100 m radius). We then monitored the area for a period of 1 h to further ensure that the song-providing group was not nearby. For each trial, the speaker was positioned in the overlapping zone between the song-providing and target group, such that it was facing away from the home range centre of the song-providing group towards the target group.

Data collection

Due to the difficult visual conditions in the forest, it was impossible to continuously video-tape the entire duration of trials nor to film all group members simultaneously. We therefore decided to restrict observations to the primary male of each group. Males are easily identifiable by their body hair colouration, facial features and genitals. Primary males were video recorded as long as possible (i.e. until they moved out of sight) using a Panasonic SDR-S26 Camcorder. Videos were coded using ELAN software (ELAN (V5.2) Nijmegen: Max Planck Institute for Psycholinguistics). Because the speaker location was not visible on the video clips (outside camera range) it was necessary for the experimenter to comment on the male’s gazing direction during filming, which made blind coding redundant. All video recordings are available on figshare (https://doi.org/10.6084/m9.figshare.12363050.v1).

Regarding long-term effects, we collected 5-min scan samples of the primary male’s behavioural activities, gaze directions, body positions, elevations (m) and proximities to their female partner (m) during 1 h after each trial (i.e. 13 scans per trial; Table 2). Furthermore, we scored all defaecation/urination and branch dropping events over a two-hour period using all occurrence sampling.

Table 2:

Behavioural response variables extracted for the primary males in both playback conditions.

	Definition
Behaviour
Feeding	Handling or consuming food items
Resting	Prolonged stationary position, with or without eyes closed
Grooming	Auto- or allo-grooming (giver and receiver identity were collected)
Social	Mating, play, aggressive, or parental behaviour
Moving	Travel within or between trees (at least 2 metres)
Vigilance	Scanning the environment, head rotating by at least 45° (Koenig, 1998)
Other	Behaviour not classified into any of the above categories
Body position
Hanging	Suspended in the air, grabbing a branch or a tree part with at least one arm
Sitting/Lying down	Sitting on a branch or on the ground / Resting in horizontal position
Gaze direction (staring at a specific location/direction/animal/person for ≥ 3s)
Speaker	Staring in the direction of the speaker
Ground	Looking towards or actively scanning the ground
Canopy	Looking around, or towards a specific location in the trees at the same elevation as the animal location
Sky	Looking up at the sky
Group member	Looking at a group member (the identity of the receiver was collected)
Observer	Looking at the observer
Elsewhere	Looking in a direction that cannot be classified into any of the above categories
Nowhere	Resting with eyes closed
Other measurements
Elevation (m)	Height of the animal in relation to the ground
Proximity (m)	Distance between the two focal individuals (paired male and female)
Defaecation/Urination	Exuding faeces and/or urine
Dropping branch	Individuals shaking branch(es) so as it ended up falling on the ground
Latency of first look towards the speaker (s)	Time elapsed between stimulus onset and first look towards the speaker
Duration of first look towards the speaker (s)	Duration of first gaze directed towards the speaker location

DOI: 10.7717/peerj.9477/table-2

Vocal responses

We digitised, analysed and compared songs given in response to both playback conditions, using Raven Pro 64 1.4 (Cornell laboratory of Ornithology, Ithaca, NY, USA). For the introductory sequence, we determined the duration of the initial ‘hoo’ notes series (s) and the corresponding number of ‘hoo’ notes, the type of the first ten notes following the ‘hoo’ series, and the duration of the introductory sequence (i.e. latency to the first female great call). We measured the interval between the female great call and the male coda reply (s), the total song duration (s), and determined whether a neighbouring group also produced a song and its type. Finally, we identified the presence of ‘sharp wow’ notes and we measured the latency to the first ‘sharp wow’ note (i.e. time elapsed in seconds between the onset of the song bout and the first ‘sharp wow’ emitted).

This study was approved by the School of Psychology Ethics Committee of St. Andrews University. Approval was given on the understanding that the ASAB guidelines for the Treatment of Animals in Behavioural research and Teaching are adhered to (n°16112011). The research permit was delivered by the National Research Council of Thailand (NRCT, n°0002/5841).

Data Analysis

Behavioural responses

We compared behavioural responses within subjects and across playback conditions; the primary male’s latency and duration of first looks towards the speaker, the occurrence of defecations/urinations and branch droppings, the average distance to their female mate and the canopy heights (medians across all scan samples; Table 2). For categorical data (i.e. activity, body position and gaze), we summed up and calculated for each individual the proportion of each behaviour within the categories (see Table 2) and compared the behavioural pattern across playback conditions.

Vocal responses

We compared the number of introductory ‘hoo’ notes and the duration of the introductory ‘hoo’ notes series, the number of other relevant ‘hoo’ and ‘leaning wa’ notes within the first ten notes following the introductory ‘hoo’ series, and the introductory sequence duration, within groups and across conditions. For the great call sequence, we compared male response delays to the female great calls. Finally, we compared the total song duration between playback conditions, identified the presence of ‘sharp wow’ notes, and measured the latency to first ‘sharp wow’ note produced.

Statistical procedures

Due to small sample sizes we opted for non-parametric statistics. Wilcoxon matched-pair signed-rank tests were performed for behavioural data analysis, with exact significance levels reported (Siegel & Castellan, 1988; Mundry & Fischer, 1998). For vocal data, we used Kruskal–Wallis rank sum tests with a Benjamini & Hochberg procedure to correct for multiple testing (Benjamini & Hochberg, 1995). Post-hoc tests were either Wilcoxon rank sum tests with Benjamini & Hochberg p-value adjustments or Dunn (1964)’s tests with Benjamini & Hochberg p-value adjustments for eventual ties. To compare the type of the first 10 notes produced across contexts we used a Pearson’s Chi-squared test followed by Chi-squared post-hoc tests with Benjamini & Hochberg p-value adjustments. Statistical analyses were performed using R V3.5.1 (R core Team, 2018) with the significance level set at 0.05.

Results

Vocal behaviour

Response rates

In the duet song condition, 5 of 6 groups responded with duet counter-singing to playbacks of duet songs (Table S3). In addition, eight neighbouring groups that shared their borders with the song-providing group or the tested group also produced duet songs during 3 of 6 trials (N = 3, N = 1, N = 4 neighbouring groups, respectively, see Table S4), while none of them produced a predator song.

In the predator song condition, 1 of 6 groups responded with a predator song to playbacks of a predator song (within the first 10 min, see Table S3). The response song contained a highly delayed first great call and many ‘sharp wow’ notes, highly typical for a predator song. In addition, two distant (non-neighbouring) groups also produced predator songs during 2 of 6 trials, again characterised by a delayed first great call and ‘sharp wow’ notes (Table S4). None of the groups ever produced a duet song.

Song structure

Playbacks of duet songs reliably triggered synchronised singing by the mated pair of the target groups. To confirm that these vocal responses (N = 5) qualified as regular duet songs, we compared them to both spontaneously produced duet songs and experimentally induced predator songs (using a clouded leopard model; Table 3) by the same groups.

Table 3:

Comparison of spontaneous duet songs (N = 5), predator songs (N = 5), and songs given in response to playback of duet songs (N = 5) by the same five groups (Kruskal–Wallis rank sum test).

Variables**	Spontaneous duet song	Predator song	Response song	df	χ²	P value*
Duration introductory ‘hoo’ series (s)	8.0 ± 3.1	23.4 ± 6.7	4.7 ± 2.7	2	10.5	<0.05
N introductory ‘hoo’ notes	11.0 ± 4.5	48.8 ± 14.4	7.4 ± 2.7	2	10.2	<0.05
Song duration (s)	789.4 ± 294.8	2,396.4 ± 775.8	1,006.8 ± 122.3	2	10.2	<0.05
Latency to 1st great call (s)	101.3 ± 33.5	816.4 ± 368.0	99.0 ± 41.1	2	9.5	<0.05
Latency to 1st ‘sharp wow’ (s) ^#	78.1 ± 31.1	370.5 ± 183.2	90.8 ± 35.9	2	9.0	<0.05
N ‘sharp wows’	9.2 ± 8.0	362.2 ± 233.9	5.6 ± 6.0	2	9.8	<0.05

DOI: 10.7717/peerj.9477/table-3

Notes:

# Kruskal–Wallis rank sum test for N = 14 songs (W did not produce any ‘sharp wow’ notes in spontaneous duet; N_duet = 4, N_predator = 5, N_response = 5).

* P < 0.05 corrected.

** Means ± SD.

First, there were significant differences across all six variables tested (Table 3), while subsequent pairwise comparisons revealed significant differences between predator songs and the two other song types, but not between spontaneous duet songs and response songs elicited by playbacks (Table S5 for detailed pairwise comparisons).

Second, male latencies to reply to their female’s great calls also differed significantly between song types (χ²(2) = 33.90, P < 0.001, N = 82, Kruskal–Wallis rank sum test). Here as well, post-hoc analyses revealed that males gave earlier replies to female great calls in the predatory context (mean delay: −1.7 ± 1.6 s, n = 20) than in spontaneous duets (0.6 ± 0.7 s, n = 28) or playback duet responses (0.5 ± 0.6 s, n = 34) (P < 0.001 in both cases), with no difference between spontaneous and playback duet responses (P = 0.530; Dunn’s post-hoc test for multiple comparisons, with Benjamini & Hochberg correction).

Finally, we compared the first 10 notes produced by males and females immediately following the introductory ‘hoo’ note series (mean duration: 10.50 ± 2.8 s, n = 30, accounting for a total of 100 notes per song type). Significant differences were found between song types regarding their early note composition in ‘hoo’ and ‘leaning wa’, but also in ‘wa-oo’ notes (χ²(4) = 96.86, P < 0.001, Pearson’s Chi-squared test). Predator songs contained more ‘hoo’ notes and fewer ‘leaning wa’ notes than duet songs, with no differences between spontaneous and playback duet responses. However, ‘wa-oo’ notes were more common in playback duet responses than spontaneous duet songs, and again in spontaneous duet songs than predator songs (Table S6 for detailed pairwise comparisons).

Non-vocal behaviour

We were able to record the immediate behavioural responses of primary males in 5 of 6 groups (Table 1). All males responded by turning their heads towards the speaker, albeit with no latency differences across playback conditions (median duet: 1.1 ± 1.8 s, predator 2.8 ± 3.3 s, V = 2, P = 0.188, N_duet = 5, N_Predator = 5, Wilcoxon matched-pair signed-rank test, Fig. 4A). Additionally, we found a trend (although not significant) towards longer gaze duration in the predator than the duet song condition (median duet: 2.0 ± 1.4 s, predator: 12.6 ± 6.1 s, V= 0, P = 0.063, N_duet = 5, N_Predator = 5, Wilcoxon matched-pair signed-rank test, Fig. 4B).

Figure 4: (A) Latency and (B) duration of the male gibbons’ first gaze towards the speaker broadcasting a simulated neighbouring group’s song (duet vs. predator song condition).

Download full-size image

DOI: 10.7717/peerj.9477/fig-4

For long-term behavioural responses, we collected data on all six primary males and found no differences across playback conditions in grooming, resting and displacement activities but a significant difference in feeding, with individuals less likely to engage in feeding activities after predator than duet song playbacks (Table 4). Regarding anti-predator behaviours, we found no differences in canopy use, distance between mates, and number of branch droppings across conditions. However, males were more vigilant and defaecated significantly more often following predator compared with duet song playbacks (Table 4).

Table 4:

Comparison of male long-term behavioural responses between playback treatments (Wilcoxon matched-pair signed rank tests, N = 12 playback trials, with a total of n = 156 scan sampling observations, i.e. 13 scans per individual for 1 h).

Variables**		Duet song playback	Predator song playback	V	P value
Behavioural activity	Grooming	1.0 ± 1.6	1.7 ± 2.0	5.5	0.688
	Moving	2.0 ± 1.3	1.7 ± 1.5	13.5	0.594
	Resting	0.8 ± 0.8	0 ± 0	10	0.125
	Feeding	4.0 ± 1.4	0.3 ± 0.8	21	<0.05*
	Vigilance	2.2 ± 1.3	9.0 ± 2.0	0	<0.05*
Body position	Hanging	7.2 ± 2.5	5.0 ± 1.7	3.5	0.188
Body position	Sitting/lying	5.8 ± 2.5	8.0 ± 1.7	17.5	0.188
Gaze direction	Speaker	3.0 ± 1.7	4.5 ± 1.1	2	0.125
	Canopy	8.7 ± 1.2	3.2 ± 2.5	21	<0.05*
	Ground	0 ± 0	5.0 ± 1.3	0	<0.05*
	Group member	1.3 ± 1.5	0.3 ± 0.8	8.5	0.375
Elevation (m)		17.6 ± 6.2	25.1 ± 7.1	3	0.156
Proximity to mate (m)		8.9 ± 7.3	10.3 ± 7.7	7	0.563
Dropping branch^†		0 ± 0	0.5 ± 0.8	0	0.5
Defaecation/Urination^†		0.3 ± 0.5	3.2 ± 1.2	0	<0.05*

DOI: 10.7717/peerj.9477/table-4

Notes:

† All occurrence behaviours recorded over 2 h post trial.

* P < 0.05.

** Means ± SD.

Following playback of a predator song, males increased their vigilance activity (Fig. 5), directed more gazes towards the ground (Fig. 6A) and less towards the upper canopy (Fig. 6B) compared with duet treatment (Table 4).

Figure 5: Proportion of vigilance behaviours displayed by males in each playback condition (N = 6 males).

Download full-size image

DOI: 10.7717/peerj.9477/fig-5

Figure 6: Variation of (A) ground, (B) canopy, (C) speaker and (D) group member gazes between playback treatments (N = 6 males).

Download full-size image

DOI: 10.7717/peerj.9477/fig-6

Discussion

Summary

White-handed gibbons produce two structurally distinct songs in context-specific ways; duet songs (in non-predatory contexts) and predator songs (to clouded leopards and other predators). The two song types differ in the overall duration, frequency and distribution of specific notes (‘hoo’, ‘leaning wa’, ‘sharp wow’) and in the location of the female great calls and male replies within each song. In this study, we investigated whether individuals discriminated between these two structurally different song types and whether they could infer meaning from them. We found several lines of evidence in favour of such an ability. First, playbacks of duet songs reliably elicited natural duet song replies (identifiable by several acoustic parameters) in neighbouring groups and in more distant groups, similar to how natural duet song spread throughout the forest (Raemaekers & Raemaekers, 1985b; J. Andrieu, 2012–2014, unpublished data a). Second, playbacks of predator songs never triggered duet songs in any group, but occasionally predator song replies (identifiable by several acoustic parameters) in one of six neighbouring groups and two non-neighbouring distant groups. Finally, subjects consistently showed anti-predator behaviours (vigilance, ground scanning, defaecation) and a tendency for longer first look towards the speaker after predator compared to duet song playbacks. Based on these data, we concluded that white-handed gibbon song conveys key information about the world, which is made accessible to recipients by a number of structural regularities. This conclusion fits with previous research by Clarke, Reichard & Zuberbühler (2006) who first demonstrated the presence of structural differences in white-handed gibbon songs.

Singing as anti-predator behaviour

Similar to other large cats, clouded leopards are opportunistic predators that attack both terrestrial and arboreal species, including primates (Rabinowitz, Andau & Chai, 1987; Grassman, 2001). Hence, a somewhat surprising finding was that subjects remained mostly silent to others’ predator songs, despite showing strong anti-predator behaviour (males and females appeared to behave in the same way, i.e. ground scanning, vigilance, defaecation). The lack of vocal response may be part of a cryptic strategy to conceal the group’s location when a dangerous stalking predator is presumed in the vicinity (Aguilar de Soto et al., 2012; Grow, 2019). However, this does not explain why 1 of 6 target groups and two distant groups still responded with predator songs to the playbacks. It is possible that gibbons pursue a flexible vocal strategy, altering between ‘crypsis’ and ‘perception advertisement’ depending on perceived personal risk, the ability to benefit neighbouring relatives, and the likely dissuasive effect on the predator itself (Zuberbühler, Jenny & Bshary, 1999; Clarke, Reichard & Zuberbühler, 2006).

Equally relevant is the fact that the three predator song responses were shorter than natural predator songs (Tables S3 and S4). We can think of several explanations for this finding. First, as mentioned already, it is possible that groups tried to minimise their own exposure to the predator if they decided to respond to another group’s predator song. Second, differences in predator song duration may function as indicators for perceived urgency, with longer songs indicating more serious threats than shorter songs. We find this less likely to be an evolved function since listeners would have to wait for (and compare) considerable amounts of time periods before extracting the relevant information. Finally, differences in song duration may be linked to how callers perceive the predator (visually, linked to mobbing the predator vs. acoustically, linked to localising the predator). A Direct observation of a real encounter with a tiger is in line with this hypothesis (Uhde & Sommer, 2002). In this instance, group A uncommonly travelled backward towards the tiger’s location (spotted 50 m away) and sang for at least 1 h and a half, suggesting that singing primarily serves first and foremost as a predator deterrence device and second as a conspecific warning signal if the exact location of the predator is unknown and groups feel reasonably safe.

Singing as territorial behaviour

In related research (J. Andrieu, 2012–2014, unpublished data), we have shown that spatial proximity between two neighbouring groups tends to lead to duet song overlap, due to the fact that the second group refuses to delay singing until the first group has finished their duet song. This behaviour is attenuated by kinship, to the effect that related individuals are more likely to respect each other’s duets, even if produced at close distances. In the current study, all study groups started producing duet songs while the playback duet song was still being broadcast, suggesting that the manipulation was perceived as a territorial threat. Unfortunately, we could not statistically analyse the effect of genetic relatedness in this study because the sample size was too small (N = 6 groups).

Singing as compositional behaviour

Although our study has focussed on song comprehension, it has also generated a more detailed picture of the structural composition of white-handed gibbon songs. Clarke, Reichard & Zuberbühler (2006) already noted that the duet songs of gibbon groups that were not well habituated to human observers contained elements that were normally found in predator songs, notably ‘sharp wows’. In our study, all groups were fully habituated to human presence, yet some groups still produced ‘sharp wow’ notes in their duet song replies to playbacks of neighbouring duet songs, but also in 4 of 5 natural duet songs (Table 3), of which 3 were involved in duet counter-signing exchanges with previous duetting direct neighbours. Another structural subtlety concerned the use of ‘wa-oo’ notes. This note type was near absent in predator songs but common in the early parts of the duet songs, especially the ones given in response to duet song playbacks. We attribute these findings to the fact that our experimental design consisted of playbacks of song recordings at relatively close distances (about 150 m), which may have been perceived as a social threat by some groups, either territorial or risk of partner defection. Future work is required to test whether these notes are actively used to describe events in hierarchically structured ways (main: predatory threat y/n; subsidiary: social threat y/n), similar to how humans represent natural events as tree structures in both cognition and language (Zuberbühler, 2019b).

Conclusion

Gibbons play an interesting role in questions about the biological roots of language-related capacities in humans. Although part of the Hominoidae family, they maintain a relatively basal position in their phylogeny by diverging from the great apes some 16 million years ago (Carbone et al., 2014). Nevertheless, gibbons show interesting vocal behaviour by which a small repertoire of acoustically distinct notes are combined into higher-order structures, such as figures, phrases and sequences, assembled into different song types (Raemaekers, Raemaekers & Haimoff, 1984; Clarke, Reichard & Zuberbühler, 2006). These findings have some implications for the ongoing debate about syntax and phonology in animal communication (Bolhuis et al., 2018; Townsend et al., 2018).

In a previous study (Clarke, Reichard & Zuberbühler, 2006), structural differences between gibbon song types were explained as a case of animal syntax although this was based on a very broad definition of the term. An alternative, more restricted definition of syntax invokes semantics, notably that the units subjected to syntactic operations (e.g. the notes) are meaningful, for which there is currently no evidence in gibbon song.

Whatever definition is applied, gibbon song has several levels of complexity and future research should be directed at the acoustic variation in the different note types and their combinations. For example, in the current study we found that the production of ’wa-oo’ and ‘sharp wow’ notes might be linked with perceived social threat. So far, systematic analyses have been restricted to the early parts of the song (based on the assumption that predator information should be conveyed early on) with individual contributions not systematically studied. Traditional acoustic analysis may not suffice to make meaningful progress, suggesting that automated call extraction and categorisation techniques may offer more promise to explore the full combinatorial, hierarchical and compositional capacity of gibbon song (Kershenbaum, 2014; Kershenbaum et al., 2014, 2016; Kershenbaum & Garland, 2015; Fedurek, Zuberbühler & Dahl, 2016).

Supplemental Information

Overview of playback stimuli characteristics (N_{duet songs} = 6, N_{leopard songs} = 6).

^§ N_{duet songs} = 3. Groups A, W and T did not produce any ‘sharp wow’ note in their duet songs. ** means ± SD

DOI: 10.7717/peerj.9477/supp-1

Download

Playback experimental design.

DOI: 10.7717/peerj.9477/supp-2

Download

Overview of group vocal responses to playback treatments.

^#Negative values can emerge because all vocal responses to duet playbacks temporally overlapped the stimulus. ** means ± SD.

DOI: 10.7717/peerj.9477/supp-3

Download

Overview of other groups’ responses to playback treatments.

** means ± SD.

DOI: 10.7717/peerj.9477/supp-4

Download

Comparison of songs given in response to duet playbacks with spontaneous duets and clouded leopard songs given by the same groups (Pairwise comparisons using Wilcoxon rank sum test, with Benjamini & Hochberg corrections).

^¤ As there are ties present in the data, Dunn’s test with Benjamini & Hochberg corrections was used to perform multiple pairwise comparisons between song types on the number of ‘sharp wow’ notes so as to correct z-quantiles for ties. (*P < 0.05).

Predator songs were found to be introduced by a longer ‘hoo’ note series, that also contains more ‘hoo’ notes than spontaneous duet songs and duet playback responses (mean ‘hoo’ duration: spontaneous duet: 8.2 ± 3.5 s; playback duet: 4.7 ± 2.7 s; leopard song: 23.4 ± 6.7 s; mean ‘hoo’notes number: spontaneous duet: 11.0 ± 4.5; playback duet: 7.4 ± 2.7; leopard song: 48.8 ± 11.4). Furthermore, predator songs were found to be longer in duration with a delayed first great call production compared to spontaneous duet songs and duet playback responses (mean song duration: spontaneous duet: 794.5 ± 340.1 s; playback duet: 1,006.8 ± 122.3 s; leopard song: 2,396.4 ± 775.8 s; mean latency to first great call: spontaneous duet: 104.8 ± 37.6 s; playback duet: 99.0 ± 41.1 s; leopard song: 816.4 ± 368.0 s).

Additionally, differences emerged when analysing the production latency of the first ‘sharp wow’ and in the number of ‘sharp wow’ notes produced, with predator songs containing more ‘sharp wow’ notes with a delayed production (mean latency to first ‘sharp wow’: spontaneous duet: 78.1 ± 31.1 s; playback duet: 90.8 ± 35.9 s; leopard song: 370.5 ± 183.2 s; mean ‘sharp wow’ notes number: spontaneous duet: 11.5 ± 7.1; playback duet: 5.6 ± 6.0; leopard song: 362.2 ± 233.9).

DOI: 10.7717/peerj.9477/supp-5

Download

Comparison of the first ten notes produced across singing contexts (duet playback responses, spontaneous duets and clouded leopard songs given by the same five groups, N = 15) Pairwise comparisons using Chi-squared post-hoc tests, with Benjamini & Hochberg.

(*P < 0.05).

Leopard songs were found to contain more ‘hoo’ notes and less ‘leaning wa’ than spontaneous duets and playback duet songs, with no differences between spontaneous duets and playback duet songs (‘hoo’: spontaneous duet: 1.2 ± 1.3; playback duet: 1.2 ± 1.5; leopard song: 6.1 ± 2.5; ‘leaning wa’: spontaneous duet: 1.0 ± 1.6; playback duet: 1.0 ± 1.3; leopard song: 0.2 ± 0.6).

However, songs in response to duet playbacks contained more ‘wa-oo’ notes than spontaneous duet songs and predator songs, with spontaneous duet songs containing also more ‘wa-oo’ notes than predator songs (‘wa-oo’: spontaneous duet: 1.8 ± 2.0; playback duet: 5.0 ± 2.3; leopard song: 0.1 ± 0.3).

DOI: 10.7717/peerj.9477/supp-6

Download

Raw dataset (vocal and behavioural data).

Sheet 1: Peak intensity of female great call climaxes in spontaneous duet songs

Sheet 2: Stimuli characteristics

Sheet 3: Response overview of the different tested groups following playback experiments of a duet song or a predator song

Sheet 4: Singing responses spreading into non-tested neighbouring groups.

Sheet 5: Vocal data set for comparison of song responses to duet playback treatments, spontaneous duet songs and predator songs.

Sheet 6: Male reply latency to their female mate great calls in song responses to duet playback treatments, spontaneous duet songs and predator songs

Sheet 7: First ten notes produced in song responses to duet playback treatments, spontaneous duet songs and predator songs

Sheet 8: Primary male scan sampling data following duet song and predator song playback treatments.

Sheet 9: all occurrence behaviours following duet song and predator song playback treatments.

Sheet 10: Primary male immediate behaviour following duet song and predator song playback treatments (Latency and duration of first look towards the speaker).

DOI: 10.7717/peerj.9477/supp-7

Download

The raw data used to elaborate this study which investigates the duet song production timing between two consecutive white-handed gibbons duetting groups.

Groups can either overlap another group’s ongoing duet (counter-singing) or await the end of their song that starts their own duet song. We found that group composition (single vs multi-male groups), relatedness and spatial proximity all played significant roles in whether a group will engage in respectful or counter-singing exchange with a previous duetting group.

DOI: 10.7717/peerj.9477/supp-8

Download

The raw data used to elaborate this study which investigates whether white-handed gibbons could code differently their song according to the type of predator presented (i.e. Clouded leopard model Vs Reticulated python model).

We found consistent differences at all levels of song organisation, i.e. at the note, figure, phrase and sequence level, especially during the early part of song, that could potentially allow signallers to reliably convey external events, allowing listeners to make inference about them.

DOI: 10.7717/peerj.9477/supp-9

Download

[1] Aguilar de Soto N, Madsen PT, Tyack P, Arranz P, Marrero J, Fais A, Revelli E, Johnson M. 2012. No shallow talk: cryptic strategy in the vocal communication of Blainville’s beaked whales. Marine Mammal Science 28(2):E75-E92

[2] Arnold K, Zuberbühler K. 2006a. Language evolution: semantic combinations in primate calls. Nature 441(7091):303

[3] Arnold K, Zuberbühler K. 2006b. The alarm-calling system of adult male putty-nosed monkeys, Cercopithecus nictitans martini. Animal Behaviour 72(3):643-653

[4] Arnold K, Zuberbühler K. 2013. Female putty-nosed monkeys use experimentally altered contextual information to disambiguate the cause of male alarm calls. PLOS ONE 8(8):e65660

[5] Barelli C, Mundry R, Heistermann M, Hammerschmidt K. 2013. Cues to androgens and quality in male gibbon songs. PLOS ONE 8(12):e82748

[6] Benjamini Y, Hochberg Y. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodological) 57(1):289-300

[7] Boissy A. 1995. Fear and fearfulness in animals. Quarterly Review of Biology 70(2):165-191

[8] Bolhuis JJ, Beckers GJ, Huybregts MA, Berwick RC, Everaert MB. 2018. Meaningful syntactic structure in songbird vocalizations? PLOS Biology 16(6):e2005157

[9] Carbone L, Alan Harris R, Gnerre S, Veeramah KR, Lorente-Galdos B, Huddleston J, Meyer TJ, Herrero J, Roos C, Aken B, Anaclerio F, Archidiacono N, Baker C, Barrell D, Batzer MA, Beal K, Blancher A, Bohrson CL, Brameier M, Campbell MS, Capozzi O, Casola C, Chiatante G, Cree A, Damert A, de Jong PJ, Dumas L, Fernandez-Callejo M, Flicek P, Fuchs NV, Gut I, Gut M, Hahn MW, Hernandez-Rodriguez J, Hillier LDW, Hubley R, Ianc B, Izsvák Z, Jablonski NG, Johnstone LM, Karimpour-Fard A, Konkel MK, Kostka D, Lazar NH, Lee SL, Lewis LR, Liu Y, Locke DP, Mallick S, Mendez FL, Muffato M, Nazareth LV, Nevonen KA, O’Bleness M, Ochis C, Odom DT, Pollard KS, Quilez J, Reich D, Rocchi M, Schumann GG, Searle S, Sikela JM, Skollar G, Smit A, Sonmez K, Hallers B, Terhune E, Thomas GWC, Ullmer B, Ventura M, Walker JA, Wall JD, Walter L, Ward MC, Wheelan SJ, Whelan CW, White S, Wilhelm LJ, Woerner AE, Yandell M, Zhu B, Hammer MF, Marques-Bonet T, Eichler EE, Fulton L, Fronick C, Muzny DM, Warren WC, Worley KC, Rogers J, Wilson RK, Gibbs RA. 2014. Gibbon genome and the fast karyotype evolution of small apes. Nature 513(7517):195-201

[10] Carnap R. 1988. Meaning and necessity: a study in semantics and modal logic. Chicago: University of Chicago Press.

[11] Clarke E, Reichard UH, Zuberbühler K. 2006. The syntax and meaning of wild gibbon songs. PLOS ONE 1(1):e73

[12] Clarke E, Reichard UH, Zuberbühler K. 2012. The anti-predator behaviour of wild white-handed gibbons (Hylobates lar) Behavioral Ecology and Sociobiology 66(1):85-96

[13] Clarke E, Reichard UH, Zuberbühler K. 2015. Context-specific close-range “hoo” calls in wild gibbons (Hylobates lar) BMC Evolutionary Biology 15(1):56

[14] Clink DJ, Bernard H, Crofoot MC, Marshall AJ. 2017. Investigating individual vocal signatures and small-scale patterns of geographic variation in female Bornean gibbon (Hylobates muelleri) great calls. International Journal of Primatology 38(4):656-671

[15] Dunn OJ. 1964. Multiple comparisons using rank sums. Technometrics 6(3):241-252

[16] Ellefson JO. 1968. Territorial behavior in the common white-handed gibbon, Hylobates lar Linn. In: Jay PC, ed. Primates: Studies in adaptation and variability. New York: Holt, Rinehart and Winston. 180-199

[17] Fedurek P, Zuberbühler K, Dahl CD. 2016. Sequential information in a great ape utterance. Scientific Reports 6(1):38226

[18] Fischer J, Hammerschmidt K. 2006. Vocal communication in Barbary macaques: a comparative perspective. Barbary Macaque: Biology, Management and Conservation 73(1):33-45

[19] Fischer J, Hammerschmidt K, Todt D. 1995. Factors affecting acoustic variation in Barbary-macaque (Macaca sylvanus) disturbance calls. Ethology 101(1):51-66

[20] Fischer J, Hammerschmidt K, Todt D. 1998. Local variation in Barbary macaque shrill barks. Animal Behaviour 56(3):623-629

[21] Geissmann T. 1999. Duet songs of the siamang, Hylobates syndactylus: II. Testing the pair-bonding hypothesis during a partner exchange. Behaviour 136(8):1005-1039

[22] Geissmann T. 2002. Duet-splitting and the evolution of gibbon songs. Biological Reviews 77(1):57-76

[23] Geissmann T, Orgeldinger M. 2000. The relationship between duet songs and pair bonds in siamangs, Hylobates syndactylus. Animal Behaviour 60(6):805-809

[24] Grassman LI. 2001. Spatial ecology and conservation of the felid community in Phu Khieo Wildlife Sanctuary, Thailand. Action Treasury. Report to Cat

[25] Grice HP. 1969. Utterer’s meaning and intentions. Philosophical Review 78(2):147-177

[26] Grow NB. 2019. Cryptic communication in a montane nocturnal haplorhine, Tarsius pumilus. Folia Primatologica 90(Suppl. 5):404-421

[27] Haimoff EH. 1984. Acoustic and organizational features of gibbon songs. In: Preuschoft H, Chivers DJ, Brockelman WY, Creel N, eds. The Lesser Apes: Evolutionary and Behavioural Biology. Edinburgh: Edinburgh University Press. 333-353

[28] Keenan S, Lemasson A, Zuberbühler K. 2013. Graded or discrete? A quantitative analysis of Campbell’s monkey alarm calls. Animal Behaviour 85(1):109-118

[29] Kershenbaum A. 2014. Entropy rate as a measure of animal vocal complexity. Bioacoustics 23:195-208

[30] Kershenbaum A, Blumstein DT, Roch MA, Akçay Ç, Backus G, Bee MA, Bohn K, Cao Y, Carter G, Cäsar C, Coen M, DeRuiter SL, Doyle L, Edelman S, Ferrer-i-Cancho R, Freeberg TM, Garland EC, Gustison M, Harley HE, Huetz Cé, Hughes M, Hyland Bruno J, Ilany A, Jin DZ, Johnson M, Ju C, Karnowski J, Lohr B, Manser MB, McCowan B, Mercado E, Narins PM, Piel A, Rice M, Salmi R, Sasahara K, Sayigh L, Shiu Y, Taylor C, Vallejo EE, Waller S, Zamora-Gutierrez V. 2016. Acoustic sequences in non-human animals: a tutorial review and prospectus. Biological Reviews 91(1):13-52

[31] Kershenbaum A, Bowles AE, Freeberg TM, Jin DZ, Lameira AR, Bohn K. 2014. Animal vocal sequences: not the Markov chains we thought they were. Proceedings of the Royal Society B: Biological Sciences 281(1792):20141370

[32] Kershenbaum A, Garland EC. 2015. Quantifying similarity in animal vocal sequences: which metric performs best? Methods in Ecology and Evolution 6(12):1452-1461

[33] Koda H, Lemasson A, Oyakawa C, Pamungkas J, Masataka N. 2013. Possible role of mother-daughter vocal interactions on the development of species-specific song in gibbons. PLOS ONE 8(8):e71432

[34] Koenig A. 1998. Visual scanning by common marmosets (Callithrix jacchus): Functional aspects and the special role of adult males. Primates 39(1):85-90

[35] Lemasson A, Ouattara K, Bouchet H, Zuberbühler K. 2010. Speed of call delivery is related to context and caller identity in Campbell’s monkey males. Naturwissenschaften 97(11):1023-1027

[36] Marshall JT, Marshall ER. 1976. Gibbons and their territorial songs. Science 193(4249):235-237

[37] Matsudaira K, Ishida T, Malaivijitnond S, Reichard UH. 2018. Short dispersal distance of males in a wild white-handed gibbon (Hylobates lar) population. American Journal of Physical Anthropology 1(1):61-71

[38] Mitani JC. 1985. Gibbon song duets and intergroup spacing. Behaviour 92(1/2):59-96

[39] Mundry R, Fischer J. 1998. Use of statistical programs for nonparametric tests of small samples often leads to incorrect Pvalues: examples from animal behaviour. Animal Behaviour 56(1):256-259

[40] Ouattara K, Lemasson A, Zuberbühler K. 2009a. Campbell’s monkeys use affixation to alter call meaning. PLOS ONE 4(11):e7808

[41] Ouattara K, Lemasson A, Zuberbühler K. 2009b. Campbell’s monkeys concatenate vocalizations into context-specific call sequences. Proceedings of the National Academy of Sciences of the United States of America 106(51):22026-22031

[42] Ouattara K, Zuberbühler K, N’goran EK, Gombert J-E, Lemasson A. 2009. The alarm call system of female Campbell’s monkeys. Animal Behaviour 78(1):35-44

[43] Oyakawa C, Koda H, Sugiura H. 2007. Acoustic features contributing to the individuality of wild agile gibbon (Hylobates agilis agilis) songs. American Journal of Primatology 69(7):777-790

[44] R core Team. 2018. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.

[45] Rabinowitz A, Andau P, Chai PPK. 1987. The clouded leopard in Malaysian Borneo. Oryx 21(2):107-111

[46] Raemaekers JJ, Raemaekers PM. 1985a. Field playback of loud calls to gibbons (Hylobates lar): territorial, sex-specific and species-specific responses. Animal Behaviour 33(2):481-493

[47] Raemaekers PM, Raemaekers JJ. 1985b. Long-range vocal interactions between groups of gibbons (Hylobates lar) Behaviour 95(1–2):26-44

[48] Raemaekers JJ, Raemaekers PM, Haimoff EH. 1984. Loud calls of the gibbon (Hylobates lar): repertoire, organisation and context. Behaviour 91(1–3):146-189

[49] Schel AM, Tranquilli S, Zuberbühler K. 2009. The alarm call system of two species of black-and-white colobus monkeys (Colobus polykomos and Colobus guereza) Journal of Comparative Psychology 123(2):136-150

[50] Scott-Phillips TC. 2010. Animal communication: insights from linguistic pragmatics. Animal Behaviour 79(1):e1-e4

[51] Seyfarth R, Cheney D. 2018. Pragmatic flexibility in primate vocal production. Current Opinion in Behavioral Sciences 21:56-61

[52] Seyfarth RM, Cheney DL, Marler P. 1980a. Monkey responses to three different alarm calls: evidence of predator classification and semantic communication. Science 210(4471):801-803

[53] Seyfarth RM, Cheney DL, Marler P. 1980b. Vervet monkey alarm calls: semantic communication in a free-ranging primate. Animal Behaviour 28(4):1070-1094

[54] Siegel S, Castellan NJ. 1988. The Friedman two-way analysis of variance by ranks. Nonparametric Statistics for the Behavioral Sciences 174-184

[55] Slocombe KE, Zuberbühler K. 2005. Functionally referential communication in a chimpanzee. Current Biology 15(19):1779-1784

[56] Smith WJ. 1977. The behavior of communicating: an ethological approach. Cambridge: Harvard University Press.

[57] Terleph TA, Malaivijitnond S, Reichard UH. 2015. Lar gibbon (Hylobates lar) great call reveals individual caller identity: lar Gibbon Great Calls. American Journal of Primatology 77(7):811-821

[58] Terleph TA, Malaivijitnond S, Reichard UH. 2016. Age related decline in female lar gibbon great call performance suggests that call features correlate with physical condition. BMC Evolutionary Biology 16(1):4

[59] Terleph TA, Malaivijitnond S, Reichard UH. 2017. Male white-handed gibbons flexibly time duet contributions. Behavioral Ecology and Sociobiology 72(1):157

[60] Townsend SW, Engesser S, Stoll S, Zuberbühler K, Bickel B. 2018. Compositionality in animals and humans. PLOS Biology 16(8):e2006425

[61] Uhde NL, Sommer V. 2002. Antipredatory behavior in gibbons (Hylobates lar, Khao Yai, Thailand). Cambridge: Cambridge University Press.

[62] Zuberbühler K. 2000a. Referential labelling in Diana monkeys. Animal Behaviour 59(5):917-927