Rethinking templates: A syntactic analysis of verbal morphology in Emirati Arabic

This paper presents an analysis of the morphosyntax and lexical semantics of the system of verbal forms of Emirati Arabic (EA, the variety of Gulf Arabic spoken in the United Arab Emirates) in terms of syntactic decomposition of argument structure. We argue that verbal meaning is a function of at least two syntactic functional heads: Voice and little v; and a lexical head: the consonantal root. We will further show that the unified syntactic structure, resulting from the interaction of the semantics and argument structure of the root with little v and Voice, captures the regularities as well as the exceptions in the interpretation of the verb forms of EA.


Introduction
While the morphophonology of Arabic verbs has received much attention in the literature, following McCarthy's (1979) seminal work on Arabic nonconcatenative word structure, very few studies have dealt with the morphosyntax and semantics of Arabic verbs (more relevant work has been done on Hebrew, see for example Borer 1995;Doron 2003;Kastner 2016;2018). In addition, most of previous research has focused on the classical or formal variety of Arabic (e.g. Wright 1896;Ryding 2005;Fassi-Fehri 2003), generally referred to as Classical Arabic (CA) or Modern Standard Arabic (MSA). The syntax and semantics of the verbal systems of colloquial spoken dialects remain highly understudied.
In this paper, we set out to investigate the complex verbal system of EA. We present an analysis that captures the regularities in the interpretation of the nine verb forms of this variety of Arabic, while at the same time allowing for exceptions to these regularities. We also provide brief comparisons of the verbal system of EA and the better investigated verbal system of MSA (cf. McCarthy 1979;1981;Fassi-Fehri 2003;Tucker 2010). This theory is also briefly evaluated against the main claims in Doron's (2003) account of Hebrew verbal templates.
Our basic assumption is that there is no separate morphological component of structurebuilding mechanisms and that complex verbal forms are derived in syntax through the same mechanisms that derive syntactic phrases. This assumption is more or less held by numerous syntax-based frameworks, including Distributed Morphology (DM) (Halle & Marantz 1993;Marantz 1997 & related work), Antisymmetry (Kayne 1994;Koopman & Szabolcsi 2000) and more recently Nanosyntax (Starke 2009).
The analysis defended in this paper is couched within the theory of Distributed Morphology (DM) (Halle & Marantz 1993;Marantz 1997 and related work). In line with DM, we take it that the different EA verbal forms are derived in the syntax, not in a separate lexical component of the grammar. We then suggest a unified syntactic structure for

Theoretical assumptions
Since work in Larson (1988), the verbal domain has been decomposed into smaller domains where different verbal arguments project and verbal properties such as valency and event and aspect structure are fixed. The higher VP-shell in Larson (1988) was later replaced by the little v head (e.g. Chomsky 1995), following ideas in Hale & Keyser (1993). Kratzer (1996) terms the projection VoiceP where Voice/v is the head hosting in its specifier the external argument of the verb, combining this argument with the verb's event structure, and assigning accusative case to the object, thus encompassing Burzio's generalization. In subsequent accounts both vP and VoiceP have been used to designate the projection hosting the external argument.
In contrast to these approaches, in DM, v is assumed to be a "verbalizer", i.e. a categoryassigning head that transforms a root into a verb. Both Harley (1995) and Marantz (1997) maintain that, given this formulation, v must be present in unaccusatives and unergatives as well as in transitives. In DM approaches, Larson's lower VP-shell has been reinterpreted as the root domain where an acategorial root merges (possibly together with an internal argument, if present). This root domain can be selected by the category-assigning v head, which verbalizes the structure, creating a first phase domain where (possibly idiosyncratic) semantics are negotiated.
Starting with work in Pylkkänen (2008) and later work in Cuervo (2003), Collins (2005), Alexiadou, Anagnostopoulou, & Schäfer (2006), Merchant (2008), and Harley (2009), VoiceP and vP are assumed to occupy two different positions in the extended verbal projection. The external argument is introduced by the higher VoiceP projection, which is also the locus for voice morphology in passive voice structures. The lower vP maintains the role of verbalizing the root domain and also may carry semantic (and morphological) content with the introduction of causative or inchoative semantics (see Cuervo 2003 for detailed discussion of the different flavors of little v).
One of the main morphosyntactic functions that have been attributed to v and Voice (or related) syntactic heads in the literature is that of valency increasing and decreasing operations. This includes causatives and applicative structures which generally contain heads that introduce new arguments to the verbal argument structure and passive, middle, anticausative, and reflexive voice structures which usually either absorb one of the predicate's theta roles or assign it to an entity that has already been assigned a theta role, resulting in a reduction of the number of overt verbal arguments. For example, causative structures introduce a causative semantic component which is not available in the base structure (unaccusative, adjectival, nominal, and so on), manifested syntactically as a causative head (e.g. v or Cause), and are typically morphologically marked (Kayne 1975;Marantz 1985;Travis 1991;Chomsky 1995 and others). In terms of morphosyntactic contribution, the head CAUS (or little v with a causative flavor) introduces a structural relation between a causing event which forms the implicit argument of v CAUS and the resultant state denoted by the root predicate plus any internal arguments. Since the causative event is superimposed on the resultative event, the causative predicate is necessarily bi-eventive. Similar properties are associated with applicative heads. On the other hand, passive formation usually absorbs the external theta role of the predicate, inducing valency reduction, while being similarly morphologically marked. Similar properties emerge in reflexivization and middle voice formation.
An additional property of some of these valency-changing processes is variability on the syntactic locus of their application. Thus, for both causative and applicative verb formation, at least two different syntactic heights of causative or applicative head projection have been proposed in the relevant literature: low causatives and applicatives and high ones. Each projection level is assumed to derive verbal forms with distinct morphosyntactic and semantic properties. Let us consider an example from causatives. Lexical or low causatives heads (as opposed to syntactic causatives, see Hale & Keyser 1993;Shibatani 1996;Travis 2000; are usually associated with transitivity in crosslinguistic studies (see work in Marantz 1997;Arad 1999;Travis 2000;Arad 2002;Bowers 2002;Pylkkänen 2002;Embick 2004;and others). These low causatives present a number of idiosyncratic properties including often changing the grammatical/phrasal category of the stems they attach to and exhibiting non-transparent semantics and non-productivity. For example, in Malagasy (Travis 2000; the lexical causative prefix an-, derives transitive verbs form adjectival or nominal roots, where the semantic interpretation of the resulting string is not always transparent and with several verbs being transitive without affixing the causative/transitive morpheme. On the other hand, the higher causative prefix amp-derives verbs with transparent semantics and high productivity. Travis (2000; based on Hale and Keyser (1993), assumes that both cases of causativization involve syntactic processes, but with the case of the lower transitive an-being introduced in a lower lexical-syntax (l-syntax) domain, which is characterized by idiosyncratic properties, while the higher causative amp-merges in syntax proper (s-syntax) and displays productive syntactic properties similar to those of phrase formation. Travis, based on evidence from Malagasy and Tagalog, assumes that the divide between l-syntax and s-syntax is an aspectual projection which introduces the event argument of the verb, termed EventP (see Travis 2010). This indicates that, while we can retain a decompositional account for the formation of causative and applicative verbs in languages, we expect several of these forms to exhibit idiosyncratic properties, both in terms of semantic interpretation as well as productivity.
Following this work, we assume the following minimal structure for the verbal domain: (1) The consonantal root of the verb is introduced by the RootP level of the structure. This carries the main meaning of the verb, which at this lexical level may be completely idiosyncratic. Little v attaches above the root level, fixing the categorical status as verbal, and delimiting the first phase domain. Since v is a phase head, its complement domain is unavailable for further syntactic processes and has a fixed phonological representation and semantic interpretation. For the purposes of the latter, the vP (and every first cycle structure) has been assumed to be the domain for contextual allosemy, i.e. making a semantic choice out of the various related meanings of a polysemous morpheme. Marantz (2010) defines the vP as the domain where contextual allosemy is negotiated, i.e. the meaning is fixed as soon as a morpheme is interpreted, i.e., within its spell-out domain. In contrast, the domain for idiomatic interpretation as well as other lexical properties such as productivity, may be larger (Marantz 2013a), see also discussion on Travis (2000; above. Following Arad's (2003; work on Hebrew verbal forms, we assume that the little v projection is morphologically realized as the template of the Emirati Arabic verb. That is, little v can have different flavors including causative, applicative and inchoative interpretations and the morphological exponent of these flavors is realized as one of the specific forms described in the following section. The realization is manifested as prosodic changes affecting the surface form of both the root and template, namely the gemination of the second or third radical consonant (in causative Form II and inchoative Form IX, respectively) and second syllable vowel length (in applicative Form III). Finally, the higher VoiceP projection is spelled-out as voice morphology realized as a prefix/infix on the verb form: (2) The analysis presented here, and the structure in (2) follows approaches that distribute verbal morphology across different projections in the lower clausal syntactic space. In particular, we espouse the system put forward by Arad (2003;; see also Marantz 1997) where each of the morphemic constituents of the Arabic (and Semitic in general) verb has its own lexical and semantic contribution and is distributed in three different levels in the extended verbal projection. Following Arad (2003;, we assume that the root provides the lexical content of the verb and merges at a low Root projection. We also follow Arad's (2003; proposal that the verbal template is a separate morphosyntactic entity, which merges at a higher, functional vP projection and contributes morphosyntactic content related to lower Causativity (transitivity), and inchoative structures. We enrich this system based on our data from EA, in assuming that additional flavors of the little v head are available in the composition of verbal morphosyntactic structures. If little v is part of the extended morphological make-up of the Semitic verb, there is no reason to exclude other morphosyntactic features and corresponding semantic content that has been associated with the little v head in the relevant literature. Thus, following work in (McGinnis 2001;Pylkkänen 2008), we attribute applicative/associative semantics to morphosyntactic features of the v head and support this with evidence from the EA data. Finally, we depart from Arad's (2003; work in proposing that the Voice head in EA hosts a series of voice prefixes and not (at least not exclusively) the vocalic melody of the associated template or a template. In Arad's analysis certain templates may merge at the v level, selecting for the root, or at the Voice level, selecting for another lower template. This can derive certain ambiguities in the Hebrew verbal system as the low-merged template will derive verbs with certain lexical idiosyncrasies, while the same template, when merged at VoiceP, will derive verbal forms with more regular, compositional semantics. In our account, we assume that all templates merge at the v level and only voice/reflexive/mediopassive morphology merges at the Voice level. Idiosyncrasies, in our account are derived through a division of the domain where syntactic operations apply into two distinct domains. Thus, contrary to Arad's analysis of all roots being listed in the Lexicon (the Encyclopedia in DM) together with the template in which they acquire their meaning, we assume that the root is just listed in the Lexicon with a number of possible, related meanings. One of these meanings is fixed when the first functional head, little v, is attached to the root. Further idiosyncrasy in the verb semantics may be available all the way to the VoiceP, as this seems to be the level at which idiomatic expressions are formed.
Thus, while our proposed structure in (2) is the base on which numerous verbal forms in Emirati Arabic are formed, the syntactic domain where it projects is within the lexical domain of syntax (Travis 2000; and thus subject to a number of idiosyncratic properties. This means that while we expect the specific syntactic selectional properties of a voice head to restrict the type of vPs that it selects for, in many cases there will be exceptions to the type of verbal forms that are derived by merging this specific head. We will illustrate this in our discussion of the different verbal forms in Emirati Arabic in the following sections.
This allows us to provide a more coherent model for the derivation of EA verbs, which accommodates the data, including the cases where the frequently observed semantic patterns break-down, without assuming multiple distinct merger levels for the same morphemes. Each morpheme merges at a unique, clearly defined level in the extended verbal projection.
The following In the following section, we will explore in detail how these basic assumptions of morphosyntactic structure, together with more general syntactic operations, can derive the desired interpretations and morphosyntactic distribution of the available verbal forms in Emirati Arabic, with reference also to the corresponding MSA forms.

Previous analyses of verbal forms in MSA and EA
There are ten verbal templates associated with three-consonant root verbs in MSA and nine in EA. These are listed in (4), where, in line with the standard practice in the literature, the different forms are referred to with Roman numerals, a convention we will follow throughout the paper. Purely phonological epenthetic material appears between brackets: and ħwall 'to be cross-eyed') in EA is more productive than in MSA, in the sense that it applies to a wider range of physical properties than just colors and physical defects (see section 5.3 for a detailed analysis). Third, Form X, e.g., staxdam 'he used (for one's benefit)' is less productive in EA and may have been borrowed from MSA. Note also that the prefix t-in EA Forms V and VI is not followed by the vowel [a]. This difference between EA and MSA can be explained in terms of the syllable structure of the two varieties. One may argue that phonotactics is involved because EA allows certain word-initial consonant clusters which MSA does not, and so the vowel after the prefix is not necessary. Note also that since EA allows word-initial consonant clusters, the epenthetic [ʔi] is not added. Studies of the phonological and morphological properties of these forms (also called templates or patterns) include McCarthy (1979;1981), McCarthy & Prince (1990 and subsequent work), see also Tucker (2010) for an analysis of Iraqi Arabic. However, very few researchers have investigated their semantic properties (see Benmamoun 2000;Fassi-Fehri 2003).
Within McCarthy's (1979;1981) analysis, different templates are characterized as fixed sequences of consonantal and vowel slots in what he calls a skeleton or template. During morphological derivation, root consonants are mapped onto the consonantal slots in a left-to-right fashion, while the vowel slots are filled in with the vowel melody. According to McCarthy, roots, templates, and vowel melodies all constitute separate morphemes, represented on separate, parallel autosegmental tiers.
Following recent approaches towards Arabic, which attempt to derive notions such as skeleton or template as emergent properties, Tucker (2010) argues that roots are real morphemes which combine with affixes (whether vocalic, e.g., passive {u -i} or consonantal, e.g., {n} and {t}), but templates are nothing more than the outcome of the interaction of universal syllabic and prosodic constraints.
The status of templates as independent morphemes which occupy a specific categorydetermining v slot in the verbal functional domain relies mainly on two different assumptions about little x slots of this type (see Arad 2003;Tucker 2010): i) how the interpretation of the root is set (Marantz 2001 and later work) ii) argument structure (for example Embick 2004) In both Arad's (2003; and Tucker's (2010) analyses, both of these assumptions are not confirmed by the morphosyntactic properties of Semitic verbal forms. Thus, with respect to the interpretive properties of verbal forms, in both Arabic and Hebrew, there is a many-to-many relationship between verbal form and verbal semantics. This seems to also be the case in Emirati Arabic. Thus, Form I verbs can have transitive, and intransitive interpretations, as the following The fact that there seems to be a many-to-many correspondence between verb forms and semantic content for these forms, has led Tucker (2010) to propose that the templatic part of the verb make-up in Semitic languages is not a morphological entity, but rather emergent properties of words which surface from the necessary satisfaction of high-ranking prosodic markedness constraints. This requires a restatement of certain assumptions about linearization of morphosyntactic strings (see for example Embick 2010) allowing for the phonological component to linearize morphosyntactic strings smaller than words, while word units are linearized as before within the syntactic component (see Tucker 2010 for details of the proposal). While, a prosodic linearization analysis explains adequately the many-to-many correspondence between templates and semantic interpretation in Semitic verbal forms, it fails to capture the significant degree of regularization that still exists in this correspondence (see for example Younes 2000 for Palestinian Arabic andArad 2003; for extensive discussion based on a large number of Hebrew roots and available verbal forms). It is for example true that when both Form I and Form II verbs are available, Form II verbs have almost always greater valency than Form I verbs. In addition, it is not clear how a prosodic linearization analysis could be extended to capture the contribution of templates in other areas of word morphology in Arabic, such as the formation of irregular plurals or the expression of comparison in adjectival forms.
For these reasons, we follow here Arad's proposal that the template is in fact the realization of morphosyntactic features on the v head. In Arad's proposal, the fact that not all roots are available for all templates (discussed here in Section 8) falls from selectional restrictions of the v head. In other words, v selects for specific roots, resulting possibly in a different v head for each available verbal pattern. We adopt this approach here, assuming different flavors of v as has independently been established in the relevant literature and propose that any irregularities in the form-semantic interpretation correspondence are independently derived from the syntactic structure and the selectional restrictions of the functional heads contained in the structure as well as the level at which the relevant syntactic rules operate. In the following sections we discuss in detail all possible verbal forms in Emirati Arabic with occasional references to the corresponding MSA forms and we show how the proposed structure and generally held assumptions on the syntactic operations involved capture both the form and semantics of the derived verbs straightforwardly.
Given the fact that no contemporary linguistics studies have examined EA, one of the main goals of this paper is to properly describe the EA verbal system and compare it to that of MSA. Data was collected from EA native speakers' spontaneous speech as validated against seven informants in addition to author intuitions. Verbal forms were then classified, depending on their prosodic shapes. Each verb was then analyzed along a number of criteria such as the nature of the agent and patient and transitivity.

Form I: The default form
Form I in MSA can be transitive or intransitive (Wright 1896;Ryding 2005). This transitivity is marked by the quality of the second vowel. This form has three shapes: C 1 aC 2 aC 3 , C 1 aC 2 uC 3 , and C 1 aC 2 iC 3 . Generally, CaCaC is associated with transitive verbs and a few intransitive verbs (e.g., katab 'he wrote', ʒalas 'he sat'). By contrast, CaCiC and CaCuC usually go with stative intransitive verbs indicating a (permanent) quality or a (temporary) state (e.g., ħasun 'he was beautiful', kabur 'he was big/old' vs. ħazin 'he was sad', fariħ 'he was glad, maridˤ 'he was sick'). For many researchers who take Arabic verbs to be derived from whole words (as opposed to consonantal roots and templates), Form I often serves as the input for subsequent derivations.
In EA, just like its MSA counterpart, Form I is the simplest, unmarked form. However, unlike in MSA, it does not show the same semantically-driven alternations in the quality of the second vowel. Also, phonetically, the first vowel in Form I in EA is generally a schwa but turns into /a/ after a guttural consonant (kətab 'he wrote' vs. ħafaðˁ 'he memorized'). This alternation is phonologically conditioned and has no semantic implications. The examples in (7)-(15) illustrate EA Form I. As can be seen from the interpretations next to each example, EA Form I has a wide range of meanings, including transitive (7)-(11), and intransitive of the unergative (12), or unaccusative/inchoative type (13)-(14).
(Inchoative) det-water got cold.3m.sg 'The water got cold.' The following The diversity of verb types derived through Form I in Emirati Arabic point towards this being a default form, as has been assumed in various grammars for both Emirati Arabic (Qafisheh 1977;Holes 1990) and MSA (Ryding 2005). Thus, we assume the syntactic structure in (17) for Form I (which represents Form I kəsar as it appears in (9)): (17) As discussed in Section 2, the external argument of the verb is introduced in the specifier of VoiceP (Kratzer 1994;Harley 2009;2014a;2014b).
Little v in this template carries no special selectional restrictions and simply acts as a verbalizer in the sense of Marantz (2001). The meaning of the form is fixed following locality restrictions on contextual allosemy (Marantz 2013a). That is, at this low first phase domain, containing the root and any possible internal arguments, the polysemy of the root allows for numerous choices of meaning to be negotiated and fixed. Once the little v is projected, the meaning is fixed to one of these possible interpretations. For example, the fact that the root ksr 'break' implies an action, and given that it projects an object complement, the resulting Form I is transitive. By contrast, since the root ðwb 'melt' is unaccusative, the output Form I is derived by moving the internal argument of the verb to spec-VoiceP as in standard assumptions of unaccusative formation (see for example Marantz 1985;Baker 1988;Hung 1988;Travis 1991;Koopman 1992; and many others).

Form II: The causative
Form II is morphologically characterized by the gemination of the medial root consonant: C 1 aC 2 C 2 aC 3 . Ryding (2005) describes Form II verbs as causative or transitive counterparts of Form I verbs. For example, Form I fariħ 'he was glad' corresponds to Form II farraħ 'he gladdened s.o.' (i.e., he caused s.o. to be glad). As for the verbs, which are already transitive in Form I, they become causative (or in traditional terms double transitive) in Form 2 (e.g., katab 'he wrote' versus kattab 'he made someone write'). In addition, this form is typically associated with 'intensity' or 'extensity' in terms of the nature of the action (where the latter is generally violent such as in dˤarrab 'he beat violently', as opposed to Form I dˤarab 'he beat'), temporally extensive actions (as in kassar 'he broke into pieces', as opposed to Form I kasar 'he broke'), numerically extensive actions (as in farraq 'he dispersed groups or people', as opposed to Form I faraq 'he dispersed/split'), and repeated or frequentative actions (as in tˤawwaf 'he went around often', as opposed to Form I tˤaaf, with the underlying form /tˤawaf/ 'he went around'). Following Greenberg (1991), Fassi-Fehri (2003) proposes that Form II has the meaning of 'plurality', which is realized as the reduplication of the medial consonant, and is associated, in addition to intensity, with (i) temporal repetition (or repeated action) as in (18) and (19), (ii) plural action on many with transitives (20) and by many with intransitives (21). 2   (18) kassar l-raʒul-u l-kaʔs-a. broke-3m.sg det-man-nom det-glass.acc 'The man broke the glass into pieces.' (19) ʒawwal-3m.sg l-raʒul-u. walked det-man-nom 'The man took many walks.' (20) ʒarraħ-3m.sg l-raʒul-u l-ʒunuud-a. wounded det-man-nom det-soldiers-Acc 'The man inflicted wounds on many soldiers.' (action on many)

Form II in MSA
barrak-3m.sg l-naʕam-u. kneeled det-camels-nom 'The (whole herd of) camels kneeled.' (action by many) Form II in EA primarily bears the causative meaning. That is, the template applies to Form I verbs (or adjectival and nominal bases) introducing a causative interpretation and increasing the valency of the predicate by introducing a causative argument. The following examples illustrate this with Form I intransitive verbs: Ahmad caused the door to break/be broken.
(36) Ahmed used the tent as it should normally be used.
In establishing a syntactic structure for Form II, it is important to emphasize what was stated above with regard to the main assumptions; the causative, just like the inchoative and applicative, is a feature of little v (see Harley 2013). Note that the example in (32) might not seem to have an obvious causative interpretation, but if it is derived from a root with a nominal nature (c.f. footnote 3) then valency increasing is necessary in order to introduce the single argument of the derived verbal predicate (x be (in) tend).
In some accounts, the causative interpretation as well as the introduction of the Causer argument is mediated by a causative functional head as in Pylkkänen (2008) and Key (2013) (c.f. also Harley 2017). In other words, the causative constructions involve two events: i) a causing event headed by Cause and ii) a caused event headed by the main predicate/verb, Cause being a relation between two events.
While this structure most likely derives higher causatives, in cases of low causation (i.e. transitivity alternations), the causative feature (and the causer argument) may be introduced by little v. Alternatively, one can assume that the CausP and vP projections are "bundled" in the sense of Pylkkänen (2008) and Harley (2017), i.e. that features relating to v and Cause are bundled together in EA (leaving open the possibility that in other languages these features are expressed by separate syntactic heads). We propose the following structure for the derivation of pairs of intransitive (Form I) and transitive/causative (Form II) forms of the verb wəgaf 'he stopped'/ waggaf 'he caused to stop' in EA (from the examples in (23)): As shown in (37), the root wgf is selected by a causative v, which is then spelled out as gemination, resulting in waggaf (see Kastner (2016) for similar analysis of the Hebrew verbs). We assume here that the Causer is introduced in spec-vP as in standard accounts, although it could also be introduced directly in the higher VoiceP, as in Harley (2017), where the little v has no specifier and acts simply as a verbalizer with a causative feature. Both accounts result in the same derived structure.
It is important at this stage to point out that in MSA, the causative meaning is expressed mainly by Form IV and only occasionally by Form II. According to Wright (1896), Form IV is morphologically realized by the prefix ʔ-. 4 He suggests that it bears either the unmarked or the causative meanings, proposing that if the verb in Form I is intransitive, it becomes transitive in Form IV (compare (38) and (39)).  (40) and (41)):
(UR: /arʔaj/) see-cause.3m.sg det-man-acc thing.acc 'he showed the man.' ('he made the man see.') Wright also implies that when verbs built on Form II and Form IV are both causative, then a slight difference in meaning could emerge (e.g., ʕallam 'to teach', ʔaʕlam 'to let know' or 'to inform') or, in most cases, they just mean the same. For example, both xabbar and ʔaxbar mean 'to let know'. Interestingly, Form IV is not attested in EA, and the causative meaning is exclusively expressed by Form II and, to a lesser extent, by Form I.
We have shown clearly that Form II verbs in Emirati Arabic are causative counterparts of Form I intransitive and transitive verbs and have provided a decompositional syntactic structure which accounts for this. However, it is not always the case that a Form I verb has less valency than a Form II verb in Emirati Arabic. Thus, in a very small number of cases, Form I verbs may share the same usage and meaning with the corresponding Form II verbs. For example, both Form I kəsar and Form II kassar have exactly the same interpretation 'he broke something' in EA. 5 We return to this in our discussion in Section 8.

Form III: The associative/applicative
Form III in MSA is morphologically characterized by vowel length in the first syllable: C 1 aaC 2 aC 3 . It mainly introduces a new participant in the action denoted by the verb and is thus termed "associative" (Ryding 2005: 503). It can express reciprocity (e.g., raafaq 'he accompanied someone, where someone also accompanied him' and sˁaafaħ 'he shook hands with someone, where someone also shook hands with him'), in addition to repeated and/or attempted actions. Wright (1896) suggests that this form could be unmarked, 6 but reciprocity is always more or less implied (e.g., saafar 'to travel', where in the old Arab tradition and context travelling was always done in pairs or groups).
Related to the idea of reciprocity in Form III is the notion of plurality. Benmamoun (2000) argues that Form III is the plural form of Form I. He argues that this form denotes a plurality of events, and that each event involves at least one agent. He compares the morphological marker of Form III, vowel length, to that of plurals in Arabic, also marked with a long vowel.
According to Fassi-Fehri (2003), Form III in MSA expresses the meaning of 'participation' as in: (42) ʃaarab l-rajul-u l-ʃaabb-a. drink.3m.sg det-man-nom det-young man-acc 'the man drank with the young man.' i.e., 'he shared a drink with him.' It is important to note that 'participation' in Fassi-Fehri's sense means that both participants are interpreted as agents of the action, when the event is the same (as is in (42)). It emerges that Form III typically describes an event which involves two (or more) participants and expresses some reciprocity. From both these notions follows a sense of plurality of events/actions.
In EA, this form results in similar interpretations. However, unlike its counterpart in MSA, EA Form III does not really carry the reciprocal meaning. In EA, Form III is almost always 7 either transitive, in which case it requires a human object/patient (the 'theme' the-matic role) (e.g., Ali ʕaaqab Ahmad 'Ali punished Ahmad') or dative (the 'goal' thematic role) as with verbs that require a preposition (e.g., Ali s ʕ aarax 3ala Ahmad 'Ali yelled at Ahmad'). In short, the most consistent feature of this form in EA is the presence of two animate participants in the event. Consider the examples in (43) Ali waayag (min l-baab) ʕala l-walad. Ali gazed.3m.sg (from det-door) on det-boy 'Ali gazed or looked secretly (through the door) at the boy.' The interpretation and argument structure of Form III resembles a type of associative/ applicative morphology which is sometimes overtly realized in other languages. See for example the discussion of suffix -an in Bantu languages, which surprisingly provides the same interpretations as Form III with Arabic verbs: it denotes reciprocity with transitive verbs and a notion of "acting together" i.e. associativity, with intransitive verbs (see Schadeberg 2003 for discussion of these properties of Bantu -an). It is this second feature of associativity that we think encapsulates the semantic properties of Form III verbs in Emirati Arabic. We could analyze Form III verbs as applicatives in the sense of Pylkkänen (2008) who proposes an analysis where she defends two types of applicatives. The first type, high applicatives, relates new participants to the event described by the verb: it is a relation between an event and an individual. The second type, low applicatives, relates individuals to the direct object: it is a relation between two individuals. If Form III involves an applicative head that relates an additional argument (in addition to the agent) to the main predicate, it is not clear how the different EA Form III verbs fall into either of Pylkkänen's types. In fact, Form III seems to exploit both types depending on the root. For example, Ali laaʕab l-walad 'Ali played with the boy' is a case of a high applicative, where a new participant (l-walad 'the boy') is introduced by the event 'play' denoted by the root (see (46)). By contrast, Ali waayag ʕala l -walad 'Ali gazed/looked secretly at the boy' or, more accurately, 'Ali gave a look to the boy' is a case of low applicative, where Ali and the boy are related to the null argument, in (51). In most cases however (e.g. all the examples in the table of (43)) a high applicative account can be maintained, and we will adopt this analysis here. For purely terminological reasons and following the tradition in the MSA literature we will term this form the associative but will assume a high applicative structure and interpretation. ( Pylkkänen 2002;  We base our treatment of Form III verbs on Pylkkänen's (2008) analysis of applicatives. However, we propose that in Form III, the applicative is a feature in little v. As can be seen in the structure in (52), Pylkkänen (2008) places the higher applicative head ApplP between the VP shell and VoiceP. This is exactly where we assume v resides and it is plausible to assume that ApplP is in fact vP with v having an applicative flavor.
The additional participant is introduced in ApplP while the external argument merges in spec-VoiceP. The structure straightforwardly explains why in most cases, in EA, Form III involves two animate participants: one is primary (the agent) and the other is a secondary (a patient who is also involved in the event). However, the degree of the involvement of the so -called "secondary" participant in the event depends on the meaning and argument structure of the root itself. For example, in (49), both Fatma and Shaikha are involved in the process of becoming friends; whereas in the first sentence in (47), Ahmad undergoes the event of punishment more than he actually participates in it. Following the same pattern as discussed with causative Form II verbs in the previous section, we assume that Form III verbs are derived from the same minimal structure, albeit this time with a small v had which has an associative feature. This feature requires an additional (frequently but not always) human participant in the action denoted by the verbal root. Thus, for a transitive Form III verb, as in example (46), we assume the following structure: As shown in (53), the structure of the associative template would be similar to the structure of the template of Form II, except that the little v in (53) has the feature [+associative] which introduces the applicative argument. This generates the desired associative interpretation and is morphologically expressed by a prosodic change (vowel length) to the unmarked template.
In other cases, the associative v does not introduce the applicative argument but rather forces movement of a lower Root argument to its specifier (perhaps due to an EPP requirement). This would derive transitive verbs of the type in (47)-(51). Thus, the derivation of (47) would proceed as follows: The affected participant -Ahmad in the structure above -is an argument of the root, but it moves up to the Spec-v because of the EPP feature of this flavor of v. We assume that in cases of denominal verbs, the EPP feature is satisfied by a null DP (roughly interpreted as 'with someone') where the additional participant is implied in the verb semantics. In sum, the presence of the associative feature straightforwardly accounts for the range of interpretations of Form III verbs in EA and the fact that the patient may not directly participate in the event.
An additional benefit of this account is that perceived reciprocity with (mainly) transitive Form III verbs is the result of the interaction of the requirement for an additional participant and the main lexical semantic contribution of the root. Thus, for verbs like raasal 'correspond', ʃaarak 'share', and saabag 'compete', the perceived reciprocity is only a side effect of the lexical contribution of the root which requires action by the additional participant (i.e. '* I corresponded with Ali but he didn't send me any letters.'). 9 In discussing the EA verbal forms in the following sections, we will see that while a certain pattern of correspondences between verbal forms and interpretations can be established, in many cases these patterns are not straightforward, but rather allow for numerous exceptional forms which do not conform to the described patterns. This is also the case with Form III verbs. While many of them show this presence of an additional participant, a small number of them are pure transitive verbs and there are also some intransitive cases such as saafar 'he travelled' and ħaawal 'he tried'. We will mention these irregularities for each form but will delay the discussion of an account of these irregularities until Section 8.

Form IX
In terms of the nature of prosodic change, Form IX in both EA and MSA is marked by the gemination of the third root consonant: C 1 C 2 aC 3 C 3 . In MSA, this form expresses states reflecting colors and physical defects (Wright 1896;Ryding 2005). Both Wright (1896) and Ryding (2005) note that this form is rare in MSA. Kouloughli (1994: 207) (as reported in Ryding 2005: 579, fn. 1) reports a 0.5% occurrence of these forms out of the total number of Forms II-X in MSA. Similarly, Tucker (2010) observes that these forms are not common in Iraqi Arabic. This form is also nonexistent in Moroccan Arabic, although a counterpart of it is attested: CCaaC (Ali Idrissi, personal communication). Arbaoui (2010) argues that MSA Form IX is deadjectival, and that it is derived, not from a root that denotes a color or physical defect, but rather from an adjective, itself derived from the root. According to her, this adjective component is what regulates the prosodic position that leads to surface gemination.
Form IX in EA is mainly associated with colors and physical defects, but it also denotes any change in physical appearance. As such, it is inchoative, and may understandably be said to involve adjectival roots (i.e., roots which are also found in adjectives). As a matter of fact, any Form IX verb can be translated as 'become/be adjective'. This however does not necessarily mean that Form IX verbs are derived from adjectives; rather, both the verbs and the corresponding adjectives are built out of the same abstract root. In contrast to MSA, Iraqi, and Moroccan Arabic, Form IX is rather productive in EA. Some examples of Form IX in this language are given in (59)-(65). Note that word order is free -the subject can also precede the verb: It can be assumed that Form IX verbs are deadjectival, derived by adding a little v layer to an adjectival projection. However, here, following standard assumptions in the Distributed Morphology framework, we assume that Form IX verbs are derived by adding a vP layer to the root projection, while the corresponding adjectives are derived by adding a little categorial aP. Thus, Form IX is derived directly from the root, which happens to also be related to an adjective, but not derived from the adjective itself. 10 Phonologically, on a par with all other templates, Form IX may be derivable via constraints on prosodic structure. We take it that it is phonologically realized by a sub-syllabic position, a mora, which is then filled in by the spreading of the third root consonant: C 1 C 2 aC 3 C 3 . Since empty moraic positions can be filled by either consonants or vowels, this analysis predicts that some varieties of Arabic may exploit the other option and have the vowel, instead, fill in the sub -syllabic position. Ali Idrissi (p.c.) points out that Moroccan Arabic verbs expressing colors and physical properties show this pattern: wsaaʕ 'it became wide' (EA wsaʕʕ), dˤʕaaf 'he became thin/weak' (EA ðˤʕaff), and xdˤaar 'it became green' (EA xðˤarr).

EA Form IX
Syntactically, we assume that Form IX has the same syntactic structure as all forms with the exception that little v bears an inchoative feature (translated as become X). Thus, the structure of the first sentence in (66): mtann l-walad 'the boy became fat' would be as shown in (71). (71) In accordance with basic assumptions of syntactic composition of lexical verb meaning (see for example Harley 1995;Marantz 1997), the inchoative interpretation is mediated by a an inchoative verbalizing v head, which does not select for its own external argument. The combination of the root and the v head provide the inchoative interpretation and the root argument raises to spec-vP and subsequently spec-VoiceP.
There are some roots that select other patterns along with Form IX. As the examples below show, the meaning of Form IX is consistent with the analysis provided.

Voice features: Reflexives/middles and passives
In the preceding section we discussed cases in which features on little v are realized via prosodic changes on the basic verbal stem. In what follows, we will turn to verb forms where morphosyntactic information is expressed by affixal material, namely t-(in either prefixal or infixal positions), n-and st-. The major claim in this paper is that these affixal segments are exponents of Voice and express reflexive, middle, and passive valency alternations.

Form V
Form V is morphologically characterized by the prefix t-and the gemination of the medial root consonant: t-C 1 aC 2 C 2 aC 3 . Wright (1896) adopts a linear view of this form in MSA and argues that it is formed by simply prefixing t-to Form II. Semantically, he argues that it bears the reflexive (e.g., taxawwaf 'to be scared') or the passive (e.g., tafarraq 'to scatter/be scattered') meanings and expresses the 'state into which the object of the action denoted by Form II is brought by that action, as its effect or result.' (Wright 1896: 36). However, he points out that reflexivity is not a very prominent meaning and proposed instead that 'intensity' could emerge in this form as in tafarraq 'to scatter/be scattered' whose more accurate interpretation would be 'to scatter/be scattered into many small groups or into various directions'. Wright argues that Form V has an 'effective' meaning, which emerges from the reflexive meaning, in the sense that an act is done to a person whether it is caused by another agent or by the person himself/herself, as in taʕʕalam which can mean either 'to be taught' or 'to learn'. Looking at the EA data in (72)-(73), we can see that Form V is also derived morphologically by adding the prefix t-to Form II causative verbs. The direct result of this morphological process is valency-reduction which may be interpreted in a number of different ways as is often the case cross-linguistically. The most frequent interpretations are those of reflexivization (79)-(80) and passivization (81) In all these cases a mediopassive interpretation seems to arise from the verb forms with an additional reflexive meaning arising from the lexical semantics of the root. Thus, while in (74) it is plausible that Fatima is both the 'teacher' and the 'learner' in the act of 'learning', it is also OK to assume another teacher, and so no reflexive interpretation. Even in cases where a reflexive interpretation may seem more plausible, simple refelexivity tests show that the distributional behavior of these verb forms is not the same as with verbs with anaphoric objects, which involve a bound element (see Sells, Zaenen and Zec (1987) for such tests; see also Doron (2003) for their application in Hebrew verb forms with similar semantics). Importantly, the reflexive reading of the Form V verb does not involve a bound argument (83), unlike a verb with an anaphor object (84) The examples above indicate that Form V verbs in EA retain only a sloppy reading. We assume, following Sells, Zaenen & Zec (1987) that this is the case because a process of detransitivization has taken place. In addition, the statue test provides a similar result (see Doron 2003 (2013), that the reflexive interpretation is not the realization of a reflexive argument as the morpheme t-, but rather that t-is the realization of a mediopassive feature of VoiceP, which suppress the causer argument introduced by the causative v (manifested as germination of the middle root consonant). This suppression occasionally (but not necessarily) can lead to reflexive interpretations. We follow Alexiadou (2014) in assuming three different Voice heads, active, passive and middle (what we have termed here mediopassive). The middle voice head is just the non-active equivalent of active voice, both manifestations of the Voice head in Kratzer's (1994) account. Alexiadou assumes that passive projects in a higher Passive head which takes the active voice string as complement. However, for EA we assume that all voice manifestations are located in VoiceP (see also Alexiadou's 2014 similar analysis for Greek). In such an account then, EA projects two voice heads, an active voice which selects the vP (unmarked, causative or applicative/associative) and attracts (or merges) the highest projected argument to its specifier. Middle voice on the other hand suppresses this highest argument and allows for the next lower argument to be promoted. This results in valencyreduction as in the examples in (72)-(73), where the causative argument introduced by v is absorbed by Voice and the lower argument is attracted to spec-VoiceP. This results in a detransitivization of the verbal predicate, deriving an unaccusative from 2-place predicate bases. If the base is ditransitive (as in 'teach s.th. to s.o.') then the resulting predicate is transitive (e.g. 'learn s.th.').
Given that Form V verbs are usually intransitive because voice morphology absorbs a verbal argument, it is natural for the remaining argument to occasionally receive both actor-patient roles and occasionally patient roles (with an implied external actor) resulting in reflexive or mediopassive interpretations. This is in clear contrast with Form I and Form II verbs where the two roles are assigned to different arguments, as in (87) and (88) Under our analysis, the structure of Form V will be like the one proposed for the causative form, with the addition of the middle morpheme under Voice, as shown in (91), representing the structure for the sentence in (77): l-ħabl tgatˁtˁaʕ 'The rope got-cut': In this representation, we adopt Marantz (1984) and Kayne's (1988) analyses in which they propose that Voice is itself a middle morpheme assigned the external theta role. This is also compatible with recent approaches to the locus and function of voice morphology crosslinguistically, and provides a straightforward way to capture the different interpretations that the form receives (mediopassive, reflexive and so on): (91) In the above configuration, the prefix t-heads a Voice projection which and instantiates non-active voice (the non-active equivalent of voice in Kratzer 1996). This mediopassive voice projects no specifier (see Alexiadou 2014 and references therein) but can host lower moved elements to its specifier (see also Schäfer 2008). Thus, we assume that the nonactive voice morphology absorbs the higher causative argument (which in active voice cases merges in spec-vP) and provides the landing site for the lower argument of the verb, which moves to spec-VoiceP. This results in an intransitive structure which is underspecified for the semantic interpretation it can receive and thus it is interpreted as mediopassive or reflexive, based on the semantics of the root and speaker choice.

Form VI
Form VI is characterized by the prefix t-and a long vowel [aa] after the first root consonant: tC 1 aaC 2 aC 3 . Based on its surface form, one may analyze it as a combination of the prefix t-and Form III. According to Ryding (2005: 543) and Wright (1896), in MSA this form bears mainly a reciprocal meaning. Additionally, it can have the meaning of pretense (e.g., tamaawat 'he acted as if he was dead'). Fassi-Fehri (2003)  In (6a), the verbal form saabaq 'raced' suggests that both Ahmed and Ali are 'involved' in the action/event, although there is a slight difference in the degree of involvement between the two participants. The subject, Ahmed, is more active than the object, Ali. On the other hand, the verbal form in (6b), tasaabaq 'raced (each other)', suggests that both participants are "subjects" of the same event (Fassi-Fehri 2003: 16-17).
In EA, Form VI is similar to Form V in that it attaches the same mediopassive prefix t-but this time the input is Form II verbs which we have termed associative/applicative in section 5.2. The function of the morpheme is similar to the one for Form V verbs: it absorbs the applicative argument that the v head has introduced, reducing the valency of the verbal predicate and allowing for the lower argument to project in spec-VoiceP. As a result, From VI in EA can give way to two interpretations: reciprocal and mediopassive. When it involves two animate participants (i.e., a plural subject, be it a dual or plural DP (e.g., l-rjajil tðˤaarb-u 'the men fought with each other'), or two conjoined DPs (e.g., Ali wa Ahmed tðˤaarb-u 'Ali and Ahmed fought with each other'), it acquires the reciprocal meaning. Whereas, when there is no implied agent, a mediopassive (with or without an external implied actor) reading is obtained.
Consider the examples in (31) that illustrate Form VI in EA.
(94) tbaar-at al-Ain maʕa l-Weħda. (Active/Transitive) tasted Ali det-soup 'Ali tasted the soup.' Examples in (94)-(98) show the reciprocal meaning of form VI, and those in (99)-(101) exemplify the mediopassive reading. One may suspect that the reciprocity reading arises from the use of the preposition maʕa 'with'. However, the example l -rjajil tðˤaarb-u 'the men fought with each other' shows that this is not true, as the verb is used with the plural subject 'men' while the reciprocal meaning still obtains, suggesting that reciprocity stems (111) nserag l-ketaab.
(Mediopassive) was_stolen det-book 'The book was stolen.' The function of the passive prefix n-is similar to that of the Voice prefix t-discussed in the previous two sections for Forms V and VII. It absorbs the external/highest theta role of the verbal predicate as well as accusative case assignment, allowing for the lower argument to raise to spec-VoiceP. Thus, we propose that Form VII would have the same structure as Form I with the passive morpheme n-, like all affixes, realized under Voice: We take n-to realize mediopassive/middle voice and not passive voice as in Alexiadou's (2014) analysis where Passive heads a higher projection selecting the full active voice string as its complement. The prefix n-simply realizes the non-active equivalent of active voice Form I. That is, n-is in complementary distribution with t-in that the two mediopassive affixes select for different vPs -n-selects for the unmarked vP while t-selects for vPcausative and vPapplicative.
In (113), we provide examples of all the forms covered so far formed with the same root. The reader can see that the different meanings of these verbs are consistent with the analysis developed in the preceding sections.

Form III
Ahmed gaatˤaʕ Ali.
(Reciprocal) 'The men cut ties with each other.' Form VII l-xeetˤ ngetˤaʕ.
(Passive) 'The thread was cut.' As we have seen with other forms so far, Form VII also presents a number of derived forms which present non-expected mediopassive structure and interpretations. A few examples are provided below: (114) Ahmed nxemad.
(Intransitive) Ali shut-up 'Ali shut his mouth.' We will discuss these exceptions in detail in Section 8.

Form VIII
Form VIII is morphologically marked with the infix -t-after the first root consonant: C 1 taC 2 aC 3 . In MSA, Form VIII is mainly associated with reflexive, mediopassive, resultative and reciprocal meanings (Wright 1896;Ryding 2005: 565). In EA, Form VIII is also widely associated with mediopassive/reflexive/reciprocal interpretations (116)-(123). In fact, as Ryding (2005: 565) notes for MSA, Form VIII verbs "express a wide range of meanings that are difficult to predict". This is also true for EA:
(Reflexive) Ali killed-himself 'Ali committed suicide.' (Transitive) bought Ahmed car 'Ahmed bought a car.' As can be seen from these interpretations, Form VIII carries a reflexive/mediopassive meaning based on the Form I morphology. However, the example in (123) bears the unmarked meaning, or the simple meaning of 'to buy'. Since Form VIII is derived from Form I, we propose that they are built on the same structure with the infix -t-being realized on the Voice head in Form VIII: The fact that Form VIII is built on Form I straightforwardly explains the polysemy of the resulting verbal strings. Following our discussion in Section 4, Form I is the default form in Arabic and, as such, negotiates its meaning locally in the first phase domain, containing the root and any possible internal arguments. At this level, the polysemy of the root allows for numerous choices of meaning to be negotiated and fixed. Once the little v is projected, the meaning is fixed to one of these possible interpretations. This allows for a wide range of interpretations to be available. Once one of these numerous possibilities is locked, the meaning remains fixed with the addition of higher functional projections.
For other Arabic verb forms, this polysemy is not available, as the little v head which verbalizes the root carries additional features (causative, associative, inchoative) which force specific meanings when combined with the root.
Once again, to highlight the meaning of Form VIII, it is worthwhile looking at cases where the same root is used in Form VIII templates and other templates. This is illustrated by the root √rfʕ 'raise' in (125)

Form X
Form X is morphologically marked by the prefixes s-and t-11 . On the surface, Form X appears as a combination of the prefix (es) st-and MSA Form IV 'afʕal' (as in adxal 'to cause to enter'): st-aC 1 C 2 aC 3 .
According to Wright (1896), in MSA this form is a combination of the prefix st-and Form I. He claims, however, that its meaning is derived from Form IV (afʕal). Thus, Form X could be either the middle or reflexive of Form IV: e.g., axraʒ 'to cause something/somebody to come/go out' as opposed to staxraʒ 'to extract something for oneself'. Wright also proposes that Form X indicates 'possession' or some sort of 'beneficiariness' meaning (e.g., aħall with underlying aħlal 'to cause it to be lawful', staħall with underlying staħlal 'he made it lawful for himself'). Additionally, it may express demands/requests, as in staʔðan 'he asked for permission', stajfar 'he asked for pardon', stasqa (underlying stasqay) 'he asked for something to drink'. Wright also states that this form could have an unmarked meaning, as in staħja (underlying staħjaj) 'he was ashamed/embarrassed', staħaqq (underlying staħqaq) 'he deserved'. However, according to Wright, a close examination of this form should show that these verbs are reflexive in nature (e.g., staħja 'to make oneself ashamed', staħaqq 'to cause something to be due to oneself as a right; Wright 1896: 45).
Fassi-Fehri (2003) claims that Form X in MSA is the reflexive of the causative Form IV. This reflexivity is realized either as a pure reflexive (e.g., stajqaðˤ-a 'to wake oneself up' derived from Form IV ajqaðˤ-a 'to wake someone up') or as a benefactive (e.g., staktaba-haa 'he made her write for his benefit' derived from Form IV aktab-a-haa 'he made her write'). Thus, for Fassi-Fehri, Form X is either reflexive causative or 'requestative' causative: stafham-a 'he asked to explain (make himself understand)'. In fact, Fassi-Fehri proposes that Form X involves a kind of double causation. For instance, in Ali stafahama Ahmed (meaning 'Ali asked information from Ahmed'), what we have is: Ali caused Ahmed to cause him (i.e., Ali) to know/be informed.
In EA, Form X is not very common, and most of the verbs built on it are borrowed from MSA or other Arabic varieties (particularly, Egyptian). We concur with the previous analysis of MSA in that Form X is built on Form IV (causative in MSA) but propose that Form X in EA is associated with the general meaning of reflexivity. This is consistent with our analysis of reflexive/middle forms where the morpheme t-realizes the reflexive/middle feature under Voice. In (42)  This polysemy of Form X in EA can be explained if we assume (as in Fassi-Fehri 2003) that Form X in is historically the reflexive of the causative Form IV. The problem with this analysis is that the base on which Form X is built on, namely Form IV, is not available in EA. Given this gap, the only available analysis for us is that form X is built by merging a default verbalizer (with no causative, inchoative, or associative features) directly to the root, and having the semantic interpretation of the form negotiated at root level with all available options that the root contains. The affix st-merges at the Voice level and provides the mediopassive/reflexive interpretation, which is frequently associated with this form.

The distribution of roots and templates in EA
The observation that consonantal roots in Arabic generally select only a subset of the templates available has always been made (McCarthy 1979), but the phenomenon itself has never been explained. As mentioned earlier, we assume a local selection relationship between a root and features of little v and features of voice. In this section, we will present a few cases where the same root can appear with more than one verbal template and show that this distribution follows from the structure and analysis defended in this paper. When the meaning of a given form diverges from the general meaning of the roots, that meaning is provided in a foot note. Each form is indicated by the Roman numeral that corresponds to it.

Discussion
Since the seminal work on Semitic verbal morphology in McCarthy (1979;1981), decompositional accounts of the Semitic verb stem have assumed a tripartite structure consisting of the consonantal root, the template and the vocalic melody, and thus allowing for three distinct morphological entities to be mapped to different phonological tiers. While this analysis has provided an elegant answer on how the attested verbal forms are derived, it has nothing to say about the semantics of the derived verbs. More recent work (Arad 2003;Doron 2003;Tucker 2010;Wallace 2013;Kastner 2018, among others) has attempted to recast these three different morphological units into a syntax-based account of verb form derivation couched mainly in the DM framework. However, while the consonantal root has easily found its place in the DM framework, which assumes that the root is the base of any lexical derivation, and while the vowels in the vocalic melody tier have been assumed to merge at the VoiceP level, in all these accounts, the status of the template as a morphological primitive has been questioned. In fact, both Kastner (2018) and Tucker (2010) take the template to be a mere phonological byproduct, derived from a syntactic structure which provides the root and template vowels through hierarchies of soft violable constraints couched within Optimality Theory (Prince & Smolensky 1993;2004). Following Arad (2003;, we argue here that the template is in fact a morpheme, occupying the little v projection, in contrast for example to Kastner (2018) who takes the v head in Hebrew to always be phonologically null. The main argument for our approach comes from the systematic regularity of meanings that the different patterns carry. Thus, the pattern C 1 aC 2 aC 3 for Form I and C 1 aC 2 C 2 aC 3 for Form II in EA, almost always result in equal or greater valency for Form II with otherwise very similar meanings derived from the consonantal root. Form V on the other hand always detransitivizes Form II using voice morphology to absorb its external/highest argument.
An additional argument for the status of the template as a morphosyntactic entity comes from selectional properties. The prefix t-, which here we have assumed to project a Voice phrase with mediopassive properties, selects for specific templates: it selects Form II verbs to derive mediopassive Form V verbs with the higher causative argument absorbed and mediopassive/reflexive interpretations and it selects Form III (the applicative/associative form) to derive Form VI verbs, absorbing the additional applicative argument and deriving again mediopassive/reciprocal verbs. No other Form (e.g. Forms I, VII, VIII, IX and X) can be the input to any of these two Forms. This is explained straightforwardly in the account presented here as the prefix t-is just the non-active version of the Voice head in the causative and applicative constructions and selects for the specific v causative and v applicative which head Forms II and III respectively.
The main argument against a decompositional account of verbal morphology in EA as presented here may come from the idiosyncratic character of the derived verbal forms. As has been noted in numerous studies of verbal lexical decomposition crosslinguistically (see for example Hale & Keyser 1993;Travis 2000;Marantz 2001; Travis 2010 among others), syntactic operations that apply to the lower part of the syntactic spine, namely the VP and its immediate higher projections, derive syntactic strings which exhibit a number of idiosyncratic properties which have been attributed to the lexicon in the broader literature. One such property is weak productivity, i.e. not all processes apply to all forms, in order to derive the target strings with the expected interpretations. A second property is semantic transparency. Thus, the output of a morphosyntactic operation at this level may exhibit unexpected semantics. Other lexical properties may include category-changing operations (nominalizations, verbalizations) and so on. These idiosyncratic properties contrast with syntactic operations that apply to higher verbal functional projections (e.g. Tense and the CP layer) in that the latter are productive (apply to all possible strings) and exhibit predictable compositional semantics.
In order to capture this contrast, Hale and Keyser (1993) and Travis (2000; introduce a division in the syntactic spine, assuming the existence of a lexical syntax (l-syntax) and a syntax proper (s-syntax) which are subject to exactly the same set of syntactic rules/operations. The only difference is that operations applying to the lower l-syntax domain are subject to lexical idiosyncratic properties while rules that apply to s-syntax are not. Based on evidence from Malagasy and Tagalog, Travis (2000; assumes that the boundary between l-syntax and s-syntax is the EventP projection, a higher aspectual projection which binds the event argument of the verb. Malagasy and Tagalog  This is also the case with several other languages, including Eastern Armenian, Hebrew and German. Based on these data, Alexiadou (2014), building on work in Doron (2003; see also Alexiadou & Doron 2012;Alexiadou, Anagnostopoulou & Schäfer 2015) proposes three distinct Voice heads: active, middle and passive. She assumes that passive merges above VoiceP as a separate Passive head and that VoiceP hosts two distinct Voice heads: the (active) Voice of Kratzer (1996) and the non-active/middle Voice head. The latter does not project a specifier and thus it exhibits unaccusative properties. However, its interpretation may vary depending on the semantics of the root: roots with natural reflexive semantics (like 'wash' or 'shave') will trigger a reflexive interpretation. Other roots will trigger inchoative or passive interpretations. As Alexiadou (2014: 34) notes "(such a) structure […] is thus underdetermined for the semantic interpretation it can receive: […] depending on the type of root the structure contains, it can yield a reflexive or a passive interpretation. This crucially means that middle Voice is underspecified, which leads to ambiguity with the same root, unless the context provided further specification. The former interpretation is readily available with natural reflexive roots, the latter with natural disjoint predicates. Since this structure is underspecified, speakers are relatively free to choose an interpretation that would go along with it." A final issue that needs to be discussed is how non-linear morphology such as the verbal template is linearized at PF. Linearization accounts of Semitic verbal morphology (such as Tucker 2010 for Iraqi Arabic; Kastner 2018 for Hebrew) are essential in order to understand how the morphosyntactic structures, which are independently necessary in order to also capture the semantic interpretations of the derived verbal forms, interface with the phonological component. Both accounts use an optimality theoretic analysis for translating morphosyntactic structures into surface strings, with the only difference that Tucker (2010) only discusses citation forms, while Kastner (2018) provides a more detailed analysis of fully inflected verbal forms, including Tense and Agreement morphology. We have chosen to avoid any discussion of linearization here, as our focus is the internal morphosyntactic structure of the attested verbal forms and how it maps to their semantics. We just briefly provide an example of such linearization below, taken from Kastner (2018), for illustrative purposes. Kastner (2018) assumes a standard hierarchy of verbal functional projections with a root base, followed by a verbalizing, phonologically null, v head and a VoiceP projection which can have three different flavors, depending on whether it requires or not a DP in its specifier (or remains unspecified). The root domain provides the consonantal root of the verb in Arabic, while v verbalizes this root and provides a projection for the internal argument if present, while voice hosts the templatic vowels. A separate head √action may be attached at the VoiceP level, imposing a [+human] requirement on the DP in spec-VoiceP. Passive voice (if present) merges above VoiceP, followed by a complex Tense/Agreement head: It is crucial for Kastner's analysis that while all other heads in the structure have a morphological exponent, v is null. When the structure is sent to Spell-Out, the DM morphological component applies an operation of Pruning (Embick 2010), eliminating null heads like v and thus establishing adjacency between heads that are otherwise too far to license contextual allomorphy, like Voice and the Root. As a result, the idiosyncratic lexical properties of the root can license contextual allomorphy on the vowels in Voice. In addition, if the passive is not present, Tense (and Agreement) can additionally license allomorphy to the stem vowels. This is not possible when passive is overtly realized. We will not go into the details of the system here, but it seems to capture elegantly the distribution of vowels in the different Hebrew verbal forms based on specific rankings of numerous optimality theoretic soft constraints, which relate to phonological and prosodic properties of the relevant forms.
In such an account, the template is not a morphosyntactic primitive and is derived at the syntax-phonology interface. We will not discuss the analysis here, as it would have to be significantly adjusted to capture the EA data, where the non-active, mediopassive, reflexive and reciprocal forms involve additional morphology. We will only point out that dispensing with the template results in a peculiar coincidence of having the right number of vowels in VoiceP to avoid consonant clusters. If root and voice are completely different morphological primitives, then there is a priori no requirement that they have matching length descriptions. A three-consonantal root could easily combine with three or four consonants in Voice to create forms which could potentially be optimal candidates for the rankings of soft constraints imposed in Kastner (2018). The fact that this is not the case is at least surprising. Imposing such a restriction on the number of available vowels as exponents of Voice seems to us a way to bring back the template as a significant factor in the derivation of the verb stem. Doron (2003) adopts a syntactic-semantic analysis in order to account for the compositional structure and meanings of Hebrew templates. She particularly introduces two syntactic heads to capture the different meanings associated with those templates: agency and voice heads. She also suggests that the root and its arguments are optionally embedded under a light verb v which introduces the agent. Under Doron's analysis, causative is a possible value of the agency head whereas middle and passive are values of the voice head.
As far as her analysis of the causative is concerned, Doron states that a causative verb should be interpreted as a single event which has a causer participant. She argues that there is no evidence for event decomposition in her analysis of causative. Therefore, there is no need to introduce a new "causing event" in addition to the main event denoted by the verb. She argues that Semitic templates give supporting evidence for introducing the agency head, in that both causative and intensive templates characterize an event as an action. Thus, in Doron's view, the only way to distinguish between the events/actions introduced by the intensive and the causative templates is by considering the different thematic relations in each template: the causer role in the causative template and the actor role in the intensive template. Agency, therefore, is an important syntactic head because it is mainly what distinguishes between the causative and the intensive templates.
The analysis of the causative we defend in this paper is different from Doron's. Causative verbs in our analysis are interpreted as denoting two events; one is denoted by the causative feature of little v and the other, by the root. Additionally, the agency head is not necessary as there is no intensive template in EA. However, before going any further, it is important to note that the morphological realization of the Hebrew causative is different than that of EA. Specifically; the causative in Hebrew involves prefixes which we claim are the realization of voice in EA. Then, causative in Hebrew says something about the external argument, while Form II causative in EA says something about the causing event. In sum, the causative in Hebrew is analyzed with information about the external argument in voice, while causative in EA is associated with information about the causing event in little v.
All the Hebrew templates are given in (138) (138), Hebrew has two passive forms: the intensive passive and the causative passive. It appears that the intensive passive involves vowel change, whereas the causative passive is realized by vowel change and the prefix {hu-}. Doron requires the presence of a voice head to account for the passive reading of the templates. She establishes that the passive morphology essentially modifies the verb, not just the root. This would explain why passive in Hebrew verbs are derived only from roots for which the active verb exists, unlike middles which (can) occur alone.
With respect to the middle, Hebrew has two middle templates, both involving prefixation. 14 Doron posits that middles, just like passives, could be accounted for within the voice head. However, the middle morpheme is not a modifier of the verb, but rather a modifier of the root. Thus, middles, as opposed to passives, do not require a corresponding active form. Doron defines two types of middles: unaccusative and reflexive. She points out that middles do not introduce their own argument since they are modifiers of the root, but they identify their arguments with the arguments of the root. Consequently, the middle morpheme may assign the agent thematic role to the argument of the root, which explains why certain middle verbs are interpreted as reflexives.
Our account of middles in EA is consistent with Doron's. We assume that EA has no Passive head and that passive is just an interpretation of the non-active (middle) head together with middle and reflexive (and possibly anticausative). In this respect EA behaves exactly like Greek in Alexiadou's (2014) account where the non-active head is unspecified and can take different interpretations depending on the root. As we showed earlier, the middle morpheme is the realization of the voice head. Yet, this morpheme can be interpreted as reflexive; depending on the semantics and argument structures of the roots.
In sum, Doron provides an analysis in which the syntactic constructions of templates and the meanings associated with them are accounted for within a unified system that employs roots and the functional heads: voice and agency. The system we propose exploits roots and an equal number of functional heads: voice and little v.

Conclusion
In this paper, we have discussed the verbal system of EA. We have defended a unified analysis where all the nine verbal forms of EA are built on the same syntactic structure, modulo differences in the featural makeup of functional heads in the structure. The different meanings associated with each verbal template follow from the combination of an abstract root (with its lexical semantic and syntactic properties) and a limited set of morphological units (namely, little v and Voice features). While Voice features generally emerge as prefixes, little v features are realized as changes to the prosodic templates. The analysis presented accounts for the regularities in the interpretations and uses of EA verb patterns as well as for the exceptions.
While the discussion we have provided relies heavily on previous approaches to Semitic verbal morphology, we depart from these approaches in a number of significant ways. We adopt the tripartite verbal spine, with a root base, a vP layer, and a VoiceP layer. However, we attribute slightly different morphosyntactic properties to these three layers. We enrich the status of vP by assuming that it can have inchoative and applicative flavors, in addition to the causative one previously assumed in the literature, and we place the number of prefixes that are attested in EA verbal forms to the VoiceP layer. This allows us to capture both the emerging interpretations of the derived forms and more importantly the observed selectional properties of these heads.
As a result, the EA data seems to indicate that a purely morphosyntactic account of Arabic verbal forms can be maintained, with the verbal template playing a clearly defined role in morphosyntax. This contrasts, as we have seen, with a number of recent approaches, which derive the verbal template from linearization principles couched in an optimality theoretic framework. The approach here re-establishes the Semitic template as a valid morphosyntactic entity that plays an important role in the morphosyntactic make-up and semantic interpretation of the verb. The status of the template as a morphosyntactic primitive is supported, at least in EA, by the empirical data. The morphemes that reside in VoiceP in EA select for specific templates (the causative Form II, the applicative Form III and so on) a fact that would have been otherwise peculiar if the template was not part of the morphosyntactic structure of the verb. In addition, the derived verbal strings present mostly predictable semantic interpretations, albeit with certain idiosyncratic properties which are familiar to syntactic approaches to the lexical domain of verb derivation (e.g. l-syntax, Travis 2000;. This latter fact proves problematic for certain approaches which challenge the status of the template as a morphosyntactic primitive.