SyMiLa and the Atlas linguistique de la France : A tool for the study of Gallo-Romance syntax

This paper presents different ways in which the Gallo-Romance data of the Atlas linguistique de la France (ALF) can contribute to syntactic description, typology and analysis. After briefly presenting the SyMiLa project, which aims at making such data accessible, it presents four case-studies. The first one focuses on wh-questions, which provide a horizontal perspective over the whole Gallo-Romance varieties. The complete data set shows how the dominant wh-structures distribute over the Gallo-Romance domain, and how the rarer ones argue for an articulated CP where que ‘that’ stands in Fin, and for a compositional nature of est-ce que. The next section turns to a vertical perspective on the data: it focuses on a very local phenomenon, complementizer doubling, and shows how the ALF reveals an optional but consistent behavior, which adds evidence to a head-movement analysis and suggests that complementizer doubling and recomplementation correspond to two different configurations. The last section deals with isolated data, that could not per se lead to any clear syntactic claim. Anecdotal and unsystematic as they may seem in the atlas, these data signal structures that, as further research shows, can enrich and/or challenge previous generalizations on interrogative clause typing and Negative Concord, respectively. In the latter case, they call for typologies and analyses of Negative Concord that include both optional concord and partial concord.


Introduction
The contribution of Gallo-Romance dialect syntax to syntactic theory, typology or, as concerns the Oïl dialects, even the diachrony of French syntax, is almost null -mostly for lack of documentation.Dialectal grammars or monographs on Gallo-Romance specific dialects focus on phonology and the lexicon, sometimes (verbal) morphology.They either eschew syntax or boil it down to a few descriptive lines. 1 This is all the more regrettable since France has an old dialectological tradition, and has published various dialect atlases.Some of them, like the Atlas linguistique de la France (ALF), contain syntactic data, although this is still a little-known fact.The SyMiLa project aims at making these data accessible to the community, at analyzing them and at conducting new syntax-specific dialectal surveys.Such data, indeed, are of great value for dialectology, diachronic linguistics, and linguistic analysis.Dialectal areas are mostly defined on phonological and lexical properties: how syntax fits in is still an open question.Another question is how the diachrony of standard languages interferes with or can be clarified by the diachrony of its kin varieties.In this paper, I will focus on the interest these data may present for comparative syntax.My first aim is to show that, in spite of their shortcomings, atlases constitute an important tool to investigate Gallo-Romance syntactic micro-variation, and that they can contribute to syntactic typology and theoretical debates in several ways.I first briefly present the ALF and the SyMiLa project.I then pick up a few examples to show how these data can be used as a starting point for different syntactic investigations.In the third section, I show how studying a syntactic phenomenon, here wh-questions, in a horizontal perspective, i.e. across the whole Gallo-Romance area, can help us test or refine known generalizations and analyses.In section 4, I illustrate another way to use the ALF data, which I call a vertical perspective.I exemplify it with the study of complementizer doubling, a "rare" phenomenon which surfaces only in a small sub-area of the Picard dialect, though it can be traced back to the Middle Ages.In section 5, I focus on interrogative particles and Negative Concord: I show how sporadic occurrences in the atlas may pinpoint larger patterns that challenge usual syntactic typologies and leave way for further theoretical investigations.

French dialect atlases and the SyMiLa project
France has known two waves of large-scale dialectal surveys that have resulted in the publication of linguistic atlases.The first wave of systematic surveys targeting all Gallo-romance dialects (Édmont and Gilliéron 1902-1910) took place at the turn of the 20th century and gave birth to the Atlas linguistique de la France (ALF).A second wave, known as the Nouvel Atlas linguistique de la France, was launched in the middle of the 20th century: it consists in a series of partially unrelated surveys targeting different dialectal areas, with a strong lexical and ethnological content, and usually devoid of any syntactic dimension.2I will focus here on the Atlas linguistique de la France.
Each of the 1920 maps in the ALF features the dialectal translation of a French word or sequence of words, noted in an ad hoc phonetic alphabet.The 1421 first maps display 639 different locations (each represented by one or, sometimes, a few speakers).The 499 next ones concentrate on the Southern or South-Eastern part of France, with a few additional locations in the North.These locations spread across the Gallo-Romance dialect speaking territories of France except Corsica,3 and extend to small areas outside of France: the Picard, Walloon and Gaumais speaking parts of Belgium and Luxemburg, the Occitan speaking area of Northern Italy, and the Romance speaking part of Switzerland.On a practical basis, this area can be defined as the places where a questionnaire could be handled in French by the informants, and yield data in some version of Occitan, Oïl languages, Catalan or Franco-Provençal.This atlas has served consistently for dialectological and diachronic works.However, they concentrate mainly on phonology and lexicology, as does the synthetic commented maps published by Brun-Trigaud et al. in 2005 or the dialectometric studies based on these fields (cf.Goebl 2003).
An ill-known or forgotten fact (cf.Tuaillon 1975) is that the ALF survey, unlike most of the later ones, had a syntactic component.This was obliterated when the sentences were segmented for publication as lexical maps.For instance, the first volume starts as illustrated in Table 1, pointing to sheer lexical maps.
However, as the field notebooks show, the data for map 3, abreuvoir ('trough'), have originally been elicited thanks to the translation of a full sentence: Il mène les boeufs à l'abreuvoir 'He leads the oxen to the water trough'.The data gathered for les boeufs ('the oxen') correspond to map 141 and for il mène ('he leads') to map 835.Same thing for map 6 and 7, originally from Je vais acheter deux chevaux à la foire et vendre les deux que j'ai achetés l'an dernier 'I am going to buy two horses at the fair and sell the two that I bought last year' and map 9 quel âge, originally from Quel âge as-tu?'How old are you'.
The SyMiLa project (http://blogs.univ-tlse2.fr/symila/) 4was set up, among other things, to make these syntactic data accessible.The elicited sentences were reconstructed by Guylaine Brun-Trigaud from the unpublished notebooks and the hints present in the published atlas.They amount to 180 full sentences, some of them grouped in coherent sequences.The corresponding data were then retyped using the IPA, 5 and entered into an ad hoc database.One aim of the database is to patch together the relevant parts of each sentence for each location, respecting the right word order.This entails two technical difficulty: first, some maps feature discontinuous parts of a source sentence: for instance, map 201 features ne…ni…ni ('prt…neither…nor'), which comes from Je ne pouvais ni avancer ni reculer 'I could neither go forward nor go back'; second, word order in the answers may differ among dialects.For now, 44 sentences out of 180 have been entered into the database (see Appendix 4).The sentences and each word in the French source sentence (called stimulus in the project) are tagged with morpho-syntactic properties, to enable targeted research in the database.The tagging of their dialectal versions is under way.Additionally, variants of Occitan words are linked to a standard version.The database, in its provisional state, with very basic research functions, can be accessed there: http://symila.univ-tlse2.fr/ALF.
What are the aims of making such data available?Since the syntax of Gallo-Romance dialects and its microvariation is, for the most part outside Occitan, unknown, the first aim is descriptive.The methodology for eliciting the data, i.e. recording the spontaneous translation of a stimulus in the standard official language, has well-known shortcomings (cf.Bucheli & Glaser 2002; Cornips 2002).However, I suggest, a first approximation is still better than nothing: it can trigger and guide further investigations.Not even mentioning the picture it can sketch of microvariation in such a large territory, it provides a first hint of structures that had gone completely unnoticed so far, and thus, positive data.These data can be of great import to syntactic typology, to the study of syntactic evolution in diachrony, and, to the extent that some other access to the dialects is provided, syntactic theory.Since the initial survey was not specifically designed for syntactic inquiry, the insight into syntactic micro-variation the ALF can provide may seem 4 SyMiLa is the current name of the project.It stands for Syntactic Microvariation in the Romance Languages of France. 5 Another ANR-funded project, ECLATS, (2015-2020), https://eclats.imag.fr/wordpress/?page_id=8,aims (among other things) at developing OCR tools able to retrieve and organize data from the maps, with each answer linked to the right location number.If it succeeds, it will help enrich and check the SyMiLa database.modest.Nevertheless, the research conducted so far shows that it can have immediate contributions, that pave the way for deeper investigations.It was in particular crucial in the planning of the second part of the SyMila project, i.e. the conduct of new surveys, based on a fully syntax-oriented questionnaire, to further our insight into Gallo-Romance dialect syntax.The latter is centered on 5 syntactic targets, for which the ALF data have already shown they have non-trivial implications for syntactic typology and analysis: negation and Negative Concord, subjects, clitics, the structure of CP and agreement within DPs.The methodology is, to a large extent, kept similar to the ALF one (translation of a stimulus sentence), in order to get data more comparable to the ALF ones.The new questionnaire, however, provides contexts missing in the ALF to assess minimal pairs and control various parts of the potential analyses, as well as judgement tasks for limited crucial points.The latter can be optionally extended while passing the questionnaire.Unlike the first surveys, the new ones have been recorded: the sound files will be segmented and accessible in the database.About 50 new surveys have already been conducted,6 most of them transcribed and segmented, and they will be added to the same database.
In the next sections, I will focus on some findings provided or fostered by the ALF data.It will illustrate several ways in which they can contribute to syntactic studies, refine typologies, add independent ground to theoretical proposals and challenge others.

A horizontal perspective: Testing theoretical proposals
The surveys on which the ALF was based have interesting characteristics, especially compared to the later regional atlases.They were conducted, in a small time-span (less than 4 years), by the same interviewer, with the same questionnaire (only expanded for the Occitan area) and the same methodology, on more than 600 locations.As a consequence, lots of comparative data are available in a single place.The method, i.e. the spontaneous translation of a stimulus sentence phrased in the normative standard variety, has flaws.A major risk, especially given the very strong stigmatization of dialects in France, is that informants may produce calques of the stimulus.However, when a speaker, or, even more clearly, several speakers from a given series of neighboring locations, produce translations consistently differing from the stimulus language, there is little doubt that the data are reliable: on a regular basis, corpora and later surveys have confirmed it so far.When a given syntactic phenomenon is well-implemented in the atlas, then, we can get a pretty good image of its geographical extension, and of its variation.
Such a case is provided by direct wh-questions in the atlas.The ALF indeed features six wh-questions, ranging over the 639 survey points. 7They are given in Table 2; the data for the first three have been entered in the SyMiLa database, and can be checked on-line.
Note that the stimulus sentences only resorted to the French most standard interrogative structure at that time: wh-words immediately followed by Subject Clitic Inversion (henceforth SCLI).The data corresponding to the 3834 answers show that, while a calque or other influence effect of the stimulus on the translation cannot be discarded, it seems far from prevailing.The answers indeed display much variation, both in the CP and the TP domain.An individual speaker can produce consistently one structure for TP and CP throughout the six questions, or up to 6 different patterns (the latter case most frequently in the transition zones between known dialects).
Within the TP domain, three different patterns can be found: SVO order, like in declaratives; SCLI; or prodrop, for which SVO/VSO cannot be directly inferred from the plain data.Within the CP-domain, several patterns may also arise, according to three parameters: -the presence (or absence) of Doubly Filled Comp (henceforth DFC) interrogative clauses, in which the wh-phrase is followed by a complementizer, the dialectal version of que 'that' -I will refer to it as que for all dialects.8-the presence (or absence) of a dialectal version of est-ce 'is this', which can be reduced to /s/.In standard French, est-ce is always followed by que, and has been given various analyses.By a simple rule of thumb, if the choice of the CP structure and its combination with a TP one was unconstrained, we could expect up to 30 overall different combinations of similar frequency.Only a subset of the potential combinations of the CP and TP patterns are actually attested: they are given in Table (3), with one example for each pattern,9 followed (or preceded) by the ALF number of the relevant location.This overall view can give rise to several questions.The first one is: are these patterns geographically constrained?To a large extent, the answer is yes.The second one is: are these patterns structurally constrained?To some extent, the answer is also yes.A third question is: are they equally productive? 10 The answer is no.I will first consider questions 1 and 3, then question 2. 11 10 I assume that /s/ is the reduced form of est-ce. 11Such Walloon constructions, which occur at the second person, can be analyzed either as partial propdrop, /s/ being construed as a verbal person inflexion, or as SCLI, /s/ being an allomorph of the 2 nd person clitic tu/te when enclitic.Three patterns are clearly dominant among Gallo-Romance dialects: [1] plus SCLI, corresponding to 1082 answers, [1] plus prodrop with 1118 answers and [2] plus declarative order with 1243 answers.They are also geographically constrained: WH + prodrop occurs, unsurprisingly, in the areas that are generally prodrop in declarative clauses (roughly, the Occitan areas), and in Walloon second person questions.DFC with declarative order appears in a large Northern half of France, SCLI in its Western and Eastern central parts and in Walloon (3 rd person), as shown in Figure 1: the darker the dot, the more consistent the usage.
The presence of est-ce que (199 answers) and c'est que (12 answers) is also geographically marked, though not as clearly for est-ce que, as Figure 2 shows.
However, not all variants are geographically-driven.So-called "intonative questions" (74 answers), in which the Wh-P is immediately followed by a declarative order, can be found all over the place, though only sporadically in the prodrop zone, cf. Figure 3.
Another way to look at the data is to ask whether we can draw any structural insights from them.A first generalization is that a que can either precede or follow c'est (or both), cf.[4], [8] and [10], repeated in (1). (1) a.If we consider these data, irrespective of their dialectal location and their frequency,12 a first clear result is that SCLI only happens when no que is present, corresponding to lines Though this is only positive data, the massive distribution gives additional support to Rizzi and Roberts (1989) proposal, based on standard languages, that SCLI proceeds from V/T to C movement.In a cartographic version of this idea, such as proposed by Poletto &  Pollock (2004), the que in all these cases would stand in Fin°, where V/T raises to in its absence.Picard massively resorts to DFC; the Picartext database of Picard written texts (https://www.u-picardie.fr/LESCLaP/PICARTEXT/Public/index.php)shows, for example, that infinitive wh-questions never display que after doù 'where', cf.(3), confirming that que is linked to the finite form of the verb: 13 (3) A doù qu' ch'étoait?Doù s'acoufter?A doù s'éjuer ?where that it was where shelter.infwhere play.inf'Where was it?Where to take shelter?Where to play?' (Collectif, Ponthieu-Vimeu) Note that when est-ce is not followed by a que, cf.patterns in line [5], SCLI is possible in some dialects, a fact that, to my knowledge, had not been illustrated yet.This strongly suggests that est-ce que is not a particle in the corresponding dialects, and that in (4) est-ce is not followed by a silent version of que: T raises to Fin in the absence of que.As for pattern [8], it allows SCLI after c'est, while c'est is preceded by que.The example given is the only one to illustrate the pattern in the ALF, I repeat it in (5): 16 (i) a.
[  (5) [ki k ʃ e voe ty k ʃɑ soeʃ] 273 who that prt want you that this be.subj'Who do you want it to be?' The que preceding c'est, then, shouldn't proceed from the (Fin°) position V/T moves to.Actually, this example, as well as pattern [10a], comes from a small part of the Picard area (Artois/Ternois), in which the CP-domain has several particular properties (see section 3).The que could receive several analyses, for instance qu' ch'est being the grammaticalized particle, or que being the head of the Wh-P or the ForceP.Note also that this is a longdistance question.I leave these single pattern aside for now.
A single analysis could capture almost all the ALF data despite their diversity: I propose wh-questions in Gallo-Romance dialects all share the structure in ( 6), where IS stands for Informational Structure -but nothing prevents, at least in the dialects illustrated in Figure 2, to consider it a mere variant of est-ce, an interrogative particle.
Under this view, dialects vary as whether que can/must be present or spelled out in Fin, whether the interrogative particle est-ce must/can be present or spelled out in Force to type it as a question, and whether V/T to C must apply in the absence of que.In this analysis, the term "Doubly Filled Comp" structures is not well suited: I assume that que is not the head of the Wh-P, but stands in another projection.This seems confirmed by the fact that, as some answers in the ALF show, the wh-word and que can be separated by discourse particles, cf.(7-9) just as est-ce and que, cf. ( 10).Besides, patterns [7] and [9], WH que est-ce Ø and WH que est-ce que respectively, are then rightly expected to be unattested. 17Under this view, dialects vary as whether que can/must be present or spelled out in Fin, whether the interrogative particle est-ce must/can be present or spelled out in Force to type it as a question, and whether V/T to C must apply in the absence of que.In this analysis, the term "Doubly Filled Comp" structures is not well suited: I assume that que is not the head of the Wh-P, but stands in another projection.This seems confirmed by the fact that, as some answers in the ALF show, the wh-word and que can be separated by discourse particles, cf.(7-9) just as est-ce and que, cf.(10).Besides, patterns [7] and [9], WH que est-ce Ø and WH que est-ce que respectively, are then rightly expected to be unattested. 17 where PRT you go 17 One anonymous reviewer objects to a Wh-P higher than Force.This indeed departs from Rizzi's (1997; 2001) proposal for the left-periphery of questions. 17One anonymous reviewer objects to a Wh-P higher than Force.This indeed departs from Rizzi's (1997; 2001)  proposal for the left-periphery of questions.Along his lines, Wh-P s are expected to stand in FocP, as they are focal in some way; alternatively, they could raise to ForceP to type the sentence.However, these two assumptions are far from consensual cross-linguistically (see a.o.Though many things remain to be investigated more closely (in particular, among each given dialect, how the patterns above distribute and which predictions are made), the various structures corresponding to the elicited wh-questions in the atlas (up to 16, including est-ce que and c'est que questions) as well as the unattested ones argue for an articulated CP-domain.They also give additional evidence to the idea that the complementizer in "Doubly Filled Comp" constructions and the verb in Subject Clitic Inversion cannot co-occur since they target the same position -they provide more evidence that this position must be restated as Fin°; they also argue for restricting the interrogative particle to est-ce: the que that follows it is not part of it, and has the same properties as que in DFC.

A vertical view: Small scale data and complementizer doubling
Another way to use the ALF data is to focus on a given small subset of neighboring locations, that may appear to form a dialectal unit.Thus, we can study the whole productions of a few individual speakers, that display phenomena that had gone unnoticed.This is the case for a small Picard zone in Artois, around Saint-Pol-sur-Ternoise (ALF 284).Most dialects of Picard display systematic (or frequent) Subject Doubling (see Auger 2003a; b; Appendix 3): any subject DP is doubled by a subject clitic.The ALF has several temporal clauses, which in this area, like in some other dialects (see Appendix 1), are introduced by a doubly filled complementizer, quand que 'when that'.Two of the ALF temporal clause have a DP subject -I give them in (11).(11a) corresponds to maps 143, 458, 548A, 548B, 867, 1109 and 1405, (11b) to maps 573, 514, 469 and 972.
(11) a. Il buvait moins quand sa femme vivait encore.he drank less when his wife lived still 'He used to drink less when his wife was still alive.'b.Quand mon fils sera grand, je l'enverrai à Paris.when my son be.fut big I him=send.futto Paris 'When my son will be older, I'll send him to Paris' The ALF also has a partial map corresponding to a relative clause with DP subject, (la charrette) que le domestique a chargée '(the cart) that the servant loaded' (map 1537), and an embedded exclamative with DP subject, (regarde) comme les arbres en étaient chargés '(look) how the trees were loaded [with apples]' (maps 310, 52, 513, 240).
In the small zone we are considering, these may be realized in two fashions: the que usually following quand 'when' can be delayed after the subject DP, as in (12), suggesting that, as for questions (cf.section 2), it stands in Fin, the subject DP having moved to the CP-domain: (12)  [kɑ̃mɛ gɑrʃɔ̃k i srɑ grɑ] (287) when my son that he be.futolder 'When my son will be older (I will send him to Paris.)'However, it can also be repeated: in this case, both instances of que frame the subject DP, as in (13). (13) [kɑ̃k mɛ fju k i sro grã] (286) when that my son that he be.futolder 'When my son will be older (I will send him to Paris.)' The extension and rate of this phenomenon are given in Figure 4.
Since the two sentences are not enough to understand what is going on, and since this construction is not mentioned in the (scarce) Picard grammars and monographs, these ALF data just provide an impulse to investigate further.Corpus research confirmed that these structures are attested (though optional) in this area and, to a lesser extent, in neighboring areas, and are present in any kind of finite embedded clauses (see Dagnac 2012).It also showed that they are not limited to doubled subjects: in any kind of finite embedded clause, any left peripheral item, whatever its phonological weight or Information Structure status, can trigger ( 12) or ( 13 is expected under Rizzi's (1997) analysis of the extended CP: infinitives lack the higher C position (identified as Force here), so de would have no higher position to go to.Second, there is an exception to complementizer doubling in embedded clauses: the relative pronoun qui is not followed by que, either single or doubled.Relative qui is nominative hence [+Fin], but it is also plausibly [+Sub] since it can only head embedded clauses in this dialect. 18It is then able to check both the Fin and the Force specification through wh-movement: no que is needed, since qui checks the tense feature itself on Fin, and the Subordinate feature on Force.
This phenomenon also raises non-trivial questions for diachronic perspectives within the Oïl dialects, including French.It is indeed attested, though sporadically, by Flutre  (1970) in the first half of the 17 th century, for Picard in a slightly larger region, and can be traced back at least to 14 th century charters of the Douai/Arras region, cf. ( 14  and Foulet (1919: 266-267,  §402) presents it as frequent, though he illustrates the fact with four citations, three of which are from Picard literary texts, and one (La Chastelaine de Vergi) has been expertized as coming from the Paris region but with dialectal features from Picardy and Burgundy.Same thing for Buridant (2007), who quotes one Picard literary text from the XIII th c., and one Anglo-Normand text.Given that Picard was, after Anglo-Normand, one of the first Oïl dialect to have a written tradition (Lusignan: 2011;  2012), these facts suggest that the geography and diachrony of French and Oïl dialects would greatly benefit from being studied in parallel.It is also attested for French in the 16 th (cf.Gougenheim 1984: 149) and early 17 th centuries (Haase 1898: 136.B), then seems to disappear, at least from grammars.Complementizer doubling is also attested in Old and Middle Occitan (Lafont 1967).Such a syntactic feature could be a Romance feature that got lost in French and most Oïl dialects but kept being transmitted in a sub-area of Picard; or it could be the case that literary French inherited a feature from dominant written models from Artois & Occitan and lost it since.In any case, considering the dialectal syntactic data from the ALF as seriously as the lexical and phonological data should provide an impulse for diachronic studies.
The ALF can then also act as a small scale tool to identify infrequent, local, but theoretically important phenomena: in this case, it adds independent evidence that the complementizer can undergo head-movement within an articulated CP-domain, and confirms that, within the Romance languages, the typology of complementizer doubling (Southern Italian dialects, Ternois Picard) and recomplementation (Spanish, Portuguese) relies on different syntactic structures, correlated to different distributional properties, while bringing to light new questions to investigate within diachronic studies.

Refining typologies and evaluating competing analyzes
Another way in which the ALF data can contribute syntactic analysis is by enriching or refining typological generalizations, and, plausibly, the theoretical analyzes they may inform.The previous sections have shown how they may argue for an at least semi-free combination analysis of est-ce que, or for a sub-typology of complementizer doubling.In this section, I will deal with two more cases: the first one deals with the link between polar and wh-questions interrogatives, the second one with negation and Negative Concord (henceforth NC).

Wh-movement and yes-no particles
Cheng (1991) proposes that the typological distinctions among languages with respect to the formation of wh-questions are directly linked to the presence/absence of interrogative particles: the presence of particles correlates with the lack of wh-movement.In particular, she states and argues: "No language has yes-no particles and also whmovement" (Cheng 1991: 37).Her generalization is twofold (Cheng 1991: 21-22 -her  (3) and ( 4)): (i) In situ languages have special markings in yes-no questions.(ii) Languages with special marking in wh-questions are in-situ languages.
Data from the very same dialect we considered for complementizer doubling, in the Ternois region, show that it is a wh-movement dialect.Neither the ALF nor the corresponding written corpora display in-situ questions, and the present speakers we could test disallow them.Yet, it has a specific marking in yes-no questions.The ALF offers a few yes-no questions, whose stimuli are listed in (15 A confirmation that jou is an interrogative particle can be found in map 659 Est-ce que tu viendras?'Will you come?' of the Picard regional atlas, ALPIC (Carton & Lebègue 1998), and in written corpora: they all show that jou can also act as a verbal interrogative particle, cf. ( 18).It competes with a -ti or -tu verbal particle as it exists in Québec French ( Vecchiato 2000; Vinet 2000; Morin 2009) and in Picard: Il a jou cryi ?(Picartext: Devraine, Vermandois) he has prt shouted 'Did he shout?'Note that jou is, and can only be, found in direct questions (unlike English whether, for instance) and thus qualifies as a typing particle as defined by Cheng.Besides, Picard doesn't license multiple wh-fronting (vs Polish) though it allows multiple questions (vs Italian): along Cheng's lines, it then appears to be an actual counterexample to (ii), of a sort that is not discussed in her thesis.The in situ languages she examines can realize the same or a different interrogative particle in direct wh-questions too, or display a covert one in these contexts.As pointed out by Emrik (1966/69: 4), based on ancient written texts from the surrounding Picard areas, the typing particle jou could optionally combine with several wh-words in previous stages of Picard, as shown in (19). 199) a. Voyon èn peu c'ment jou qu' os frons ?(18 th c., Corbrie) see.subj.P4 a bit how prt that we do.fut'Let's see how we will proceed.'b.Quand jou k' ô vôz avon vu sans lôji ?(19 th c., Cambrai) when prt that we you have seen without home 'When did we see you homeless?' c.Quoi jou qu' os foessez lo ?(20 th c., G. Leroux d' Etaves-Bocquion)  what prt that you do here 'What are you doing here?'However, since this wh-+jou construction does not appear in the ALF, nor in the corpora we checked for the Ternois region for the same period onwards, we may assume that Ternois Picard currently checks [+Q] with the typing particle jou in yes-no questions and 19 Current texts in these areas may also show sporadic use of jou in wh-questions, but, as far as I can tell, always in association with quand, cf.(i-ii), and also in indirect questions, cf.(iii), though I couldn't find it in temporal clauses: this points to the grammaticalization of quanjou as the interrogative form (vs quand in temporal clauses).Further investigation is needed on this point.
(i) Quanjou qu'tu viens ?when=prt that you come 'When will you come?' (Calais, Amiénois) Quanjou, pi d'où qu'il est mort ?D'où qu'o l-l'o intérrè ?when=prt and where that he is dead where that one him=has buried 'When and where did he die?Where was he buried?' (Collectif, Ponthieu-Vimeu) O n'savoait point quanjou qu' él darin.necandelle al alloait s'éteinde… one prt=knew not when=prt that the last candle she was going to go out 'We didn't know when the last candle would go out…' (Calais, Amiénois) by a covert particle in wh-questions.As suggested by the analysis in section 2, est-ce may alternate with this covert typing particle in wh-questions.
If obligatory wh-movement in wh-questions of one (and only one) Wh-P is assumed to be triggered by Clause Typing, in order to check an interrogative feature, Cheng's generalization and analysis don't hold.However, under a cartographic approach, it has been assumed that wh-movement may happen to check both a clause-typing feature, [+Q], in ForceP and a wh-feature in Wh-P (Poletto & Pollock 2004).The Ternois Picard data support the claim that wh-items must only check a [+WH] feature, [+Q] being checked in diachrony by jou, and in synchrony either by a silent version of jou or by est-ce.Cheng's generalization can then be maintained under the strict reading in which "in situ languages" correspond to languages in which wh-words don't check the [+Q] feature.This can apply to "surface" in-situ languages or to languages where obligatory wh-movement applies (only) for other reasons -a proposal already made by Cheng for some wh-fronting languages.

Negation and Negative Concord
All dialects of Picard also challenge the typology of Negative Concord languages.The ALF includes 16 maps with propositional negation.Very sporadically, they suggest that Picard speakers may have more than one negative marker.This is in fact largely confirmed by the ALPIC data and corpus work: many speakers have (at least) 20 two markers for propositional negation (cf.Dagnac 2015).I will lemmatize one as point (though it may have several phonetic and graphic realizations), the other as mie.The first one is way more frequent and can be found in all usual syntactic contexts; the second one seems to be restrained to tensed clauses.Both are m-negations in the sense of Garzonio & Poletto (2009; 2010), i.e. they have both been grammaticalized from former minimizers, like French pas. 210) N'te casse point t'tête !A n'va mie dureu.prt=you.datbreak not your=head it prt=goes not last 'Don't worry!It won't last.' (Vasseur,Vimeu) 22 When speakers have mie, unlike French pas (Corblin et al. 2004), it enters Negative Concord 2 2 (henceforth NC) with all n-words. 23I show this in (21) with a single author from the Vimeu region.As (22) shows, the n-word can also be preverbal, unlike what happens in Quebec French, where preverbal n-words preclude Negative Concord (cf.e.g.Burnett et al. 2015): 20 Some speakers also have pas, which may in some cases be constrained to some syntactic contexts; when it is unconstrained, it usually works like French pas, and is excluded from Negative Concord.See Dagnac (2015)  for details. 21In Old French, mie and point also competed with pas.Note that Picard mie differs on crucial aspects from Old Italian and dialectal mica (Garzonio & Poletto 2012: 139-141): for instance, it cannot be clause-initial and it can appear without the preverbal particle ne.It also differs from Lorraine mie (Burnett 2015). 22Unless otherwise specified, all examples in this section are from the Picard written corpus, Picartext.
The reference gives the writer's name, followed by the geographic area. 23All dialects of Picard also have negative spread, in which multiple occurrences of n-words yield a single semantic negation.Besides, their n-words have a negative content of their own.They also seem to have double negation, in the right pragmatic conditions, and speakers seem reluctant to use them in NPI contexts (if-clauses or questions, for instance): fieldwork is under way to check these last two points.Note that, though Picard retains more preverbal ne than other Oïl dialects (Auger & Villeneuve 2008), ne cannot be negative on its own and the presence of ne is not compulsory for NC or negative spread to occur, cf.(i) and (ii) respectively, and Dagnac (2015).
(  (Vasseur, Vimeu)   However, unlike what happens in some standard languages, whether they are non-strict or strict Negative Concord languages (Giannakidou 1998), i.e. whether the presence of the negative marker depends on the syntactic position of the n-word or not, Negative Concord is never compulsory in Picard.Dagnac and Burnett (2016) show that, for a same speaker, and putting aside the cases where more than one n-word appears in the sentence, it can range from 3% to 90% of the occurrences involving a n-word, according to the n-word involved.In this respect, Picard patterns with Quebec French: together with other languages and dialects like varieties of Occitan or Catalan, they show that the strict vs non-strict concord languages typology needs refining, yielding at least a four-way distinction: "compulsory strict NC" (Romanian), "optional strict NC" (Picard), "compulsory non strict NC" (Italian), "optional non strict NC" (Québec French).Besides, analyses of NC must account for this optionality.But Picard argues for an even finer-grained approach to optionality, as shown by Dagnac (2015).
The above generalization was indeed based on the behavior of the less frequent negative marker, mie.When it comes to the main negation, point, things are different.For a subgroup of speakers, point (vs mie) cannot enter NC at all: this gives us a second instance of negative markers that are unable to enter NC, as is the case with French pas (and Picard, for those speakers that also have it).It appears in line with theories of NC that, some way or other, place the responsibility for NC in the semantic or syntactic features of the negative marker, rather than in those of the n-words (see e.g.De Swart & Sag 2002 for an implementation).But some Picard speakers present an even more challenging use of NC.While they still have (optional) concord with mie and any n-word, they have (optional) selective concord with point.By this, I mean that point can enter NC only with a subset of n-words: point is allowed to co-occur with the nominal n-words like parsonne 'nobody' or rien 'nothing', but it is disallowed with the temporal adverbial ones jamais 'never' and pu 'no longer'. 24This is hardly compatible with analyses in which NC relies on some syntactic 24 As far as we can tell, point is compatible with neune pert 'nowhere', cf.(i) at least for one speaker.But we have very few occurrences of neune pert, so its behavior for all partial NC speakers remains to be checked.
(i) O n'érplante point neune pèrt !one prt=replants not nowhere 'One doesn't replant anywhere!' (Vasseur, Vimeu)  or semantic version of resumptive quantification (cf.a.o.Zanuttini 1997; De Swart & Sag  2002): in their present state, these analyses do not predict that NC could be sensitive to the particular lexical properties of specific n-words.Conversely, analyses in which NC proceeds from the properties of n-words, for instance by assuming that they are ambiguous between negative quantifiers and Negative Polarity Items (see e.g.Herburger 2001) could account for this behavior of point: it could be the case that parsonne and rien are ambiguous, while jamais and pu are not: the latter, being only negative quantifiers, are predicted to yield double negation when combined with the negative marker.However, such an approach wrongly predicts that the same pattern would occur with mie. 25 Interestingly, Garzonio and Poletto (2012) have shown that Old Italian niente presents a different Negative Concord pattern according to its argumental vs non-argumental usage, a pattern reminiscent of the Picard opposition between, on the one side, argumental parsonne and rien, and on the other side non argumental jamais and pu. 26In its non-argumental use, niente triggers obligatory Negative Concord; in its argumental use, niente triggers NC only optionally, as does mai 'never/ever'.The authors explain this pattern in structural terms: only positions occupied by the adverb (between AspP and VoiceP) can yield NC, while VP internal positions don't yield NC. The merge position then bans NC with argumental niente.Seemingly optional NC with argument niente only occurs when it can move to the adverbial position.However, this analysis cannot straightforwardly be extended to Picard, since we get the inverse generalization: in Picard, with point, NC is optional with arguments and excluded with adverbials.Besides, in Picard, rien can move to the complex T domain above the past participle verb, as in ( 23).However, it can also stay within an argument infinitive, as in ( 24), or even in a VP internal position, as in (25).As (23b), ( 24) and (25b) show, its position does not influence its ability to enter NC -in particular, cf.(25b), NC obtains even when it is in a merge argument position.( 23 'We never did anything as beautiful since we started painting.'(Vasseur, Vimeu)   b. O n'a poé bzoin d' cangeu rien.one prt=has not need to change nothing 'We don't need to change anything.'(Vasseur, Vimeu)  Very sporadic ALF data can then be used to detect an unexpected syntactic pattern or variation.In this case, the main contribution of the ALF was to suggest lexical variation: some Picard speakers use mie instead of point.But coupled with other tools of investigation, such as localized corpora, they contributed to the uncovering that these two markers have contrasted usages: one of them, mie 'not', allows (optional) Negative Concord, while the other, point 'not' either rules it out, or allows it only with a subset of n-words.This partial concord, though in some ways converging with Quebec French Negative Concord, differs from Old Italian partial concord -thus adding to the growing bulk of evidence (Déprez & Martineau 2004; 2006; Burnett et al. 2015; Dagnac & Burnett 2016; Franco  et al. 2016) that the typology and analysis of Negative Concord must be refined to deal with both optional and partial concord, even in the Romance languages.

Conclusion
Though the ALF was not specifically conceived to study dialectal syntax, and though the methodology used to collect the data is not perfect for theoretical analysis, I have shown that it can still be an invaluable tool in a domain that dramatically lacks literature.There are many ways to use this tool.First, answers to the same stimuli over a territory and/or a given syntactic structure, produce consistent data, concerning for instance Doubly Filled Comps in Wh-questions, prodrop or subject doubling: they can give us a first rough sketch of the geographical domains where a phenomenon is expected to happen.This kind of large-scale, "horizontal" view, which the SyMiLa group will keep on establishing, can be capitalized upon at least in two manners.First, it can signal where we should go and investigate in a finer-grained way in priority: given the extent of the work to be done in order to describe and analyze the syntax of Gallo-Romance dialects, this per se is welcome.As I showed in section 2 for wh-questions and 4.2 for polar questions, it can also help draw and test generalizations on the structure of grammar, confirming the need for an articulated CP, restating what "Doubly Filled Comp" or est-ce may be, severing clause typing from wh-words in Romance.The ALF can also be used as a small scale, "vertical" tool, to spot infrequent phenomena such as complementizer doubling or the presence of alternate lexical forms such as mie, which are very isolated in the atlas; trusting the informants that produced them can then be an incentive to try and understand why these forms surface and start setting up corpora and specific surveys, at least while there are some speakers left.Doing so on these two cases lead us to add evidence that the typology of complementizer doubling must be revised, one type relying on head-movement of a Fin head, the other one on the presence of Informational Structure heads, and that the typology and analysis of Negative Concord should encompass both optional and partial concord.
Lastly, these first results point the fact that the ALF can serve at least two long-term objectives.First, it invites us to reassess, as a few have already started doing, the diachrony of French syntax by taking into account its potential links with the syntax of the dialects that co-existed with it for centuries, especially its next of kin.Second, it can, to some extent, enable us to assess how syntax connects (or not) with the better known phonological and lexical features that have been used to define dialects and sub-dialects, a non-trivial question both for dialectology and for the structure of grammars.

Figure 1 :
Figure 1: Dominant patterns for wh-questions in the ALF.

Figure 2 :
Figure 2: Est-ce que and c'est que in ALF wh-questions.

Table 1 :
First maps in the ALF.

Table 2 :
Direct wh-questions in the ALF.

Table 3 :
Attested patterns for wh-questions in the ALF.

Table 4
, while a que-construction, whether it seems to proceed from a Doubly Filled Comp construction as in [2], a est-ce construction as in [3] or a c'est one as in [4] and [10], is compatible both with declarative order and prodrop, but not with SCLI.This is materialized in Table4by the dark grey filling.
I=have not can.ptcpnothing bring.backfrom the fair 'I couldn't bring anything back from the fair.' (Dumont, Vimeu) b. […] i y o point dzoin dé rien nous dire.there is not need to nothing us=tell 'we don't need to say anything to each other.' (Calais, Amiénois)