Doublecortin engages the microtubule lattice through a cooperative binding mode involving its C-terminal domain

Doublecortin (DCX) is a microtubule (MT)-associated protein that regulates MT structure and function during neuronal development and mutations in DCX lead to a spectrum of neurological disorders. The structural properties of MT-bound DCX that explain these disorders are incompletely determined. Here, we describe the molecular architecture of the DCX–MT complex through an integrative modeling approach that combines data from X-ray crystallography, cryo-electron microscopy, and a high-fidelity chemical crosslinking method. We demonstrate that DCX interacts with MTs through its N-terminal domain and induces a lattice-dependent self-association involving the C-terminal structured domain and its disordered tail, in a conformation that favors an open, domain-swapped state. The networked state can accommodate multiple different attachment points on the MT lattice, all of which orient the C-terminal tails away from the lattice. As numerous disease mutations cluster in the C-terminus, and regulatory phosphorylations cluster in its tail, our study shows that lattice-driven self-assembly is an important property of DCX.


Sample-size estimation
• You should state whether an appropriate sample size was computed when the study was being designed • You should state the statistical method of sample size computation and any required assumptions • If no explicit power analysis was used, you should describe how you decided what sample (replicate) size (number) to use Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission:

Replicates
• You should report how often each experiment was performed • You should include a definition of biological versus technical replication • The data obtained should be provided and sufficient information should be provided to indicate the number of independent biological and/or technical replicates • If you encountered any outliers, you should describe how these were handled • Criteria for exclusion/inclusion of data should be clearly stated • High-throughput sequence data should be uploaded before submission, with a private link for reviewers provided (these are available from both GEO and ArrayExpress) Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: No sample size estimation was applied in this study. The standard in the field of crosslinking mass spectrometry involves establishing the stability and repeatability of a method, which we reported in a separate publication (Rafiei A and Schriemer DC (2019)  We used two biological replicates. The methodology is extensively described on pages 24-27 and we reference our methodology paper mentioned above (Rafiei A and Schriemer DC (2019) Anal. Biochem. 586, 113416). Specifically, we adopted an integrative approach where strong statistical criteria were applied to crosslink identification using the Mass Spec Studio and the data from multiple experiments were aggregated, to accommodate the pseudostochastic nature of data-dependent acquisition experiments in mass spectrometry. This is justified based on the extensive validation work completed in the precursor study.

Statistical reporting • Statistical analysis methods should be described and justified
• Raw data should be presented in figures whenever informative to do so (typically when N per group is less than 10) • For each experiment, you should identify the statistical tests used, exact values of N, definitions of center, methods of multiple test correction, and dispersion and precision measures (e.g., mean, median, SD, SEM, confidence intervals; and, for the major substantive results, a measure of effect size (e.g., Pearson's r, Cohen's d) • Report exact p-values wherever possible alongside the summary statistics and 95% confidence intervals. These should be reported for all key questions and not only when the p-value is less than 0.05.
Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: (For large datasets, or papers with a very large number of statistical tests, you may upload a single table file with tests, Ns, etc., with reference to sections in the manuscript.)

Group allocation
• Indicate how samples were allocated into experimental groups (in the case of clinical studies, please specify allocation to treatment method); if randomization was used, please also state if restricted randomization was applied • Indicate if masking was used during group allocation, data collection and/or data analysis Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: Additional data files ("source data") • We encourage you to upload relevant additional data files, such as numerical data that are represented as a graph in a figure, or as a summary table • Where provided, these should be in the most useful format, and they can be uploaded as "Source data" files linked to a main figure or table • Include model definition files including the full list of parameters used • Include code used for data analysis (e.g., R, MatLab) • Avoid stating that data files are "available upon request" Please indicate the figures or tables for which source data files have been provided: This information does not specifically apply in our submission due to the integrative nature of results generation as described above.
Group allocation was not required for this study, which involves crosslinking of a narrowly defined set of laboratory samples and integrating the results from multiple identical preparations.
All data are made available through the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD023950. This provides the user the full set of data (32 files, multiple injections from 2 biological replicates).