USP16 counteracts mono-ubiquitination of RPS27a and promotes maturation of the 40S ribosomal subunit

Establishment of translational competence represents a decisive cytoplasmic step in the biogenesis of 40S ribosomal subunits. This involves final 18S rRNA processing and release of residual biogenesis factors, including the protein kinase RIOK1. To identify novel proteins promoting the final maturation of human 40S subunits, we characterized pre-ribosomal subunits trapped on RIOK1 by mass spectrometry, and identified the deubiquitinase USP16 among the captured factors. We demonstrate that USP16 constitutes a component of late cytoplasmic pre-40S subunits that promotes the removal of ubiquitin from an internal lysine of ribosomal protein RPS27a/eS31. USP16 deletion leads to late 40S subunit maturation defects, manifesting in incomplete processing of 18S rRNA and retarded recycling of late-acting ribosome biogenesis factors, revealing an unexpected contribution of USP16 to the ultimate step of 40S synthesis. Finally, ubiquitination of RPS27a appears to depend on active translation, pointing at a potential connection between 40S maturation and protein synthesis.


Sample-size estimation
• You should state whether an appropriate sample size was computed when the study was being designed • You should state the statistical method of sample size computation and any required assumptions • If no explicit power analysis was used, you should describe how you decided what sample (replicate) size (number) to use Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission:

Replicates
• You should report how often each experiment was performed • You should include a definition of biological versus technical replication • The data obtained should be provided and sufficient information should be provided to indicate the number of independent biological and/or technical replicates • If you encountered any outliers, you should describe how these were handled • Criteria for exclusion/inclusion of data should be clearly stated • High-throughput sequence data should be uploaded before submission, with a private link for reviewers provided (these are available from both GEO and ArrayExpress) Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: No explicit power analysis was used to determine the sample size for each experiment. Sample size was based on empiric considerations taking into account both robustness of the respective assay and the experimental challenge of acquiring the data.
All experiments represented in the figures were performed at least in triplicates except for Figs. 4C, 5C, 7A and S2 which were performed in duplicates and Figures 1D, E and 2A . These repetitions refer to the number of times the entire experiment was performed on independent days and therefore correspond to biological replicates of the experiment. Figure 1D represents a fourth biological replicate of the MS experiment presented in Figure 1 A-C.

Statistical reporting
• Statistical analysis methods should be described and justified • Raw data should be presented in figures whenever informative to do so (typically when N per group is less than 10) • For each experiment, you should identify the statistical tests used, exact values of N, definitions of center, methods of multiple test correction, and dispersion and precision measures (e.g., mean, median, SD, SEM, confidence intervals; and, for the major substantive results, a measure of effect size (e.g., Pearson's r, Cohen's d) • Report exact p-values wherever possible alongside the summary statistics and 95% confidence intervals. These should be reported for all key questions and not only when the p-value is less than 0.05.
Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: (For large datasets, or papers with a very large number of statistical tests, you may upload a single table file with tests, Ns, etc., with reference to sections in the manuscript.)

Group allocation
• Indicate how samples were allocated into experimental groups (in the case of clinical studies, please specify allocation to treatment method); if randomization was used, please also state if restricted randomization was applied • Indicate if masking was used during group allocation, data collection and/or data analysis Please outline where this information can be found within the submission (e.g., sections or figure legends), or explain why this information doesn't apply to your submission: Additional data files ("source data") • We encourage you to upload relevant additional data files, such as numerical data that are represented as a graph in a figure, or as a summary table • Where provided, these should be in the most useful format, and they can be uploaded as "Source data" files linked to a main figure or table • Include model definition files including the full list of parameters used • Include code used for data analysis (e.g., R, MatLab) • Avoid stating that data files are "available upon request" Please indicate the figures or tables for which source data files have been provided: For experiments that were quantified, the number of replicates, number of cells, and precision measures (SD) can be found in the figure legends (Figs 1B and C,2C,6B and D,7E). Where necessary, we have performed Student's t-test to calculate the statistical significance of our data. P-values were calculated using Prism and have been reported in the corresponding figure legends (Fig 6B and 7E). For mass spectrometry experiments, data analysis is described in the materials and methods section and p values are indicated in tables S1 and S2. NA eLife Sciences Publications, Ltd is a limited liability non-profit non-stock corporation incorporated in the State of Delaware, USA, with company number 5030732, and is registered in the UK with company number FC030576 and branch number BR015634 at the address 1st Floor, 24 Hills Road, Cambridge CB2 1JP | August 2014 3 Raw mass spectrometry data and corresponding search results have been deposited to the PRIDE (ProteomeXchange) repository under the submission number: PXD016458.