Human LINE-1 retrotransposition requires a metastable coiled coil and a positively charged N-terminus in L1ORF1p

LINE-1 (L1) is an autonomous retrotransposon, which acted throughout mammalian evolution and keeps contributing to human genotypic diversity, genetic disease and cancer. L1 encodes two essential proteins: L1ORF1p, a unique RNA-binding protein, and L1ORF2p, an endonuclease and reverse transcriptase. L1ORF1p contains an essential, but rapidly evolving N-terminal portion, homo-trimerizes via a coiled coil and packages L1RNA into large assemblies. Here, we determined crystal structures of the entire coiled coil domain of human L1ORF1p. We show that retrotransposition requires a non-ideal and metastable coiled coil structure, and a strongly basic L1ORF1p amino terminus. Human L1ORF1p therefore emerges as a highly calibrated molecular machine, sensitive to mutation but functional in different hosts. Our analysis rationalizes the locally rapid L1ORF1p sequence evolution and reveals striking mechanistic parallels to coiled coil-containing membrane fusion proteins. It also suggests how trimeric L1ORF1p could form larger meshworks and indicates critical novel steps in L1 retrotransposition.

1 Overall quality at a glance i O The following experimental techniques were used to determine the structure:

X-RAY DIFFRACTION
The reported resolution of this entry is 2.65 Å.
Percentile scores (ranging between 0-100) for global validation metrics of the entry are shown in the following graphic. The table shows the number of entries on which the scores are based.

Metric
Whole archive (#Entries) Similar resolution (#Entries, resolution range(Å)) The table below summarises the geometric issues observed across the polymeric chains and their t to the electron density. The red, orange, yellow and green segments on the lower bar indicate the fraction of residues that contain outliers for >=3, 2, 1 and 0 types of geometric quality criteria. A grey segment represents the fraction of residues that are not modelled. The numeric value for each fraction is indicated below the corresponding segment, with a dot representing fractions <=5% The upper red bar (where present) indicates the fraction of residues that have poor t to the electron density. The numeric value is given above the bar.

Mol Chain Length
Quality of chain In the tables below, the ZeroOcc column contains the number of atoms modelled with zero occupancy, the AltConf column contains the number of residues with at least one atom in alternate conformation and the Trace column contains the number of residues modelled with at most 2 atoms.
Molecule 1 is a protein. 3 Residue-property plots i O These plots are drawn for all protein, RNA and DNA chains in the entry. The rst graphic for a chain summarises the proportions of the various outlier classes displayed in the second graphic. The second graphic shows the sequence view annotated by issues in geometry and electron density. Residues are color-coded according to the number of geometric quality criteria for which they contain at least one outlier: green = 0, yellow = 1, orange = 2 and red = 3 or more. A red dot above a residue indicates a poor t to the electron density (RSRZ > 2). Stretches of 2 or more consecutive residues without any outlier are shown as a green connector. Residues present in the sample, but not in the model, are shown in grey.
There are no chirality outliers.
There are no planarity outliers.  6  D  2  0  0  0  0  7  H  64  0  0  0  0  All  All  4919  0  4851  18  0 The all-atom clashscore is dened as the number of clashes found per 1000 atoms (including hydrogen atoms). The all-atom clashscore for this structure is 2.

Mol Chain Non-H H(model) H(added) Clashes Symm-Clashes
All (18)  There are no Ramachandran outliers to report.

Protein sidechains i O
In the following table, the Percentiles column shows the percent sidechain outliers of the chain as a percentile score with respect to all X-ray entries followed by that with respect to entries of similar resolution.
The Analysed column shows the number of residues for which the sidechain conformation was analysed, and the total number of residues. There are no bond length outliers.

Mol Chain Analysed Rotameric Outliers Percentiles
There are no bond angle outliers.
There are no chirality outliers.
There are no torsion outliers.
There are no ring outliers.
No monomer is involved in short contacts.