The paths to the atomic structures of proteins and nucleic acids

Schulz, Georg E.

doi:10.1007/s40828-023-00180-x

The paths to the atomic structures of proteins and nucleic acids

Lecture Text
Open access
Published: 15 June 2023

Volume 9, article number 7, (2023)
Cite this article

Download PDF

You have full access to this open access article

ChemTexts Aims and scope Submit manuscript

The paths to the atomic structures of proteins and nucleic acids

Download PDF

Georg E. Schulz¹

1359 Accesses
Explore all metrics

Abstract

Atomic structures of large biological molecules were first established by scattering X-rays in protein crystals and later with crystals of nucleic acids. Good crystals allow for an accuracy of 0.1 Å (10⁻¹¹ m) that may reveal details of catalytic processes. The novel cryo-electron microscopy method does not need crystals and it can establish chain folds confidently. Chain folds can also be derived from NMR data to produce numerous binary atomic distances. Recently, chain folds for a given amino acid sequence were derived by mere computing, based on the presently available large library of proteins that are related by their amino acid sequences and structures.

Graphical abstract

Advances in Structural Bioinformatics

Introduction to neutron scattering

Article Open access 25 October 2023

Thermodynamics of protein folding: methodology, data analysis and interpretation of data

Article 03 April 2019

Introduction

Living material consists of mostly four types of molecules; lipids, saccharides, proteins, and nucleic acids. Lipids associate with membranes or they form long-term energy reservoirs. Saccharides form either short-term energy stores (glucose alpha bonds, starch) or solid scaffolds (glucose beta bonds, cellulose). As lipids and saccharides depend on the local activities of enzymes, they are rarely uniform and, therefore, not discussed here. In contrast, proteins and nucleic acids have defined structures based on genes. The historical pathway to the elucidation of these structures is here outlined.

Protein crystal structures

The story began more than 180 years ago when Hünefeld detected crystals in squashed blood [1]. Probably, the crystals had been formed by hemoglobin, the ubiquitous red dye coloring blood. The identity of these crystals was confirmed 22 years later by Hoppe-Seyler [2], who isolated and crystallized hemoglobin. Today, such crystals would certify the uniformity of all hemoglobin molecules. At that time, however, things were less clear. In 1869 the nucleic acids were detected by Miescher [3]. In contrast to proteins, the nucleic acids did not crystallize. Analyzable crystals of these molecules were produced only almost 100 years later (Fig. 1).

In 1895 Röntgen [4] performed experiments with cathode ray tubes, in which electrons with energies of more than 10,000 eV were shot onto a metal like tungsten, where they produced a penetrating (so-called) X-radiation. This radiation showed, e.g., the bones of a hand on a scintillation screen without hurting the tissue. Consequently, this observation was developed into an important diagnostic medical tool. When in 1912, a crystal was exposed to a thin X-ray beam, a multitude of weak beams split away from the incident beam giving rise to so-called reflections that were documented on a photographic film. This observation was correctly interpreted by von Laue [5] as an interference phenomenon involving the scattering of an electromagnetic wave by the periodically located electrons in the crystal. This confirmed both the wave nature of the X-rays and the periodic arrangement of atoms in the crystal.

The wavelength of the X-rays was around 10^–10 m = 1 Å, which corresponds to the atom–atom binding lengths and should therefore allow one to locate individual atoms in a crystal. The reflections are measurable because the scattered waves of millions of crystal unit cells add up in the interference process. Moreover, von Laue [5] showed that the electron density in the crystal (and thus the atom positions) can be calculated from a Fourier synthesis of the reflections. However, such a reconstruction requires the intensities as well as the phases of the reflections. Unfortunately, the phases cannot be measured directly, but only derived indirectly.

The phases may be established by combining several pieces of information: for instance, the electron density distribution in the crystal has to be positive everywhere, the distribution of electrons around an atomic nucleus is radial, the atom radii are known, all atomic bonds are close to a certain specific distance, the internal symmetries of crystals cause restrictions to the phase angles, partial structures may be known and accounted for (e.g., a phenyl ring), etc. In 1986, all these possibilities were compiled by Hauptman [6] under the name direct methods. Moreover, for crystals with less than a handful of atoms in the unit cell, a Fourier synthesis of the mere intensities without the phases (a Patterson function) may yield the positions of these atoms in the unit cell.

The first crystal structures, namely those of NaCl and diamond, were determined in 1913 by Bragg and Bragg [7]. They were followed by numerous other structures of larger molecules culminating in the structure of vitamin B₁₂ (M_r = 1355), which was elucidated in 1956 by Crowfoot-Hodgkin [8]. Crystals of molecules smaller than vitamin B₁₂ are usually analyzed by direct methods [6], but they do not work for larger molecules.

Around 1920, the intrinsic stability of proteins like hemoglobin was generally accepted, but proteineous enzymes remained mysterious. As enzymes catalyze chemical reactions, they should be intrinsically mobile. At that time, they were considered colloids without a stable spatial structure. The puzzle was solved in 1926 when Sumner reported crystals of the enzyme urease [9]. The crystals indicated that enzymes also have a defined spatial structure. Later on, it became clear that enzymes are indeed mobile, but can crystallize in one of their stable states. In 1995, the first movie of all states of an enzyme over a complete catalytic cycle was published [10].

The first structural knowledge on proteins did not come from crystals but from peptide fibers. In 1931, Astbury analyzed such fibers and detected two dominant X-ray scattering patterns, which he named α (observed with wool) and β (characteristic for silk) [11]. The actual structures of the α- and β-fibers remained obscure for 20 years. However, when Pauling studied the crystal structures of small peptides, he recognized that the bonds between the amino acid residues are always in the trans conformation, greatly restricting the structures of longer peptides [12]. Long all-trans-peptides can assume only two regular conformations stabilized by hydrogen bonding, the α-helix and the β-sheet, which actually corresponded to the α- and β-patterns of the fibers analyzed by Astbury [11]. These regular conformations turned out to constitute the dominant substructures (so-called secondary structures) of proteins.

The first serious X-ray diffraction experiment on a protein crystal (Fig. 2) was performed in 1934 by Bernal [13]. The crystals contained the enzyme pepsin and showed defined reflections up to high (about 2 Å) resolution, confirming the proposal of Sumner [9] and indicating that the atomic structure of pepsin could be obtained in principle. Actually, however, the protein structure remained unknown because the phases of the reflections could not be determined. A suitable method for phase determination was invented only 17 years later by Bijvoet [14], who compared the reflection intensities of the isomorphous crystals of strychnine sulfate and strychnine selenate and derived the position of the sulfur (selenium) atom in the unit cell by a Patterson function of the reflection intensity differences. The position helped decisively in determining all phases. Bijvoet named this the method of isomorphous replacement.

Three years later, Perutz [15] used a variation of this idea with hemoglobin crystals. He soaked the crystal with a solution of mercury ions that bound locally in a defined manner at the free cysteines of the protein. Soaking was possible because his protein crystal, like all others, consisted of about 50% water. As usually several cysteines were available, he called this method multiple isomorphous replacement (MIR). The localized 80 electrons of a mercury atom change all reflection intensities measurably. A Fourier synthesis of these differences (difference Patterson) reveals the mercury atom positions, which in turn can be used for determining all phases. The MIR method was applied in almost all following structure analyses of proteins and nucleic acids. Astonishingly, Perutz [15] did not quote Bijvoet [14], the initiator of this method.

Using the MIR method, Kendrew [16] produced the electron density map of a myoglobin crystal (M_r = 17,000) 6 years later. During this analysis the phases of around 10,000 reflections had been calculated, which was an extraordinary logistic achievement in those days without versatile computers. It should be noted that the reliable interpretation of the resulting electron density map required the amino acid sequence of myoglobin. After the pioneering work of Sanger [17], that sequence was available on time. It turned out that myoglobin consists exclusively of α-helices, the geometry of which confirmed the substructure proposal of Pauling [12]. Five years after the atomic structure of myoglobin, Phillips [18] determined the first structure of an enzyme, lysozyme, which had crystallized in one of its stable conformations as proposed by Sumner [9] and Bernal [13]. In the beginning, the MIR phasing method was generally applied. However, after numerous protein structures were established, the molecular replacement method, in which phases were determined in a refinement using a resembling (part of the) protein structure, became popular [19].

The 60 years following the determination of the structure of myoglobin saw a multitude of reports on atomic protein and enzyme structures, giving rise to a very large amount of structural data. Since 1971, the protein structure data were normalized and compiled in an easily accessible bank, the Protein Data Bank [20, 21]. This bank brought an exceptional stimulus for this field of research.

Until 1985, all structures were from soluble proteins, because membrane proteins failed to crystallize as they associated nonspecifically at hydrophobic surface patches. After tedious experiments, Michel [22] observed in 1982 that membrane proteins can also be crystallized if their hydrophobic surface regions were covered by detergent molecules. This expanded the field of known atomic protein structures appreciably.

The size and the importance of the published protein structures grew with time. A typical structure is shown as a ribbon plot in Fig. 3. It is the membrane channel MspA, which is the base of the modern DNA sequence analysis [23, 24]. Several important atomic structures were rewarded with a Nobel prize, beginning with the first membrane protein [25], followed by the F₁-ATPase [26], the potassium channel [27], RNA polymerase [28], and the G-protein-coupled receptor [29]. The analysis of crystallized proteins remains important because only this method allows for positional accuracies of 0.1 Å that are required for the explanation of catalytic processes.

Nucleic acid structures

After the nucleic acids were detected by Miescher [3], it took a long time before their chemical structures were established. Nucleic acids are linear chains of nucleotides that are connected via phosphodiester bonds. Each nucleotide consists of a heteroaromatic ring system (base) and a ribose (RNA) or 2′-deoxyribose (DNA) 5′-phosphate. There are essentially four different bases, the sequences of which constitute the genetic information of all protein and RNA molecules. Sanger [30] and Gilbert [31] separately designed two analytical methods for determining such DNA sequences. Nowadays, the Sanger method has been greatly simplified and extensively applied, giving rise to a very large number of known natural DNA sequences. In analogy to the spatial protein structures in the Protein Data Bank [20, 21], the linear DNA sequences were compiled in another easily accessible data bank, GenBank [32], which also brought a great stimulus for the research field.

The first report on the spatial structure of a piece of DNA was published only 70 years after Miescher [3], when Astbury [33] drew a thin fiber out of bulk DNA material and subjected it to X-rays. The resulting scattering pattern contained a very strong 3.5 Å reflection that indicated long stacks of bases along the DNA fiber. Thirteen years later in 1951 Chargaff [34] performed a detailed quantitative chemical analysis of DNA, finding that the amount of base G corresponded to that of base C and the amount of base A to that of base T. This indicated the existence of base pairs G–C and A–T in the DNA. Two years later, accounting for base stacks [33], base pairing [34], and for an unauthorized photo from Franklin [35], Watson and Crick built a DNA model that fitted biology (exact duplication via base pairing = inheritance), chemistry (base pairing via hydrogen bonds), physics (hydrophobic inside and polar outside), and informatics (the general structure was independent of the base pair sequence) [36]. Their model turned out to be correct and was a great leap forward.

In the following interim, there was no hint of a folded spatial structure of single- or double-stranded DNA. However, it became clear that RNA exists in more or less stable folded single-stranded structures, as indicated for the numerous transfer RNA molecules. In 1970 Cramer et al. [37] produced the first X-ray-grade crystals from a phenylalanine-specific transfer RNA of yeast. Three years later the crystal structure of this particular transfer RNA was determined by Rich [38] and independently by Klug [39], both using the MIR methods known from protein analyses. They found double helices resembling the Watson–Crick DNA model that interweaved with each other. As with proteins, the electron density distribution in the crystal could only be interpreted using the known base sequence. This sequence, however, had been established long before the crystal analysis and used for base pairing trials that had already indicated where the single-stranded RNA is involved in double-helical interactions.

In 1982 Cech [40] discovered RNA molecules that are active catalysts and named them ribozymes. Several groups then focused on and succeeded in crystallizing ribozymes, giving rise to a number of structures. In particular Yonath [41] tried to crystallize full ribosomes and parts thereof for a long time. After ribosomal crystals appeared, other scientists got interested and joined the endeavor, which in 2000 resulted in the ribosome structure being elucidated in three separate competing analyses by Yonath [41], Steitz [42], and Ramakrishnan [43]. The structure showed that ribosomes are ribozymes despite the large number of associated proteins. The ribosomal proteins do not participate in the formation of the peptide bond, but merely stabilize the RNA structure. This observation corroborated the hypothesis that there existed an original “RNA world”, which was superseded by our present more efficient RNA–DNA–protein world.

Cryo-electron microscopy

Besides the MIR analyses of crystals, there exist further methods which, however, in general do not reach the quality of a good crystal structure. In 1939, Ruska [44] designed and built an electron microscope. This apparatus has a theoretical accuracy far below 1 Å, but as a result of the small aperture of the electron beam and the low contrast in the sample, the real resolution remained far above 1 Å. Over the years, however, the electron microscope and the sample preparation have been greatly improved. In 1984, for instance, Dubochet [45] introduced the cryo sample, reducing dramatically the scattering background of supporting material. Following the work of Henderson [46], the sensitivity of the electron detector was greatly improved. Upon these developments Frank [47] was able in 1995 to derive the structures of large molecules from numerous projections from different angles that were appropriately added and averaged. The method reached resolutions below 3 Å that allowed one to trace the polypeptide chain with confidence. It has been applied for numerous proteins and RNAs, all of which are available from the Protein Data Bank [20, 21].

Recently, another electron microscopy method became available—micro-crystal electron diffraction—that uses essentially two-dimensional crystals and very weak electron beams [48]. Here, the third dimension is explored by stage tilting. The rate of depositions of electron microscope structures in the Protein Data Bank is presently about half of that of X-ray structures [49].

Nuclear magnetic resonance

A further crystal-free method was introduced by Wüthrich [50]. This method requires a highly concentrated mono-disperse protein solution and an apparatus suitable for the measurement of nuclear magnetic spin resonances. The measurable magnetic interaction between spatially neighboring nuclear spins can be interpreted as their local distances within the large molecule. The three-dimensional structure is then calculated from a multitude of mutual distances using an iterative algorithm. This method does not need crystals; however, the obtained structures do not reach the quality of a good crystal structure. On the other hand, the determined structure is more natural because it is not disturbed by crystal contacts. Unfortunately, the identification of contacting atoms is very tedious so that the number of such structures in the Protein Data Bank [20, 21] is rather limited [49].

Calculation of atomic protein structures from amino acid sequences

Early on, it became clear that a large number of known spatial protein structures, which are related to each other across millions of organisms, may in the future form an extensive library in which a spatial structure could be derived from a given amino acid sequence alone [51]. As such a sequence can be translated from an easily measurable underlying DNA sequence, the structure analysis should become a simple enterprise. In the beginning, the number of known sequences and structures was rather small. Despite this limitation, several groups developed methods for predicting substructures from sequences. For quite a time these methods remained unreliable. This changed, however, with a combined secondary (sub)structure prediction for the given sequence of the enzyme adenylate kinase [52]. Here, a simple addition of the submitted nine predictions outlined accurately all α-helices, β-strands, and loops. Obviously, at that time, the size of the available data library allowed for an identification of substructures.

In light of the numerous spatial protein structures published in the following years, the structure prediction methods improved appreciably. In order to establish the status of the field, Moult [53] invited all interested groups to a Conference for the Assessment of Structure Predictions (CASP) in 1994. The meeting was held in Asilomar, California and considered a success. Consequently, the participants decided to repeat it biannually, giving rise to the 15th CASP meeting held this year. At the 14th CASP meeting in 2021, Jumper [54] presented the computer program AlphaFold that predicted complete spatial structures with an astonishing accuracy. It was based on the very large data compilations presently available from GenBank [32] and the Protein Data Bank [20, 21]. Presently, it requires only about 1 day computing time for a structure. Moreover, AlphaFold has been followed up by other similar programs [55, 56]. As the data banks expand quickly, these programs are bound to improve in the future. Nowadays, any protein structure analysis will start with a DNA sequence (translated to an amino acid sequence) that is applied to one or more of the artificial intelligence programs [54,55,56]. The resulting initial model is then used for guiding all further experimental analyses.

Data availability

No data was used for the research described in the article.

References

Hünefeld FL (1840) Der Chemismus in der thierischen Organisation. Brockhaus 158–163
Hoppe-Seyler F (1862) Über das Verhalten des Blutfarbstoffs im Spektrum des Sonnenlichtes. Virchows Arch 23:446–449
Google Scholar
Miescher JF (1871) Über die chemische Zusammensetzung der Eiterzellen. Med Chem Untersuchungen (ed. Hoppe-Seyler) 4:441–460
Google Scholar
Röntgen WC (1895) Über eine neue Art von Strahlen (Vorläufige Mitteilung). Verlag Stahel’sche Buchhandlung, Würzburg
Google Scholar
von Laue M, Friedrich W, Knipping P (1912) Interferenz Erscheinungen bei Röntgenstrahlen. Verlag der Bayrischen Akademie der Wissenschaften, München
Google Scholar
Hauptman H (1986) The direct methods of X-ray crystallography. Science 233:178–183
CAS PubMed Google Scholar
Bragg WH, Bragg WL (1913) The structure of the diamond. Proc Roy Soc Lond A 89:277–291
Google Scholar
Brink C, Crowfoot-Hodgkin D, Linsay J, Pickworth J, Robertson JH, White JG (1954) X-ray crystallographic evidence on the structure of vitamin B₁₂. Nature 174:1169–1171
CAS PubMed Google Scholar
Sumner JB (1926) The isolation and crystallization of the enzyme urease: preliminary paper. J Biol Chem 69:435–441
CAS Google Scholar
Vonrhein C, Schlauderer GJ, Schulz GE (1995) Movie of the structural changes during a catalytic cycle of nucleoside monophosphate kinases. Structure 3:483–490
CAS PubMed Google Scholar
Astbury WT, Street A (1931) X-ray studies of the structure of hair, wool, and related fibers. Philos Trans Roy Soc Lond A 230:75–101
Google Scholar
Pauling L, Corey RB, Branson HR (1951) The structure of proteins: two hydrogen-bonded helical configurations of the polypeptide chain. Proc Natl Acad Sci USA 37:205–211
CAS PubMed PubMed Central Google Scholar
Bernal JD, Crowfoot D (1934) X-ray photographs of crystalline pepsin. Nature 133:794–795
CAS Google Scholar
Bokhoven C, Schoone JC, Bijvoet JM (1951) The Fourier synthesis of the crystal structure of strychnine sulphate pentahydrate. Acta Cryst 4:275–280
CAS Google Scholar
Bragg L, Perutz MF (1954) The structure of haemoglobin VI. Fourier projection on the 010 plane. Proc Roy Soc Lond A 225:315–329
CAS Google Scholar
Kendrew JC, Dickerson RE, Strandberg BE, Hart RG, Davies DR, Phillips DC, Shore VC (1960) Structure of myoglobin. A three-dimensional Fourier synthesis at 2 Å resolution. Nature 185:422–427
CAS PubMed Google Scholar
Sanger F, Thompson EOP (1953) The amino acid sequence in the glycyl chain of insulin. Biochem J 53:366–374
CAS PubMed PubMed Central Google Scholar
Blake CCF, Koenig DF, Mair GA, North ACT, Phillips DC, Sarma VR (1965) Structure of hen egg-white lysozyme. A three-dimensional Fourier synthesis at 2 Å resolution. Nature 206:757–761
CAS PubMed Google Scholar
Rossmann MG (1990) The molecular replacement method. Acta Cryst A 46:73–82
Google Scholar
Protein Data Bank (1971) Protein Data Bank. Nat New Biol 233:223–223
Google Scholar
Sussman JL, Lin D, Jiang J, Manning NO, Prilusky J, Ritter O, Abola EE (1998) Protein Data Bank (PDB): database of 3D structural information of biological macromolecules. Acta Cryst D 54:1078–1084
CAS Google Scholar
Michel H (1982) Three-dimensional crystals of a membrane protein complex. J Mol Biol 158:567–572
CAS PubMed Google Scholar
Faller M, Niederweis M, Schulz GE (2004) The structure of a mycobacterial outer membrane channel. Science 303:1189–1192
CAS PubMed Google Scholar
Derrington IM, Butler TZ, Collins MD, Manrao E, Pavlenok M, Niederweis M, Gundlach JH (2010) Nanopore DNA sequencing with MspA. Proc Natl Acad Sci USA 107:16060–16065
CAS PubMed PubMed Central Google Scholar
Deisenhofer J, Epp O, Miki M, Huber R, Michel H (1985) Structure of the protein subunits in the photosynthetic reaction centre of Rhodopseudomonas viridis at 3 Å resolution. Nature 318:618–624
CAS PubMed Google Scholar
Abrahams JP, Leslie AGW, Lutter R, Walker JE (1994) Structure at 2.8 Å resolution of F₁-ATPase from bovine heart mitochondria. Nature 370:621–628
CAS PubMed Google Scholar
Doyle DA, Cabral JM, Pfuetzner RA, Kuo A, Gulbis JM, Cohen SL, Chait BT, MacKinnon R (1998) The structure of the potassium channel: molecular basis of K⁺ conduction and selectivity. Science 280:69–77
CAS PubMed Google Scholar
Westover K, Bushnell DA, Kornberg RD (2001) Structural basis of transcription: RNA polymerase II at 2.8 Ångstrom resolution. Science 292:1863–1876
Google Scholar
Rasmussen SGF, Choi H-J, Rosenbaum DM, Kobilka TS, Thian FS, Edwards PC, Burghammer M, Ratnalla VRP, Sanishvili R, Fischetti RF, Schertler GFX, Weis WI, Kobilka BK (2007) Crystal structure of the human β₂ adrenergic G-protein-coupled receptor. Nature 450:383–387
CAS PubMed Google Scholar
Brownlee GG, Sanger F, Barrel BG (1967) Nucleotide sequence of 5S-ribosomal RNA from Escherichia coli. Nature 215:735–736
CAS PubMed Google Scholar
Gilbert W, Maxam A (1973) The nucleotide sequence of the lac operator. Proc Natl Acad Sci USA 70:3581–3584
CAS PubMed PubMed Central Google Scholar
Benson DA, Cavanaugh M, Clark K, Karsch-Mizrachi I, Lipman DJ, Ostell J, Sayers EW (2013) GenBank. Nucl Acid Res 41:D36–D42
CAS Google Scholar
Astbury WT, Bell FO (1938) X-ray study of thymonucleic acid. Nature 141:747–748
CAS Google Scholar
Chargaff E, Lipshitz R, Green C, Hodes ME (1951) The composition of the desoxyribonucleic acid of salmon sperm. J Biol Chem 192:223–230
CAS PubMed Google Scholar
Franklin RE, Gosling RG (1953) Evidence for 2-chain helix in crystalline structure of sodium deoxyribonucleate. Nature 172:156–157
CAS PubMed Google Scholar
Watson JD, Crick FHC (1953) Molecular structure of nucleic acids. Nature 171:737–738
CAS PubMed Google Scholar
Cramer F, von der Haar F, Holmes KC, Saenger W, Schlimme E, Schulz GE (1970) Crystallization of phenylalanine specific transfer ribonucleic acid. J Mol Biol 51:523–530
CAS PubMed Google Scholar
Kim SH, Quigly GJ, Suddath FL, McPherson A, Sneden D, Kim JJ, Weinzierl J, Rich A (1973) Three-dimensional structure of yeast phenylalanine transfer-RNA. Folding of the polynucleotide chain. Science 179:285–288
CAS PubMed Google Scholar
Robertus JD, Ladner JE, Finch JT, Rhodes D, Brown RS, Clark BFC, Klug A (1974) Structure of yeast phenylalanine tRNA at 3 Å resolution. Nature 250:546–551
CAS PubMed Google Scholar
Kruger K, Grabowski PJ, Zaug AJ, Sands J, Gottschling DE, Cech TR (1982) Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of tetrahymena. Cell 31:147–157
CAS PubMed Google Scholar
Schluenzen F, Tocilj A, Zarivach R, Harms J, Gluehmann M, Janell D, Bashan A, Bartels H, Agmon I, Franceschi F, Yonath A (2000) Structure of functionally activated small ribosomal subunit at 3.3 Å resolution. Cell 102:615–623
CAS PubMed Google Scholar
Ban N, Nissen P, Hansen J, Moore P, Steitz TA (2000) The complete atomic structure of the large ribosomal subunit at 2.4 Å resolution. Science 289:905–920
CAS PubMed Google Scholar
Wimberly BT, Brodersen DE, Clemens WM Jr, Morgan-Warren RJ, Carter AP, Vonrhein C, Hartsch T, Ramakrishnan V (2000) Structure of the 30S ribosomal subunit. Nature 407:327–339
CAS PubMed Google Scholar
Borries Bv, Ruska E (1939) Ein Übermikroskop für Forschungsinstitute. Naturwissenschaften 27:577–582
Google Scholar
Adrian M, Dubochet J, Lepault J, McDowall AW (1984) Cryo-electron microscopy of viruses. Nature 308:32–36
CAS PubMed Google Scholar
Henderson R, Baldwin JM, Ceska TA, Zemlin F, Beckmann E, Downing KH (1990) Model for the structure of bacteriorhodopsin based on high-resolution electron cryo-microscopy. J Mol Biol 213:899–929
CAS PubMed Google Scholar
Agrawal RK, Penczek P, Grassucci RA, Li Y, Leith A, Nierhaus KH, Frank J (1996) Direct visualization of A-, P-, and E-site transfer RNAs in the Escherichia coli ribosome. Science 271:1000–1002
CAS PubMed Google Scholar
Corbett KD, Herzig MA Jr (2020) Electron counting takes microED to the next level. Nat Methods 19:649–655
Google Scholar
Protein Data Bank (2023) Number of released PDB structures per year. https://www.rcsb.org/stats/all-released-structures
Kline AD, Braun W, Wüthrich K (1988) Determination of the complete three-dimensional structure of the α-amylase inhibitor tendamistat in aqueous solution by nuclear magnetic resonance and distance geometry. J Mol Biol 204:675–724
CAS PubMed Google Scholar
Schulz GE (1974) Wird die Röntgenstrukturanalyse von Proteinen überflüssig? Nachr Chem Techn 22:431–432
Google Scholar
Schulz GE, Barry CD, Friedmann J, Chou PY, Fasman GD, Finkelstein AV, Lim VI, Ptitsyn OB, Kabat EA, Wu TT, Levitt M, Robson B, Nagano K (1974) Comparison of predicted and experimentally determined secondary structure of adenylate kinase. Nature 250:140–142
CAS PubMed Google Scholar
Moult J, Pedersen JT, Judson R, Fidelis K (1995) A large-scale experiment to assess protein structure prediction methods. Proteins Struc Func Genet 23:ii–iv
CAS Google Scholar
Jumper J, Evans R et al (2021) Highly accurate protein structure prediction with AlphaFold. Nature 596:583–589
CAS PubMed PubMed Central Google Scholar
Baek M, DiMaio F et al (2021) Accurate prediction of protein structures and interactions using a three-track neural network. Science 373:871–876
CAS PubMed PubMed Central Google Scholar
Lin Z, Akin H et al (2023) Evolutionary-scale prediction of atomic level protein structure with a language model. Science 379:1123–1130
CAS PubMed Google Scholar

Download references

Acknowledgements

A slightly different German version of this manuscript was published in BioSpektrum 02/23 page 118.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institut für Biochemie, Universität Freiburg im Breisgau, Albertstr. 21, 79104, Freiburg, Germany
Georg E. Schulz

Authors

Georg E. Schulz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

GES wrote the paper and produced the figures (Fig. 2 was obtained with permission from Judith A. Howard, Durham, UK).

Corresponding author

Correspondence to Georg E. Schulz.

Ethics declarations

Conflict of interest

The author declares no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schulz, G.E. The paths to the atomic structures of proteins and nucleic acids. ChemTexts 9, 7 (2023). https://doi.org/10.1007/s40828-023-00180-x

Download citation

Received: 13 April 2023
Accepted: 09 May 2023
Published: 15 June 2023
DOI: https://doi.org/10.1007/s40828-023-00180-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The paths to the atomic structures of proteins and nucleic acids