Corrections

BIOPHYSICS AND COMPUTATIONAL BIOLOGY Correction for “Structural defects and the diagnosis of amyloidogenic propensity,” by Ariel Fernández, József Kardos, L. Ridgway Scott, Yuji Goto, and R. Stephen Berry, which appeared in issue 11, May 27, 2003, of Proc Natl Acad Sci USA (100:6446–6451; first published May 12, 2003; 10.1073/ pnas.0731893100). The undersigned authors note the following: “We wish to bring to your attention an issue regarding our PNAS publication referenced above. Although we cite our earlier PNAS publication (see ref. 23 therein), portions of the text and figures are similar to ref. 23 and were not properly attributed. Ref. 23 reports an experimental result, while the paper indicated above reports theoretical work. Nevertheless, in the examples below we should have provided a citation to ref. 23 as the source of the information. “Fig. 2 was adapted from Fig. 1 in ref. 23. Fig. 5 was adapted from Fig. 2 in ref. 23. “The following text in the section titled ‘Structure Wrapping and Molecular Disease’ on page 6447 of our text is similar to the text in the fifth paragraph of the “Results and Discussion” section on page 2392 in ref. 23:

We have now rebuilt and refined the correct (S)-enantiomeric ICRF-187 compound into the ATPase region (see corrected Fig.  5 and legend below). Compared with the original model containing (R)-ICRF-186, the new model with (S)-ICRF-187 alters the angle at which the single methyl group extends from the chiral center of the linker. Despite this change, the well defined electron density of the two dioxopiperazine rings constrains the ethanediyl linker of ICRF-187 to adopt a slightly different twist than observed previously, shifting the coordinate position of the methyl substituent by only Ϸ0.5 Å. As a consequence, the methyl-pi interaction postulated to exist between the ICRF-187 ethanediyl linker and Tyr28 is maintained. Other interactions observed between the drug and topoisomerase II are similarly unaffected, and all discussions and conclusions of the paper still stand. Indeed, both ICRF-186 and ICRF-187 inhibit topoisomerase II with virtually the same K i values (2)(3)(4), an observation that can be explained by the absence of stereospecific interactions between these drugs and the enzyme. The new coordinates have been deposited in the Protein Data Bank, www.rcsb.org (PDB ID code 1QZR).
Group I introns are common in the 23 rRNA genes of mitochondria and chloroplasts. Often, they encode ''homing endonucleases,'' which target highly conserved gene sequences and drive interorganellar intron mobility, even across species and genus lines. Most bacterial 23S rRNA genes show these same endonuclease-sensitive target sequences. However, only two bacterial 23S rRNA genes are known to contain group I introns: that of Simkania negevensis Here we provide direct evidence for splicing, and evolutionary evidence for mobility, of group I introns in the 23S rRNA genes of several free-living hyperthermophilic bacteria of the genus Thermotoga. These bacteria do not live closely with eukaryotes, but phylogenetic analyses suggest that their introns were also acquired from eukaryotic (probably algal) organelles. In vivo, their introns must be spliced at temperatures approaching 90°C, making them the most thermostable natural ribozymes so far described. We demonstrate that at least some of these introns can also self-splice in vitro. G roup I introns constitute a distinct class of ribozyme, characterized by conserved primary and secondary structure and capable of protein-assisted or self-splicing, in vivo and in vitro (1,2). Many group I introns also carry ORFs encoding ''homing endonucleases,'' which render them mobile. Homing endonucleases in the 23S rRNA genes of eukaryotic chloroplasts and mitochondria specifically cleave conserved sequences in intron-free 23S rRNA genes. Repair of cleaved genes, templated by intron-containing intact genes, leads to unidirectional ''gene conversion,'' or copying of the intron, its endonuclease ORF, and flanking sequences into intron-free recipient genes (3,4). Group I introns can thus move between widely divergent species that retain the endonuclease target sequence, an evolutionary process that has been extensively documented among plant and algal plastids and mitochondria (3,5,6). Indeed, it can be convincingly argued that periodic homing provides the only selective pressure for retention of endonuclease activity (7). If this argument is true, then evidence for selection acting on endonuclease gene sequence (for instance, evidence that synonymous codon changes predominate, especially at sites required for activity) can by itself be taken as evidence for evolutionarily recent homing activity.
Until very recently, 23S (or 16S) rDNA introns were unknown in bacteria, a surprise given that most bacterial 23S rRNA genes contain conserved target sequences for intron-encoded homing endonucleases such as I-CeuI and I-CreI (8,9). Nikolchaeva and Woodson (10) showed that functional or retained (i.e., spliced or unspliced) introns artificially introduced into Escherichia coli 23S rRNA inhibited ribosome formation or function, and Edgell et al. (11) suggested that such deleterious consequences might be one of several ''barriers to intron promiscuity in bacteria.'' Group I introns have now been reported in two bacterial 23S rRNA genes, and at least one of them may indeed be deleterious to growth. This intron, described in 1999, is in the 23S rRNA gene of Simikania negevensis (12). It is not spliced out, but persists in the 23S rRNA, where its presence is thought to retard growth (12). The second bacterial 23S rRNA group I intron was only very recently discovered, through the complete sequencing of the genome of Coxiella burnetii (13). Structures of this intron and its ORF encoding a LAGLIDADG homing endonuclease (8,9) are consistent with splicing and nuclease activity, although neither has been demonstrated. S. negevensis and C. burnetii are both obligate intracellular pathogens, encouraging speculation that they acquired introns from the organelles of eukaryotes.

Methods
DNA from different Thermotoga strains was extracted from frozen cell mass donated by K. O. Stetter (University of Regensburg, Regensburg, Germany) by using the protocol of Charbonnier and Forterre (14). RNA from Thermotoga neapolitana NS-E was extracted from the same cell mass by using the RNeasy Mini Kit (Qiagen, Valencia, CA). Other DNAs were gifts from Y. Takahata (Marine Biotechnology Institute, Kamaishi Laboratories, Kamaishi, Japan), S. L'Haridon and C. Jeanthon (University de Bretagne Occidentale, Brest, France), and M. Madsen and T. Lien (University of Bergen, Bergen, Norway).
Amplification of the intron was carried out by using the following primers: Thermotoga23Sintron.2U, GTGACAAG-GCCCTGGCGACT, and Thermotoga23Sintron.275L, GGCATCTTCACCCAGACTGA. Amplifications were carried out in 50 l final volume containing 20-200 ng of template DNA, 1ϫ PCR buffer, 2.5 mM MgCl 2 , 0.2 mM dNTP, 1 mM each primer, 0.5-1 unit of Taq DNA polymerase or HiFi Taq DNA polymerase (Invitrogen). The reactions were submitted to an initial denaturation at 93°C for 3 min, and then 30 cycles at 93°C for 30 s, 55-57°C for 30 s, and 72°C for 1.5-2 min. The resulting PCR products were either (i) cloned by using the TOPO TA Cloning Kit (Invitrogen) with an average of five individual clones sequenced with T7 and m13rev, or (ii) cleaned by using Microcon PCR columns (Millipore) and sequenced directly by using the Thermotoga23Sintron.2U and Thermotoga23Sintron.275L primers.
RT-PCR was carried out by using the C. therm. Polymerase for Reverse Transcription in Two-Step RT-PCR Kit (Roche Applied Science) with 60°C synthesis or the Omniscript reverse transcription kit (Qiagen) with 37°C synthesis, and the Thermotoga23Sintron.275L primer in the first step. The second PCR step was carried out as described above with 57°C annealing. The resulting PCR products were cloned by using the TOPO TA Cloning Kit (Invitrogen), and individual clones were sequenced by using T7 and m13rev.
RNA for in vitro splicing was transcribed by T7 RNA polymerase (Fermentas) off PCR product obtained by using vector primers f lanking the exon-intron inserts described above (cloned into PCR-2.1 TOPO), with 3 mM NTP and 5 mM MgCl 2 to minimize self-splicing during transcription (15). Splicing was initiated by adding 0.2 mM GTP, 25 mM MgCl 2 , and 1 M KCl (16) to the transcripts, and stopped by chilling on ice. The splicing products were analyzed by RT-PCR as described above.
Transcripts for the time-course analysis of the self-splicing reaction were radioactively labeled by adding 1 l of [␣-32 P]GTP (10 Ci͞l; 1 Ci ϭ 37 GBq) in a total reaction volume of 50 l. In these transcription reactions we used 3 mM CTP, UTP, and ATP and 0.8 mM unlabeled GTP. Splicing was carried out as described above in a total volume of 5 l, electrophoresis was on a 6% polyacrylamide͞6 M urea gel, and visualization was by autoradiography.
Samples for Southern blot hybridization were prepared for electrophoresis by digesting 1-2 g of genomic DNA with restriction enzymes in a total of 20 l, and the DNA fragments were separated in a 1% agarose gel. The strains in Fig. 4b and Thermotoga subterranea SL1 were included on the blots. Two samples of each strain were electrophoresed, one cut with EcoRI and one cut with HindIII. The gels were blotted onto positively charged nylon membranes (Roche Diagnostics). A probe made from the PCR product covering both the T. neapolitana NS-E intron and the 23S rDNA was used in the hybridizations. Prehybridization and hybridization were carried out in DIG Easy Hyb (Roche Diagnostics) at 42°C in a rotary oven. Washes were performed in a 0.5ϫ SSC-equivalent buffer at 60°C. The digoxigenin (DIG)-labeled probe was detected by using CDP* (Roche Diagnostics), and exposures were 20 min to 1 h depending on the signal strength.

Results and Discussion
The 23S group I introns we describe here are in strains of species of the free-living hyperthermophilic bacterial genus Thermotoga. They were discovered in suppressive subtractive hybridization experiments (ref. 20 and unpublished data) designed to identify genes that are present in some strains͞species of Thermotoga, but not that of the sequenced Thermotoga maritima (strain MSB8), which has an intron-free 23S rRNA gene. With T. neapolitana NS-E as tester and T. maritima MSB8 as driver (unpublished data), we obtained a 403-bp clone of which base pairs 25-117 showed 60% protein identity (expected by chance 2.0 ϫ 10 Ϫ12 , denoted exp. 2e-12) to a putative group I intron site-specific DNA endonuclease from the chloroplast of the green alga Pterosperma cristatum (GenBank accession no. AAL34315), whereas base pairs 295-403 showed 100% DNA identity to base pairs 2045-2162 of the 23S rRNA of T. maritima MSB8. The junction corresponds to a highly conserved site, strongly suggesting that we had detected a group I intron. In fact, the insertion site (corresponding to position 1931 in the E. coli 23S rRNA) is identical to the insertion site of the intron from P. cristatum (21). Southern blot analyses confirmed that only one 23S rRNA gene exists in T. neapolitana NS-E [and the other Thermotoga strains tested and T. maritima MSB8 (22)], ruling out the possibility that this could be a nonfunctional 23S rRNA gene.
PCR amplification with primers flanking the insertion site showed that the T. neapolitana NS-E 23S intron (denoted Tna.bL1931 according to ref. 23; Fig. 1a) is 699 nt. Computer folding of Tna.bL1931 revealed that it is indeed a group I intron, of subgroup IB (24, 25) (Fig. 1a), as is the P. cristatum intron (21). The intron contains one 489-nt-long ORF (from nucleotide 206 to nucleotide 695) that shows 59% protein identity (exp. 8e-41) to a putative single-LAGLIDADG (26) group I homing endonuclease from the chloroplast of Chlorosarcina brevispinosa (GenBank accession no. AAL34389.1), and high similarity to endonucleases from other green algal group I introns. The endonuclease ORF is inserted into the L8 loop (Fig. 1a), as in other group IB introns, notably the IB4 introns in this position in the 23S rRNA genes in the chloroplasts and mitochondria of green algae, the mitochondria of Acanthamoeba castellanii, and the unspliced intron of S. negevensis (24,26). The length of the T. neapolitana ORF (163 aa) is similar to that of other known LAGLIDADG endonucleases, and its shows very high similarity to the 152-aa I-CpaI along its entire length (40% identities and 65% similarities). I-CpaI has been shown biochemically be a functional endonuclease, cutting at position 1931 (27). All 149 significant hits in translated BLAST searches were to putative intron-encoded homing endonucleases from eukaryotes, with the exception of the putative endonuclease from the unspliced group I intron in S. negevensis (GenBank accession no. AAD38228, exp. 2e-29), the ORF in the putative group I intron in C. burnetii (GenBank accession no. AE016960, exp. 2e-17), and an endonuclease (related to intron-encoded endonucleases) from Methanopyrus kandleri that appears to be a free-standing ORF (GenBank accession no. NP613840, exp. 5e-06). The eight best hits were all from introns found in chloroplasts and mitochondria at the same insertion site as the Thermotoga intron (E. coli 23S position 1931).
To investigate whether Tna.bL1931 indeed is an intron that is spliced from 23S rRNA, we isolated RNA from T. neapolitana NS-E and performed RT-PCR with primers flanking the insertion site. If the intron is processed out but the exons are not rejoined, no band amplification should be observed, whereas if the intron persists in the 23S rRNA, as observed in S. negevenisis (12), only a 972-bp band should be amplified. However, if the intron is properly spliced (exons rejoined), a 273-bp band is expected. This was indeed what we observed (Fig. 2). The sequence of the 273-bp fragment confirmed that Tna.bL1931 was the spliced intronless rRNA.
This intron must be stable and spliceable in vivo at temperatures at Ϸ80-90°C. [80°C is optimal and 90°C is maximal temperature for growth of T. neapolitana NS-E (28).] In accordance with this, the GϩC content of Tna.bL1931 is much higher than observed for other IB4 introns in the same 23S rRNA location (21): 58% GϩC for the intron excluding the ORF and 49% for the complete intron, compared with a GϩC content of 29-39% for other intron sequences. High GϩC content was also reported for a thermostable group I intron from Azoarcus sp. BH72 tRNA Ile (optimum splicing between 70 and 75°C) (29) and a group II intron in the hsp60 gene of Azotobacter vinelandii (optimum splicing at 65°C) (30).
To test whether the intron could self-splice in vitro, we transcribed RNA off the cloned exon-intron fragments, and incubated with Mg 2ϩ and GTP at different temperatures (15). The resulting splicing products were subjected to RT-PCR. If splicing has occurred, both a 972-bp band (intron and exon) and a 273-bp band (exon only) should be amplified. After incubation at various temperatures (30, 40, 50, 70, 80, 90, 100, and 105°C), bands corresponding to the joined exon were indeed detected (Fig. 3a, lanes 1-14). Sequences (data not shown) indicated that some ''correct'' splicing products were obtained at all temperatures tested, but that some alternatively spliced products also appeared at the higher temperatures. To confirm that in vitro splicing was in fact occurring at elevated temperatures (and not as reaction mixtures were being heated or chilled), the kinetics of product formation with radioactively labeled transcript were monitored at 60, 90, and 100°C (Fig. 3b, lanes 1-16). Reactions were too quick and͞or products too unstable at the higher temperatures to permit precise kinetic monitoring, but product formation showed clear time dependence at 60°C.
To detect introns in other members of the Thermotogales, we screened 18 additional strains (representing five genera and seven species) by using the same flanking primers. Introns were found in nine additional strains (Fig. 4), all in the genus Thermotoga. Six are very close relatives of T. neapolitana NS-E and three have been considered different species. For eight of the introns, very little sequence variation was observed compared with the T. neapolitana NS-E sequence (0.1-11% DNA divergence including the ORF and 0-3% excluding the ORF). All conformed to the same structure as the T. neapolitana NS-E intron and were inserted at the same position in the 23S, and the ORF was inserted in the same intron loop. Phylogenetic analyses showed that the phylogeny resembled that of the 16S rRNA (Fig.  4b), when both excluding and including the ORF (Fig. 4c). The only difference was the position of Thermotoga sp. strain RQ7 and Thermotoga sp. strain SG1. However, the branching pattern in the 16S rRNA tree is not strongly supported, and the intron phylogeny mirrors that obtained from two protein-coding genes (31). Hence, this intron was most likely present in the common ancestor of the Thermotoga strains that possess it presently; it was probably lost from T. maritima MSB8 and Thermotoga sp. strain RQ2 (Fig. 4).
Surprisingly, the intron found in T. subterranea SL1 (denoted Tsu.bL1926; Fig. 1b) proved to be quite different from the T. neapolitana intron, and we infer that it results from an independent acquisition. It is inserted a few base pairs upstream of Tna.bL1931, at position 1926 relative to the E. coli sequence, a position that has been found to harbor nuclear group I introns in different protists (21). Tsu.bL1926 is 774 nt, and computer folding of the intron sequence showed that the structure, although also most similar to group IB introns (25), is different from Tna.bL1931. Tsu.bl1926 lacks P2, has a homing endonuclease ORF inserted into L6, and contains additional P9 helices (Fig. 1b). As observed for Tna.bL1931, the GϩC content of this intron is higher than usual (48% excluding the ORF and 42% including the ORF). The GϩC content of this intron is nevertheless lower than that of Tna.bL1931, perhaps reflecting that the optimal growth temperature of T. subterranea is 10°C lower than the optimal temperature for T. neapolitana (32), or that it was acquired more recently. The T. subterranea SL1 endonuclease is 507 bp long (inserted between nucleotides 191 and 697) and showed highest similarity to a putative site-specific DNA endonuclease from a group I intron in the mitochondrion of the charophyte Chaetosphaeridium globosum (GenBank accession no. AAM96632, 36% protein identity, exp. 6e-23). The T. subterranea SL1 intron can also be spliced in vitro (Fig. 3a, lanes  15-24). The optimal temperature under the in vitro splicing conditions we used appears to be lower than what we observed for Tna.bL1931 (Fig. 3a). Together with the lower GϩC content, these observations are consistent with Tsu.bL1926 being adapted for function at a lower temperature than Tna.bL1931.
A maximum-likelihood tree of the two Thermotoga homing endonucleases together with their closest single-LAGLIDADG matches in BLAST searches is shown in Fig. 5. Introns and their endonuclease inserted at the same 23S rRNA position have been shown in earlier studies to be more closely related than introns in different positions, even within the same genome (26,33). The tree in Fig. 5 confirms the close relationship between the endonucleases from all introns at position 1931 in the 23S rRNA and suggests that Tna.bL1931 was acquired from a eukaryote, possibly from the chloroplast or mitochondrion of a green alga. The universally conserved proline residue in ␣3 (P93 in figure 6 in ref. 26) is replaced by a glutamate residue in all the Tna.bL1931-type ORFs, which might be an adaptation to hyperthermophily (34). The putative endonuclease from T. subterranea is only distantly related to that from Tna.bL1931. The otherwise universally conserved glutamine residue in ␤2 (Q47 in figure 6 in ref. 26) has been replaced by asparagine in the T. subterranea SL1 endonuclease and in the algal sequences that appear to be most closely related to it: the sequence from Chaetosphaeridium globosum and one of the endonucleases from the chloroplast of Chlorosarcina brevispinosa (c2.Chlorosarcina brevispinosa in Fig. 4, GenBank accession no. AAL34388). None of the other introns reported so far at position 1926 encode LAGLIDADG endonucleases (21), which probably is the reason no very close relative of the Tsu.bL1926 ORF exists. Considering the structural differences (Fig. 1), and the differences in insertion sites between the two introns, it seems certain that group I introns have been acquired at least twice by the Thermotoga lineage.
Notably, BLAST searches with the 170 nt of 23S rRNA sequence upstream of the insertion site of T. neapolitana NS-E give two cyanobacteria and a chloroplast 23S rRNA as the first hits after T. maritima MSB8 and Fervidobacterium islandicum, and about half of the 50 best hits were to cyanobacteria or chloroplasts. This pattern was not observed when searching with the whole sequenced fragment or with the 100 nt downstream of the insertion site. This pattern could be due to flanking region co-conversion at the time of intron acquisition (4), and if so, implies that the Fervidobacterium lineage also once harbored such introns.
The data presented here raise additional intriguing questions about the transfer and maintenance of these introns and their encoded endonucleases. Lateral transfer (cross-species homing) of group I introns is well documented among eukaryotes (4,5) and seems by far the most plausible scenario for initial acquisition of the Thermotoga introns described here. Transfers across species lines and much larger phylogenetic distances have been observed before, but in most cases a shared niche or some physically intimate relationship between the donor and the recipient can be inferred (3,12). All members of the Thermotoga lineage characterized are free-living, and all are hyperthermophilic or at least thermophilic (optimum growth, 66-80°C) (35). Presumably, they acquired their introns by means of free DNA, viruses, or plasmids encountered where they grow. The very high similarity of the TnaL.b1931 to endonucleases from eukaryotic introns at the same position, together with the fact that the largest reservoirs of group I introns are the nuclei and organelles of eukaryotes (21), suggests that these introns were acquired from a eukaryote. Although it is easy to imagine that hyper-   5. Maximum-likelihood tree of single-LAGLIDADG homing endonucleases carried by group I introns. The 23 first hits to single-LAGLIDADG homing endonucleases in BLAST-X searches, with the Thermotoga sequences as probes, was selected to build the phylogeny. Chloroplast and mitochondrial introns are indicated by a ''c'' or an ''m,'' respectively. Double-LAGLIDADG homing endonucleases were excluded because they have a higher rate of divergence (26). The tree was estimated by using a JTTϩ⌫ model in PHYLIP version 3.6 (17) with a substitution matrix provided by E. Tillier (personal communication), with 10 random additions of the sequences and global rearrangements. The ␣-parameter was estimated in PUZZLE version 4.0 (18). Values at nodes indicate number of times the node was recovered in 100 bootstrap replicates (bold numbers) or PUZZLE support (italic numbers). Insertion position in the 23S rRNA of the group I intron carrying the endonuclease is indicated where this was given in the GenBank entry or available at www.rna.icmb.utexas.edu (21). Specific insertion positions were not available for m. Chlorella vulgaris, m2. Chlorella vulgaris, m. Acanthamoeba castellanii, m2. Acanthamoeba castellanii, and m. Chaetosphaeridium globosum. thermophiles could acquire DNA from mesophiles unfortunate enough to fall into their special environments, it seems unlikely that a mesophilic endonuclease would function, or a that mesophilic intron would be properly spliced, in a hyperthermophilic background. Thermophilic (but not hyperthermophilic) eukaryotes are known (36), and, of course, an intron and endonuclease functioning marginally at the lower limits of a thermophile's range might subsequently adapt to higher temperatures.
Of the 100 substitutions observed among the nine Tna.b1931 endonucleases we have sequenced, at most 19 would introduce an altered amino acid (average ds͞dn in pairwise comparisons ϭ 23.02), suggesting that all are (or have until recently been) under selection for activity at the temperatures at which the organism now encoding them grow. Only two of the nonsynonymous substitutions would affect conserved positions in the internal segment, which covers all but one of the secondary elements in I-CreI (26). Both of these are conservative replacements (I 7 V and K 7 R), found in other endonucleases that cut the same site. Because strong theoretical arguments exist that endonuclease activity is selectively maintained only by and through homing (7), we infer that introns have been actively moving in genus Thermotoga since their introduction, before the diversification of the species studied here. More active within-species than betweenspecies transfer is likely, however, because striking incongruence does not occur between phylogenies for endonuclease and 16S rRNA.
If physiological ''barriers to intron promiscuity'' do exist that explain the paucity of group I introns in other bacterial 23S rRNA genes, they have been relaxed within species of this genus of free-living hyperthermophiles. Thus these introns, in addition to providing models for high-temperature ribozymology and examples for interdomain transfer, might provide a window into hyperthermophile cell and population biology.