Automatic cryo-EM particle selection for membrane proteins in spherical liposomes

https://doi.org/10.1016/j.jsb.2014.01.004Get rights and content

Abstract

Random spherically constrained (RSC) single particle reconstruction is a method to obtain structures of membrane proteins embedded in lipid vesicles (liposomes). As in all single-particle cryo-EM methods, structure determination is greatly aided by reliable detection of protein “particles” in micrographs. After fitting and subtraction of the membrane density from a micrograph, normalized cross-correlation (NCC) and estimates of the particle signal amplitude are used to detect particles, using as references the projections of a 3D model. At each pixel position, the NCC is computed with only those references that are allowed by the geometric constraint of the particle’s embedding in the spherical vesicle membrane. We describe an efficient algorithm for computing this position-dependent correlation, and demonstrate its application to selection of membrane-protein particles, GluA2 glutamate receptors, which present very different views from different projection directions.

Introduction

Random Spherically Constrained (RSC) single particle reconstruction (Jiang et al., 2001, Wang and Sigworth, 2010) is a useful approach for studying the structures of membrane proteins in membrane environments. The proteins are reconstituted into spherical lipid vesicles (liposomes), which in turn are frozen in vitreous ice and imaged by electron cryomicroscopy (cryo-EM). As in all single-particle methods, the automatic “picking”, that is selection, of protein particles in micrographs is an important first step in three-dimensional structure determination. The embedding in spherical lipid vesicles presents both disadvantages and advantages. Vesicles are large (∼30 nm diameter) but ideally have only one or two protein particles embedded each one, yielding a low overall density of particles in a micrograph. This low density has made the problem of particle selection particularly important, as in our experience a typical 4 k × 4 k pixel micrograph contains only about 50 particles. This means that many micrographs must be taken and processed to obtain large datasets. Further, the low density of particles means that the relative density of “non-particles” such as frost balls or other contaminants is high. A further disadvantage is that, to avoid distortion of vesicles, the ice layer must be relatively thick, larger than the diameter of the vesicles; thus the high contrast that is observed with very thin ice layers in cryo-EM specimens cannot be exploited.

On the other hand, an advantage arises from the fact that no particles exist in regions between vesicles, and at a given position relative to the center of a vesicle an embedded particle presents a restricted set of views. We sought to design a particle-picking algorithm that efficiently exploits these geometric constraints.

A popular approach to automatic particle picking involves the search of a micrograph using one or more templates (reference images) using the local correlation function (Roseman, 2003, Zhu et al., 2004, Voss et al., 2009) sometimes combined with other criteria for improved detection (Chen and Grigorieff, 2007, Langlois et al., 2011). The local correlation function is also known in the image-processing literature as the normalized cross-correlation (NCC). The program described here computes the NCC, and another selection criterion derived from the NCC, using reference images that are projections of an initial 3D model for the particles to be discovered. Due to the geometric constraint of embedding a protein particle in a spherical vesicle, two of the three Euler angles of orientation of a particle can be determined, within a fourfold ambiguity, from the position of the particle’s projection in the micrograph. Here we adapt an eigenimage-expansion strategy for multiple-template searching (Sigworth, 2004) to exploit these constraints.

As a test case we consider the GluA2 glutamate receptor, a 400 kDa membrane protein. We simulate images using its known X-ray crystal structure (Sobolevsky et al., 2009), and search for particles in cryo-EM images of GluA2 reconstituted into lipid vesicles.

The following notation is used in this paper. An upper-case variable denotes a matrix or an image; a bold-face variable denotes a vector, while lower-case variable denotes a scalar. The squared norm or “power” of an image X having np pixels with values xi is defined asX2=i=1npxi2,and CX denotes operator C operating on X.

Section snippets

Spherically-constrained projection geometry

We denote the position of a particle on a spherical vesicle by angles α,β,γ (Fig. 1) which, unlike the standard definition of Euler angles for electron microscopy (Heymann et al., 2005) are defined as three successive rotations of the particle rather than rotations of the coordinate system. The projection direction is the -z axis and the projection plane is the (x,y) plane. The definition of the angles is different from the one we used previously (Jiang et al., 2001) and is designed such that a

Experimental and simulated data

As a test of the algorithm, we consider images of the tetrameric GluA2 glutamate receptor reconstituted into lipid vesicles containing the lipids POPC, DOPS and cholesterol in the ratio 8:1:1. Vesicles were adsorbed to a thin (∼5 nm) carbon film overlaying a holey-carbon film and plunge-frozen. Micrographs were taken on a Tecnai F20 microscope and recorded with a Gatan Ultrascan 4000 CCD camera at an effective pixel size of 1.7 A˚. The images shown here were merged from three exposures at defocus

Discussion

We describe here a geometry-aware particle picker for lipid-vesicle-embedded membrane proteins. At each pixel position the particle-selection algorithm tests for the existence of a particle whose center of mass lies on that position. In this process only those reference images (projections of a 3D particle model) are considered that are appropriate to that pixel location in view of the constraint that the particle is oriented normal to the membrane surface. The expansion of each reference image

Acknowledgments

We thank K. Duerr and E. Gouaux (Vollum Institute, Portland, OR) for the purified GluA2 protein, Y-Y. Yan and Y. Yang for reconstituting it into lipid vesicles, and H. Shigematsu for cryo-EM imaging. We are grateful to H. Tagare and A. Barthel for helpful discussions. This work was supported by NIH grant R01NS21501 to F.S.

Cited by (20)

  • New software tools in EMAN2 inspired by EMDatabank map challenge

    2018, Journal of Structural Biology
    Citation Excerpt :

    Each of these features are available when running refinements in EMAN2 from the GUI (e2projectmanager.py) or command line via e2refine_easy.py. The problem of accurately identifying individual particles within a micrograph is as old as the field of CryoEM, and there are literally dozens of algorithms, (Liu and Sigworth, 2014; Scheres, 2015; Woolford et al., 2007; Zhu et al., 2004) to name a few, which have been developed to tackle this problem. These include some neural network-based methods (Ogura and Sato, 2001; Wang et al., 2016).

  • Big data in cryoEM: automated collection, processing and accessibility of EM data

    2018, Current Opinion in Microbiology
    Citation Excerpt :

    A schematic for the pipeline up to tomographic reconstruction is depicted in Figure 2. Although automated 3D particle picking algorithms [25,50–52] are routinely used for identifying single particles in vitrified ice for cryoET, for more complex systems it is still common to identify objects of interest manually due to the very high noise content in tomograms, high heterogeneity between particles, and/or low information content due to the missing wedge [53–60]. Particles are commonly identified first in sub-sampled tomograms comprising approximately 1 gigabyte of data, and then extracted from fully sampled tomograms.

  • SuRVoS: Super-Region Volume Segmentation workbench

    2017, Journal of Structural Biology
    Citation Excerpt :

    Model or template based searches can be considered a type of segmentation. Recent advances in cross-correlation based template searches have been successfully used to identify protein complexes (Asano et al., 2015; Förster et al., 2010; Liu and Sigworth, 2014), and to semi-automatically segment filaments such as actin or microtubules, and membranes (Rigort et al., 2012). However, in each of these cases, an accurate, correctly scaled, a priori model is necessary, limiting their usefulness.

  • Structural biology of glutamate receptor ion channel complexes

    2016, Current Opinion in Structural Biology
    Citation Excerpt :

    However, the recent crystal structure of TRPV6, which resolved large cytoplasmic domains that were disordered in cryo-EM TRPV channel structures, suggests that crystallography will continue to make major contributions to solving membrane protein structures [39]. In principle, cryo-EM structures can also be solved for ion channels in lipid vesicles allowing voltage control via imposition of ion gradients, although to date the resolution obtained is much lower than with conventional single particle techniques [40,41]. Finally, in the distant future, in-cell EM tomography might be applicable to iGluRs embedded in their native membrane environment [42].

  • Statistical modeling and removal of lipid membrane projections for cryo-EM structure determination of reconstituted membrane proteins

    2016, Journal of Structural Biology
    Citation Excerpt :

    We increase the visibility of particles by acquiring focal-pairs of exposures and merge them to form a composite micrograph (Fig. 1c). In this merged image the centers and approximate radii of vesicle images are determined, and particles are selected (Fig. 1d) as described previously (Liu and Sigworth, 2014). Meanwhile, a model of the vesicle projections is constructed (Fig. 1e) and used to remove each vesicle’s membrane projection from the micrograph (Fig. 1f).

  • Emerging structural insights into the function of ionotropic glutamate receptors

    2015, Trends in Biochemical Sciences
    Citation Excerpt :

    Other problems may be the absence of lipids or the presence of detergents in the samples and lack of voltage control. Application of new techniques such as visualization of liposome-reconstituted ion channels using cryo-EM, as done in the study of the Big potassium (BK) channel, might help to resolve this issue because the liposome can provide a more physiological lipidic environment and even a voltage control [45,46]. Another central question addressed in recent studies is what happens to the LBD in the pre-open state.

View all citing articles on Scopus
View full text