Photon statistics in single molecule orientational imaging

Optical techniques in single molecule imaging rely heavily on photon counting for data acquisition. Extraction of inform ation from the recorded readings is often done by means of statistical sign al processing, however this requires a full knowledge of the photoelectron statistics. In addition to counting statistics we include a specific form of random signal variations namely reorientational dynamics, or wobble to d erive the general probability density function of the number of detected phot ons. The relative importance of the two factors is dependent upon the total num ber of photons in the system and results are given in all regimes. © 2007 Optical Society of America OCIS codes: (000.5490) Probability theory, stochastic processes, and statistics; (110.4280) Noise in imaging systems; (180.2520) Fluorescence microsc opy References and links 1. K. D. Weston and L. S. Goldner “Orientation imaging and reo rientation dynamics of single dye molecules,” J. Phys. Chem. B1053453-3462 (2001) 2. D.M. Warshaw, E. Hayes, D. Gaffney, A.M. Lauzon, J.R. Wu, G . Kennedy, K. Trybus, S. Lowey, and C. Berger, “Myosin conformational states determined by single fluorop hore polarization,” Proc. Natl. Acad. Sci. U.S.A. 95 8034-8039 (1998) 3. H. P. Lu, L. Y. Xun, and X. S. Xie, “Single molecule enzymati c dynamics,” Science2821877-1882 (1998) 4. R. E. Dale and S. C. Hopkins “Model-Independent analysis o f the orientation of fluorescent probes with restricted mobility in muscle fibers, Biophys. J. 76 1606–1618 (1999) 5. T. M. Jovin, M. Bartholdi, W. L. C. Vaz and R. H. Austin “Rota tional diffusion of biological macromolecules by time-resolved delayed luminescence (phosphorescence, fluorescence) anisotropy,” Ann. N.Y. Acad. Sci. 366 176-196 (1981) 6. T. Ha, T. Enderle, D. F. Ogletree, D. S. Chemla, P. R. Selvin and S. Weiss, “Probing the interaction between two single molecules: Fluorescence resonance energy transfer between a single donor and a single acceptor” Proc. Natl. Acad. Sci. U.S.A. 93 6264–6268 (1996). 7. T. Ha, J. Glass, T. Enderle, D. S. Chemla, and S. Weiss “Hind ered rotational diffusion and rotational jumps of single molecules,” Phys. Rev. Lett. 8


Introduction
Single molecule detection (SMD) has become an important technique in recent years for studying dynamic processes such as chemical reactions and molecular motions at a fundamental level [1].Historically these processes are usually studied using methods based on ensemble averaging of a sample of molecules, however these are based on the fundamental assumption that the sample is homogeneous.This ergodic hypothesis is invalid for many biological and chemical systems.Studies on single molecules are thus advantageous as information, such as statistical distributions of particular quantities, is not lost by averaging.
Single molecule imaging techniques, such as fluorescence microscopy, can also be used to track bio-molecular motions.This has applications in the pharmaceutical industry where a good understanding of processes such as protein folding [2] and molecule motions [3] is vital to new drug development.Relevant tracking techniques often employ fixed-site fluorescent probe molecules [4] whose properties, such as their orientation and position, vary according to their local environment.These changes can be detected with a suitable experimental setup [5,6].
Optical techniques in single molecule imaging almost always require the use of photon counting since individual fluorescent molecules are very weak light sources.Under these conditions the accuracy of measurements are limited by random variations in the measured signal and statistical processing must thus often be used to extract the desired information.This however requires a good understanding of the random processes present.In this work we thus determine the probability density function (PDF) of the number of photons detected during a measurement of finite length as a means of describing the full statistical nature of the detection process.
In the next section we consider the importance of different contributions to photon statistics and find that both noise and statistical signal variations must be considered.As such in Section 3 we derive a general expression for the PDF of photon numbers allowing for both influences.Furthermore in Section 4 we consider one form that signal fluctuations may take, namely those arising from random rotations which fluorophores may undergo; a phenomenon we refer to as wobbling.Wobble of fluorophores, which within the framework of classical electrodynamics can be considered as electric dipole emitters, can be either a continuous angular variation or discrete orientational jumps [7].We use the Laplace transform to obtain the PDF of the time averaged intensity when changes occur discretely.Successive jumps may depend on previous dipole orientations and we consider the case when this is true and when it is not.Limiting forms for both slow and fast wobble are also given.Further results are then derived for continuous variation in dipole orientation, which is found to be of a differing functional form to the discrete case.A short discussion of the results is finally given in Section 5.

Signal-to-noise considerations
Statistical fluctuations in the number of detected photons can derive either from noise present in a system or from random variations in the signal itself.The relative importance of these sources is conveniently parameterised by the signal-to-noise ratio (SNR).In the context of single molecule imaging Basché [8] states that the practically obtainable SNR can be approximated by where D is an instrument dependent collection factor typically ranging from 1-8%, q is the fluorescence quantum yield, σ is the peak absorption crosssection, P is the laser power, t 0 is the integration time, A is the beam area, E p is the energy of a photon in the beam, C b is the background count per watt of excitation power (typically around 2×10 8 photons/Ws in confocal experiments) and N d is the dark count of the detector.Figure 1 shows the behaviour of the SNR over a range of experimental conditions from which it can be seen that a value no better than around 15dB is to be expected.Consequently noise properties of the detection process play an important role in determining the statistical behaviour of the detected signal.
Fig. 1.Contour plot of SNR (dB) vs laser power and focused spot size (as parameterised by the NA of a focusing lens) assuming a wavelength of 395nm and the following parameter values based on use of GFP: D = 7%, q = 0.79 [9], t 0 = 0.01s, C b = 2 × 10 8 photons/Ws, N d = 50.For numerical apertures greater than unity we have assumed an oil immersion lens of refractive index 1.5.Saturation effects are included such that σ = σ 0 /(1 + I/I s ), where σ 0 was taken to be 4 × 10 −16 cm 2 , I = P/A and the saturation intensity I s was assumed to be 10 3 W/cm 2 [10].The inset shows the variation of the SNR versus integration time for a 5mW laser focused through a 0.95 NA lens.

Probability density function of the number of detected photons
Data acquisition in single molecule experiments is invariably done by means of photon counting in which the predominant source of noise is quantisation noise.Denoting the number of photons arriving at the detector during a measurement of duration t 0 by N(t 0 ), the output reading is of the form I out = GN, where G is some gain factor.The arrival of photons at the detector is a Poisson random process [11] and we can hence write the probability mass function of where p N (n) denotes the probability that N(t 0 ) = n.We note here that we use the convention whereby an upper case letter denotes a random process and/or variable, whilst the lower case equivalent denotes a particular outcome.I(t 0 ) is the average rate of arrival of photons (intensity) or equivalently the time average of I (t) where I (t) is the instantaneous rate of arrival of photons at the detector at time t i.e.
As an example many experimental setups use polarisation sensitive methods e.g.[12] whereby the intensity of the detected signal is proportional to the square of the dot product of the illuminating field and the electric dipole moment giving where θ (t) is the transverse orientation of the dipole at time t, β is the transverse angle of the plane of polarisation of incident light and A is a constant.For a stationary dipole Eq. ( 2) fully describes the photon statistics at the detector, however a change in dipole orientation will cause a change in I .If this change is random the arrival of photons at the detector and hence their subsequent detection is termed a doubly stochastic process.Possible sources of such randomness include fluctuations in the illuminating light source and/or movement of the molecule.It is this latter factor that we concentrate on here since for tracking applications the molecule's environment is unlikely to be static.Furthermore we consider only orientational changes since probe molecules are normally rigidly fixed to targets.Under these circumstances I(t 0 ) is a random variable and the probabilities as given by Eq. ( 2) differ for each possible value.As such we recast Eq. ( 2) by conditioning the probabilities on a particular outcome i(t 0 ) i.e.
where p N is now a conditional probability and we have dropped the functional dependence on t 0 for clarity.Assuming knowledge of the random nature of the time average, as characterised by its PDF f I (i) (see Section 4), we can use the identity [11] to find the joint PDF of N and I i.e. the probability that N = n and I = i.Integrating over the joint PDF gives the PDF of the number of detected photons where we have also included the non-ideal nature of the detector by introduction of the quantum efficiency η.Eq. ( 7) is equivalent to averaging the conditional probability with respect to the average intensity and requires knowledge of f I (i) which is discussed in the following section.

Discrete reorientational jumps
In this section we turn our attention to determining the PDF of the time averaged intensity f I (i).
We first consider the case when changes in the orientation of a dipole occur discretely.This could for example be associated with the desorption and readsorption of fluorophores from and onto a glass surface [12].In what follows we shall talk of an electric dipole in an orientational state, by which it is meant that the dipole makes an angle θ to the x-axis in the x-y plane as illustrated in Fig. 2. The dipole then remains fixed at this angle for a time τ before moving to a new state.It is this transverse angle that the signal I in many experimental techniques is dependent on (c.f.Eq. 4).Techniques based on structured illumination and total internal reflection [13,14] do however exist in which the signal depends upon the full three dimensional orientation of a dipole, although these are often restricted to very specific circumstances.Here we restrict our discussion to a two dimensional system for simplicity.Conceptially the full three dimensional situation is identical and requires only minor mathematical modifications as is discussed in the appendix.Assuming that M different orientational states are occupied during a single measurement the time averaged intensity is given by: where θ j and τ j are the parameters corresponding to the j th occupied angular state.Without loss of generality the dipole is assumed to be initially orientated parallel to the x-axis.It should be further noted that changes in the dipole angle are assumed to occur instantaneously.
From the law of rare events [15] we can assert that M be a Poisson random variable or equivalently that the length of time a dipole remains in each state is distributed according to an exponential law i.e.
where f τ (τ) denotes the PDF of τ and ν is the average rate at which dipole jump events occur.Since a measured intensity is always positive we may use the Laplace transform to find the PDF of I.The Laplace transform X * (s) of a random variable X is defined as where L [. From [11] and Eq. ( 8) the Laplace transform for f I (i) is given by where we have let Z j = A cos 2 (Θ j − β )τ j .The PDF of the average intensity I is then given by: where f I (i|m) = L −1 (I * M=m (s)) and the weighted summation over the possible values of M = m is required since the number of reorientations during a measurement is random.
With this knowledge in hand it remains to find an explicit expression for Z * j (s).From Eqs. ( 4) and (10) we can write where f Θ,τ (θ , τ) is the joint probability distribution of Θ and τ.Since dipole angle and state occupancy time are independent this is given by the product of the marginal probability distributions f Θ (θ ) and f τ (τ).Using Eq. ( 9) we can then write The physical process governing the random wobble of the electric dipole will dictate the form of the probability distribution for Θ.For example, rebinding of a fluorophore to a probe site may be modeled using a uniform PDF Standard integration tables [16] then give the analytic result Finding a full analytical result for f I (i) would be complicated however in the limits of small and large ν we can find simpler results.These limits correspond to only a few, and to many events per measurement respectively.As the rate at which events occur decreases the contribution from later terms in Eq. ( 12) becomes negligible.In the limit of ν ≪ 1 only the first term produces a significant contribution and we can consider the dipole as fixed during a single measurement and hence i.e. the PDF of the average intensity is the same as the PDF for the instantaneous intensity.Fortunately this agrees with our intuitive expectations.When dipole wobble is on a time scale much shorter than the duration of a measurement we must consider many terms in our summation i.e. large ν.Since each value of τ j is independent each Z j term is also independent.There are then two cases to consider; that when each subsequent value of θ is independent and that when they are not.In the former case we can invoke the Central Limit Theorem which states the PDF of a sum of independent, identically distributed random variables tends to a Gaussian distribution as the number of terms increases.As such the PDF of the average intensity in the limit of large ν is given by Assuming dependence of consecutive terms means the PDF of the dipole angle θ j is centered on its previous outcome, θ j−1 .For a particular realisation of θ , that is to say one possible outcome of the sequence of dipole orientations, we can write When averaged over all possible realisations the result is similar to Eq. ( 15) except now the width of the distribution increases with each subsequent jump.Consequentially the condition of identical distributions required for validity of the Central Limit Theorem is not satisfied.If, however, the Lyapunov condition [17] is satisfied then the Central Limit Theorem still applies.Numerical simulations show that this is the case.

Continuous angular variation
Changes in dipole orientation may occur continuously and it is here that we consider how this affects the PDF of the time averaged intensity.It can be shown [18] that the probability distribution function of the orientation of the dipole at a time t satisfies the differential equation: subject to the initial condition f (t = 0) = δ (θ − θ 0 ), where δ represents the Dirac delta function.This diffusion equation holds when subsequent orientations are dependent on the previous orientation.A solution to Eq. ( 20) is This result applies to all time scales over which θ may vary as is set by the diffusion coefficient α.
To find the PDF of the average intensity we first use a transformation of variables to find the PDF of the instantaneous intensity f I (I ) which we then need only integrate over the length of a measurement to give our desired result.Thus where θ k are the solutions to the equation i = A cos 2 (θ − β ) and the 1/t 0 factor is to ensure correct normalisation of the PDF.The integral can be evaluated using the substitution x 2 = t −1 and integration by parts which yields: where erfc(. ..) denotes the complimentary error function.
For the independent case f Θ (θ ,t) can not depend on time (assuming the physical cause of the wobble does not vary in time) and as such Eq.( 22) reduces to f I (i) = f I (I ).
Figure 3a) shows a histogram of the result of Monte-Carlo simulations with 10 4 realisations for continuous variation and a diffusion coefficient of α = 5.Various theoretical fits, as based on Eq. ( 23), are also drawn from which it can be seen that for α = 0 (no dipole wobble) the PDF is identical to that of a Poisson distribution as would be expected.Good agreement can also be seen between the simulated and theoretical results.
Furthermore, using these PDFs it is possible to calculate the total cumulative probability of N taking any value below n as plotted in the inset of Fig. 3b) as a function of n.Confidence levels including or neglecting dipole wobble can then be calculated.Assuming the values β = π 4 , θ 0 = 0, A = 10 5 photons/s, t 0 = 10 −3 s and α = 5 we calculated that when neglecting dipole wobble an experimental measurement can determine the orientation of a dipole within a range of 1.78 • with 90% confidence.Inclusion of dipole wobble causes this to increase to 2.43 • .Such a discrepency further highlights the need to include dipole wobble in statistical processing and error analysis.

Discussion
It has been shown that the variation of the orientation of a dipole over the course of a finite duration measurement can alter the statistical properties of the number of photoelectrons induced in a photon counting detector.Although analytic evaluation of Eqs. ( 7) and (12) will in general not be possible we can make some general observations regarding the probability functions involved.
Considering first the PDF of the time averaged intensity we can expect different forms and behaviour for differing dependence conditions and time scales.More specifically, although the distribution will always be peaked around the initial angle, when subsequent orientations of the dipole are dependent on earlier positions the distribution is narrower for slower changes, whilst the converse is true when independence holds.This can be understood since the dependent situation is essentially a diffusion problem and so the larger the ratio of spreading rate (as given by the diffusion coefficient) to integration time the larger the range of angles the dipole can cover during a measurement.On the other hand the distribution focuses when successive orientations are independent since the central probability peak for each Z term is reinforced with each additional term in the average.
For wobbling on faster time scales the PDF of the average intensity has been shown to tend to that of a bell-shaped Gaussian distribution (for discrete variations).Slower time scales will exhibit a sharper more centralised distribution, since for small ν i.e. slow variation only a few terms significantly contribute to the average performed by the detector.In this case the peaked nature of both the exponentially distributed state occupancy times and the Poisson PDF for the number of events per measurement dominate.For larger ν the Poisson PDF becomes smoother and the position of the peak moves to larger m.Low m terms of Eq. ( 12) are then negligible and the peaked nature of the exponential PDF is less dominant.Eventually the Poisson PDF tends to a Gaussian itself whereby it acts as an envelope for the PDF of the average dipole angle.
In terms of the photoelectron statistics we can say that for smaller angular ranges of dipole oscillation one would expect less deviation from conventional Poissonian behaviour.Furthermore if the variations are on a timescale much longer than the integration time then the additional random behaviour will be unobservable.On the other hand if fluctuations are much faster than the detector response the effects are likely to again go unnoticed.That said dipole wobble has been seen at many different time scales ranging from the subnanosecond level [19], through the millisecond regime [5] and higher [7,20].In conjunction with the varying time resolution of different experimental setups [21] and the large angular ranges over which fluorophores can oscillate e.g. 26 • has been observed [22], it is likely that non-Poisson behaviour will be frequently encountered.
Finally we draw attention to Fig. 3b) which shows a plot of the expected variance of experimental data as a function of the number of photons in the system (as parameterised by A), when signal variations from photon counting and dipole wobble are considered separately.Quadratic behaviour can be seen for the case of dipole wobble only, whilst for photon counting the linear behaviour expected from a pure Poisson random variable is evident.The relative importance of the two factors can be seen.At very low light intensities, where it is likely to be impractical to conduct experiments, photon counting dominates.For the intermediate regime both influences are comparable until eventually at higher intensities the molecular wobble dominates.
Our work has detailed the statistical behaviour in each of these regimes (Eqs.( 2), ( 7) and ( 12) or (23) respectively) and allows further development of suitable signal processing algorithms and estimation procedures so as to maximise the precision of angular information extracted from experimental data and thus help to fulfil the potential of single molecule techniques.
When considering the discrete case this means Eq. ( 13) becomes a triple integral f τ (τ) exp (−sI (θ , φ ) τ) dτ dφ dθ (26) however all of the subsequent working remains unchanged.In the continuous case we must solve the two dimensional diffusion equation to give the joint PDF which when integrated according to the three dimensional analogue of Eq. ( 22) yields where Γ(a, z) = ∞ z x a−1 e −x dx is the incomplete Gamma function and θ k and φ k are the solu- tions to the equation i = I (θ , φ ).

Fig. 2 .
Fig. 2.An electric dipole p has a transverse orientation described by the angle θ .Here z describes the optical axis.

Fig. 3 .
Fig. 3. a) Histogram of the time averaged intensity for a dipole undergoing continuous angular diffusion with α = 5, β = π/4, t 0 = 10 −3 s and A = 10 5 photons/s shown with theoretical fits for differing diffusion coefficients.b) Variance of the number of detected photons as a function of the peak signal strength A and the cumulative probability functions for wobbly and stationary dipoles (inset) for the same parameter values as a).