Experimental demonstration of superresolution of partially coherent light sources using parity sorting

Analyses based on quantum metrology have shown that the ability to localize the positions of two incoherent point sources can be significantly enhanced through the use of mode sorting. Here we theoretically and experimentally investigate the effect of partial coherence on the sub-diffraction limit localization of two sources based on parity sorting. With the prior information of a negative and real-valued degree of coherence, higher Fisher information is obtained than that for the incoherent case. Our results pave the way to clarifying the role of coherence in quantum limited metrology.

Analyses based on quantum metrology have shown that the ability to localize the positions of two incoherent point sources can be significantly enhanced through the use of mode sorting. Here we theoretically and experimentally investigate the effect of partial coherence on the sub-diffraction limit localization of two sources based on parity sorting. With the prior information of a negative and real-valued degree of coherence, higher Fisher information is obtained than that for the incoherent case. Our results pave the way to clarifying the role of coherence in quantum limited metrology.

I. INTRODUCTION
The resolution of imaging systems is limited by the size of the diffraction-limited point spread function (PSF) [1]. To quantify this resolution, the Rayleigh criterion has been proposed and widely used [2]. Recently, the analysis of optical resolution has been recast in terms of Fisher Information (FI) [3][4][5], which quantifies the precision of measurements and is inversely proportional to the parameter estimation error. Generally, the FI of the estimation of separation δ between two spatially incoherent point sources depends on the type of measurement performed on the image plane field. In the case of direct detection of image plane intensity, the FI goes to zero as δ − → 0, an effect termed as Rayleigh's curse. In their seminal work [3], Tsang et al. showed that Rayleigh's curse can be overcome if the optical field is detected by an appropriate spatial mode demultiplexer (SPADE), given prior knowledge of two equally bright and incoherent point sources versus a single emitter. The FI for such a scheme is constant as δ − → 0, as has been verified experimentally [6][7][8][9][10].
The sources, however, can have a non-zero coherence between them [14]. In fact, spatial coherence is a key parameter affecting the resolution of imaging systems [15]; coherent illumination techniques can offer enhanced resolution in microscopy [16] and two-point direct imaging [17,18]. Moreover, coherence imaging can offer significant practical advantages over conventional direct imaging systems, for example in the very long baseline radio interferometry (VLBI) used for black hole imaging [19]. It is then natural to ask how spatial coherence between the two sources affects the resolution obtained by SPADE. Recent theoretical works have extended the scope of the two-point estimation problem to include the general case of partial coherence among the two sources [11][12][13][20][21][22]. In particular, it was shown that Rayleigh's curse can still be avoided for a known degree of spatial coherence γ [11,12,22]. For the case of γ < 0, an even greater sensitivity for SPADE was predicted than the incoherent case. The increased sensitivity needs to be carefully interpreted, taking into account photon budgeting considerations [13]. Experimental demonstration of SPADE with partial coherence, however, has been lacking. The main result of our work is to experimentally demonstrate the breaking of Rayleigh's curse for partially coherent light sources using SPADE. In doing so, we also distill and connect the different elements of previous theoretical works.
In Section II, we derive the classical FI of our experimental setup for partially coherent fields. Special attention is paid to a priori assumptions and how they affect the obtained FI. The connection between previous works is also made clear in this section. Section III explains the experimental setup, the generation of spatial coherence, and a discussion of estimation statistics. Section IV summarizes the results.

II. THEORY
In this section we outline the calculation of the classical FI for parity sorting of the partially coherent field.
FIG. 1. Expected FI for Parity Sorter plotted versus δ/σ. A higher FI corresponds to a lower estimation error. a: FI prediction for the case when δ is the only unknown parameter. For this case, γ = −1 gives the highest FI (dashed blue line on top of the γ = −0.99 curve), as predicted by the Tsang-Nair model [11]. b: FI prediction for the case of unknown input photon number N0. For this case, the FI is zero for |γ| = 1. As γ − → −1, the FI curve gets concentrated near δ = 0, but is still bounded above by twice the FI for γ = 0. The curves in (a,b) are normalized by the object plane photon numbers. c: FI prediction normalized by the image plane photon number for the case of unknown N0. These curves are related to the curves in (b) by the weight factor of (1 + γd) as explained in the text. As γ − → −1, this image plane FI diverges and gets concentrated around δ = 0, a result which was predicted using a quantum calculation in [12,13]. As explained in the text, the information conveyed by curves (b,c) is the same. Note that the γ = 0 curves (green line) are same in all the figures.
Note that parity sorting falls under the scheme of binary SPADE (BSPADE), which is a family of measurements that simplifies SPADE at the cost of losing large-delta (δ > σ) information [3,23]. For δ ≪ σ, it has been shown that a measurement of the even and odd projections of the input field has an FI that converges to the quantum optimal FI [7,9]. We show explicitly how different a priori assumptions yield different FI curves. The physical problem is the following: Two point sources separated by δ and having a degree of spatial coherence γ are imaged by an imaging system with a finite-sized aperture. The goal is to perform quantum-limited estimation of δ in the sub-Rayleigh regime by performing parity sorting on the image plane field.
A partially coherent field is described by its crossspectral density (CSD) function W (x 1 , x 2 ) [14]. To proceed, we first note that W can be decomposed via the coherent mode decomposition (CMD) [24]. For our problem, the simplest choice of modes is to decompose the W in the symmetric (in phase) and antisymmetric (out of phase) combinations of the two sources. In the image plane, W (x 1 , x 2 ) is given as where N 0 is the average image plane photon number emitted by each point source, κ is a space-invariant efficiency factor dictated by the aperture loss, φ k (x) = f + (x) − e ikπ f − (x) are the symmetric (k = 1) and antisymmetric (k = 2) coherent modes, f ± (x) = f (x ± δ/2) are the two point spread functions separated by δ -the parameter to be estimated, p k is a real number such that 0 ≤ p k ≤ 1, and p 1 + p 2 = 1. In what follows, the terms even and odd modes are used interchangeably with symmetric and antisymmetric modes. We assume Gaussian The total number of photons in the image plane is given by is the overlap integral of the two shifted PSFs, and γ = p 1 − p 2 is an effective degree of spatial coherence between the two sources. It is here that we first encounter the departure from the incoherent estimation problem; for γ = 0, N t depends on the parameter δ to be estimated. Hence, it is necessary to spend some time clarifying the interpretation of the FI for partially coherent sources. For a parity sorter, the photon numbers in the even and odd ports are, respectively, Equations (2,3) are derived in the supplement. We assume that γ, κ are known a priori. If we know N 0 and the only unknown in the experiment is δ, then assuming Poisson statistics it can be shown [3] that the FI for parity sorting is given by where the subscript δ denotes that δ is the unknown parameter. Note that F δ (δ, γ) is normalized by 2N 0 κ, the total object plane photons multiplied by the loss factor. F δ (δ, γ) is plotted in Fig. (1a), and κ has been absorbed into N 0 for the plot. These curves show that the highest FI is achieved for γ = −1. The physical operation of parity sorting affords some intuition about this FI behavior. For γ = −1, all photons are routed to the odd port, and we have N 1 = 0 and N 2 = 2N 0 κ(1−d).
Knowing the total emitted photon number 2N 0 and the total detected photon number N 2 allows us to estimate δ directly. For δ ≪ σ, the power in the odd port is well approximated as N 0 κ(1 − γ)δ 2 /8σ 2 . Thus for sub-Rayleigh separation, the odd port has the most photons for γ = −1, and hence the highest FI. It is not uncommon, however, that an experimentalist only has access to image plane photons, and does not have knowledge of N 0 . When both δ and N 0 are unknown, the FI is found from the multiparameter Cramer-Rao bound (CRB); this FI is given by and is plotted in Fig. (1b). Note that as γ − → −1, the FI is effectively zero for all δ = 0 and γ = −1. Figures (1a,b) clearly show how the knowledge or ignorance of the object plane photon number affects the FI for δ estimation in the presence of partial coherence.
We can now ask the more practical question of how to estimate δ when we only detect the image plane field, and have no knowledge of N 0 ? In this case one can use the normalized modal weights p 1,2 = N 1,2 /N t which are independent of N 0 . The statistics are described in this case by a binomial likelihood function [25]. We can then calculate the image plane FI by the formula where the subscript 'img' denotes image plane and the function is plotted in Fig. (1c). We emphasize that F img is normalized per image plane photon; physically, Eq. (6) quantifies the information provided by a single photon in the image plane, and is agnostic to the number of object plane photons. Figure (1c) then shows that given equal number of photons in the image plane, γ < 0 can offer increased sensitivity in the regions δ ≪ σ. Note that Eq. (6) is related to Eq. (5) by a simple 'weight' factor of (1 + γd), which also relates the image and object plane photon number in Eq. (2). While the image plane FI might increase for γ < 0, more object plane photons are needed to maintain a constant image plane photon number, a 'cost' that is captured by the factor of (1 + γd). The image plane FI is also zero for γ = −1, in which case all clicks occur at the odd port for all δ. If the experimentalist does not know N 0 , they do not get any information about δ from just measuring clicks at the odd port. In any case, Figs. (1b) and (1c) give the same information, as there is a one-to-one correspondence between the two curves. Alternatively, the lowerbound on the variance of an unbiased estimator can equivalently be found either from Eq. (5) or Eq. (6). Incidentally, the aforementioned discussion provides clarity to the debate between, among others, the Tsang-Nair (TN) model [11] and the Larson-Saleh (LS) model [12]. Strictly speaking, the TN model assumes knowledge of N 0 , while the LS model assumes an unknown N 0 . Specifically, Fig. (1a) agrees with the TN model, and Fig. (1c) agrees with the LS model. Figure (1b) bridges the TN and LS models. We note that Hradil et. al. [13] also advocated the use of the weighted version of image plane FI to take into account the image plane photon number variation with γ, δ, and their results also imply the curves in Fig. (1b). Depending on the a priori assumptions afforded by the experimental setup, either TN or LS models will correctly describe the estimation statistics. Note that a similar observation has been made for coherent microscopy [26], which advocates the 'mandatory inclusion of information about underlying a priori assumptions' when discussing resolution claims.
Having clarified the issue of the FI interpretation for partial coherence, we can now proceed to discuss the experiment. Realistically, we will use the image plane model as it reflects a common situation in imaging, microscopy, and astronomy. Note that realistic situations have more than just δ and N 0 as possible unknowns. For example, our analysis till now has assumed the presence of only two sources, equal intensities of the two sources, a known centroid of the objects to which the parity sorter is aligned, and, most importantly, a known γ. In practice, one needs a combination of direct imaging, coherence interferometry, and parity sorting to estimate these unknown parameters. The application of quantum metrology-inspired ideas such as SPADE to practical situations is an active field of research [27][28][29][30]. These considerations, however, are not relevant to our proof-of-principle experiment in which we consider only δ and N 0 as the unknown parameters.

A. Spatial Coherence Generation
We use a parity sorter to perform SPADE on two spatially partially coherent sources. To generate partial coherence, we use the CMD [24]. Physically, such a CMD means that the spatial coherence at the input plane to the SPADE setup can be engineered by incoherently mixing appropriately scaled symmetric and antisymmet- of the coherent modes is sent to a Michelson type image inversion interferometer, which separates the even and odd components of the input field. One arm has a 4f system, which acts as an identity operator after the beam double passes it. The other arm has a 2f system, implemented by a convex mirror, and an extra quadratic phase, not shown, to cancel the defocus due to diffraction. This arm implements the transformation (x, y) − → (−x, −y). The combined beams from both arms are imaged onto a bucket detector. The power in the even and odd modes can be measured by setting the phase difference θ to 0 and π respectively. In the experiment, all modes used are symmetric about the y axis such that E(x, −y) = E(x, y). The interferometer then works as a parity sorter in the x-direction.
ric modes. This can be realized by adding a path difference between coherent modes that is larger than the laser coherence length. Alternatively, we can 'switch' between the modes in time, with the switching time longer than the laser coherence time, and add the recorded intensities digitally [31,32]. The CMD therefore allows us to generate spatial coherence 'offline', by performing the intensity summation electronically. To generate an intensity distribution corresponding to a specific γ in Eqs.
(3), we can post-select from a set of recorded intensities of φ 1,2 modes. This allows a great simplification of the experiment with respect to the precise control of γ. Note that we are not changing the temporal coherence properties; all the beams used are quasimonochromatic and therefore temporally coherent.

B. SPADE using parity sorting
After generating partially coherent fields, the next step is to perform parity sorting on the field described by Eq. (1). The experimental setup consists of an image inversion interferometer that sorts the input field based on its parity, as shown in Fig. (2). A Gaussian beam with σ = 327 ± 4µm is converted into either a symmetric or antisymmetric mode using linear optics, which includes a spatial light modulator. The beam flux can be adjusted using polarization optics. The mode is presented to a Michelson type interferometer. The top arm, which includes a 2f imaging system and an extra quadratic phase implemented to cancel the defocus due to diffraction, implements the transformation (x) − → (−x) and the arm with the 4f system images the field with unity magnification, after two reflections. Experimental details of the interferometer are described in Ref. (33). For parity sorting, we set α = π in Eqs. (1-3) of Ref. (33). The field at the output of the interferometer is where k = 1, 2, θ is the global phase difference between the two arms of the interferometer, N k is the photon number in the input mode φ k , and each φ k is spatially coherent in Eq. (7). Note that the coherent modes used are symmetric in y, so the 1D analysis is valid for the experiment.
To project onto the even and odd components of the field, we can choose θ = 0, π. As explained in Section III A, we send only one of the coherent modes φ k at a given time. To generate CSD for a given γ, we add the measured intensities offline. Details of the offline coherence generation are given in the supplement. For θ = 0(π), all of the symmetric (antisymmetric) mode power will be directed to the bucket detector, while the antisymmetric (symmetric) mode will destructively interfere at the detector. For θ = 0(π) the output is called as the even (odd) port. A bucket detector measures the photon number in each port.

C. Estimation Statistics
The goal of superresolution is to estimate δ for regions of δ < σ. To estimate δ, we use maximum likelihood estimation (MLE) on the measured normalized modal weights p 1,2 . Because we normalize the modal weights by the image plane photons, we use a binomial likelihood function for the parity sorter [25]. The estimatedδ is shown in Fig. (3a). Note that all the estimated δ's are below the Rayleigh limit (δ = σ). For δ in the interval [0.2 − 1]σ (in increments of 0.1σ), we take 100 images each of the symmetric and antisymmetric modes, thus getting 100 ML estimates and the corresponding variance. We have not observed any bias in the estimates, FIG. 3. a: Estimated shiftδ/σ for γ = 0, −0.75 using MLE on the measured modal weights. The estimated shifts are all below the Rayleigh limit (δ = σ). Each point represents the mean MLE of 100 measurements. The error bars are too small to be noticed on the graph, but are still bounded by the CRB as shown in (b). Note that the γ = 0 estimates are not biased; to distinguish the two data sets, we introduce a vertical offset between the γ = 0 and the γ = −0.75 estimates. Both γ = 0 and γ = −0.75 estimates are in good agreement with the expected shifts. b: Measured MSE for γ = 0 (green triangles) and γ = −0.75 (red crosses). For each data point, ML estimates from 100 trials were used to calculate the variance. Note that for a given δ/σ, the MSE for γ = −0.75 is consistently less than the MSE for γ = 0. The dashed green and solid red lines indicate the CRB for γ = 0, −0.75 respectively. The CRB is given by the inverse of Eq. (6). Technical noise factors causing the discrepancy between theory and experiment are explained in the main text.
as evident in Fig. (3a), where the mean of the estimates are equal to the true value of δ/σ. The variance in the MLE estimates, which is related the inverse of the FI, is too small to be noticed in Fig. (3a). Nevertheless, the variance of an unbiased estimator is lowerbound by the Cramer-Rao bound (CRB), which is related to the inverse of the FI. Formally, Var[δ] ≥ (N t F img ) −1 , where Var[δ] is the variance in the MLE estimatorδ, and F img is the image plane FI as given by Eq. (6) and shown in Fig. (1c). Figure ( shows that the MSE for γ = −0.75 is below the CRB for the γ = 0 case. In other words, not only is Rayleigh's curse avoided for γ = −0.75, the estimation is more precise than the incoherent case of γ = 0. Note that the MSE are still offset from the CRB. To truly saturate the CRB, the system must be shot noise limited, and any other noise source will raise the MSE. Another source of noise in our system are the phase fluctuations in the interferometer when it is biased at θ = 0 or π (See Fig.  (2)). Furthermore, the MSE for γ = 0, −0.75 might appear correlated, for example at δ = 0.2, 0.3. This is because the same set of images are used for CMD of both γ = 0, −0.75, and hence both γ = 0, −0.75 MSE's will be affected by the same phase fluctuations; if the γ = 0 MSE is higher, so will be the γ = −0.75 MSE. Finally, the CRB curves in Fig. (3b) are nearly equivalent to the quantum CRB predicted for δ < σ [12], and therefore our measurements represent near quantum-limited localization of partially coherent sources.
The reader might observe that no statistics for |γ| = 1 are shown in Fig. (3). As discussed in Section (II), the FI for |γ| = 1 is zero for all δ if N 0 is unknown. The likelihood function in this case is independent of δ for |γ| = 1, and hence δ cannot be estimated in principle. N 0 is unknown in our experiment because we generate the image plane field directly through unitary transformations and not through a Gaussian aperture that scales the coherent modes according to the (1±d) factor in Eqs.
(3). While our system has an effective 'aperture' loss factor that connects the source photon number to the image plane photon number, this loss factor is independent of δ for the coherent modes generated by the SLM, as also reported in the Supplement. The experiment is the generalization of previous localization experiments on incoherent beams [7,9]. This technique allows 1) a great experimental simplification with regard to avoiding the need to perform precise fabrication of point sources with different separations and 2) to circumvent issues of low photon budget and spurious diffraction effects from the source geometries. However, this technique fails to provide access to an effective object plane photon number which is related to the image plane photon number by the factor of (1 + γd), and hence does not allow us to reconstruct results of Fig. (1a). Barring these technical difficulties, our theoretical and experimental results are easily generalized to the case of a known N 0 . We note that having access to only the image plane photon number is however a common situation in optical physics, where one does not have an independent probe on the object plane photons. Our results are therefore valid for a large variety of microscopy and imaging experiments. The details of image processing, CMD, the photon number in Fig. (3) versus δ, and mode generation are given in the Supplement.

IV. CONCLUSION
We have carried out a theoretical analysis of superresolution of partially coherent light using parity sorting. For partially coherent sources, the object plane photon number was identified as a relevant parameter that affects the obtainable FI, and that connects the different results of previous works [11][12][13]. We also performed parity sorting on two Gaussian PSFs with varying degrees of spatial coherence. Our results show that partial anticorrelation of the two sources increases the FI of δ estimation. Therefore, Rayleigh's curse can be avoided for partially coherent sources. The proof-of-principle experiment paves the way to using coherence as a resource in quantum-limited metrology. Our analysis assumes a real, known value of γ. Further studies could include concurrent estimation of δ and γ, for which a vanishing FI with δ − → 0 is predicted [20,22]. The natural extension of the current work is to consider the more realistic case of multiparameter estimation of a complex γ, the centroid and intensity ratio of the two sources [22], and the effects of cross-talk in the SPADE setup [27][28][29]34]. While we have been primarily concerned with the twopoint problem, the technique of SPADE can also tackle the more general problem of imaging an extended object scene. There the problem reduces to estimation of moments of the object in the sub-diffraction limit, a case which was treated for incoherent objects [4,[35][36][37].
It is an open question as to how these theoretical works generalize to the case of partially coherent object distributions.

B. Experimental Details
Please refer to the supplement for the experimental details about the field generation, the measurement of the modal weights, mode intensity versus δ, and the data processing. The supplement is available under the 'ancillary files' link on the arXiv page.