Polarization nulling interferometry for exoplanet detection

We introduce a new concept of nulling interferometer without any achromatic device, using polarization properties of light. This type of interferometer should enable a high rejection ratio in a theoretically unlimited spectral band. We analyze several consequences of the proposed design, notably, the possibility of fast internal modulation. © 2006 Optical Society of America OCIS codes: (120.3180) Interferometry; (260.5430) Polarization; (350.1260) Astronomical optics; (999.9999) Nulling interferometry. References and links 1. M. Mayor and D. Queloz, “A Jupiter-mass companion to a solar-type star,” Nature 378, 355-359 (1995). 2. N. Woolf and J. R. Angel, “Astronomical searches for earth-like planets and signs of life,” Astron. Astrophys. 36, 507-537 (1998). 3. G. W. Marcy and R. P. Butler, “Detection of extrasolar giant planets,” Astron. Astrophys. 36, 57-97 (1998). 4. R. Bracewell, “Detecting nonsolar planets by spinning infrared interferometer,” Nature 274, 780-781 (1978) 5. J. R. Angel, A. Y. S. Cheng and N. J. Woolf, “A space telescope for IR spectroscopy of Earthlike planets,” Nature 232, 341-343 (1986) 6. N. Baba and N. Murakami, “A method to image extrasolar planets with polarized light,” Publ. Astron. Soc. Pac. 115, 1363-1366 (2003) 7. J. Spronck, S. F. Pereira and J. J. M. Braat, “Chromatism compensation in wide-band nulling interferometry for exoplanet detection,” Appl. Opt. 45 (4), 597-604 (2006). 8. R. M. A. Azzam and N. M. Bashara, “Ellipsometry and polarized light” (Elsevier, Amsterdam, 1987) 9. P. Hariharan, “Achromatic and apochromatic halfwave and quarterwave retarders,” Opt. Eng. 35 (11), 3335-3337 (1996). 10. D. Mawet, J. Baudrand, C. Lenaerts, V. Moreau, P. Riaud, D. Rouan and J. Surdej “Birefringent achromatic phase shifters for nulling interferometry and phase coronography,” Proceedings of Towards Other Earths: DARWIN/TPF and the Search for Extrasolar Terrestrial Planets, Heidelberg, Germany, 22-25 April 2003.


Introduction
The first exoplanet has been discovered in 1995 by Mayor and Queloz [1].Since that moment, more than one hundred and fifty planets have been detected within ten years.All of them were found by indirect methods [2,3], which means that only some effects that the planet has on its star have been detected.
Direct detection of an Earth-like exoplanets is not an easy task.Indeed, if our solar system was seen from a distance of 10 pc, the angular separation between Earth and Sun would be equal to 0.5 μrad and the brightness contrast between the star and the planet would be 10 6 at 10 μm and significantly larger (10 10 ) in the visible.
Nulling interferometry [4] seems a quite promising technique up to now.It consists in observing a star-planet system with an array of telescopes, and then combining the light from these telescopes in such a way that, simultaneously, destructive interference occurs for the star light and (partially) constructive interference for the planet light.The ratio between the intensities corresponding to constructive and destructive interference is called the rejection ratio.To be able to detect a planet, this ratio should be of the order of at least 10 6 , with the extra requirement that it should be achieved in a wide spectral band (6)(7)(8)(9)(10)(11)(12)(13)(14)(15)(16)(17)(18) μm or even wider [5]).Indeed, this wide band is required to obtain spectral information from the planet and to optimally exploit the photon flux from the planet.
To reach this high rejection ratio in a wide spectral band, most current nulling interferometers use a (achromatic) phase shifter.In this paper, we present a totally different approach that makes use of the polarization properties of light, leading to a new way to achieve nulling interferometry.Note that a similar analogy has been proposed in visible coronography [6].
In Section 2, we derive the generalized condition to have on-axis destructive interference for an N-telescope array, including the polarization effects.We apply this condition to a two-and a three-telescope configurations.In Section 3, we apply this concept in a wide spectral band and we propose a design of a new type of nulling interferometer.In Section 4, we look at the interference patterns that can be obtained with the proposed design.In Section 5, we establish a criterion to define an acceptable width of the spectral band.In Section 6, we look at the sensitivity of the proposed design with respect to some imperfections and misalignments.Our conclusions are then summarized in Section 7.

Generalized nulling condition
In this section, we derive the general condition to have on-axis destructive interference for an array of N telescopes and we look at some implications of this condition, in the case of a twoand a three-beam nulling interferometer.
Let us consider an array of N telescopes and let us assume that we can apply independent phases and amplitudes φ j and A j to each beam before recombination.To cancel the light from the star, we need on-axis destructive interference.We can show [7] that the condition to have such a destructive interference (nulling condition) is given by We can divide both members of this equation by the factor A 1 exp(iφ 1 ), in such a way that the amplitudes A j and the phases φ j are defined relatively to the amplitude and phase of the first beam.Note that we assumed here that the relative amplitudes A j are not wavelength-dependent but we did not make any assumption about the absolute spectra of the beams.Note also that these relative amplitudes A j and phases φ j could be wavelength-dependent [7].A more general condition can be derived assuming independent states of polarization for each beam.Using Jones formalism [8] to describe polarization, the generalized condition is given by where A x, j and A y, j are complex numbers.This generalized condition has very important consequences since it can lead to a new type of nulling interferometers, as discussed below.

Example 1: Two-beam nulling interferometer
In the case of a two-beam nulling interferometer, the generalized nulling condition in Eq. ( 2) simply amounts to In most current nulling interferometers, this condition is satisfied by applying a π-phase shift between the two beams (φ 2 = φ 1 + π).The condition in Eq. (3) could also be fulfilled without any phase shift but considering a polarization rotation of π ( A 1 = − A 2 ).This is a fundamentally different approach, as it will appear more clearly in the following example.

Example 2: Three-beam nulling interferometer
In this case, we have the following nulling condition.
If we assume that all the beams have the same phase, we have This condition can be fulfilled by rotating the polarization of the beams.For example, if we impose a horizontal linear state of polarization on the first beam, we could satisfy the condition in Eq. ( 5) using This shows that we can satisfy the nulling condition without any phase shifter by only rotating the polarization and consequently cancel the light coming from an on-axis star.If a planet is orbiting around that star, then the planetary light coming from the different telescopes will have different optical path lengths.For that reason, it is interesting to look at the detected intensity as a function of the optical path differences between the three beams.Let us first consider the monochromatic case.The detected amplitude as a function of the optical path differences is given, within a phase factor, by where OPD 21 and OPD 31 are respectively the optical path differences between beams 2 and 1 and between beams 3 and 1.The detected intensity is then given by the square modulus of the amplitude in Eq. (7).
In the case of the example of Eqs.(6), we find the detected intensity depicted in Fig. 1.The rejection ratio, defined as the ratio between the maximal and minimal intensities of the interference pattern, is theoretically infinite.
It is usually thought that beams with different coherent states of polarization cannot interfere with a high contrast.Our example shows that we can make three differently-polarized coherent beams interfere with a theoretically perfect contrast.This is also true for N beams provided that N > 2. The second consequence is that, since the intensity depends on the optical path differences, it should be possible to have constructive interference for the light coming from the planet.The important fact is that the destructive interference takes place at the zero-OPD position.In that case, there is no wavelength-dependent phase difference between the beams.

Applications in wide-band nulling interferometry
In this section, we look at the generalized condition in a wide spectral band in order to design a new type of nulling interferometer.
To reach the amplitudes in Eqs. ( 6) for every wavelength in the spectral band, we would need achromatic polarization rotators.This is possible by combining different waveplates to create achromatic half-wave plates [9] or by using zero-order gratings [10].But we choose a different approach, as explained below.
Suppose a system of N beams, initially horizontally linearly polarized (x direction).Each polarization is then changed using a simple waveplate whose principal axis makes an angle α with the horizontal (see Fig. 2).If T r and T α are the complex transmission coefficients of the waveplate in its principal directions (T r = |T r | and T α = |T α | exp(iφ o−e ), where φ o−e is the phase difference between the ordinary and extraordinary axes), the Jones matrix of that waveplate at an angle α = 0 is given by For an orientation of the waveplate α, the Jones matrix would then be where R (α) is the rotation matrix at an angle α.The polarization state after the waveplate is then given by .
(10) Since we want on-axis destructive interference without any phase difference between the beams and assuming that all the waveplates are exactly the same but with different orientations, Fig. 2. The polarization of each beam (initially linear in the x direction (horizontal)) is changed after a waveplate whose principal axis makes an angle α with the horizontal.
we must satisfy Using a simple waveplate in a wide spectral band, T r and T α will be wavelength-dependent in such a way that the first component of the vector in Eq. ( 11) cannot be equal to zero for every wavelength.The second component, on the other hand, can be canceled achromatically by a good choice of the amplitudes A j and angles α j .If we add, for each beam, a perfect vertical linear polarizer after the wave plate, the amplitude of the j th beam is then given by and the nulling condition simply amounts to This condition is wavelength-independent.Therefore the null is achromatic if we assume identical wavepelates.
In this proposed type of nulling interferometers, each beam encounters a horizontal linear polarizer, a waveplate and a vertical linear polarizer (see Fig. 3).Thus, it should be possible to reach a high rejection ratio in a wide spectral band with simple commercially available components.For example, in the case of a two-beam nulling interferometer and choosing A 1 = A 2 , the condition to have an achromatic null is that α 2 = π − α 1 or α 2 = π/2 + α 1 .In order to maximize the transmission of the interferometer, we can choose α 1 = π/4 and α 2 = 3π/4.This can also be easily applied to a three-beam nulling interferometer by choosing for example Note that we would have obtained similar results if the beams were initially vertically linearly polarized.We then could use a polarizing beam splitter instead of the first linear polarizer and apply the same principle to both outputs of the beam splitter in order to use the whole incoming intensity.

Transmission and modulation
In this section, we look at several direct consequences of the proposed design in the case of a three-beam nulling interferometer.

Transmission map
Let us consider N coplanar telescopes looking in the same direction z (see Fig. 4).The position of the j th telescope is given in polar coordinates by (L j , δ j ).For a point source located at an angular separation from the optical axis θ and at an azimuth angle ϕ, the detected complex amplitude f ϕ (θ ) is given by Note that this expression is not general in the sense that the star which we point at lies on the z-axis.If this was not the case, there would be additional delays that are not taken into account here.Note also that this reasoning is valid for a space mission where longitudinal dispersion can be neglected.For any ground-based interferometer, this effect would decrease the performances of the interferometer.In principle, the function f ϕ (θ ) should also include the polarization and birefringent properties of the individual telescopes.We have not included this effect here because we suppose that the telescopes behave identically in this respect and that, on recombination, these effects are canceled.
If we define the transmission map T ϕ (θ ) as the normalized detected intensity, we have (15)

θ -dependence of the transmission map
A star is not a point source but has some non-negligible finite size.For example, the angular diameter of our sun, seen from a distance of 10 pc, is of the order of 5 nrad.To detect an q j (L 2 cos d 2 ,L 2 sin d 2 ,0) Fig. 4. Array of telescopes (dots) situated in the plane z = 0 and looking in the z direction.
The angles θ and ϕ define the direction of the incoming light.The position of the j th telescope is given in polar coordinates by L j , δ j .
exoplanet, we need not only a high rejection ratio for θ = 0 but also for angular separations θ of a few nrad.The flatter the transmission map around θ = 0, the easier it will be to reach this "extended" rejection ratio.That is why a transmission map proportional to θ 4 or, even better, to θ 6 is preferred.We can show [7] that, in order to have a θ 4 -transmission map, we must satisfy, in addition to the nulling condition in Eq. ( 13), Since this condition should be fulfilled for all angles ϕ, Eq. ( 16) can be split into two different conditions These conditions are different from the θ 4 -conditions for other types of nulling interferometers [7].However, we can show that, in the case of a three-telescope configuration, the only possible configuration to fulfill these conditions is a linear configuration, as it is the case for other nulling interferometers.For exoplanet detection, a linear configuration is less interesting because it only gives information in one direction.A solution to this lack of information would be to rotate the whole array of telescopes but this would give rise to slow modulation, as explained in the next section.We can then conclude, that no interesting three-telescope configuration can fulfill the θ 4 -conditions.

Modulation
Another difficulty could prevent us from directly detecting an Earthlike exoplanet: the possible emission from exo-zodiacal dust near the orbital plane of the planet, as in our own solar system.We, a priori, do not know anything about the exo-zodiacal cloud, but we can assume that it is centro-symmetric.Because of this central symmetry, this problem could be handled by using modulation techniques.A possible solution is to use external modulation, which consists in rotating the whole telescope array around its center, but this gives rise to very slow modulation and it will considerably decrease the number of targets that we can observe during a space mission.A more convenient solution is internal modulation.With this technique, we do not change the positions of the telescopes.Via optical means, we create different transmission maps that we combine in order to create modulation maps.
By changing the angle α j , we can change the "weight" of the amplitude A j .Thus, we can then change the ratio between the amplitudes of the different beams by simply rotating the waveplates, provided that the nulling condition in Eq. ( 13) is satisfied.This has two consequences: the first one is that, with this type of nulling interferometer, we do not need any extra amplitude-matching device, as it is the case in most of current nulling interferometers.The amplitude-matching is inherent to the design and is simply produced by a rotation of the waveplate; the second and much more important consequence is that, since we can change the ratio between the amplitudes of the beams, we can have a continuous set of transmission maps, which could be used for fast modulation.
Fig. 5 shows an example of a set of six transmission maps in the three-telescope case that have been obtained by only rotating the waveplates.In these transmission maps, the maximal intensity has been normalized to a value given by ∑ N j=1 A j /A 1 sin 2α j 2 .Note that this is just an example, out of a continuous range of transmission maps.Nevertheless, it can be shown that any of these transmission maps can be represented by a linear combination of three others.Therefore, three different transmission maps are sufficient to get the whole continuous set (for example, Fig. 5(a), 5(c) and 5(e) or 5(b), 5(d) and 5(f)).We also have to think, in more details, about a modulation strategy between these transmission maps.

Spectral response
In some applications, besides the detection of an Earthlike exoplanet, spectral information of the light coming from the planet is needed in order to study its atmosphere.In this case, a wide spectral band is required.
In the proposed design, if we make the assumption of perfect polarizers and exactly identical waveplates, there is absolutely nothing in the nulling condition in Eq. ( 13) that limits the spectral band, so a high rejection ratio in an infinitely-wide spectral band is not unthinkable.However, in practice, this is not true since polarizers and waveplates are not perfect and are spectrally limited.Furthermore, as we will see in this section, the response of the interferometer is not the same for all wavelengths, that is, the detected intensity is wavelength-dependent.
If we assume identical waveplates for each beam, the detected intensity will be proportional to independently of the optical path length differences between the beams.The intensity for the constructive interference is then also proportional to Eq. ( 18), which, in the case of a perfect wave plate is proportional to where Δφ is the phase difference between the two states of polarization induced by the waveplate.If we furthermore consider conventional waveplates (as opposed to achromatic waveplates), we have The intensity is then maximum for Δφ = (2n + 1)π (half-wave plate) and equal to zero for Δφ = 2nπ, where n is an integer.This shows that some wavelengths will be well transmitted, while others will not be transmitted at all.
The criterion that we chose to define the acceptable spectral band is then that all the wavelengths should be transmitted with at least half the maximal intensity, which leads to the following condition We assume that the birefringence is constant in the spectral band.This is not a very realistic assumption but we can show, in the example of quartz, that it does not drastically affect the criterion.In Fig. 6, we compare the spectral response of the interferometer in the visible domain in the case of quartz waveplates and in the case of constant-birefringence approximation.We can see that the approximation is not very good but in both cases, the acceptable spectral band is of the same order of magnitude.Furthermore, the birefringence is chosen in such a way that the waveplate is a half-wave plate for the wavelength λ 0 .We then have The minimal and maximal wavelengths in the acceptable spectral band are then given by We can characterize the bandwidth by defining We can see that the bandwidth is maximum if we use zero-order waveplates (n = 0), which in this case gives M = 3.For example, in the infrared region, this technique would allow us to work from 6 to 18μm.The spectral band will then probably be limited by the polarizers.Obviously, the acceptable spectral band can be wider if we use achromatic waveplates (see Fig. 6).
Note that the presence of polarizers does not necessarily imply losses if we use achromatic waveplates and a polarizing beam splitter to separate the two orthogonal states of polarization.In this way, both outputs of the beam splitter can be used in an apart set-up.

Sensitivity to imperfections and misalignments
In this section, we will see how sensitive the proposed set-up is with respect to misalignments and imperfections and how these defects affect the rejection ratio.
Let us first derive a general expression for the on-axis detected intensity.We assume that each polarizer could have its own imperfection, that is its own extinction ratio, ε k, j , k = 1 or 2 (ε k, j represents the imperfection of the first or second polarizer for the j th beam) and that each waveplate could be different for each beam (T r, j and T α, j ).The amplitude of the j th beam is given by Fig. 6.Spectral response of the interferometer in the visible domain in the case of quartz waveplates (dash-dot lines) and in the case of constant-birefringence approximation (solid lines).We also compare zeroth-order (blue lines), first-order (green lines) and second-order (red lines) waveplates.The magenta solid line represents the spectral response in the case of an achromatic waveplate made of a combination of quartz and magnesium fluoride.
After some calculations, we find that the on-axis detected intensity is given by 2 sin 2α j (T r, j − T α, j ) + ε 1, j T r, j sin 2 α j + T α, j cos 2 α j 2 . (26) Hereafter, we will analyze each defect separately.For all the simulations, we used the following parameters for the perfect case

Amplitude and phase mismatchings
Let us first consider an amplitude mismatching A l = A l,0 + δ A l .We can show that the on-axis detected intensity is then given by The rejection ratio obtained with an amplitude mismatching for the first beam is depicted in Fig. 7.The amplitude mismatching should be of the order of 10 −3 or lower in order to have a rejection ratio of 10 6 .
If we consider now a phase mismatching A l = A l,0 exp(iδ φ l ), we will find the same expression for the on-axis intensity, We can then draw the same conclusion for the rejection ratio (not depicted here since it is identical to the rejection ratio obtained with an amplitude mismatching).The phase mismatching should be lower than 10 −3 , which is easily obtained in the infrared.

Imperfections of the polarizers
Imperfections of the polarizers are modeled by setting ε 1, j and ε 2, j different from zero.These numbers can be complex, giving rise to a certain ellipticity in the polarization state.We chose all the ε 1, j and ε 2, j in such a way that their moduli are a random number between 0 and 10 −3 and the phases are also randomly chosen.We then look at the average rejection ratio that can be obtained.The results are plotted in Fig. 8.
The rejection ratio is in average slightly higher than 10 6 .The "amplitude imperfections" of the polarizers should then also be of the order of 10 −3 , which is not easy to satisfy in a wide spectral band.Note that this requirement is less stringent if we use achromatic waveplates instead of conventional waveplates.Indeed, if we consider identical quasi-achromatic waveplates (T α,1 = T α,2 = T α,3 = T α = exp(i (π + δ )), where δ << π) and identical imperfect polarizers which shows that imperfections of the polarizers can be compensated by very achromatic waveplates and inversely, chromatic waveplates can be used combined with very good polarizers.

Rotation of the waveplates
We consider a small additional angle in the rotation of the l th waveplate, we then have α l = α l,0 + δ α l .After calculations, we find  This shows that the rejection ratio will be much less sensitive to waveplate rotations if cos 2α l = 0, as shown in Fig. 9(a).The required accuracy in waveplate rotation should not be the limiting factor in an actual set-up.

Differential birefringence
Suppose that the birefringence of the l th waveplate is slightly different from the birefringence of the other waveplates, B l = B 0 + δ B. The detected intensity is then given by As shown in Fig. 9(b), we should have δ B/λ ≤ 5 × 10 −4 in order to have a rejection ratio higher than 10 6 , which should be easier to reach in the infrared than in the visible region of the spectrum.

Conclusions
We have derived an expression for the generalized N-beam nulling condition, which includes amplitude, phase and polarization.We have applied this condition to the cases of a two-and a three-beam nulling interferometer.We have shown that interferometry with beams with different coherent states of polarization is possible with a theoretically perfect contrast.
We have seen that we can theoretically reach an infinite rejection ratio in the monochromatic case without any phase shifter, using only polarization rotation.In a wide spectral band, we can still have a high rejection ratio without any phase shifter, but we need an extra device, which could be an achromatic polarization rotator.The approach we show involves only commercial elements: linear polarizers and waveplates.We thus have introduced a totally new type of nulling interferometers, which should allow a high rejection ratio, without any achromatic device.We have derived an expression for the nulling condition of the newly-designed nulling interferometer, which should be fulfilled only by rotating waveplates.
We have looked at the θ -dependence of the transmission map and have seen that, as in regular nulling interferometers, the only three-telescope configuration to have a θ 4 -transmission map is the linear configuration, which is not very interesting for exoplanet detection.As conclusion, the proposed design does not need any amplitude-matching device and is also very suitable for fast internal modulation.Indeed, it allows to obtain a continuous set of transmission maps, by only rotating the waveplates.But a modulation strategy still has to be investigated.
Another interesting aspect of these calculations is that there is no theoretical limit for the width of the spectral band for which we reach a deep null.However, the transmission is not equal for all wavelengths.We have established a criterion to define the acceptable spectral band in such a way that the transmission over the whole spectral band is higher than half the maximal transmission.We have seen that this criterion could lead to a relatively wide band in the infrared.The ultimate limit of the spectral band will probably be set by the transmission of the linear polarizers and the waveplates.
In order to complete the analysis, we also have looked at the sensitivity of the proposed design to some imperfections and misalignments.We have seen that the factor limiting the rejection ratio will probably be the imperfections of the polarizers.Indeed, rotations can be reached very accurately.Phase mismatching and differential birefringence can be a problem in the visible region but are much less important in the infrared.In conclusion, using the approach presented in this paper, we have shown that it should be possible to reach a high rejection ratio without any achromatic device, thus opening a new promising way to detect Earthlike exoplanets.

Fig. 1 .
Fig. 1.Normalized detected intensity (simulation) as a function of the optical path differences (OPD) between the three beams.

Fig. 3 .
Fig. 3. Design of a new type of nulling interferometer, each beam encounters a horizontal linear polarizer, a waveplate and a vertical linear polarizer.

Fig. 7 .
Fig. 7. Rejection ratio as a function of an amplitude-mismatching δ A 1