Brought to you by:
Paper The following article is Open access

Divergence of an orbital-angular-momentum-carrying beam upon propagation

, , , and

Published 4 February 2015 © 2015 IOP Publishing Ltd and Deutsche Physikalische Gesellschaft
, , Citation Miles J Padgett et al 2015 New J. Phys. 17 023011 DOI 10.1088/1367-2630/17/2/023011

1367-2630/17/2/023011

Abstract

There is recent interest in the use of light beams carrying orbital angular momentum (OAM) for creating multiple channels within free-space optical communication systems. One crucial issue is that, for a given beam size at the transmitter, the beam divergence angle increases with increasing OAM. Therefore the larger the value of OAM, the larger the aperture required at the receiving optical system if the efficiency of detection is to be maintained. Confusion exists as to whether this divergence scales linearly with, or with the square root of, the beam's OAM. We clarify how both these scaling laws are valid, depending upon whether it is the radius of the waist of the beam's Gaussian term or the radius of rms intensity of the beam that is kept constant while varying the OAM.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Over the past 20 years there has been a growing interest in the orbital angular momentum (OAM) of light, which is carried by any optical beam possessing helical phase fronts [1]. For phase fronts described by an azimuthally dependent phase factor ${\rm exp} ({\rm i}\ell \phi )$, the OAM is equivalent to $\ell \hbar $ per photon [2]. This OAM of a light beam can be derived in a number of ways, but for our purposes it can be seen to arise from the local direction of the Poynting vector. At all points within the beam, the Poynting vector is perpendicular to the phase front and hence possesses an azimuthal momentum component [3]. Throughout this paper we consider only the case of free-space propagation of light beam. As confirmed by measurement [4], the helical phase fronts of an OAM-carrying beam lead to the local Poynting vector being skewed with respect to the beam axis by an angle β, which is a function of r, the radial distance from the beam axis:

Equation (1)

where ${{k}_{0}}=2\pi /\lambda $. One analytically simple and widely used form of helically phased beams is the Laguerre–Gaussian (LG) modes which have a field amplitude given as the product of a Gaussian term with beam waist w0 (defined as the radius at which the electric field of the Gaussian term is lower by a factor $1/e$), with a generalized Laguerre polynomial with indices $\mid \ell \mid $ and p. The radial index p gives the number of off-axis nodes in the radial direction.

In this work we restrict ourselves to the small numerical aperture case where the size of any spatial features in the modes or their superpositions is significantly larger than the optical wavelength. For p = 0 and $\ell =0$ one has the fundamental Gaussian mode. For p = 0 and $\ell \gt 0$ the LG modes are single-ringed annular modes with an azimuthal phase term ${\rm exp} ({\rm i}\ell \phi )$ and a radius of maximum optical intensity, $r({{I}_{{\rm max} }})$, given as [3]

Equation (2)

where $w(z)={{w}_{0}}\sqrt{1+{{z}^{2}}/z_{R}^{2}}$ and ${{z}_{R}}=\frac{1}{2}kw_{0}^{2}$ is the Rayleigh range (here $k=2\pi /\lambda $). One notes that for high OAM values the size of the LG mode is much larger than the beam waist. The restriction of our study to the paraxial regime also implies that $w(z)\gg \lambda $ and hence that β (see equation (1)) remains small for all radii at which the beam has a non-negligible intensity.

In terms of their complex amplitudes, LG beams with different mode indices are orthogonal to each other and this leads to their potential use in communication applications where the orthogonality of the modes allows for multiple states within a channel or independent channels. A simple optical communication system based on OAM was demonstrated over 10 years ago [5], but recently the idea has been revisited with reports of both high data rates [6] and outdoor demonstrations [7]. One issue of critical importance is the degree to which the use of OAM-carrying beams require the use of larger aperture optics. Confusion exists as to whether the divergence and hence the far-field size of a beam carrying OAM scales linearly or with the square root of its angular momentum.

2. Divergence of an OAM carrying beam

The the geometrically calculated skew angle of the Pointing vector calculated with respect to the beam axis given by equation (1) might be thought to approximate the angular divergence of the beam. However, an additional contribution to the divergence that has to be considered is the normal diffractive spreading arising from the finite beam diameter. Rather than the beam waist w0, or the radius of maximum intensity $r({{I}_{{\rm max} }})$, the divergence of a light beam is governed by the standard deviation of its spatial distribution, rrms [8, 9]. The calculation of these quantities is straightforward. We start from the definition of a LG mode which gives the intensity distribution for the lowest-order radial mode (p = 0) as

Equation (3)

which is normalized such that $\int {{I}_{\ell }}(r,\phi ,z)r{\rm d}r{\rm d}\phi =1$. From this distribution, we can calculate the radial position of the maximum of intensity, by solving the equation $\partial {{I}_{\ell }}/\partial r=0$, obtaining (as anticipated in equation (2) [3])

Equation (4)

However, as $r({{I}_{{\rm max} }})$ is zero for $\ell =0$, clearly one cannot generally use this quantity to derive the angular spread of the beam as it propagates. As has been reported previously, a better quantity on which to base a calculation of the divergence of a light beam is the standard deviation of the spatial distribution of the beam, rrms. This radius is a function of z and it is given by the square root of the radial variance of the intensity distribution [10]:

Equation (5)

Equation (6)

In the plane of the beam waist, z = 0, this radius is

Equation (7)

In any given plane, figure 1 shows the ℓ dependence of $r({{I}_{{\rm max} }})$ and rrms. One notes that

Equation (8)

which tends to 1 as $\mid \ell \mid $ increases, so these two radii become equal for large ℓ, see figure 1.

Figure 1.

Figure 1. A comparison between the radii of maximum intensity, $r({{I}_{{\rm max} }})$, and the rms radius, rrms, of the spatial distribution of a p = 0 LG mode. For large ℓ, the two radii get asymptotically close, as $r({{I}_{{\rm max} }})$ is proportional to $\sqrt{\mid \ell \mid }$, while rrms is proportional to $\sqrt{\mid \ell \mid +1}$.

Standard image High-resolution image

Given the radius ${{r}_{{\rm rms}}}(z)$ as a function of propagation distance, z, we can calculate the corresponding divergence angle of a p = 0 LG mode, ${{\alpha }_{\ell }}$, to be

Equation (9)

Equation (10)

This divergence angle is a function of z, but it asymptotically reaches a limiting value (when the beam is sufficiently far from the waist). Within the paraxial regime, for small ${{\alpha }_{\ell }}$, we can equate the tangent of the angle with the angle and for ${{{\rm lim} }_{z\to \infty }},z/w(z)={{z}_{R}}/{{w}_{0}}$. Hence in the far field, the divergence angle of a p = 0 LG mode is given by

Equation (11)

Equation (11) is a convenient form for expressing the ℓ dependence of the beam divergence while holding w0 of the LG mode constant, revealing the approximate square root scaling of the far-field beam size; see figure 2.

Figure 2.

Figure 2. The divergence of an OAM beam obtained by keeping the rrms radius in the launch plane constant is larger than for an OAM beam obtained by keeping the Gaussian waist w0 constant. Moreover, the first scales as $\mid \ell \mid +1$ whereas the second scales as $\sqrt{\mid \ell \mid +1}$. This log–log plot is given in units of the divergence angle ${{\alpha }_{0}}={{w}_{0}}/\sqrt{2}{{z}_{R}}$ of a fundamental Gaussian beam. The lines serve purely as a guide to the eye.

Standard image High-resolution image

Alternatively the divergence angle of the beam can be written in terms of other quantities; for instance, using equation (7), one can instead express the ℓ dependence of the beam divergence for a constant ${{r}_{{\rm rms}}}(0)$ as

Equation (12)

Under this condition, of a fixed ${{r}_{{\rm rms}}}(0)$, we see that the beam divergence scales linearly with ℓ, see figure 2. Note also for a fixed ${{r}_{{\rm rms}}}(0)$ that w0 itself becomes a function of ℓ and scales as ${{(\mid \ell \mid +1)}^{-1/2}}$.

3. Discussion

Typical of a practical optical system is that the aperture of the optics is fixed by design, with a limiting radius R. Since the radius of maximum intensity of an LG beam can be much larger than w0, in order to design a system capable of using a wide range of different OAM modes one must choose a waist ${{w}_{0}}\ll R$.

One example of an approach that produces modes of a fixed beam waist is the cylindrical-lens mode converter that converts any incoming Hermite Gaussian mode into the corresponding LG mode with the same beam waist [11]. When using a mode converter of this type to produce the LG modes one should observe a square root scaling of the far-field beam divergence, equation (11), i.e. a scaling approximately proportional to $\sqrt{\mid \ell \mid }$.

A more frequently used and flexible method of producing beams carrying OAM is the illumination of a diffractive optical element (computer generated hologram) with an expanded Gaussian mode from a conventional laser. If the diffractive element is a spatial light modulator (or similar) then the displayed kinoform can be updated to change the beam type allowing a switching of the system between different OAM states. Most common of such kinoforms is a forked diffraction grating that produces a helically phased beam in the first diffraction order [12]. In the plane of the diffractive element, the beam has the required helical phase structure, but possesses the Gaussian intensity distribution of the illuminating beam. This beam corresponds to a superposition of different LG modes, all of the same OAM ℓ, but with various values of p. The precise p-weighting of this superposition is a complicated function of both $\mid \ell \mid $ and the value of beam waist w0 chosen to perform the decomposition [13]. An obvious choice of beam waist for the decomposition is the one which maximizes the amount of power in the p = 0 mode, as referred to above this optimum beam waist scales ${{(\mid \ell \mid +1)}^{-1/2}}$. Therefore rather than the resulting beam having a particular $\mid \ell \mid $-dependent beam waist, the $\mid \ell \mid $-invariant quantity is simply the size of the beam incident on the diffraction grating and hence the system has a fixed ${{r}_{{\rm rms}}}(0)$. Therefore when using a forked diffraction grating as the beam generation component within an optical system to produce LG-like modes one should observe the approximately linear $\mid \ell \mid $-scaling of the far-field beam divergence, equation (12). This linear scaling of the beam size as a function of $\mid \ell \mid $ has previously been noted within optical tweezers and microscope systems [14].

We note that the reasoning we have followed above is consistent with an understanding of the scaling in terms of the spatial resolution of an optical system. For example, if one considers a transmitted beam comprising a superposition of $\pm \ell $ modes, then the resulting beam profile is an annular ring of $2\mid \ell \mid $ maxima, or 'petals'. In the far field the spacing between these petals cannot be smaller than the resolving power of the optical system. Since the number of petals scales with $\mid \ell \mid $, this implies that the beam size and hence the beam divergence must also scale with $\mid \ell \mid $. A square root scaling of the beam divergence ${{\alpha }_{\ell }}$ and corresponding decrease in petal separation would seemingly be in conflict with this simple resolution argument. However, a square root scaling requires a constant beam waist w0, which implies a beam size ${{r}_{{\rm rms}}}(0)$ and hence a minimum transmission aperture that must increase with the square root of $\mid \ell \mid $. This increasing aperture itself supports a higher spatial resolution, meaning that the reduced spacing of the resulting petals remains compatible with the resolution limit.

4. Conclusions

In conclusion, we have explained how optical systems exploiting the propagation of light beams carrying OAM can produce a far-field beam divergence that scales either with the square root of the OAM or linearly with the OAM, see figure 3. The square root scaling applies to systems where the beam waist of the Gaussian term, w0, is held constant for all values of ℓ, such as the case where the beams are produced using a cylindrical lens mode converter [11]. The linear scaling of the beam divergence applies to systems where the rms radius of the launch beam is held constant, such as the case where the beam is produced by a simple forked diffraction grating implemented on a spatial light modulator illuminated with a Gaussian beam of fixed beam6 .

Figure 3.

Figure 3. A schematic diagram comparing the far-field divergence for OAM carrying beams where the beams are created with a forked diffraction grating (top) illuminated with a Gaussian beam of fixed beam waist, with the case of a cylindrical lens mode converter (bottom) to transform a Hermite–Gaussian mode also of fixed beam waist.

Standard image High-resolution image

Acknowledgments

We thank the referee for their careful reading of our manuscript and we acknowledge the financial support from the UK EPSRC under a Program Grant, COAM and the ERC under an Advanced Investigator Grant, TWISTS. RWB and FMM acknowledge support from the Canada Excellence Research Chairs programme. MPJL is supported by a Royal Academy of Engineering Research Fellowship.

Footnotes

  • One notes that more sophisticated hologram designs can produce arbitrary combinations of LG modes with any values of ℓ, p and w0 (see for instance M Dennis et al Nat. Phys. 6, 118 (2010)), albeit at greatly reduced efficiency compared to the simple fork design that is widely used.

Please wait… references are loading.