Optical Diffraction in Close Proximity to Plane Apertures. IV. Test of a Pseudo-Vectorial Theory

Rayleigh’s pseudo-vectorial theory of the diffraction of polarized light by apertures which are small compared to the wavelength of light is analyzed with respect to its mathematical rigor and physical significance. It is found that the results published by Rayleigh and Bouwkamp for s-polarized incident do not obey the conditions assumed in their derivation and must therefore be dismissed. It is also found that the theory leads to paradoxical predictions concerning the polarization of the diffracted field, so that the pseudo-vectorial approach is intrinsically incapable of describing polarization effects.

. The Rayleigh-Bouwkamp diffraction problem. A very narrow aperture A in a metallic screen S which is located in the xy-plane is illuminated by a monochromatic, p-or s-polarized plane wave e ikz , and solutions are constructed so that the optical field is everywhere continuous with continuous first derivatives.
theory will be reexamined with respect to its physical significance, mathematical rigor, and accuracy. Rayleigh considered a monochromatic plane wave of unit irradiance, u 0 = e ikz , which is incident from the half space z < 0 on a circular aperture or slit A contained in an infinite, infinitesimally thin, opaque screen S that lies in the xy-plane of a cartesian coordinate system (Fig. 1). He also assumed that the diffracted field in the half space z ≥ 0 obeys the Rayleigh-Sommerfeld integral equations, where Q = (ξ, η, 0) is a point in A , P = (x, y, z) is an arbitrary point in space, QP is the distance between them, u (p,s) are scalar wave functions, and ∂ u (p) (Q)/∂ z and u (s) (Q) are zero on the opaque portion of S as if the screen is a perfect metallic reflector and the incident light is p-or s-polarized. Furthermore, Rayleigh assumed that field on the source side of S is a modified geometrical field of the form (1c) and hence the objective of the theory is to find the unknown quantities u (p,s) (P) and u -(p,s) (P) so that the overall field is continuously differentiable in the aperture plane.
From Eq. (1c) it follows that ∂ u (p) /∂ z and u (s) will be continuous for Thus, u (p) = 2 -u (p) , ∂u (s) /∂z = 2ik -∂u (s) /∂z when z = 0, so that u (p) and ∂u (s) /∂z will also be continuous if The conditions [Eq. (2d)] represent the underlying integral equations of the Rayleigh-Bouwkamp and other so-called "rigorous" diffraction theories which are intended to solve the wave equation in the presence of prescribed boundary conditions. To this author's knowledge, all existing solutions with the sole exception of Sommerfeld's half-plane theory are approximations of some sort and in the cases considered here these approximations take the form of ascending power series in kw which were derived on the assumption that the unknown aperture distributions in Eqs. (1a,b) can be expressed in the form (3) where , q is the distance of Q from the coordinate origin, and the coefficients p n and s n are to be chosen so that Eqs. (2d) are satisfied. The determination of these coefficients is tedious and will not be repeated here as the results have been detailed in Ref. [3]. In the following the corresponding solutions will be denoted by the subscript 'RB' and it will be sufficient to consider only the initial terms of the expansions [Eq. (3)] by assuming, as Rayleigh did originally, that kw → 0.

Circular Apertures
For circular apertures it is appropriate to use polar coordinates inside the aperture and in the plane of observation, so that

Fresnel Approximation
The evaluation of Eqs. (4d,e) in the Fresnel limit is simple. When z >> λ, and thus z >> q, the dependence of QP on the aperture coordinates can be ignored, so that (5a) and (5b) where q/w = sinα was substituted in both integrals. Hence, the total radiant fluxes transmitted by the aperture and incident on the inside of a large hemisphere with radius r are given by (5d) and the corresponding transmission coefficients of the aperture are (5f) where Φ 0 = πw 2 is the geometrical flux incident on the aperture.
These results can now be checked for consistency with Eqs. (2d) and (3) by using an alternative expression [3,6] for transmission coefficients, where A is the area of the aperture and u(Q) is the field inside it. According to Eq. (2d) we expect u (p) RB (Q) = 1 and therefore (6b) whereas from Eq. (3) and s 1 = -2/π it follows that (6c) The p-coefficients of Eqs. (5f) and (6b) agree reasonably well. On the other hand, there is a considerable difference between the corresponding s-coefficients.

Near-Field Solution
In the immediate proximity of the aperture plane the exponential e ikQP is effectively constant and equal to 1 inside the aperture, and Eqs. (4d,e) can be simplified as follows, As these wave functions must be rotationally symmetrical about the z-axis it suffices to evaluate the χintegrals appearing in these expressions for ϕ = 0. Substituting α = (χπ)/2, we find  where relative coordinates are used. This expression was evaluated by numerical integration to analyze its behavior in the aperture plane, using a polynomial approximation for K(m) given in Ref. [7] and assuming z/w = 0 as well as z/w = 0.01 in order to find ∂u (p) RB /∂z by numerical differentiation. The results of these computations are listed in the left-hand portion of Table 1, which shows that they are in satisfactory agreement with the values expected from Eqs. (2d) and (4d).
The corresponding expressions for u (s) RB are obtained from Eqs. (7b) and (8d) and can be expressed as follows, and E(m) can also be expressed by a polynomial approximation [7]. It was ascertained that the integrals A and B are everywhere finite, and hence it follows from the first Eq. (10a) that the aperture values of u (s) RB are identically equal to zero. This is in disagreement with Eq. (4e), and it was also found that the computed aperture values of (1/ik)∂u (s) RB /∂z do not satisfy the second integral equation, [Eq. (2d)]. These discrepancies are documented in the right-hand portion of Table 1.

Slits
The second case considered by Rayleigh and Bouwkamp is that of an infinitely long, narrow slit of width 2w which is centered on the y-axis of a rectangular coordinate system. As the resulting diffraction pattern will be independent of y, it is sufficient to compute its variation in the xz-plane by assuming  where cosθ = z/r and the two integrals were again evaluated by substituting ξ/w = sinα. These expressions can now be used to compute the quantities (12d) which represent the radiant flux emerging from a slit segment of length ∆y and falling on the corresponding annular segment of a semicylinder of radius r which is centered on the y-axis. As the geometrical flux incident on the slit segment is ∆Φ 0 = 2w∆y, this leads at once to the following expressions for the transmission coefficients of the slit, As in Sec. 2.1 alternative expressions for these transmission coefficients can be obtained from Eq. (6a), which in this case is equivalent to (13a) Hence we find, using Eqs. (2d) and (3) with s 1 = 1, Both of these results disagree with the corresponding values in Eq. (12e).

Near-Field Solution
The integrals [Eq. (11d,e)] can be evaluated without simplifying assumptions for arbitrary values of z, and it was found that they are finite in the aperture plane in spite of the singularities of H 0 (1) (β) and H 1 (1) (β)/β for β = 0.
For p-polarized light we find (14) where J n and Y n are Bessel functions of the first and second kinds and M = o 2 + π 2 /4. The integral [Eq. (14)] was evaluated numerically for kw = 0.1 and z = 0 as well as z = 0.01w, yielding the results for u RB (p) and ∂u RB (p) /∂z shown in the left-hand portion of Table 2. The table also shows the corresponding values expected from Eqs. (2d) and (3), and the agreement was judged satisfactory in view of the approximations made in Rayleigh's derivations.
The corresponding expression obtained from Eq.  Tables 1 and 2 there can be no doubt that the Rayleigh-Bouwkamp theory is mathematically flawed and fails to give a credible account of the diffraction of polarized light. The reasons for this conclusion are twofold.
The near-field solutions derived in Secs 2.2 and 3.2 for p-polarized incident light (∂u 0 /∂z = 0 on S ) are in approximate agreement with the conditions in Eqs. (2d) and (3) that were assumed in their derivation, and thus appear to be free of contradictions. On the other hand, the corresponding solutions for s-polarization (u 0 = 0 on S ) do not obey these conditions at all. This discrepancy was found for circular apertures and slits alike, and thus points to a general problem associated with the second integral equation [Eq. (2d)] as such. Bouwkamp [3] mentioned that this integral equation [Eq. (2d)] is notoriously difficult as it requires differentiation with respect to z under the integral sign and can lead to divergent solutions. He pointed out that these divergencies can be eliminated, but this was not attempted in this work. In any case, the polarization effects shown in Fig. 1 must be dismissed as wrong.
A second, and most likely uncorrectable, problem arises from the fact that the 'polarization effects' predicted by the Rayleigh-Bouwkamp theory run counter to common experience. As mentioned in the Introduction, a polarized field behind a circular aperture illuminated by normally incident, unpolarized light is impossible for reasons of symmetry. Yet the existence of such a field is implied by the upper curve in Fig. 1, and the discrepancy persists even when the suspect transmission coefficient τ RB (s) in Eq. (5f) is replaced by the improved coefficient (τ RB (s) )' in Eq. (6c). Additionally, the different values obtained in Secs. 2.1 and 3.1 for the far-and near-field transmission coefficients seemed paradoxical because the polarization of light cannot change during free-space propagation. As this discrepancy was encountered for both states of polarization, it also cannot be blamed on the mathematical problem mentioned in the previous paragraph.
Hence it is concluded that the Rayleigh-Bouwkamp theory cannot be relied upon for describing the diffraction of polarized light by small apertures. Virtually the same determination was made in an earlier paper [6] on the feasibility of using the Rayleigh-Sommerfeld boundary-value integrals for analyzing diffraction by apertures of arbitrary size, and thus the overall conclusion is that the inability to account for polarization effects is an inherent property of scalar diffraction theories, no matter what boundary conditions are assumed. The fact of the matter appears to be that, in scalar diffraction theory, the boundary conditions ∂u 0 /∂z = 0 or u 0 = 0 are necessary and sufficient to satisfy Helmholtz' theorem. As it happens, the same conditions also pertain to the metallic reflection of p-or s-polarized light but it does not follow that they are sufficient to ensure conformance with electromagnetic theory.    In Ref. [6] it was also shown that a modified theory of unpolarized diffraction can be formulated in which the Rayleigh-Sommerfeld integrals are used to describe a continuous, bidirectional flow of energy in the near zones on either side of an aperture. The transmission coefficients computed in this manner are similar to those published by Levine and Schwinger [4,5] for subwavelength aperture sizes, so that a special theory for this region may not be needed.