Self-aligning universal beam coupler

We propose a device that can take an arbitrary monochromatic input beam and, automatically and without any calculations, couple it into a single-mode guide or beam. Simple feedback loops from detectors to modulator elements allow the device to adapt to any specific input beam form. Potential applications include automatic compensation for misalignment and defocusing of an input beam, coupling of complex modes or multiple beams from fibers or free space to single-mode guides, and retaining coupling to a moving source. Straightforward extensions allow multiple different overlapping orthogonal input beams to be separated simultaneously to different single-mode guides with no splitting loss in principle. The approach is suitable for implementation in integrated optics platforms that offer elements such as phase shifters, Mach-Zehnder interferometers, grating couplers, and integrated monitoring detectors, and the basic approach is applicable in principle to other types of waves, such as microwaves or acoustics. ©2013 Optical Society of America OCIS codes: (230.3120) Integrated optics devices; (220.1080) Active or adaptive optics; (220.1140) Alignment; (140.3298) Laser beam combining; (220.4298) Nonimaging optics. References and links 1. R. Ryf, S. Randel, A. H. Gnauck, C. Bolle, A. Sierra, S. Mumtaz, M. Esmaeelpour, E. C. Burrows, R.-J. Essiambre, P. J. Winzer, D. W. Peckham, A. H. McCurdy, and R. Lingle, Jr., “Mode-division multiplexing over 96 km of few-mode fiber using coherent 6x6 MIMO processing,” J. Lightwave Technol. 30(4), 521–531 (2012). 2. A. E. Willner, J. Wang, and H. Huang, “Applied physics. A different angle on light communications,” Science 337(6095), 655–656 (2012). 3. T. Su, R. P. Scott, S. S. Djordjevic, N. K. Fontaine, D. J. Geisler, X. Cai, and S. J. B. Yoo, “Demonstration of free space coherent optical communication using integrated silicon photonic orbital angular momentum devices,” Opt. Express 20(9), 9396–9402 (2012). 4. Y. Jiao, S. Fan, and D. A. B. Miller, “Demonstration of systematic photonic crystal device design and optimization by low-rank adjustments: an extremely compact mode separator,” Opt. Lett. 30(2), 141–143 (2005). 5. A. P. Mosk, A. Lagendijk, G. Lerosey, and M. Fink, “Controlling waves in space and time for imaging and focusing in complex media,” Nat. Photonics 6(5), 283–292 (2012), doi:10.1038/nphoton.2012.88. 6. D. P. Palomar, J. M. Cioffi, and M. A. Lagunas, “Joint Tx-Rx beamforming design for multicarrier MIMO channels: A unified framework for convex optimization,” IEEE Trans. Signal Process. 51(9), 2381–2401 (2003). 7. S. Azam, T. Yasui, and K. Jinguji, “Synthesis algorithm of a multi-channel lattice-form optical delay-line circuit,” Optik (Stuttg.) 121(12), 1075–1083 (2010). 8. S. Ibrahim, N. K. Fontaine, S. S. Djordjevic, B. Guan, T. Su, S. Cheung, R. P. Scott, A. T. Pomerene, L. L. Seaford, C. M. Hill, S. Danziger, Z. Ding, K. Okamoto, and S. J. B. Yoo, “Demonstration of a fastreconfigurable silicon CMOS optical lattice filter,” Opt. Express 19(14), 13245–13256 (2011), http://www.opticsinfobase.org/oe/abstract.cfm?URI=oe-19-14-13245. 9. R. N. Mahalati, D. Askarov, J. P. Wilde, and J. M. Kahn, “Adaptive control of input field to achieve desired output intensity profile in multimode fiber with random mode coupling,” Opt. Express 20(13), 14321–14337 (2012). 10. T. M. Shay, “Theory of electronically phased coherent beam combination without a reference beam,” Opt. Express 14(25), 12188–12195 (2006). 11. S. Sohma, T. Watanabe, N. Ooba, M. Itoh, T. Shibata, and H. Takahashi, “Silica-based PLC type 32 x 32 optical matrix switch,” European Conference on Optical Communications (ECOC 2006), Cannes, France, 24 – 28 Sept. 2006, Paper OThV4. 10.1109/ECOC.2006.4801113 12. G. Roelkens, D. Vermeulen, S. Selvaraja, R. Halir, W. Bogaerts, and D. Van Thourhout, “Grating-based optical fiber interfaces for silicon-on-insulator photonic integrated circuits,” IEEE J. Quantum Electron. 17(3), 571–580 (2011). #184014 $15.00 USD Received 23 Jan 2013; revised 25 Feb 2013; accepted 26 Feb 2013; published 6 Mar 2013 (C) 2013 OSA 11 March 2013 / Vol. 21, No. 5 / OPTICS EXPRESS 6360 13. K. Van Acoleyen, H. Rogier, and R. Baets, “Two-dimensional optical phased array antenna on silicon-onInsulator,” Opt. Express 18(13), 13655–13660 (2010). 14. W. Bogaerts, D. Taillaert, P. Dumon, D. Van Thourhout, R. Baets, and E. Pluk, “A polarization-diversity wavelength duplexer circuit in silicon-on-insulator photonic wires,” Opt. Express 15(4), 1567–1578 (2007). 15. G. Roelkens, D. Van Thourhout, and R. Baets, “Silicon-on-insulator ultra-compact duplexer based on a diffractive grating structure,” Opt. Express 15(16), 10091–10096 (2007). 16. F. Van Laere, W. Bogaerts, P. Dumon, G. Roelkens, D. Van Thourhout, and R. Baets, “Focusing polarization diversity grating couplers in silicon-on-insulator,” J. Lightwave Technol. 27(5), 612–618 (2009). 17. T. Baehr-Jones, M. Hochberg, and A. Scherer, “Photodetection in silicon beyond the band edge with surface states,” Opt. Express 16(3), 1659–1668 (2008). 18. J. D. B. Bradley, P. E. Jessop, and A. P. Knights, “Silicon waveguide-integrated optical power monitor with enhanced sensitivity at 1550 nm,” Appl. Phys. Lett. 86(24), 241103 (2005), doi:10.1063/1.1947379. 19. D. F. Logan, P. E. Jessop, and A. P. Knights, “Modeling defect enhanced detection at 1550 nm in integrated silicon waveguide photodetectors,” J. Lightwave Technol. 27(7), 930–937 (2009). 20. M. W. Geis, S. J. Spector, M. E. Grein, J. U. Yoon, D. M. Lennon, and T. M. Lyszczarz, “Silicon waveguide infrared photodiodes with >35 GHz bandwidth and phototransistors with 50 AW-1 response,” Opt. Express 17(7), 5193–5204 (2009). 21. I. Goykhman, B. Desiatov, J. Khurgin, J. Shappir, and U. Levy, “Locally oxidized silicon surface-plasmon Schottky detector for telecom regime,” Nano Lett. 11(6), 2219–2224 (2011). 22. D. A. B. Miller, “All linear optical devices are mode converters,” Opt. Express 20(21), 23985–23993 (2012). 23. J. Sun, E. Timurdogan, A. Yaacobi, E. S. Hosseini, and M. R. Watts, “Large-scale nanophotonic phased array,” Nature 493(7431), 195–199 (2013), doi:10.1038/nature11727. 24. A. Biberman, M. J. Shaw, E. Timurdogan, J. B. Wright, and M. R. Watts, “Ultralow-loss silicon ring resonators,” Opt. Lett. 37(20), 4236–4238 (2012). 25. D. A. B. Miller, “How complicated must an optical component be?” J. Opt. Soc. Am. A 30(2), 238–251 (2013). 26. K. Chang, ed., Handbook of RF/Microwave Components and Engineering (Wiley, 2003). 27. L. Tang, S. E. Kocabas, S. Latif, A. K. Okyay, D.-S. Ly-Gagnon, K. C. Saraswat, and D. A. B. Miller, “Nanometre-scale germanium photodetector enhanced by a near-infrared dipole antenna,” Nat. Photonics 2(4), 226–229 (2008), doi:10.1038/nphoton.2008.30. 28. D.-S. Ly-Gagnon, K. C. Balram, J. S. White, P. Wahl, M. L. Brongersma, and D. A. B. Miller, “Routing and photodetection in subwavelength plasmonic slot waveguides,” Nanophotonics 1(1), 9–16 (2012), doi:10.1515/nanoph-2012-0002. 29. J. A. Schuller, E. S. Barnard, W. Cai, Y. C. Jun, J. S. White, and M. L. Brongersma, “Plasmonics for extreme light concentration and manipulation,” Nat. Mater. 9(3), 193–204 (2010). 30. D. A. B. Miller, “Self-configuring universal linear optical component,” Photonics Research. (to be published).


Introduction
There has recently been growing interest in exploiting multiple modes, both in optical fibers [1] and free-space [2], for expanding communications bandwidth and capabilities. Selecting and coupling to complicated mode forms such as higher fiber modes [1] or angular momentum beams [2] is challenging, however, especially if splitting losses are to be avoided. Coupling to waveguides generally remains difficult in optics, especially if alignment or precise focusing cannot be guaranteed. Simultaneous coupling of multiple overlapping input modes without splitting loss has had few known solutions [3,4]. Here we propose a novel approach to coupling that both allows complicated modes to be coupled, e.g., to single-mode waveguides, and can accommodate misalignments and movements, all without moving parts. It can also couple multiple different overlapping modes simultaneously to different output waveguides, without fundamental splitting loss. The approach can be implemented using standard integrated optical components, detectors and simple local feedback loops, and is automatic, not requiring advance knowledge of the beam form to be coupled. The approach could be applied in principle to other waves, such as radio waves, microwaves or acoustics [5]. Unlike some other approaches to optimizing wave channels in optics [5] or microwaves [6], or recent approaches to designing and operating sophisticated spectral filters [7,8], this approach requires no global multi-parameter optimizations or calculations; in fact, it requires no calculations at all. Figure 1 shows a conceptual schematic of the approach. For simplicity for the moment, we consider a beam varying only in the lateral direction. For illustration we divide the arbitrary input beam into 4 pieces, each incident on a different one of the 4 beam splitter blocks. Each block includes a variable reflector (except number 4, which is 100% reflecting) and a phase shifter. (The phase shifter PS1 is optional, allowing the overall output phase of the beam to be controlled.) We presume loss-less devices whose reflectivity and phase shift can be set independently, for example, by applied voltages for electrooptic or thermal control. For the moment, we neglect diffraction inside the optics and presume that the phase shifters, reflectors, and detectors operate equally on the whole beam going through one beamsplitter. We shine the input beam onto the beamsplitter blocks as shown. We presume this beam is monochromatic and in a single polarization. Initially, the phase shifter and reflectivity settings can be arbitrary as long as the reflectivities are non-zero so that we start with non-zero powers on the detectors. First, we adjust the phase shifter P4 to minimize the power on detector D3. Doing so ensures that the wave reflected downwards from beamsplitter 3 is in antiphase with any wave transmitted from the top through beamsplitter 3. Then we adjust the reflectivity R3 to minimize the power in detector D3 again, now completely cancelling the transmitted and reflected beams coming out of the bottom of beamsplitter 3. (If there are small phase changes associated with adjusting reflectivity, then we can iterate this process, adjusting the phase shifter again, then the reflectivity, and so on, to minimize the D3 signal.)

Device concept
We then repeat this procedure for the next beamsplitter block, adjusting first phase shifter P3 to minimize the power in detector D2, and then reflectivity R2 to minimize the D2 signal again. We repeat this procedure along the line of phase shifters, beamsplitters and detectors. Finally, all the power in the incident beam emerges from the output port on the right. A flow diagram for this algorithm for four elements as in Fig. 1 is given in Fig. 2. An equivalent algebraic derivation of this conclusion is given in Appendix A, which also gives the resulting explicit values that will have been set for the phase shifters and reflectors. (This approach could also be used to combine multiple mutually-coherent monochromatic beams of unknown relative phases, as in high-power fiber laser systems [10] based on multiple amplifiers from a single laser source, with each beam incident on a separate beamsplitter block.) Unlike typical adaptive optical schemes (see, e.g., Refs. [5,9].), this method is progressive rather than iterative -the process is complete once we have stepped once through setting the elements one by one -and only requires local feedback for minimization on one variable at a time -no global calculation of a merit function or simultaneous multiparameter optimization is required. Straightforward low-speed electronics could implement the feedback; even simple analog systems could be sufficient. One example approach to find minima would be to impose a small additional a.c. modulation at some frequency f on each phase shifter or reflectivity being adjusted; detecting the amplitude of the modulated signal at frequency f from the corresponding detector using a lock-in amplifier technique would give an error signal that would be zero at the minimum and have different signs on either side of the minimum, as required for a simple negative feedback loop to set the phase shifter or reflector to achieve the desired minimum. Note that, in terms of complexity, we only need one such analog circuit combined with a simple switch to cycle round all the phase-shifters, reflectors and detectors and a set of capacitors to hold the control voltages on the elements after they are set. Note too that no calculations need to be performed to run this system. In a modern digital implementation, a simple microcontroller would have more than enough complexity to run the entire system.
To optimize this beam coupling continually, we can leave this feedback system running as we use the device, stepping cyclically through the minimizations as discussed. This would allow real-time tracking and adjustments for misalignments or to retain coupling to moving sources. For static sources, we could use an alternate algorithm based only on maximizing output beam power (see Appendix B). Figure 3 shows a waveguide version based on Mach-Zehnder interferometers (MZIs) as the adjustable "reflectors" and phase shifters, with Fig. 3(a) corresponding to Fig. 1(a). A MZI gives variable overall phase shift of both outputs based on the common mode drive of the controllable phase elements in each arm and variable "reflectivity" (i.e., splitting between the output ports) based on the differential arm drive. Such a waveguide approach avoids diffractions inside the apparatus and allows equal path lengths for all the beam segments. Equal path lengths are important for operation over a broad wavelength range or bandwidth; otherwise the relative propagation phase changes with wavelength in the different waveguide paths. (See Appendix C for a discussion of usable wavelength ranges.) Such Mach-Zehnder implementations physically resemble optical lattice spectral filters (e.g., [7,8]), though it is important to note that, because of the equal path lengths in this device, it is deliberately not a spectral filter. Note, too, that, unlike such filters, it does not require any calculations to design or configure it because of the self-aligning algorithm described above.

Waveguide device
For further equality of beam paths and losses, we could add dummy MZIs in paths 1, 2, and 3, respectively, as shown in Fig. 3(b), to give the same number of MZIs in every beam path through the device; the dummy devices would be set so as not to couple between the adjacent waveguides (i.e., the "bar" rather than the "cross" state), and to give a standard phase shift. Note that as long as no power is lost from the system out of the "open" arms -here, the top right ports of the top two dummy devices in Fig. 3(b) -the settings of these dummy devices are not critical; the subsequent setting of MZIs 1 -4 can compensate for any such loss-less modification of the input waves. We could add further detectors at those top right ports, adjusting the dummy MZI reflectivities to minimize the signals in such detectors, ensuring loss-less operation. We note that systems with large numbers (e.g., 2048) of MZIs have been demonstrated experimentally, with low overall loss [11], so the relatively large arrays that might be required for complex beams could be feasible practically. An alternative scheme to that of Fig. 3(a) or 3(b) using a binary tree of devices for coupling to a single input beam is presented in Appendix D. To use the waveguide scheme with a spatially continuous input beam, we need to put the different portions of the beam into the different waveguides. We could use one grating coupler [12] per waveguide as explored for angular momentum beams [3] or phased-array antennas [13]. For full 2D arrays, we need space to pass the waveguides between the grating couplers [12]. We could either allow an imperfect fill factor, shining the whole beam onto the top of the grating coupler array (as in Fig. 4(a)), or we could use an array of lenslets focusing the beam portions onto the grating couplers to improve the fraction of the beam that lands on the grating couplers ( Fig. 4(b)). Grating coupler approaches are also known that can separate polarizations to two separate channels [12,[14][15][16], allowing the input mode of interest to have arbitrary polarization content at the necessary expense of twice as many channels in the device overall.

Separation of multiple orthogonal beams
We can extend this concept to detecting multiple orthogonal modes simultaneously. In this case, we would use detectors that are mostly transparent, such as silicon defect-enhanced [17][18][19][20] or internal photoemission [21] photodetectors in telecommunications wavelength ranges, sampling only a small amount of the power and transmitting the rest. Now, (Figs. 1(b) and "top" row of beamsplitters and photodetectors, becoming an input beam on the top of the second row.
To understand why this second orthogonal beam passes through the first row, note that, for any loss-less beam coupler that couples all of one input mode (e.g., our first input beam shining into our device) into a single output mode (here output beam 1), it is impossible to combine any power from any other orthogonal beam into the same output mode [22]; since there is nowhere else for the power in some second orthogonal beam to go (nothing in our optics reflects this second beam backwards), it will all pass through to the second row of beamsplitter blocks. The beam form will certainly be changed as it passes through the first row, but since our process can adapt to an arbitrary beam, we can simply use the same alignment process as before, now with the second row of phase shifters and reflectors (devices 21 -23), and the detectors D21 -D22, to direct all of this power to output beam 2.
We can repeat this process for further orthogonal beams using further rows, up to the point where the number of rows of beamsplitters (and the number of output beams) equals the number of beamsplitter blocks in the first row. (Generally, we can leave all the preceding beams on, if we wish, as we adjust for successive added orthogonal beams.) We could analogously apply the same approach to the structure of Fig. 4, adding further "rows" as in Fig. 3(c) to allow simultaneous detection of multiple orthogonal 2D beams shone on the grating couplers or lenslets.
We could also add some identifying coding to each orthogonal input beam, such as a small amplitude modulation at a different frequency for each beam; then, we can have all beams on at once, with each detector row looking only for one beam's specific modulation frequency or coding signal. Such an approach, combined with continuous cycling through the different rows as above, allows continuous tracking and alignment adjustment of all the orthogonal beams while leaving all of them on.

Discussion
Silicon photonics appears to be a very attractive platform for making the self-aligning coupler. All of the necessary components -grating couplers [12][13][14][15]23], low-loss waveguides [24], directional couplers and phase shifters [23] (as required to make the fourport Mach-Zehnder interferometers), and photodetectors (including the mostly-transparent ones [17][18][19][20][21]) -can likely be made in one integrated platform. Very large arrays (64 x 64) of directional couplers and grating couplers have recently been demonstrated [23] in a compact structure. That platform also demonstrated phase shifters controlled by integrated heaters. Low optical loss is obviously desirable. Losses are not quoted explicitly in that work [23], but the mere demonstration of such a large array shows that loss per element must be very small. Loss in the silicon waveguides themselves in such processes can range from 0.8 dB/cm to as low as 0.027 dB/cm [24], so waveguide loss is not likely to be a substantial issue. Good electronic design for low speed circuits in the feedback loops can easily work with detected signal currents as low as pico-amperes. Si-implanted Si photodetectors in similar CMOS compatible processes can show >50% efficiency at 1550 nm in converting absorbed photons to electrons of current [20] even at low photodiode biases. Hence, if we presume the device is working with the microwatt or higher optical input powers typically required for high-speed signal transmission, we need not absorb much of the signal power in the mostly-transparent detectors required for the schemes for separating multiple beams.
The main practical optical losses are likely to be in the coupling from the beam into the waveguides. First, devices such as grating couplers have some coupling loss (e.g., ~1.6 dB and possibly < 0.5 dB in optimized designs [12]). More fundamentally, the process of sampling the input beam using a finite set of regions necessarily will effectively incur some loss, at the very least because the beam will not in fact have uniform intensity and phase across each such portion. The minimum number of regions or subdivisions we need to use for a given beam depends on how complex a mode we want to select or how complicated a correction we want to apply. Generally, this self-aligning coupler corresponds to the approach to complexity counting discussed in Ref. [25]. If we want to be able to select one specific input mode form out of M I orthogonal possibilities, we need at least M I beamsplitter blocks in the (first) row. Subsequent rows to select other specific modes from this set need, progressively, one fewer beamsplitter block. The number of rows of beamsplitter blocks is the mode coupling number M C in the notation of Ref. [25]. To minimize the overall coupling loss and improve the discrimination in practice, we should expect to use more than this minimum number of beamsplitter blocks or grating couplers. Just how many sampling regions we should use will depend on the specific beam coupling problem to be solved and how much additional loss we are prepared to tolerate from this finite number of sampling regions.
At radio or microwave frequencies, we could use antennas instead of grating couplers. Various microwave splitters and phase shifters are routinely possible [26]. Use of nanometallic or plasmonic antennas [27], waveguides [28], modulators [29] and detectors [27,28] is also conceivable for subwavelength circuitry in optics, allowing possibly very small and highly functional mode separation and detection schemes. Various extensions of this idea are also possible in optics [30].

Conclusions
In conclusion, we have shown a general method for coupling an arbitrary input beam to one specific output beam, such as a waveguide mode, with an automatic method for setting the necessary coefficients in the array of adjustable reflectors and phase shifters based on signals from photodetectors, and with extensions to allow multiple orthogonal input beams to be separated without fundamental splitting loss. This should open a broad range of flexible and adaptable optical functions and components, with analogous possibilities for other forms of waves such as microwaves and acoustics.

Appendix A -Algebraic derivation of self-alignment results
Presume for simplicity of the algebra that the beamsplitter blocks in Fig. 1 (a) are perfectly square and that the path from the top of a beamsplitter block and reflecting to the input of the phase shifter on the right is an integer number of wavelengths. Because the beamsplitter is presumed lossless (i.e., unitary), the transmission from the top of a beamsplitter block through the (partially) reflecting surface to the bottom of the block necessarily has an additional phase shift of ±π/2 (see, e.g., [30]); for definiteness and simplicity here, we use +π/2 and we will call this the unitarity phase shift. Here, for simplicity of explanation also, we will consider the 4 beamsplitter blocks as in Fig. 1(a), though the approach extends analogously to arbitrary numbers of blocks. The phase shifters P1 -P4 we will set to give phase delays θ 1 to θ 4 respectively, and the (field) reflectivities of reflectors R1 -R4, which we take to be real for simplicity, will be r 1 to r 4 respectively. The corresponding (field) transmissivities will be 2 1 n n t r = − (1) where n = 1, … 4 indexes the blocks. We presume the fields incident on the top of the beamsplitter blocks are described by the complex amplitudes where E no and φ n are real numbers.
Setting the transmitted and reflected fields out of the bottom of beamsplitter 3 to be in antiphase means that 4 4 3 2 2 m π φ θ φ π π + = + − + Here the + π/2 is the unitarity phase shift and the term -π + 2mπ, where m is any integer, ensures the two beams are in antiphase. Henceforth for simplicity we drop the 2mπ, and so we have which means the phase set in phase shifter P4 to minimize the power on detector D3 is 4 3 4 / 2 θ φ φ π = − − (5) When we now minimize the power on detector D3 again (now to zero) by adjusting the reflector R3, we will be equalizing the transmitted and reflected field magnitudes so that, since these two fields are in antiphase, we will have complete cancellation on D3. Using Eqs. (1) and (6), therefore, this second power minimization on D3 will give Noting the field transmitted through beamsplitter 3 from left to right must also acquire the unitarity phase shift + π/2, the total field entering phase shifter P3 from the left is Using Eqs. (4) and (7), after some algebra, we obtain ( ) We note that Eq. (9) is in exactly the same form as the incident field amplitudes in Eq. (2), so we can proceed in a progressive and similar algebraic fashion to set the phase and reflectivity values in successive blocks. After similarly setting θ 2 and r 2 , for the field incident on phase shifter P2 from the left we would have ( ) Finally, after similarly setting θ 1 and r 2 , for the field incident on phase shifter P1 from the left (i.e., effectively the output field if we omit the P1) we would have ( ) Since the intensities I n in the various input beam segments are just given by 2 n no (at least within a dimensioning constant), the output intensity I out is which is the sum of the incident intensities in each input segment. In other words, all the input power in the beam has been coupled to the output.

Appendix B -Alternate alignment algorithm
As an alternative to the use of multiple detectors when aligning a single beam with the device, we could use only a detector in the output beam, with a different algorithm. We first set all the reflectors in the beam splitter blocks in Fig. 1(a) to be 100% transmitting, except the last one -beamsplitter block 4, which is set permanently to 100% reflection -and the second last one (block 3), which we set to some intermediate value of reflectivity. Then, monitoring a detector in the output beam (port 1 on the right), we adjust the phase shifter P4 on the right of beamsplitter block 4 to maximize the output power. We then adjust the reflectivity R3 in beamsplitter block 3 to maximize the output power again (these two steps in sequence arrange that there is no power emerging from the bottom of block 3). We then proceed along the beamsplitter blocks in a similar fashion, setting the next beamsplitter reflectivity to some initial intermediate value, adjusting the phase shifter just to its left to maximize the output, then adjusting the reflectivity in this block to maximize output again, and so on along the beamsplitter blocks. (In this case, we would not be able to do continuous feedback on the settings while the system was running because we need to set some of the reflectors temporarily to 100% transmission during the optimization steps.) This approach can be extended to multiple orthogonal beams. Once we have set the first row, we leave those settings fixed and then proceed with aligning the second orthogonal beam in a similar fashion, monitoring the power now in the output beam in the second row.

Appendix C -Wavelength range
Though the approach here is described for a monochromatic beam, in the implementations above with substantially equal path lengths, this device should be able to function similarly over significant range of wavelengths. The useful wavelength range will be limited in practice by the imperfections of the phase shifters and by the wavelength dependence of beam sizes. Phase-shifting elements in optics generally do not give the same phase shift at all wavelengths. More typically, they actually give a controllable propagation time delay by changing refractive index, and the same time delay does not correspond to the same phase delay at different wavelengths. Furthermore, the induced change in refractive index in the phase shifting elements in general may itself depend on wavelength. Just how substantial a limitation these phase shifter problems are depends on the specific implementations. Any given spatial mode is in practice likely to change in size as the wavelength is changed. For example, the precise size of spatial modes in fibers generally depends on the wavelength, and free-space diffracting modes that start out the same size at a beam waist will in general have wavelength-dependent sizes in the far field. Especially if the mode is a complicated one, such as a high-order Laguerre-Gaussian that has many phase reversals over its profile, a device self-aligned for one wavelength could easily fail to work for another. Both of these effects could well be substantial issues if we were, for example, to try to make such a coupler that would work simultaneously over the whole visible wavelength range.
We do expect the device can work for an input beam that is not monochromatic provided the beam's spectral range is narrow enough that the device would work well enough for any monochromatic beam within that spectral range when using the same settings. In such a case, just as in white light fringes in an interferometer with substantially equal path lengths, each monochromatic component essentially interferes only with itself, with beatings with other monochromatic components averaging to zero in the measured powers or intensities over the timescales of the response of the photodetectors.

Appendix D -Alternate configuration for single beam coupling
For coupling a single beam, an alternate configuration of phase shifters, MZ interferometers and detectors is shown in Fig. 5. In this approach, phase shifter P1 is adjusted to minimize the signal in detector DA1, and then the split ratio ("reflectivity") of MZI MA1 is adjusted through differential drive of the arms to minimize the DA1 signal again. Similar processes can be used simultaneously with P2, DA2 and MA2, with P3, DA3 and MA3, and with P4, DA4 and MA4. Next, the overall phase is adjusted in MA1 to minimize signal in DB1, and then the split ratio ("reflectivity") of MB1 is adjusted to minimize the DB1 signal again. A similar process can be run simultaneously with MA3, DB2 and MB2. Finally, in this example, the phase in MB1 is adjusted to minimize the DC1 signal, and then the split ratio ("reflectivity") of MC1 is adjusted to minimize the DC1 signal again. Dummy phase shifters can be incorporated in the input paths for beams 2, 4, 6, and 8, as shown to help ensure equality of path lengths in the system overall.
This approach has the advantages of requiring no dummy interferometers and allowing simultaneous feedback loop adjustments, first in the DA column of detectors, then in the DB column, and finally in the DC column. In contrast to the approach of Fig. 3(a) of the main text, the MZI devices are arranged in a binary tree rather than a linear sequence, so the device is shorter and a given beam travels through fewer MZI devices, possibly reducing loss. We could extend this approach also for coupling multiple orthogonal beams (e.g., by using beams transmitted through mostly transparent versions of the detectors into analogous, shorter trees of devices); but, unlike the approaches of Fig. 3, we would require crossing waveguides and/or multiple stacked planar circuits if we used a planar optical approach. Fig. 5. Alternative binary approach for coupling one arbitrary input beam to a single output beam. P1 -P4 are controllable phase shifters, MA1 -MC1 are controllable MZ interferometers, and DA1 -DC1 are detectors used to give the signals for feedback loops. The dummy phase shifters are optional and could be included for equality of path lengths and/or loss.