Ultrafast pulse-amplitude modulation with a femtojoule silicon photonic modulator

Ultrahigh-speed optical interconnects are essential to future cloud computing. Further increase in optical transmission speed has been hindered by power consumption and limited bandwidth resources, for which integrated optical transceivers using advanced modulation formats, such as pulse-amplitude modulation (PAM), are a promising solution. We report 80 Gb∕s PAM operation of a silicon microring modulator (MRM) with an ultralow power consumption below 7 fJ∕bit. We also report the first demonstration of PAM-8 modulation of MRMs in the Gb/s order, achieving error-free capability at 45 Gb∕s, using 1 fJ∕bit. To the best of our knowledge, these results feature the lowest power consumption, per transmitted bit, ever demonstrated at such high data rates. We further demonstrate PAM data transmission up to 64 Gb∕s over 5 km. Simultaneous achievement of ultrafast modulation and ultralow power consumption is a critical step toward next-generation optical interconnects. © 2016 Optical Society of America


INTRODUCTION
Transition to next-generation optical interconnects is driven by the demand for ultrahigh-speed data transmission in computing systems [1] and data centers for the cloud [2]. Key enablers of this transition have been identified as further advances in photonic integration and high-speed, low-power complementary metal-oxide semiconductor (CMOS) circuits. Leveraging welldeveloped CMOS fabrication processes, silicon photonics has quickly emerged as the preferred technology for large-scale photonic integration. In particular, silicon microring modulators (MRMs) are among the most promising solutions for integrated optical transmitters since they combine many desirable features, such as low power consumption, compactness, and CMOS compatibility [3]. Using a MRM, 60 Gb∕s on-off keying (OOK) transmission has been demonstrated [4]. Ultralow power of 1 fJ∕bit was also reported, but running at a relatively low data rate of 25 Gb∕s [5]. An eight-channel wavelength division multiplexing (WDM) transmitter based on MRMs, each channel operating at 40 Gb∕s at 32 fJ∕bit, was recently reported [6]. Finally, a novel approach using a Bragg grating as the optical cavity, demonstrates 60 Gb∕s operation [7].
Future interconnects desire a higher data rate per wavelength to minimize the number of WDM channels. However, the path toward higher speed has been hindered by the power consumption and limited bandwidth of electronic circuits, not to mention the difficulty of integrating laser sources on silicon. First, MRMs suffer from the intrinsic trade-off between modulation efficiency and bandwidth [5,8,9] because the photon lifetime limits how fast the optical cavity can be modulated. Consequently, high-speed MRMs are usually designed with a low quality factor (Q) for a wide bandwidth, thereby sacrificing modulation efficiency. In addition, the power consumption of driving and logic circuits scales up quickly with the frequency. High-speed, low-power drivers are very challenging above 40 GHz.
Advanced modulation formats, such as pulse-amplitude modulation and quadrature phase-shift keying (QPSK), provide higher spectral efficiency, i.e., higher bit rates, within a given bandwidth [2]. Very limited experimental results have been reported for highspeed MRMs with high-order modulation formats, including 56 Gb∕s [10] QPSK and 24 Gb∕s PAM-4 [11]. Nevertheless, these devices have relatively high power consumptions in the range of a few tens to hundreds of fJ/bit. In addition, in the case of QPSK, coherent detection introduces extra complexity and cost. Therefore, direct detection is preferred for low-power optical interconnects. Although highly desired, simultaneous achievement of an ultrahighspeed data rate beyond 40 Gb∕s and ultralow power operation approaching one femtojoule per bit has not been demonstrated.
In this paper, we examine higher-order modulation of an optimized silicon MRM with the direct detection scheme for ultrahigh-speed optical interconnects. We present PAM operation of the MRM up to 80 Gb∕s with an ultralow power consumption at the level of fJ/bit. We show significantly enhanced spectral efficiency of 2 and 3 bits per symbol with error-free capability. Transmission of a PAM-4 signal over 5 km of standard singlemode fiber (SSMF) at 64 Gb∕s is also demonstrated. Our results further demonstrate the possibility to adapt the design to comply with commercial foundry rules, such as multiproject wafer (MPW) services. This paper is organized as follows. Section 2 provides information on design and fabrication, followed by the characterization of the modulator under direct current (DC) and small-signal operation. Based on the measured results, the extinction ratio (ER) and electro-optic (EO) bandwidth are extracted from the measurements and the inherent trade-off discussed. Specifically, it is shown that the optimal operating point depends on the targeted operating speed. Section 3 presents our experimental results and bit error rate (BER) measurements regarding PAM modulation and transmission. Section 4 provides information on the evaluation of the power consumption of the modulator.

DEVICE DESIGN AND CHARACTERIZATION
We designed and optimized the modulator with the aid of the dynamical model presented in [8]. As illustrated in Fig. 1, it makes use of the plasma dispersion effect through carrier depletion in a lateral p-n junction on a 220 nm thick siliconon-insulator (SOI) wafer. A 60-nm-thick slab is used for electrical connections. The MRM has a radius of 8 μm and a coupling gap of 230 nm. The heavily doped regions for metal contacts are 500 nm away from the edge of the 220-nm-high, 500-nm-wide rib waveguide. These parameters were chosen to achieve the critical coupling condition and maximize dynamic extinction ratio, following the methodology given in [8]. Due to fabrication process variations, we observed a variation of up to 10 dB in the DC ER, i.e., from 25 to 35 dB of ER. However, we obtained similar performance, even with SOI chips having lower ER. This is in part because we operate the MRM at high frequency detuning. In this case, the optical modulation amplitude (OMA) is not significantly sensitive to variations of the DC ER.
A semiconductor heater is included in the design for wavelength tuning, to compensate for fabrication errors and temperature fluctuations. The p-n junction for intracavity modulation spans roughly 70% of the circumference, and the heater spans roughly 20% of the circumference. On-chip optical input/output (I/O) is achieved via surface grating couplers for TE-polarized light. The modulator was fabricated through the MPW service at IMEC, Belgium.
The measured static responses at various applied voltages are presented in Fig. 2(a). We measure a very high resonance depth of 35 dB at zero bias, indicating that the modulator is in the critical coupling condition, i.e., the round-trip propagation loss of light in the ring cavity is equal to the loss due to the coupling to the bus waveguide. Sufficient ER is important for achieving high-order PAM modulation. Figure 2(a) also provides further indication of the presence of the critical coupling condition. It shows that the resonance depth decreases as the forward potential increases, indicating an undercoupling condition in forward bias due to the increased free carriers in the waveguide and thus increased absorption loss. The figure also shows that the resonance depth decreases as the reverse potential increases, indicating an overcoupling condition in reverse bias. The resonance shift as a function of  voltage shows an efficiency of about 2 GHz∕V, which is among the highest values reported for a lateral p-n junction in the depletion mode. Also, the measured free spectral range (FSR) is 12.14 nm and the quality factor is 18,000, at equilibrium.

A. Bandwidth-Efficiency Trade-Off
The frequency responses of the MRM are measured and shown in Fig. 2(b). The bandwidth is measured through the S 21 scattering parameter, as a function of the frequency. Measurements are done with varied frequency detuning Δf , here defined as the frequency f op of the optical input minus the resonant frequency f res of the cavity, i.e., Δf f op − f res . Figure 2(b) clearly shows the presence of the modulation resonance [12]. This representation is useful to extract bandwidth information at a given detuning state. However, the choice of the optimal operating point is a result of the trade-off between the modulation bandwidth and modulation depth. On one hand, it is generally accepted that the EO bandwidth is proportional to the optical detuning, as per Fig. 2(b). On the other hand, the DC representation of the OMA is often used and allows one to conclude that, for an infinitesimal small-signal excitation, the OMA is proportional to the detuning up to a given point where the relation becomes inversely proportional, substantiating the so-called bandwidth-efficiency trade-off. In addition, we here corroborate the fact that this particular trade-off is also a function of the operating speed and that it cannot be specified only in terms of optical detuning, as noted by Yu et al. [9]. The relation between the OMA and bandwidth as a function of the frequency detuning is expressed, for our modulator, in Fig. 2(c), where the normalized OMA is here defined as the difference between the maximum P max and minimum P min optical power when driven by a small signal over the voltage amplitude V p−p times the input optical power P in , i.e., 1 P max − P min ∕V p−p · P in . For instance, in our case, the optimal operating point at 10 GBaud would be Δf ∼ 7 GHz, whereas it would be Δf ∼ 11 GHz at 40 GBaud.
For the BER measurements of the forthcoming section, we operate the device at a detuning of roughly 10 GHz. Since the vector network analyzer used is limited at 20 GHz, we extrapolate the S 21 curve with the aid of the dynamical model [8], and find the −3 dB bandwidth to be 25 GHz at a detuning of 10 GHz. Since we consider a non-return-to-zero modulation scheme, the spectrum of the signal is thus proportional to a sinc 2 of the modulation speed. Such signal has most of its power inside spectral components with frequency below half of the operating frequency [13]. Additionally, due to the high frequency detuning used in the current demonstration, the insertion loss in the coming transmission experiment is consistently measured below 1 dB.

HIGH-SPEED MODULATION AND TRANSMISSION
A schematic of the test bench for large-signal modulation and transmission is shown in Fig. 3. We use a BER test system as a pseudo-random binary sequence (PRBS) of 2 15 − 1 and clock source. The PRBSs are combined and regenerated with a 3 bit digital-to-analog converter (DAC). The output of the DAC needs to be carefully chosen to linearize the optical output of the PAM signals; see Supplement 1. We use two channels of the DAC for PAM-4 and three for PAM-8. The analog signal is then amplified with a 55 GHz radio-frequency (RF) amplifier and biased with a 70 GHz bias tee. The electrical signal is then sent via a 50 GHz RF, 50 Ω terminated, ground-signal-ground configured microprobe. We use a polarization maintaining fiber, fed with a tunable laser as the optical input, to ensure an optimal coupling with the surface grating couplers. A 250 μm spaced fiber array is used to input and collect the light to and from the SOI chip. The measured fiber-to-fiber insertion loss of the SOI chip is 12.8 dB. We use an optical isolator at the output of the chip followed by an erbium-doped fiber amplifier (EDFA) to amplify the modulated signal. The amplified spontaneous emission is filtered out by using an optical filter. Then, a tunable optical attenuator is used to control the received power. A 70 GHz photodetector is used to achieve optical-to-electrical conversion. The electrical signal is acquired with a 30 GHz analog bandwidth, 80 Gsamples∕s real-time oscilloscope (RTO). Finally, off-line signal processing, including filtering, resampling, and BER counting, is performed numerically.
We examine the performance of the modulator by first considering the PAM eye diagrams collected by the RTO, as shown in Fig. 4, representing examples of the data captured for BER measurements. Figure 4(a) shows a 30 Gb∕s OOK signal as observed at the RTO, whereas Fig. 4(b) shows a 80 Gb∕s PAM-4 signal after equalization. In addition, one can observe the degradation due to the transmission over 5 km by visually comparing Figs. 4(c) and 4(d). It is also possible to observe the improvement due to the equalization by considering Figs. 4(e) and 4(f ). For the purpose of demonstration, the signals have been upsampled to produce the eye-diagrams using an anti-aliasing finite-impulse response filter. Note that this filter is not applied on the signals that are used for BER computations. In addition to the electrical eye diagrams, optical eyes are provided in Supplement 1.
In the transmission experiment, the modulator is driven by peak-to-peak voltages (V p−p ) of 3.5 V and 2.2 V for PAM-4 and PAM-8, respectively. The OOK case uses 4 V of V p−p . The low voltage required to drive the PAM-8 cases demonstrates the possibility of integration between the silicon chip and the CMOS driver. Doing such an integration would be beneficial since it would dramatically reduce the cost and power consumption of the transmitter. The modulator is biased at −5.5 V and operated at Δf ∼ 10 GHz. It is also important to note that, with respect to our setup, we do not apply any kind of digital signal processing at the transmitter. The contributions from additional signal impairments, such as the power penalty due to the chirp, are investigated in Supplement 1. It is noteworthy that, under the aforementioned operating conditions, i.e., Δf > 0, the chirp leads to pulse compression. Nonetheless, the power penalty At the receiver side, we acquire the data at 80 GSa∕s and apply a super-Gaussian, fourth-order filter. The signal is then resampled to 1 sample per bit. The optimal sampling time is taken such that the probability density function (PDF) of each level is best resolved. At this point, the N − 1 optimal decision thresholds are found, where N is the PAM order. Optimal decision thresholds are taken to be at the N − 1 local minima formed by the total PDF of the PAM signal. This is valid under the assumption that the N PDFs are Gaussian and equivalent, i.e., each symbol is equiprobable. Measured PDFs taken at the optimal instant for the decision threshold, along with their Gaussian fits, are provided in Supplement 1. A first round of BERs is computed, providing the raw, or unequalized, BERs.
The resampled data stream is then filtered with a minimum mean square error (MMSE) filter. To create the MMSE estimator, we use a known sequence of transmitted bits, i.e., a training of 2000 bits. A Toeplitz matrix X is populated by the discrete autocorrelation function of the received signal, computed up to a delay τ X . In this paper, our results have been computed with τ X 50. An estimate R of X is computed, R X T X . At the same time, the cross-correlation Q between the received signal and known sequence is computed up to a delay τ Q τ X ∕2. The coefficient of the estimator E is then obtained in a straightforward manner, i.e., E RnQ. The equalized data are finally obtained by computing the convolution between the received data and E. Hence, the MMSE equalization we employ uses fixed coefficients and is not adaptive. A second round of BERs is then computed, providing the equalized BERs. Figure 5 shows the measured BERs in back-to-back and after 5 km SSMF transmission. Data before and after equalization are presented for PAM-4 up to 40 GBaud (80 Gb∕s) and for PAM-8 up to 15 GBaud (45 Gb∕s). We see that the transmission of a 40 GBaud PAM-4 signal at a reasonable BER is possible only when the equalization scheme is used. In contrast, PAM-4 at 32 GBaud and PAM-8 at 15 GBaud are possible even without using equalization. The seemingly high received power is because no RF amplifier has been used after the photodetector, and the sensitivity of our RTO is relatively low. Our setup is thus limited by a noise floor of −7 dBm. The received powers for given BERs could be reduced by improving the noise floor of our setup. Based on the OT4U standard [14], we consider that a 6.7% forward error correction (FEC) overhead can be used such that pre-FEC BER below 3.8 × 10 −3 can be regarded as error-free, i.e., the post-FEC BER will be below 10 −15 [15,16]. This FEC threshold is denoted by a black dashed lines in Fig. 5(a)-5(c). In addition, we consider Gray coding when counting the BER, i.e., the BER is further reduced by a 1∕ log 2 N factor, assuming that erroneous symbol decisions are made to the closest neighboring symbols. As shown in Fig. 5(b), data transmission over 5 km with a BER below the pre-FEC threshold has been achieved for PAM-4 up to 64 Gb∕s with equalization and up to 52 Gb∕s (26 GBaud) without equalization. We notice that the latter result is compliant with emerging IEEE802.3bs standards, specifically for 400 Gb∕s Ethernet at 2 km, where the use of PAM-4 at 26.6 GBaud before FEC has been proposed [17].

POWER CONSUMPTION
Electrical power is dissipated inside the modulator on rising transitions, charging the capacitor, of capacitance C, in the depletion region of the p-n junction. In this case, the energy E consumed by a rising transition of magnitude V is given by E CV 2 [18]. Since PAM modulation formats inherently contain multiple transitions occurring at different magnitudes, it is more convenient to find an expression for the total energy consumed E T by all rising transitions as a function of V p−p . Under the assumption that levels in a given PAM signal are equally distributed inside V p−p , E T is given by where, we recall, N is defined as the PAM order. There are N 2 possible transitions in a PAM signal and log 2 N bit(s) per symbol, so the energy consumed per bit E b is given by To evaluate the power consumption of the modulator, we measured its small-signal frequency responses, and we extracted the capacitance C pn of the p-n junction under equilibrium. The value of C pn under operating conditions, i.e., −5.5 V of applied bias, is computed at C pn 9.4 fF using the model presented in [8]. Therefore, the estimated effective power consumption, i.e., the power that goes through the p-n junction and acts as the modulating force, is 6.5 fJ∕bit at 40 GBaud, PAM-4, and is 1 fJ∕bit for PAM-8 at 15 GBaud. Under operating conditions, the electrical bandwidth is evaluated at 32 GHz. A thorough description of the measurement of C pn , as well as further estimations of the power consumption, are provided in Supplement 1.
We measured the efficiency of the heater to be 33 μW∕GHz. However, the heater has been included as a proof of concept rather than as an optimized heater. Nevertheless, this value is comparable to previous demonstrations [19,20], but at much higher modulation speeds. For instance, 130 fJ∕bit would be necessary to compensate for a 10°C at 80 Gb∕s. Details and suggestions to improve the efficiency are discussed in Supplement 1. An improved efficiency could help to bring down the power consumption of the heater to 5 fJ∕bit [21].

CONCLUSION
We have demonstrated ultrahigh-speed, ultralow-power PAM operation with a silicon photonic modulator. Direct detection below the pre-FEC threshold has been achieved up to 80 and 45 Gb∕s for PAM-4 and PAM-8, respectively. The power consumed by optical modulation has been estimated to be as low as 1 fJ∕bit and 6.5 fJ∕bit for PAM-8 and PAM-4, respectively. To the best of our knowledge, both values are the lowest yet demonstrated at such high data rates. In addition, data transmission over 5 km has been demonstrated for PAM-4 up to 64 Gb∕s with equalization and up to 52 Gb∕s (26 GBaud) without equalization. It is shown that higher-order modulation formats can significantly increase the data rate given the efficiency-bandwidth trade-off of a resonator modulator; in the present work, this is 2 and 3 bits per symbol for PAM-4 and PAM-8, respectively. The enhanced spectral efficiency may drastically reduce the required operating frequency and power consumption of driving and logic circuits for a CMOS-photonic integrated system. These results and findings reveal that silicon resonator modulators with advanced modulation formats are capable of delivering ultrahigh data rates (toward 100 Gb∕s per channel) with ultralow power consumption at the level of fJ/bit, indicating a promising path toward future ultrafast optical interconnects.