Analysis and experimental demonstration of novel 8PolSK-QPSK modulation at 5 bits/symbol for passive mitigation of nonlinear impairments

We propose a new modulation format providing 5 bits of information per recovered symbol while maintaining constant the total optical power. The proposed format applies a simple power constrain to the DP-8QAM format. This modulation format provides a passive way to mitigate nonlinear impairments due to Kerr effects occurring during propagation, and most specifically in the first 40 km. This report presents, to the authors’ knowledge, a new transmission format using solely phase and polarization as modulation methods. The performance of this format, named 8PolSK-QPSK, is experimentally compared with that of the DP8QAM format as both require equal transmitter complexity and implementation penalty, at the expense of a 20% increase in signaling baud rate. The greater nonlinear tolerance of this format is experimentally demonstrated. Moreover, thorough analysis of the Manakov-PDM propagation equation applied to both formats provides analytic explanation of the 8PolSK-QPSK's improved performance. The constant power property of the symbol set of the proposed format mitigates selfand cross-phase modulation (SPM, XPM) nonlinear effects and is experimentally validated over a long-haul transmission system in a WDM scenario. A total throughput of 7×129 Gbps is maintained for the transmission format comparison. Simulation of the same transmission system allows separate analysis of the strength of SPM, XPM and Cross-Polarization Modulation (XPolM) nonlinear effects and demonstrate reduced nonlinear impairments for the proposed format in the first span. We show an experimental reduction of the required OSNR for a BER threshold of 1.4×10 of 0.5 dB for 8PolSK-QPSK compared to DP-8QAM in back-to-back. After 1920 km of SMF fiber, we demonstrate a required OSNR (ROSNR) diminution for increasing launch power, allowing a ROSNR relief of 0.95 dB at optimal launch power of –1 dBm for the proposed format. Using the same threshold, we show an increased reach by more than 34%, or 975 km, at optimal launch power. We also demonstrate that the relative reach increase for 8PolSK-QPSK compared to DP-8QAM monotonically increases with increasing BER threshold and that the BER growth with distance, after the first span, is equal for both formats. ©2013 Optical Society of America OCIS codes: (060.0060) Fiber optics and optical communications; (060.1660) Coherent communications; (060.2330) Fiber optics communications; (190.3270) Kerr effect. References and links 1. G. P. Agrawal, Lightwave Technology Telecommunication Systems (Wiley-Interscience, 2005) 2. G. P. Agrawal, Nonlinear Fiber Optics, 3 ed. (Academic Press, 2001) 3. D. S. Milla, D. Lavery, S. Makovejs, C. Behrens, B. C. Thomsen, P. Bayvel, and Seb J. Savory, "Generation and long-haul transmission of polarization-switched QPSK at 42.9 Gb/s," Opt. Express 19(10), 9296-9302 (2011) 4. E. Agrell and M. Karlsson, "Power-efficient modulation formats in coherent transmission systems," J. Lightwave Technol. 27(22), 5115-5126 (2009) 5. Z. Chen, L. Yan, W. Pan, B. Luo, Y. Guo, H. Jiang, A. Yi, Y. Sun and X. Wu, "Transmission of multipolarization-multiplexed signals: another freedom to explore?," Opt. Express 21(9), 11590-11605 (2013) 6. I. B. Djordjevic, A. Z. Jovanovic, M. Cvijetic and Z. Peric, "Multidimensional Vector Quantization-Based Signal Constellation Design Enabling Beyond 1 Pb/s Serial Optical Transport Networks," IEEE Photonics J. 5(4), 7901312-7901312 (2013) 7. M. Chagnon, M. Osman, X. Xu, Q. Zhuge, and D. V. Plant, "Blind, fast and SOP independent polarization recovery for square dual polarization–MQAM formats and optical coherent receivers," Opt. Express 20(25), 27847-27865 (2012) 8. D. Marcuse, C. R. Menyuk, and P. K. A. Wai, "Application of the Manakov-PMD equation to studies of signal propagation in optical fibers with randomly varying birefringence," J Lightwave Technol. 15(9), 1735-1746 (1997) 9. J. Proakis, Digital Communications, 4 ed. (McGraw-Hill Science, 2000) 10. S. Benedetto, R. Gaudino, and P. Poggiolini, "Direct detection of optical digital transmission based on polarization shift keying modulation," IEEE J. Sel. Areas Commun. 13(3), 531-542 (1995) 11. S. Benedetto and P. Poggiolini, "Theory of polarization shift keying modulation," IEEE Trans. Commun. 40(4), 708-721 (1992) 12. E. Ip and J. M. Kahn, "Carrier synchronization for 3and 4-bit-per-symbol optical transmission," J. Lightwave Technol. 23(12), 4110-4124 (2005) 13. M. Winter, C.-A. Bunge, D. Setti, and K. Petermann, "A statistical treatment of cross-polarization modulation in DWDM systems," J. Lightwave Technol. 27(17), 3739-3751 (2009) 14. P. K. A. Wai and C. R. Menyuk, "Polarization mode dispersion, decorrelation, and diffusion in optical fibers with randomly varying birefringence," J. Lightwave Technol. 14(2), 148-157 (1996) 15. J. P. Gordon and H. Kogelnik, "PMD fundamentals: polarization mode dispersion in optical fibers," in Processings of the National Academy of Sciences 97(9), 4541–4550, (2000) 16. J. H. Lee, K. J. Park, C. H. Kim, and Y. C. Chung, "Effects of nonlinear crosstalk in optical PMD compensation," IEEE Photonics Technol. Lett. 14(8), 1082-1084 (2002) 17. M. Winter, "A statistical treatment of cross-polarization modulation in DWDM systems & its application," Techn. Univ, Berlin, Deutsche National Bibliothek, http://d-nb.info/1009105868 (2010) 18. M. Karlsson and H. Sunnerud, "Effects of nonlinearities on PMD-induced system impairments," J. Lightwave Technol. 24(11), 4127-4137 (2006) 19. K.-P. Ho and J. M. Kahn, "Electronic compensation technique to mitigate nonlinear phase noise," J. Lightwave Technol. 22(3), 779-783 (2004) 20. S. Kumar, "Effect of dispersion on nonlinear phase noise in optical transmission systems," Opt. Lett. 30(24), 3278-3280 (2005) 21. K.-P. Ho, "Probability density of nonlinear phase noise," J. Opt. Soc. Am. B 20(9), 1875-1879 (2003) 22. M. Nazarathy, W. V. Sorin, D. M. Baney and, S. A. Newton, "Spectral analysis of optical mixing measurements," J. Lightwave Technol. 7(7), 1083-1096 (1989) 23. M. Shtaif, "Analytical description of cross-phase modulation in dispersive optical fibers," Opt. Lett. 23(15), 1191-1193 (1998) 24. A. Carena, V. Curri, G. Bosco, P. Poggiolini, F. Forghieri, "Modeling of the impact of nonlinear propagation effects in uncompensated optical coherent transmission links," J. Lightwave Technol. 30(10) 1524-1539 (2012) 25. ITU-T Rec. G. 975.1, “Forward error correction for high bit-rate DWDM submarine systems,” (2004) 26. D. Lavery, C. Behrens, and S. J. Savory, "A comparison of modulation formats for passive optical networks," Opt. Express 19(26), B836-B841 (2011) 27. D. Chang, F. Yu, Z. Xiao, Y. Li and N. Stojanovic, "FPGA verification of a single QC-LDPC code for 100 Gb/s optical systems without error floor down to BER of 10," Proc. OFC 2011, Paper OTuN2, March 2012 28. X. Zhou, L. Nelson, R. Isaac, P. Magill, B. Zhu, and D. Peckham, "1200km transmission of 50GHz spaced, 5x504-Gb/s PDM-32-64 hybrid QAM using electrical and optical spectral shaping," Proc. OFC 2012, Paper OM2A.2, March 2012


Introduction
Standard optical fibers exhibit a nonlinear response that introduces a power-dependent phase retardation into the passing wave [1], causing a multitude of effects like self-phase modulation (SPM), cross-phase modulation (XPM) and cross-polarization modulation (XPolM).Undesired phase modulations from nonlinear impairments get converted to amplitude fluctuations during propagation [2], further deteriorating the signal quality.The greater the instantaneous power variation of the optical waveform, the greater the nonlinear impairments.
Modulation formats using only phase and polarization manipulation help reduce the nonlinear impact and increase propagation distance.Polarization Shift (PS)-Quadrature Phase Shift Keying (QPSK) proved to be a good candidate for ultra long-haul transmission allowing greater launch power and increased reach [3,4].With the advent of optical coherent receivers, polarization multiplexing (PM) rapidly gained interest, as it allows modulation on a 4 dimensional (4D) space and an increase in the binary transmission rate.As an example, a recent publication analyses legacy formats of on-off keying (OOK) and binary phase shift keying (2PSK) when multiplexed on 4 states of polarization (SOP) as a novel modulation format employing PM [5].Full dual polarization IQ modulators allow the study and use of novel 4D signaling schemes [4].Formats of higher dimensions were suggested lately, as in [6] where the authors propose a signal constellation design algorithm generating multidimensional signaling schemes that increase the number of dimensions above 4D, where spatial multiplexing in few mode fibers is employed to increase the number of degrees of freedom.
PM offers drawbacks, amongst which the inherent loss of 3dB signal power per polarization to maintain the total optical power.Some formats like Polarization Shift Keying (PolSK) rely solely on modulating the State of Polarization (SOP) to imprint information.Such format benefits from the constant power property, but lack bits per symbol from the smaller set size N SOP of possible SOPs, log 2 (N SOP ).
Higher order modulation formats are widely employed as a way to increase the data throughput by allowing a larger number of different amplitudes and phases.However, such formats inherently exhibit larger power variations from symbol to symbol, accentuating nonlinear effects, especially when polarization multiplexed is used [7].Higher order modulation formats that do not exhibit larger power variation are of great interest in order to meet the growing need of reach and data throughput.
In this paper, we present a new modulation format for single mode fibers relying on a mixture of PolSK and PSK.We entitled this format 8PolSK-QPSK.The format provides 5 bits of information per symbol where each orthogonal polarization exhibits an 8QAM constellation like Dual Polarization (DP)-8QAM format, itself providing 6 bits per symbol.One of the main characteristics differentiating the 2 formats is the waveform independence on orthogonal states: instead of having completely independent 8QAM formats on each polarization as is the case for DP-8QAM, the proposed 8PolSK-QPSK format couples part of the information on each polarization such that each symbol in the dual polarization symbol family share the property of having equal total optical power.We show both analytically and experimentally that modulating only in polarization and phase allowing a symbol set of equal power attenuates the impact of the optical Kerr effects.To tackle both intra-and interchannel nonlinear effects, we experimentally transmit 7 channels, all modulated in either DP-8QAM or 8PolSK-QPSK, and study the performance of the central channel.For a fair format comparison, the bit rate per channel is kept constant at 129 Gbit/s.To satisfy this constant bit rate the symbol rate of 8PolSK-QPSK has to be higher than that of DP-8QAM by 20%.Besides this modest baud rate difference, the transmitter architecture and the transmitter implementation penalty are the same for both formats.
This paper is organized as follows.First, the proposed 8PolSK-QPSK modulation format is presented in Section 2 and compared to the DP-8QAM format.The advantage of combining PolSK and QPSK as a modulation scheme are highlighted.We also describe a novel minimum distance detection scheme applicable to any constant power format that is less computationally demanding than the conventional complex minimum distance detection.Section 3 presents an analysis of the nonlinear Manakov-PMD equation [8] and shows how the modified DP-8QAM format can mitigate nonlinear effects.Propagation simulations using both formats demonstrate where and how the 8PolSK-QPSK format suffers less from the Kerr effect.Subsequently, Section 4 presents the experimental test bed used to compare performance of both formats.In this section, we also introduce a way to generate 8PolSK-QPSK format with a generic delay-and-add dual polarization emulator and a controlled single polarization (SP) 8QAM signal.Finally, system performance using both formats is presented in Section 5, where we show performance improvements using the proposed format, including an increased propagation distance of 34%, or 975 km, compared to DP-8QAM, and a reduction of the required OSNR for a BER of 1.4×10 -2 of 0.95dB.The paper concludes in Section 6.

Proposed 8PolSK-QPSK modulation format
The proposed modulation scheme is very similar to the polarization multiplexed 8QAM format (DP-8QAM).For DP-8QAM, each polarization carries independently an 8QAM modulation format.8QAM formats can be thought of as a QPSK format (2 bits/symbol) from a symbol family {1, i, -1, -i} on top of a binary (1 bit/symbol) format of symbol family {a, be iπ/4 }, giving a total of 3 bits/symbol/polarization.Of course, both QPSK and binary symbols are independent on each polarization for DP-8QAM.The binary symbol's amplitude corresponds to the 2 possible rings as depicted in Fig. 1.For a total mean signal power of 1, the total mean power per polarization is 0.5 and for equidistant constellation points, the 2 radii are a = 0.4597 and b = 0.8881.Consequently, for DP-8QAM formats, the total optical power jumps from symbol to symbol to any of the 3 possible power levels: a 2 +a 2 =0.4226, a 2 +b 2 =1, or b 2 +b 2 =1.5774.
In the proposed 8PolSK-QPSK format, the binary amplitude bit is not independent on each polarization.This constitutes the main difference between DP-8QAM and the proposed format.In 8PolSK-QPSK, if the x-pol.shows a symbol in the inner ring (binary symbol a), the simultaneous symbol on the y-pol.has to be a symbol in the outer ring (symbol be iπ/4 ), and vice versa.This constraint removes one bit of freedom from DP-8QAM's 6 bits, providing 5 bits per symbol for the format.As one can compute, enforcing different amplitude on each polarization restricts the total optical power to always be a 2 +b 2 =1 for any given symbol, relieving DP-8QAM from its large 3-level power fluctuation In the proposed 8PokSK-QPSK format, we only modulate information on the polarization and phase of an optical waveform.The advantage of combining polarization and phase shift keying is found in the analysis of the nonlinear (NL) Schrödinger (NLS) equation.A well known, simplistic way to study the Kerr nonlinear effects in a single mode fiber is to neglect all impairments but attenuation and Kerr nonlinearity in the NLS equation and study the equation governing the field evolution during propagation.There is a drawback, however, when using solely polarization and phase as a way to imprint information on a waveform.The 8PolSK-QPSK format is a 4D, constant symbol power modulation format that carries M=5 bits/symbol and consequently has a symbol set of size 2 M =32.By defining the dimensions as X ̂REAL , X ̂IMAG , Ŷ REAL and Ŷ IMAG , the constraint is written as X ̂REAL-i , for all symbol i in the set.Increasing the symbol set size is equivalent of packing more states in a 4D space, which naturally reduces the minimum squared Euclidean distance (SED) between neighboring states, increasing the required SNR for a target BER [9].However, by constraining the symbols to satisfy the unit power criterion we lose the independence of all dimensions in the 4D space; all 2 M symbols now have to be sitting on the surface of a 4 dimensional sphere.We end-up with a 4D space of only 3 degrees of freedom.Packing states on a 4D surface normally brings the states closer to each other than when packing them on independent 4Ds.Consequently, the drawback of constant total power formats modulating solely polarization and phase is that the SED is reduced, consequently requiring a larger SNR compared to the unconstrained 4D modulation format of equal symbol set size.However, for the specific cases of 64-states DP-star-8QAM and its proposed constrained 32-states 8PolSK-QPSK, the two formats do share the same smallest SED between closest states: a 2 +a 2 =0.4226.
As the Stokes space represents well the power and state of polarization of symbols, we depict in Fig. 2 the constellations in the Stokes space of (a) the 8PolSK-QPSK format and (b) the DP-8QAM format, both with added white Gaussian noise giving 16 dB of signal to noise ratio (SNR).In Stokes space, the radius of a point represents its power.Fig. 2(b) clearly depicts the increased power variation of the DP-8QAM format.For DP-8QAM, one fourth of all possible symbols have radii (r=0.4226)below the grey sphere of radius 1, half the symbols lie exactly on the sphere and one fourth reside outside (r=1.5774).For 8PolSK-QPSK format, however, all generated states reside on the sphere.One can observe that the 8 SOP states of DP-8QAM lying on the sphere are exactly the same 8 states for 8PolSK-QPSK.If we distinguish SOPs having the same orientation but of different magnitude, we realize that DP-8QAM has 16 SOPs.For both formats, each SOP can carry 4 different absolute phases.Hence, we can compute that 8PolSK-QPSK supports 8×4=32 symbols and DP-8QAM, 16×4=64.When considered as a pure 3 bit/symbol Stokes space constellation, 8PolSK is referred to as "cubic"-PolSK [10,11], clearly pictured in Fig. 2(a).

Minimum distance detection for 8PolSK-QPSK
The 8PolSK-QPSK format shows a total of 2 5 =32 possible symbols.Each symbol can be represented by a Jones vector of the form , where S i-x and S i-y are the information on x̂ and ŷ polarization of the i th Jones symbol.As the information on each polarization is coupled, we cannot apply an independent minimum distance detection on each polarizations over the 8 constellation points of Fig. 1, as is done for DP-8QAM.With coupled polarization information, the minimum distance detector needs to use information on both polarizations simultaneously for symbol detection.If |R⟩ is the received symbol, the minimum distance detector is [9] 2 min , where we sum the component wise subtraction of each polarization.There is a simpler, less computationally extensive way to derive the minimum distance detection.This technique uses ( ) To show this, we expand the minimization argument of Eq. ( 2) to obtain ( ) The proposed format exhibits constant total power, so ⟨S i |S i ⟩=1 for any symbol, and the first term ⟨R|R⟩ is independent of different symbols |S i ⟩.The only relevant term becomes the central one.Consequently, instead of computing three terms to obtain Eq. ( 4), the minimum distance detector's complexity is alleviated by computing only one term, as in Eq. ( 3), for the 8PolSK-QPSK format.It is noteworthy to mention that the computation of the minimum distance using Eq. ( 3) is only valid for formats having a symbol set exhibiting constant power.
A different symbol-to-bits mapping is indeed required for the two formats.For DP-8QAM, we use the optimal bit-to-symbol mapping for 8QAM [12] applied independently to both orthogonal polarizations.Optimal mapping means a bit mapping that yields the smallest average number of bits in error per symbol in error.This mapping uses identical Gray-coded QPSK mapping on each ring of Fig. 1, where one ring is rotated by 45°, with an extra amplitude bit, for a total of 6 bits.For the 8PolSK-QPSK format, the first bit of each symbol is assigned to power, defining if x̂ or ŷ was decided to have more power, equivalent of slicing Fig. 2(a) on the plane A 1 =0.Then, 2 phase bits on x̂ and on ŷ are Gray coded independently, complying with the optimal mapping scheme found in [12], for a total of 5 bits.

Nonlinearity mitigation by modulating with constant power
The idea behind the 8PolSK-QPSK is to reduce the total power variation at the transmitter while keeping a high bit per symbol efficiency.From the wave equation, we know that the nonlinear phase depends on the total optical power in the fiber.The vectorized form of the NLS equation for the modulated optical field, referred to as the Manakov-PMD equation, is described as [1,8,13,14]  ( ) ( ) where b ˆ is the unitary birefringence vector, σ → is the vector of the three Pauli spin matrices [2], |u⟩=|u(z,t)⟩=(u x , u y ) T is the field envelope in the Jones vector notation.The second and third terms on the right-hand side of Eq. ( 5) are the phase-velocity and group-velocity birefringence, the latter being commonly called PMD, with a respective strength of Δβ 0 and Δβ 1 .All b ˆ, Δβ 0 and Δβ 1 are z-dependent.All the nonlinear components are lumped in the last term and include Self Phase Modulation (SPM), Cross-Phase Modulation (XPM) and Cross Polarization Modulation (XPolM).There is one nonlinear term that is intentionally omitted in Eq. ( 5) and is the "nonlinear PMD".This term, thoroughly discussed in [8], is proven to be vanishingly small and to have negligible effects.From Eq. ( 5) it is clear that the nonlinear processes impact the phase of the waveform and that this impact is itself dependent on the power ⟨u|u⟩ of the propagating waveform.For single carrier systems, ⟨u|u⟩=u x 2 (t)+u y 2 (t) and consequently the more time varying the power of the waveform, the greater the variation of the nonlinear phase.
If |u⟩ is the sum of modulated envelopes at different frequencies, for instance in the case of a WDM transmission, |u⟩=∑ k =1 N |u k (z,t)⟩ e iω k t and one can show that the nonlinear phase term ⟨u|u⟩|u⟩ of Eq. ( 5) becomes for the n th carrier, after mathematically retaining only terms oscillating at the n th carrier frequency ω n .We used the identity [15] where u → m is the Stokes representation of |u m ⟩ and I is the 2-by-2 identity matrix.Here, ω k =kω GRID , where ω GRID is the WDM grid, e.g.50 GHz.
The third term in Eq. ( 6), representing XPolM, can be cast as is a unitary Stokes vector.Put in this form, one can realize that the XPolM term is expressed similarly as the birefringence term of Eq. ( 5), with a birefringence vector v ˆ and amplitude -8γ|∑ m≠n u → m |/9.In some literature, XPolM is in fact called nonlinear birefringence [2,16], as clearly depicted in Eq. ( 6).As a consequence of XPolM, the state of polarization of the channel of interest is modified by a rotation around the instantaneous sum of the Stokes vectors of the interfering channels, ∑ m≠n u → m , and this nonlinear birefringence term depends on the statistics of the interfering channels' cumulative power, |∑ m≠n u → m |, and that of their cumulative direction, v ˆ [17,18].This XPolM term is the only nonlinear term that is polarization dependant, and XPolM's impact depends on the SOPs of all other co-propagating channels.

Nonlinear variance study for 8PolSK-QPSK and DP-8QAM
In the following subsection, we study the variance of each nonlinear term appearing in Eq. ( 6) for the two modulation formats: the proposed format and the DP-8QAM.We study the central channel of an N-channel system using Eq. ( 5).The full split-step Fourier method is employed for propagation.We use, in the simulation, parameters that will be used in a subsequent experimental demonstration presented below, namely N=7 co-propagating channels of identical modulating format and power, launched with a pulse shape of root-raised cosine (RRC) of roll-off factor α=1/8, at a baud rate of 25.8 Gbaud for 8PolSK-QPSK and 21.5 Gbaud for DP-8QAM.The channels sit in a 50 GHz grid and we study the impact on the central channel.In the simulations, the SOPs of all channels at launch are random.In this simulation, we deliberately neglected ASE noise added by EDFAs, PMD, and birefringence.ASE is neglected because we don't want to interfere the statistics of the signal with external additive white Gaussian noise, as is shown in [19].PMD is neglected because we do not want to convolve XPolM induced rotations with frequency dependant rotation from PMD, and finally birefringence is neglected as it simply acts as a global unitary rotation equally applied to the entire waveform.Amplification is lumped at every 80 km and the fiber is standard SMF-28 with β 2 = -21.4ps 2 /km and an attenuation coefficient of α dB/km =0.2 dB/km.The launch power per channel is set at +1dBm to trigger significantly some nonlinear impairments.We also use γ = 1.3 W -1 km -1 .
We are interested in the variance of all nonlinear effects applied to the channel of interest at every distance z, with respect to the mean power of said channel at such distance.As the nonlinear noise is proportional to the signal power [20], one can understand that the NL variance will be proportional to the squared signal power.Consequently, as applied in [21], we normalize the strength of the NL impairments by the squared signal power to alleviate the NL noise from its power dependence.In both our simulations and experiments we set the power per channel to be equal for all channels, allowing normalizing the strength of different NL impairments by the squared power of a single channel.The equations allowing nonlinear monitoring are SPM: XPM: As |u⟩=|u(z,t)⟩, the nonlinear term ⟨u|u⟩ of Eq. ( 5), expended in Eq. ( 6) as ⟨u|u⟩ SPM +⟨u|u⟩ XPM+ ⟨u|u⟩ XPolM , is both time and distance dependant.The variance of those three nonlinear terms is applied over the time variable, yielding the z-dependant variances σ SPM 2 , As the solution of the propagation equation of Eq. ( 5) evolves for every dz, the local temporal variance of the NL processes also evolves with distance.The variance of the three NL processes of Eq. ( 5), namely x 1 =⟨u n |u n ⟩, x 2 =∑ m≠n 3/2⟨u m |u m ⟩ and x 3 =∑ m≠n ½u → m •σ → , are depicted in Fig. 3.It is important that the y-axis of Fig. 3 be understood as the normalized variance of x 1 , x 2 and x 3 .By removing the normalization we obtain the observable nonlinear strength variances, σ ~NL 2 =σ NL 2 ×⟨u n |u n ⟩ ---2 where the subscript NL is either SPM, XPM or XPolM.
Of course, ⟨u n |u n ⟩ ---2 decays with distance as e -2αz , where α=α dB/km ⁄ (10log 10 (e)).The averaging operator X(t) -is a statistical ensemble average and time average, where the time averaging operator is lim T→∞ 1/T∫ -T/2 T/2 X(t)dt [22].Monte Carlo simulations of 100 propagation runs were realized to determine the properties in Eqs. ( 7) through (9) with confidence.Each propagation consists of a PRBS sequence of 2 12 symbols with 8 samples per symbols for each of the 7 channels.The results are presented in Fig. 3.We observe that the behaviour of the variance of nonlinear processes at launch, when z=0, is much different than that after 40 km and more, and that this is applicable for both modulation formats.The interplay between chromatic dispersion with channel walk-off and nonlinearity greatly modifies the initial statistics.At launch, we obtain a SPM variance σ SPM 2 of 0.138 for 8PolSK-QPSK and 0.26 for DP-8QAM.Those values show that the constant power format exhibits a power variance for SPM that is roughly 1.9 times less than that of DP-8QAM.The same is applicable to XPM: at launch, XPM's variance σ XPM 2 is roughly 1.86 for 8PolSK-QPSK and 3.52 for DP-8QAM.We deliberately added markers, ▷ , at those nonlinear strengths at z=0.We will come back and relate those values to theory in Section 3.2.As the waveform propagates in the first span, the relative variances σ SPM 2 , σ XPM 2 and σ XPolM 2 increase very rapidly until they each reach a value roughly maintained for the rest of the fiber length.After about only 40 km, the interplay of CD with nonlinearity renders respective variances seemingly the same for both formats We will show in Section 5 experimental results demonstrating that the constant power 8PolSK-QPSK format performs better than DP-8QAM format.Propagation simulation results in Fig. 3 well explain the reasons.This performance improvement is due to the smaller SPM and XPM variances at launch.
It is well known that fiber dispersion plays two roles in the process of XPM induced intensity interference.The first is translating the nonlinear phase modulation into intensity fluctuations and the second is causing signals to propagate with different velocities [1,23].It was previously concluded that the nonlinear XPM phase interaction between the signals takes place only in the beginning of the fiber [1,18,23], occurring over a short length in which the walk-off between the signals is small.In that part of the fiber, the phase of the channel of interest gets modulated by neighboring channels, as the walk-off between the signals is minimal and pulses are still well defined and contained in their symbol slot, while the intensity modulation is small, as cumulative CD is almost null.After the first walk-off length, the signals then propagate linearly over the rest of the fiber, where the action of dispersion on the phase modulated signal are continuously translated into intensity fluctuations [23], appearing as a continuous additive Gaussian noise [24].As the bulk of nonlinear XPM phase shift is induced over the first walk-off length only, if that can be kept small, the overall XPM penalty will be smaller [18].Consequently, the better performance of 8PolSK-QPSK format is explicable by the fact that it exhibits a much smaller XPM strength at launch, and SPM for that matter, hence suffering less nonlinear penalty.Results in Fig. 3 also concur with those observations from the literature: with a dispersion parameter of β 2 = -21.4ps 2 /km and a grid spacing of 50 GHz, the first neighboring channels are delayed at a rate of 1.07 ps/km.With a pulse duration of 38.5 or 46.5 ps, the walk-off length is around 40 km: the required distance before nonlinear variances converge, as we can observe in Fig. 3.
Figures (c) and (d) in Fig. 3 depict the behavior of normalized nonlinear strengths after longer propagation distances by providing a closer look at σ SPM 2 and σ XPM 2 , respectively.Although almost identical, we observe that the SPM and XPM variances for DP-8QAM are slightly higher than that of 8PolSK-QPSK.Albeit not the primary argument justifying improved performance of the proposed format, this subtle reduction of σ SPM 2 and σ XPM 2 occurring at longer distances would also benefit the 8PolSK-QPSK format, as well as the principal and dominant reduction occurring in the first walk-off length.Finally, we also observe an oscillation of the normalized nonlinear strengths of period equal to the EDFA spacing of 80 km, showing a ramp up of the relative power variances for the first 20 to 25 km after each EDFAs, followed by a slow decay for the rest of the span.

Variance of SPM, XPM and XPolM at launch
The relative power variance numbers obtained at launch (z=0) describing SPM and XPM can be recovered analytically.In order to analytically compute Eq. ( 7) for SPM, we need to calculate ⟨u n |u n ⟩ 2 ---.If the launched pulse shape function is g(t) and the polarization multiplexed After keeping only terms of ⟨u m |u n ⟩⟨u k |u l ⟩ that have a non-zero statistical ensemble average, we end up with

u u u u u u g t g t mT dt T u u u u u u u u u u g t mT dt T
This equation is valid for the symbol set |S p ⟩ of any modulation format, and for any pulse shape g(t), even for a pulse shape that includes CD.The respective terms --are modulation format dependant.If the magnitude of the symbols |S p ⟩ are such that the mean total power is unitary, one can show that X kkll = 1, X kllk = 0.5 and X kkkk = 1 for 8PolSK-QPSK, whereas X kkll = 1, X kllk = 0.5 and X kkkk = 7/6 for DP-8QAM.If the pulse shape is root-raised cosine (RRC) of any roll-off factor 0<α<1, the mean power ⟨u i |u i ⟩ -can be relatively easily analytically calculated to be 1/T.However, the analytical expression for ⟨u i |u i ⟩ 2 --is more complicated and it is out of the scope of this paper to derive it.
--is both pulse shape dependant and format dependant.It is easy to obtain the numerical value of Eq. ( 11) using a mathematical software.Now that we have an expression for ⟨u i |u i ⟩ 2 --and its value, we can obtain the relative nonlinear variance of SPM and XPM from Eqs. ( 7) and ( 8) for each format.From the result of σ SPM 2 , σ XPM 2 can easily be obtained: for a N-channel co-propagation, σ XPM 2 =(N-1)(3/2) 2 σ SPM 2 .The numerical evaluation of σ SPM 2 for RRC with α=1/8 is 0.13785 and 0.26015 for 8PolSK-QPSK and DP-8QAM respectively.Consequently, with N=7, σ XPM 2 yields 1.861 and 3.512 for the 2 formats, respectively.Those values, obtained from Eqs. ( 7) and ( 8), map exactly to our propagation simulation results at z=0 in Fig. 3. Small right-pointing triangle markers " ▷ " located at those theoretical values of σ SPM 2 and σ XPM 2 for both formats show the match between simulation and theory at launch for σ SPM 2 and σ XPM 2 , when no CD nor nonlinearity is applied on the waveform.
A further discussion on the XPM impact on a channel when an increasingly larger counts of copropagating channels are added to the link is of interest.It is known that XPM occurs only if the two pulses from different channels overlap temporally [2].It is also established that XPM-induced phase shift decreases as the relative group-velocity difference (GVD) increases, where the latter is proportional to the frequency difference between channels [2].Consequently, only close frequency neighbours to a channel have a significant XPM impact on said channel.Channels added increasingly farther from a reference channel observe increasingly larger GVD, and pulses from the two different channels will traverse at an increasingly different speed, blurring the effect of XPM.As Fig. 3 only shows the evolution of the variance of the three nonlinear elements in Eq. ( 5) as it is numerically propagated, the reduced effect of XPM impairment from increasingly farther channels is not directly shown in this Fig.As the number of aggressor channels grows, located farther away from a reference channel, their XPM impact on said reference channel would be negligible compared to XPM's impact from the closest neighbours.
The variance of XPolM's rotation strength, one the other hand, is random by nature and increases to the same steady-state average, and becomes rapidly agnostic to the modulation format.The rapid settling of the 3 variances σ SPM 2 , σ XPM to an approximately steadystate value concurs with a recent conclusion that nonlinear interferences can be modeled as excess additive Gaussian noise [24].We show here that the variance of that additive Gaussian noise does tend to a coarse steady state value very rapidly.
For XPolM, the statistics of the variable X=|∑u → m | at launch, namely the mean μ X and variance σ X 2 , depend on the modulation format, the number of other co-propagating channels and the pulse shape applied to all channels.The nonlinear XPolM strength X is naturally a random process due to both the instantaneous randomness of the data of every other channel affecting one channel and due to walk-off between channels [18].As |∑u → m | is the norm of a Stokes space vector, the variable represents time varying power.The mean and variance of |∑u → m | over time can easily be numerically obtained.For a N=7 channel WDM propagation where each channel's power is normalized to 1, with RRC pulse shaped with α=1/8, one can compute that μ X ≈2.412 and σ X 2 ≈1.01 for 8PolSK-QPSK where μ X ≈2.523 and σ X 2 ≈1.19 for DP-8QAM.Consequently, σ XPolM 2 in Eq. ( 9) equals 1.01 for 8PolSK-QPSK and 1.19 for DP-8QAM.We can observe that the XPolM variance obtained in the propagation simulation in Fig. 3, at launch, match exactly those numbers.Again, this match is depicted in Fig. 3  ) is a random variable of Chidistribution with 3 degrees of freedom.It is known that for a Chi-distribution where the generating Gaussian variables all have zero-mean and a variance of σ 2 , the Chi variable X has a mean of μ X = σ√2×Γ(2)/Γ(3/2) where Γ is the Gamma function, and a variance of One can show numerically that the variance σ 2 of every u i TOT is larger by ≈10% for the DP-8QAM format compared to that of 8PolSK-QPSK.This explains why both the mean and variance of the strength of XPolM |∑u → m | is smaller for 8PolSK-QPSK at launch.Finally, we can conclude that even the impact of XPolM is lessened for 8PolSK-QPSK format during the early stage of propagation.
It is noteworthy to observe in Eq. ( 6) that the impact of XPolM is 3 times less efficient than that of XPM.Hence the benefit of focusing on reducing the XPM effect over XPolM.The above analysis validates our assumption that having constant power modulation on all wavelengths would reduce nonlinear effects.The experimental test and results presented below confirm our analysis.

Experimental Test bed
To assess the performance of the two modulation formats, we use the following test bed shown in Fig. 4. In order to use the same transmitter configuration, and consequently the same transmitter implementation penalty, the only parameter modified when switching from one format to the other is the waveform baud rate imprinted in the Digital to Analog Converters (DACs) using a fixed clock rate.The test bed begins with 7 lasers combined to generate the WDM waveform.Six of which are DFBs and the central channel of interest is an ECL with laser linewidth < 100 khz.Lasers are placed on a 50 GH grid.The 6 DFBs, of linewidths smaller than 1 MHz, are used as aggressors acting on the central channel.All channels have equal power.This WDM configuration allows assessing the full nonlinear degradation, where all nonlinear impairments are included.A SP-IQ modulator bulk modulates the laser tones using two DACs clocked at a rate of 32 GSa/s for both formats.To modulate the different formats at a their desired symbol rates, a proper Root Raised Cosine (RRC) matched pulse shaping filter with a roll-off factor of 1/8 is applied.In order to yield the same bit rate of 129 Gbit/s, the 6 bits/symbol DP-8QAM format runs at a baud rate of 21.5 Gbaud, whereas the 5 bits/symbol 8PolSK-QPSK run at 25.75 Gbaud.The reason why this latter symbol rate is not 129/5=25.8GBaud is twofold.First, the DACs' memory has to be a power of 2 and second, the DAC rate is deliberately fixed for both formats.The difference between a baudrate of 25.8 and 25.75 is assumed negligible.The DAC's pattern length is set to 2 19 .The SP-IQ modulator is driven in the linear regime.Following the IQ modulator is the dual polarization emulator with a delay set at 108 ns.This delay translates to an integer number of symbols at both 21.5 GBaud (2322 Symbols) and 25.75 GBaud (2781 Symbols).Then, two 50/100 GHz passive interleavers are used to temporally decorrelate even and odd channels.The WDM waveform is then boosted and attenuated to sweep the launch power before launching in an optical recirculating loop.The latter is composed of 4 spans of 80 km of Corning ® SMF-28 e+ LL fiber with dual-stage inline EDFAs of noise figure ≈5.5 dB.A gain flattening filter is inserted after the second span inside the loop.Chromatic dispersion is uncompensated.A noise loading stage follows the recirculating loop for OSNR sweeping followed by a channel selecting filter, isolating the central channel of interest.Finally, a preamp and a 0.4 nm filter are preceding the optical coherent receiver.The receiver consists of a dual polarization 90° optical hybrid, 4 balanced photodiodes (U 2 T BPDV2020R) and a 4 channel real-time scope sampling at 80 GSa/s.

8PolSK-QPSK signal generation with a dual polarization emulator
Generating a 8PolSK-QPSK format with a SP-IQ modulator and a dual polarization delayand-add emulator requires some handling of the signal provided to the SP-MZM.The dual polarization emulator consists of splitting the incoming SP waveform on 2 orthogonal branches, delaying one, and recombining on two orthogonal polarizations, as pictured in Fig. 4. For regular Dual Polarization (DP) formats like DP-8QAM, the delay of the DP emulator as no impact on the output format: it will always be DP-8QAM as long as the delay is an integer number of symbols and is not zero.Generating a 8PolSK-QPSK format out of a SP-8QAM format, however, requires extra manipulation on the SP waveform out of the Mach-Zehnder modulator (MZM).The SP waveform generated has to be related to the DP emulator's delay.As explained in Section 2, the symbol constellation of a single polarization of the 8PolSK-QPSK format is exactly the same as that of the DP-8QAM format, i.e. the 8QAM constellation of Fig. 1.However, the 8PolSK-QPSK signal out of the DP emulator has to exhibit the constant power constraint, not observable before the emulator.
To generate a SP-8QAM waveform that will get converted to 8PolSK-QPSK after the DP emulator having a delay of N=2781 symbols, a specific repetitive pattern has to be imprinted on the DACs.Let's mention that for a DAC sampling rate of 32 GSa/s, a DAC memory of 2 19  samples and a symbol rate of 25.75 GBaud, a total of M=421888 symbols are repeatedly generated by the DACs.Two different symbols streams are needed to properly mimic both 8PolSK-QPSK and DP-8QAM formats.The first and most simple symbol stream is a random QPSK sequence of length M. The second symbols stream is what will determine if the waveform after the DP emulator is a 8PolSK-QPSK or DP-8QAM format.If the second symbol stream is a random sequence also of length M of symbols {a, be iπ/4 } and both streams are multiplied symbol by symbol, then the waveform generated is DP-8QAM.To generate an 8PolSK-QPSK format, the second symbol stream is constructed as follows.First, we generate a random sequence of length N from symbol family {S 0 =a, S 1 =be iπ/4 }.We call this sequence S.Then, a sequence of length 2N is generated by concatenating the inverse of S to itself, generating the sequence S ' = [S S -].If S '(k)=S 0 , S '(N+k)=S 1 and vice versa.This new sequence S ' is then replicated and truncated after M symbols are generated.By multiplying this symbol stream with the QPSK stream, we will generate the desired 8PolSK-QPSK format after the DP emulator of DP delay N. One can prove that the generated dual polarization signal |u(n)⟩=( u x (n), u y (n)) T =( S '(n), S '(n+N) ) T will have the desired constant power ⟨u(n)|u(n)⟩=1 for every symbol n.Of course, this tedious way of generating this constant power signal is avoided with a full Dual Polarization-Dual Parallel MZM.

Results
In this section we present the performance of both 8PolSK-QPSK and DP-8QAM modulation formats.We assume BER free operation with 29% coding overhead (OH) for hard decision forward error correction (HD-FEC) with a BER threshold of 1.4×10 -2 , providing slightly more coding overhead than the 25% used in references [25,26].First, we present in Fig. 5 the maximum reach achievable for a BER of 1.4×10 -2 as a function of the launch power per channel.As the launch power jumps were relatively large by 2dB, the resolution and hence the optimum powers of Fig. 5 are coarse.At such granularity the optimum launch power appears to be the same for both formats at -1 dBm.However, by looking carefully at the trend of the curves, we realize that the optimum launch power for DP-8QAM would sit slightly below -1 dBm had we used a smaller launch power step size.On the other hand, the 8PolSK-QPSK format shows an apparent true optimum launch power at -1 dBm.We can observe, after interpolation of the DP-8QAM curve, that 8PolSK-QPSK allows a slightly greater optimum launch power, by around 0.4 dB.Secondly and most importantly, Fig. 5 shows that the 8PolSK-QPSK format allows a much greater propagation distance compared to DP-8QAM, 3800 km compared to 2825km, translating to an additional 975 km, or 34%.8PolSK-QPSK format propagates 34% more than DP-8QAM, or 975 km more In the following Fig. 6, we show the experimental BER as a function of the OSNR in 0.1 nm for both formats in back-to-back, along with their theoretical values.The theoretical curves in Fig. 6 show a 0.6 dB difference between 8PolSK-QPSK and DP-8QAM at a BER of 1.4×10 -2 , benefitting 8PolSK-QPSK.Experimentally, we obtain a difference of 0.5 dB.The offset from theory for both formats, mainly due to the transmitter implementation penalty, is rather large at 4.37 dB.Our claim that both formats have the same implementation penalty is well proven here.The inset (a) in Fig. 6 shows the theoretical OSNR (in 0.1 nm) difference between DP-8QAM and 8PolSK-QPSK as a function of the BER, for BERs varying between 2.5×10 -3 and 2.5×10 -2 .The independent variable, BER, is put in the y-axis to be consistent with the main outermost axes.This inset demonstrates that as the BER increases, the 8PolSK-QPSK format requires an increasingly smaller OSNR with respect to the DP-8QAM.This property is beneficial for the proposed format as improving FEC schemes increase BER thresholds; a recent FEC scheme can bring a BER of 2.4×10 -2 down to 10 -15 [27].
The theoretical BER against SNR function for DP-8QAM format can be approximated by BER(SNR)=2/3 erfc(√(3 SNR/14)), in the BER range of 10 -2 to 10 -4 , with a maximal deviation of ±0.05 dB [24].Unfortunately, to the author's knowledge, there is no such theoretical curve in the literature for 8PolSK-QPSK.By using a model of ( ) as was used for the 4 modulation formats of PM-BPSK, -QPSK, -8QAM and -16QAM in [24], we found α and β to match the theoretical curve of Fig. 6 in a least mean square fashion for the 8PolSK-QPSK format.Within a maximum deviation of ±0.07 dB, we obtain α = 0.2817 and β = 0.2225 over the larger BER range of 10 -2 to 10 -5 .We can observe in Fig. 6 that the theoretical curves cross around an OSNR in 0.1 nm of 17.2 dB, where 8PolSK-QPSK performs better for worst OSNRs and DP-8QAM starts outperforming for larger OSNRs.
Identical formats compared at different baudrates would simply yield a horizontal shift on the theoretical BER vs OSNR curve.In Fig. 6, the two formats exhibit not only a different baudrate, but also a different number of recovered bits per symbol, and a different symbol-tobits mapping, where all closest symbols do not always differ by a single bit.Hence, the two formats exhibit a different behavior in their BER against OSNR curve.Theoretical OSNR (in 0.1 nm) difference between formats as a function of BER threshold The next Fig.7 depicts the required OSNR (ROSNR) in 0.1 nm to obtain a BER of 1.4×10 -2 after propagation of 1920 km as a function of the launch power, for both formats.We can see that in the fully linear propagation regime, at -5 dBm, the 8PolSK-QPSK format requires 0.5 dB less OSNR compared to DP-8QAM format, as predicted by results in Fig. 6.As the launch power increases, the strength of nonlinear impairments accordingly increase and Fig. 7 exhibits the better nonlinear tolerance of 8PolSK-QPSK.The smaller nonlinear impairments received by using the proposed format is expressed by the clear decoupling of the 2 curves with growing launch power.At the optimal launch power of -1dBm where the simultaneous impact of linear and nonlinear effects are at minimum, the OSNR requirement is 0.95 dB less for the proposed format.At +1dBm of launch power, where nonlinearity clearly supersedes linear impairments, 8PolSK-QPSK outperform DP-QAM by 2.1 dB of ROSNR.
As shown in Fig. 3 using simulations, the better performance of 8PolSK-QPSK after 1920 km for increasing launch powers comes almost solely from the first 40 km or so, where the proposed format receives significantly less NL impairments.As the launch power increases, the NL noise applied to both formats also naturally increases, but even more NL noise is applied to the DP-8QAM format within the first 40 km; a supplemental impairment still observable after longer distances like 1920 km.Fig. 7 demonstrate very well both the linear and nonlinear advantages of the proposed format.
Finally, as a last experimental result, we present in Fig. 8 BER against Distance curves for the two formats at their optimum launch power of -1 dBm.We can see that at any fixed distance, the BER is always lower when the 8PolSK-QPSK format is employed compared to DP-8QAM.Another curve was added to the 2 BER against Distance curves of Fig. 8, showing the ratio of the BERs of the 2 formats as a function of distance, expressed as BER DP- 8QAM (z)÷BER 8PolSK-QPSK (z)×10 -6 and expressed in log-scale.An offset of 10 -6 was deliberately applied to the ratio to have it fit at the bottom of the y-axis while using the same y-scale.This curve shows that the ratio of the BERs of the 2 formats is essentially constant with distance.The fact that the BER equally increases with distance for the 2 formats concurs with the simulations observations of Fig. 3.It was shown in Fig. 3 that the variance of the nonlinear strength quickly saturates to the same value after the second span for both formats.Consequently the nonlinear noise applied to both formats can be assumed equal after a few hundred kilometers.Both NL and linear noise equally increase with distance, explaining the experimental even growth of the BER for both formats.
Another informative curve was added to Fig. 8 in inset (b), showing the excessive reach of 8PolSK-QPSK over DP-8QAM at a specific BER, calculated for different BERs.As an example, at a BER of 1.4×10 -2 , DP-8QAM's maximum reach is marked by "A" and 8PolSK-QPSK's maximum reach is marked by "B".B-A is the excessive distance when 8PolSK-QPSK is employed compared to when DP-8QAM is used for a target BER threshold of 1.4×10 -2 .This reach difference is shown on the left-hand side of Fig. 8, with the same x-axis but now representing excessive distance instead of absolute distance.Starting at a BER threshold BER th of 3×10 -3 , the added reach R + when using 8PolSK-QPSK over DP-8QAM can be well approximated by a linear relation with BER th : R + ≈ 375+43475×BER th .The inset  (b) replicates the reach difference curve with BER in linear scale and put on the abscissa.This would further benefit the proposed format as a large effort is recently put in increasing FEC thresholds for optical communications [27].With such supplemental reach monotonically increasing with BER threshold, we can predict a excessive reach by more than 1400 km at the BER threshold of 2.4×10 -2 ; a threshold currently applicable for soft-decision (SD) FEC [27,28].The excessive FEC overhead of 29% used in our experiments would be sufficiently large for such SD-FEC with extra space for protocol overhead.

Conclusion
We have presented a novel modulation format obtained by constraining the well known DP-8QAM format to exhibit constant symbol power.The proposed format, providing 5 bits of information per symbol and entitled 8PolSK-QPSK, presents a passive way to mitigate nonlinear impairments by significantly reducing the optical power variation from symbol to symbol in the first 40 km of propagation.Using both theoretical explanations, simulations and experimental validations, we confirmed the better nonlinear tolerance of the proposed format.We showed a back-to-back decrease of the required OSNR for a BER of 1.4×10 -2 by 0.5 dB for 8PolSK-QPSK, showing an improved ASE noise tolerance for 8PolSK-QPSK at such BER.After a distance of 1920 km with a launch power of -1 dBm/ch with 6 other copropagating channels, the proposed format exhibits a ROSNR reduction of 0.95 dB.We also demonstrated that at the optimum launch power of -1dBm/ch, the proposed format extends the propagation reach by 975 km, or by 34%, for a BER threshold of 1.4×10 -2 , from 2825 km to 3800 km.We also presented a relation between supplemental reach provided by the format against increasing BER threshold.Finally, to bolster the analysis of the format, we presented a detailed study, both analytically and via simulations, of a comparison of the strengths of the three main nonlinear impairments occurring during propagation, namely SPM, XPM and XPolM, where we showed significantly reduced SPM and XPM strengths by using the 8PolSK-QPSK format.

Fig. 1 .
Fig.1.Star-8QAM format.In DP-8QAM, the 2 polarizations can both independently be at amplitude a or b.For 8PolSK-QPSK format, that independence is removed.

Fig. 2 .
Fig. 2. Stokes space representation of the a) 8PolSK-QPSK and b) DP-8QAM format, both with added noise giving 16 dB of SNR.The axis of both figures are equal.The higher power variations from symbol to symbol is clearly observed for DP-8QAM

Fig. 3 .
Fig. 3. Temporal variance of SPM, XPM and XPolM, normalized by mean power squared, for 8PolSK-QPSK (solid lines: ---) and DP-8QAM (circled solid lines: -○-) a) Main figure showing all NL strengths for first two spans b) Zoom-in from 0 to 40 km showing SPM and XPolM strengths.c) SPM only, from 3 rd to 9 th span.d) XPM only between 400 and 650 km

Fig. 6 .
Fig. 6.BER against OSNR in 0.1 nm for both 8PolSK-QPSK and DP-8QAM formats.Inset a):Theoretical OSNR (in 0.1 nm) difference between formats as a function of BER threshold

Fig. 8 .
Fig. 8. BER against Distance [km] for both 8PolSK-QPSK and DP-8QAM.Aslo shown in inset (b) is the reach difference of the two formats as a function of BER in linear scale.
If we call the optical field at z=0 |u(z=0,t)⟩=|u o ⟩, the field |u(z,t)⟩ evolves following |u(z,t)⟩=|u o ⟩exp(iϕ NL (z,t)), where ϕ NL (z,t)=γ(8/9)∫ 0 z ⟨u o |u o ⟩e -αz' dz' [2].The SPM-induced spectral broadening, or the NL impairment, is a consequence of the time dependence of the nonlinear phase ϕ NL (z,t) [2].If the temporal variation of the total power ⟨u o |u o ⟩ is reduced, the strength of the NL impairments during propagation is also reduced.When modulating in polarization, the total power does not change as PolSK relies on applying a unitary rotation R(t) to the input field |u CW ⟩ such that |u o ⟩=R(t)|u CW ⟩, where R † R=I.Additionally, modulating the phase of a continuous wave by applying e -iϕ(t) |u CW ⟩ also maintains the power.The combination of both formats, |u o ⟩=e -iϕ(t) R(t)|u CW ⟩, give a total power of P o (t), and the NL phase noise ϕ NL grows as by markers ▷ added at those theoretical values for XpolM.The variance difference of |∑u