Joint spectral efficiency optimization of uplink and downlink for massive MIMO-enabled wireless energy harvesting systems

This paper investigated the spectral efficiency (SE) in massive multiple-input multiple-output systems, where all terminals have no fixed power supply and thus need to replenish energy via the received signals from the base station. The hybrid wireless energy harvesting (EH) protocol is applied for each terminal, which can switch to either existing time-switching (TS) protocol or power-splitting (PS) protocol. Based on the hybrid wireless EH protocol, a general system model is developed, which can switch to either only uplink data transmission or only downlink data transmission. As a result, a general analytical framework is formulated. Then, closed-form lower bound expressions on SE for each terminal are obtained on the uplink and downlink, respectively. According to these expressions, the joint SE of uplink and downlink maximization problem is designed with some practical constraints. As the designed optimization problem is non-linear and non-convex, it is hard to solve directly. To provide a solution, an iteration algorithm is proposed by utilizing one-dimensional search technique and successive approximation method based on geometric program. Additionally, the convergence and complexity of the proposed algorithm are discussed as well. Finally, the feasibility of the proposed algorithm is analyzed by simulations. Numerical results manifest that the proposed algorithm can provide good SE by optimizing relevant system parameters, and the system model can help to discuss the TS, PS or hybrid protocol for only uplink data transmission, only downlink data transmission or joint data transmission of uplink and downlink in the considered system.

a promising approach to prolong the lifetime of energy-constrained terminals [1][2][3][4]. Although some natural energy sources such as sunshine and wind can be considered, they are usually not as effective as expected due to the inconsistent and unforeseeable nature of ambient sources [5]. Compared with natural energy sources, radio frequency (RF) EH is nominated as the best EH scheme due to the stability and the ability of transforming the RF signals into voltage to charge the terminal battery. To perform RF EH, two main protocols adopted at terminals are time-switching (TS) protocol and powersplitting (PS) protocol.
One transmission scheme based on TS protocol is wireless powered communication network (WPCN) in which each terminal harvests power on the downlink in the first slot and then transmits its information on the uplink in the second slot. In this scheme, TS ratio is an important parameter to evaluate system performance. For example, in broadcast channels scenario, the authors in [6] focused on the trade-off of wireless energy and information transfer by adjusting TS ratio, and the authors in [7] maximized the minimum throughput among all terminals by optimizing the downlink/uplink TS ratio, the downlink energy beamforming, and the uplink power allocation (PA) as well as receive beamforming. In relay channels, the authors in [8] maximized instantaneous throughput by optimizing beamformers and TS ratio at the relay, and the authors in [9] addressed the problems of maximizing throughput for fixed supplementary battery energy and minimizing supplementary battery energy consumption for target throughput performance by optimizing the TS ratio.
Another transmission scheme, simultaneous wireless information and power transfer (SWIPT), is based on PS protocol. In this scheme, a transmitter uses the same waveform to transfer wireless energy and information, and each terminal divides the received signal power by PS ratio between EH and information decoding (ID). Here, PS ratio is of great importance because it can directly affect the system performance. For example, the authors in [10] jointly designed transmit beamforming vector, PS ratio and transmit power to minimize the weighted sum transmit power in full-duplex (FD) networks. The authors in [11] designed the optimal PS and TS ratios to maximize the weighted sum rate over all users under some constraints in orthogonal frequency division multiplexing (OFDM) systems. To study the performance of relay networks based on wireless EH, PS and TS ratios are also intensively investigated to obtain the optimal throughput in an amplify-and-forward (AF) relaying network [12] and a decode-and-forward (DF) relaying network [13].
In practice, RF signals decay quickly over a long distance. One feasible solution to that is utilizing energy beamforming that can focus RF signals into a narrow beam to enhance transmission efficiency [14]. Based on this fact, massive multiple-input multiple-output (MIMO) is considered as a strong candidate for energy beamforming as the large-scale antenna array equipped at base station (BS) can provide a sharp beam to enhance the received signal strength at each terminal [15,16]. Therefore, the combination of RF EH and massive MIMO is a more practical scheme and attracts intensive research interests.
Among them, the authors in [17] studied a massive MIMO WPCN where PA weights, energy-splitting fraction and TS ratio were optimized respectively for maximizing the minimum rate among terminals on the uplink. The authors in [18] investigated a downlink multi-user massive MIMO system based on SWIPT where an iterative algorithm was proposed to optimize PA coefficients at BS and PS ratios at all terminals for maximizing the minimum achievable rate among all terminals on the downlink. To study the influence of line of sight path, the authors in [19] investigated the downlink transmission of massive MIMO-enabled SWIPT systems in Rician fading channels where PA, channel estimation duration and PS ratios were optimized to maximize the downlink sum rate and the minimum rate among all terminals, respectively. In [20], the authors investigated the beam-domain SWIPT in a massive MIMO system in which the transmit power at BS and the TS ratios at all terminals were optimized under the constraints of the current available energy and minimum transmission rate of terminals for achieving maximum sum rate. In integrated data and energy communication networks, the authors optimized the resource allocation and PS ratios for uplink throughput maximization in [21]. The optimal energy-rate trade-offs based on SWIPT were investigated in a relay-assisted downlink massive MIMO system [22].
Nevertheless, the aforementioned studies [17][18][19][20][21][22] have been limited to investigating the system performance on the uplink or on the downlink via TS protocol or PS protocol. Moreover, the authors in [23] proposed a novel hybrid wireless EH protocol which is a combination of TS and PS protocols in wireless relay networks. Based on these observations and inspired by the analyses of [17][18][19][20][21][22][23], in this paper, we extend the existing system model from only uplink or downlink transmission to joint uplink and downlink transmission, and each terminal is equipped with the hybrid wireless EH protocol, which is a combination of TS protocol and PS protocol. Our aim is to investigate joint spectral efficiency (SE) of uplink and downlink with some practical constraints. To the best of our knowledge, such the study has not been found in the existing studies yet. The main contributions of this work are summarized as follows: • We propose a joint uplink and downlink transmission scheme in massive MIMO systems and each terminal is equipped with the hybrid wireless EH protocol. On the downlink, the BS delivers RF signals to all terminals and each terminal uses the hybrid wireless EH protocol to coordinates the processes of EH and ID. On the uplink, a fraction of the harvested energy is used for uplink pilot transmission and the remaining fraction is used for uplink data transmission. Specially, this scheme provides unified system model as it can switch to only uplink data transmission or only downlink data transmission. The hybrid wireless EH protocol can run in three modes, i.e., TS, PS or hybrid protocol. • An optimization problem is designed to maximize joint SE of uplink and downlink for massive MIMO systems under some practical constraints. As the designed optimization problem is non-convex and non-linear, which poses huge challenges to solve directly, an algorithm is proposed to solve such a complex problem by utilizing one-dimensional search method and successive approximation method based on geometric program (GP). In addition, the proposed algorithm is illustrated in details with rigorous mathematic analyses and its computational complexity and convergence are also discussed. • Numerical results reveal that the performance of TS protocol is far below that of PS protocol and hybrid wireless EH protocol, and that the performance of the hybrid wireless EH protocol closely approaches that of PS protocol for the joint SE of uplink and downlink maximization problem in the considered system. Moreover, time resources have a more significant impact on system performance than energy resources. In addition, wireless EH protocol, channel state information (CSI), PA coefficients and the number of antennas at BS, and energy allocation ratio at each terminal are all effective means to improve the system performance.
The rest of this paper is organized as follows: Section 2 briefly describes the system and signal model, and then formulates the joint SE of uplink and downlink optimization problem. In Sect. 3, an algorithm is proposed for solving this optimization problem. Additionally, the complexity of the propose algorithm is also discussed in this section. Furthermore, numerical results are conducted to demonstrate our proposed algorithm in Sect. 4. Finally, we conclude the whole paper in Sect. 5.
Notations: Scalars are denoted by lowercase or uppercase letters. Vectors and matrices are denoted by bold lowercase and bold uppercase letters, respectively. I K and I M are the identity matrices of size K × K and M × M , respectively. The operator E{·} stands for the expectation of a random variable. The notation � · � represents the Euclidean norm. The Hermitian and regular transpose are denoted by (·) H and (·) T , respectively. Finally, CN (., .) is the circular symmetric complex-Gaussian distribution.

System and signal model
We consider a massive MIMO-enabled wireless EH system as shown in Fig. 1, where the BS employing a compact array of M antennas communicates simultaneously with K active single-antenna terminals. It is assumed that the BS is connected to a continuous stable power supply while each terminal can be empowered by the energy harvested from the received RF signals via the hybrid wireless EH protocol in [23] as shown in Fig. 2. We also assume that the channels between the BS and all terminals are constant and frequency-flat in each frame, and the system operates in a time-division-duplexing (TDD) mode. In detail, each frame lasts T seconds and consists of four phases based on the idea of the hybrid wireless EH protocol. Their operations are explained below.
• In the phase of uplink pilots, all terminals simultaneously transmit mutually orthogonal uplink pilots to the BS and then the BS estimates uplink channels. By exploiting uplink channels reciprocity, the downlink CSI is obtained easily. This phase lasts τ T (0 ≤ τ ≤ 1) seconds. • In the phase of length αT (0 ≤ α ≤ 1 − τ ) seconds, based on the idea of TS protocol, the BS simply transfers power to all terminals without any information exchange and each terminal charges its own battery. This phase is named downlink wireless power transfer (WPT). • In the phase of length (1 − τ − α)T /2 seconds, based on the idea of PS protocol, the BS transmits information and enery using the same RF signals to all terminals and each terminal divides the received signal power into two parts. A ρ(0 ≤ ρ ≤ 1) part of the power is used for ID and the remaining 1 − ρ part is used for EH. This phase is named downlink SWIPT. • In the phase of uplink data, the total harvested energy in the phases of downlink WPT and SWIPT is split such that a (0 ≤ ≤ 1) fraction of total harvested energy is used to transmit uplink pilots and the remaining 1 − fraction is used to send uplink data. This phase lasts (1 − τ − α)T /2 seconds.
When = 1 or ρ = 0 , the joint transmission system model of uplink and downlink can switch to only downlink transmission system model as shown in [18,19] or only uplink transmission system model as shown in [17], respectively. When α = 0 or ρ = 1 , the hybrid wireless EH protocol is converted to the PS or TS protocol, respectively. Therefore, this paper provides unified system model and analytical model for massive MIMO-enabled wireless EH system.
It is assumed that the channels between all terminals and BS antennas follow independent and identically distributed (i.i.d.) Rayleigh fading. Let g k = √ β k h k ∈ C M×1 , for k = 1 · · · K , denote the channel between the kth terminal and BS antennas, where β k represents the large-scale fading coefficient and h k ∈ C M×1 contains the i.i.d. CN (0, 1) small-scale fading coefficients. In this way, the channel matrix between all terminals and BS antennas accounting for both large-scale fading and small-scale fading can be modeled as

Uplink channel estimation
In practice, the BS needs CSI in order to take advantage of a large-scale antennas at BS in every frame. The typical of doing this is to utilize uplink pilots. In this phase, each terminal transmits an assigned uplink pilot sequence of length p l symbols, where p l ≥ K is required to avoid pilot contamination. Clearly, τ T = p l T s should be satisfied, where T s is the sampling period. Denote that the kth terminal pilot sequence is φ k , for k = 1, 2, · · · K , which is the kth column of ∈ C p l ×K , satisfying H = I K . The pilot signals propagate through the uplink channel. The received pilot signals at BS can be written as where P p = diag(p p,1 , p p,2 , · · · , p p,K ) is the pilot transmit power diagonal matrix, and N b is the additive white Gaussian noise (AWGN) matrix with i.i.d. elements following CN (0, σ 2 b ) , which is introduced by BS antennas. According to (2), BS can apply minimum mean square error (MMSE) to obtain a channel estimate of g k as follows: Thus, the MMSE estimate ĝ k of the channel g k is and the estimation error is defined as Consequently, according to [24,25], the channel estimate and the estimation error are independent and distributed as and where (1) By exploiting channels reciprocity, the downlink CSI can be obtained easily. Note that the uplink pilot transmit power of each terminal will be provided according to the energy allocated for uplink pilot transmission at that terminal. With the estimated channel, the BS can perform decoding on the uplink and precoding on the downlink.

Downlink WPT phase
In the downlink WPT phase, each terminal only harvests energy via TS protocol and does not demodulate the received signals. Therefore, all terminals can share the same constant symbol x e with |x e | = 1 , which is known to all terminals. According to [17], the maximum ratio (MR) precoding is the optimal for energy transfer in the context of massive MIMO. Then the MR precoding vector at the BS for energy transfer can be written as where the scaling is used to satisfy the normalization constraint E{�v k � 2 } = 1 . After the precoded signals are transmitted, the signals received by the kth terminal antenna can be given as where p dl b is the transmit power at the BS, θ k (0 ≤ θ k ≤ 1) is the PA coefficient assigned to the kth terminal, satisfying K k=1 θ k = 1 , and n a,k ∼ CN (0, σ 2 a,k ) is the AWGN introduced by the terminal antenna at the kth terminal.

Downlink SWIPT phase
In this phase, the BS broadcasts the information signals to all terminals simultaneously. Here, MR precoding is adopted again as it can approach the optimal beamforming solution in the context of massive MIMO [15]. According to PS protocol, a fraction of the received signal power is used for ID while the remaining fraction is used for EH. Denote ρ k (0 ≤ ρ k ≤ 1) as the PS ratio of the kth terminal, the signal split for ID and EH can be respectively expressed as is the message-bearing downlink data symbol and n c,k ∼ CN (0, σ 2 c,k ) is the additional AWGN introduced by RF to the baseband conversion at the kth terminal [26].

Uplink data transmission
In this phase, all terminals simultaneously transmit uplink data signals to the BS antennas. The signal vector received by the BS antennas is given by where p ul k is the transmit power of the kth terminal, x ul k ∼ CN (0, 1) , is the messagebearing uplink data symbol transmitted by the kth terminal, and n b is the AWGN vector at the BS antennas whose elements follow CN (0, σ 2 b ) . For the kth terminal, the BS processes the received signal through multiplication of the vector y b ∈ C M×1 by a decoding vector a k ∈ C M×1 that is a function of the channel estimate. The result of processing (13) is expressed as It is worth noting that the uplink data transmit power of each terminal will be obtained according to the energy allocated for uplink data transmission at that terminal.

Optimization problem formulation
In this section, the closed-form lower bound expressions on ergodic capacity for all terminals are first derived and then used to design a net sum SE maximization problem under some practical constraints.

Downlink ergodic capacity analysis
Since the CSI is not available at each terminal, similar to [27], (11) can be rewritten as +n a,k ] + n c,k .
Here, A1, A2, A3 represents the desired signal, the beamforming gain uncertainty and terminal interference, respectively. According to [28], the A2, A3 and the other terms are treated as uncorrelated noise. By assuming uncorrelated noise as independent Gaussian noise, a lower bound on ergodic capacity for the kth terminal can be expressed as where the desired signal power E{|DS k | 2 } is calculated as and the uncorrelated noise power E{|UN k | 2 } is calculated as The results in (17) and (18) can be obtained according to the proof for Theorem 1 in [29]. By substituting of (17) and (18) into (16), a lower bound on downlink ergodic capacity for the kth terminal can be recalculated as where γ k will be calculated in the sequel.

Uplink ergodic capacity analysis
Before deriving uplink ergodic capacity for each terminal, we first calculate the total harvested energy in the WPT and SWIPT phases. Similar to (15), (10) and (12) can be respectively rewritten as and With the help of (17) and (18), the harvested energy by the k terminal in WPT and SWIPT phases can be respectively calculated as and where η k (0 ≤ η k ≤ 1) is the energy conversion efficiency of the EH circuits at the kth terminal.
Obviously, the first terms in the right-hand sides of (22) and (23) are proportional to the number of BS antennas while the second terms are independent with the number of BS antennas. In the context of massive MIMO, the first terms are dominate over the second terms. Similar to [17][18][19], the lower bound on total harvested energy is used to transmit uplink pilots and data and it is given as On average, each terminal can work normally all the time as sometimes it may consume its inherent power supply to compensate for power shortage [18]. We assume that a fraction k (0 ≤ k ≤ 1) of E k is used to transmit uplink pilots and the remaining (1 − )E k energy is used to send uplink data at the kth terminal. Thus, the transmit power of the kth terminal for uplink pilot transmission is and the transmit power of the kth terminal for uplink data transmission is By substituting (25) into (8), the channel estimate can be recalculated as As the maximum-ratio combining (MRC) detection has lower computational complexity, which is compared with the zero-forcing (ZF) detection [17], the MRC detection is used to decode the received signals on the uplink. For the kth terminal, the MRC decoding vector is a k =ĝ k and then substituting it to (14), we can obtain On the uplink, the CSI is known by BS and consequently the ergodic capacity expression involves inconvenient expectation outside the logarithm. To obtain a closed-form expression, an alternative lower bound expression is derived by utilizing the technique of "use and then forget CSI" in [30]. Thus, (28) is rewritten as By following the similar derivation of (19), a closed-form lower bound expression on uplink ergodic capacity for the kth terminal can be given as where p ul k is from (26) and γ k is from (27).

Net sum spectral efficiency maximization problem
As the factor of samples per frame that are used for transmission of uplink data and downlink data, respectively, is (1 − τ − α)/2 shown as Fig. 2, the joint SE of uplink and downlink, namely, net sum SE, is Denote θ , ρ , and as K × 1 vectors that gather the elements of θ k , ρ k and k , for k = 1 · · · K , respectively. The net sum SE maximization problem can be formulated as (29) In the above, (32b) and (32c) ensure the minimum data rate for each terminal on the uplink and on the downlink, respectively. (32d) specifies the constraint for each terminal and the sum constraint for all terminals on PA coefficient. (32e) and (32f ) represent the constraints on PS ratio and energy allocation ratio for each terminal, respectively. (32g) and (32h) denote the constraints on the channel estimation duration and WPT duration for all terminals, respectively, where the lower bound KT s of τ is the minimum value to avoid pilot contamination.
Note that (32b) and (32c) can be satisfied when the transmit power of BS and the number of BS antennas are large enough. Thus, we assume that P1 is feasible.

Proposed optimization algorithm
It can be observed from P1 that the objective function and constraints are non-convex and non-linear. Moreover, the optimization variables are mutually coupled as follows: • The WPT duration α and PS ratio ρ affect the total harvested energy in WPT and SWIPT phases. • The channel estimation duration τ and the fraction of the total harvested energy for uplink pilot transmission affect CSI accuracy. • The CSI accuracy affects the amount of total harvested energy and ergodic capacity for each terminal on the uplink and downlink. • The CSI accuracy, the amount of total harvested and ergodic capacity for each terminal on the uplink and downlink are also closely related to the PA coefficients θ.
Consequently, solving problem P1 directly is a huge challenge. To provide a solution, we solve it in three steps. First, as all terminal share the same channel estimation duration τ and WPT duration α , τ and α can be calculated numerically with one-dimensional search method in their respective feasible intervals. Then, for any given τ and α , the optimal θ , ρ , and can be obtained by successive approximation method based on GP. Finally, when the search on τ and α ends, the optimal τ and α can be selected by comparison. In the following, we will discuss them in detail.
Since log 2 (·) is a monotonic increasing function, P1 can be equivalently rewritten as P2 when τ and α are given, Here, χ ul k and χ dl k are the power ratio of "desired signal" to "uncorrelated noise" on the uplink and downlink, respectively, and γ , χ ul , χ dl and p ul are denoted as K × 1 vectors that gather the elements of γ k , χ ul k , χ dl k and p ul k , for k = 1 · · · K , respectively, which are auxiliary variables. By inspecting P2 , we observe the following cases: • We have replaced "=" with " ≤ " in (33b), (33c), (33d) and (33e). However, this does not affect the original problem P2 because the objective function is monotonic decreasing with respect to χ ul k , χ dl k , γ k and p ul k respectively when other variables hold constant. If we transform (33a), (33d), and (33e) into monomial or posynomial functions respectively, the objective function and all constraints in P2 are monomial or posynomial functions. As a result, P2 becomes a GP problem, which can be solved efficiently with standard convex optimization tools.
For (33a), we transform it into a monomial function by an approximation method in [31, lemma 1]. The key idea is to utilize a monomial function ϕ k (χ k ) φ k to approximate (1 + χ k ) near an arbitrary point

The approximated result is
In this way, the objective function is a monomial function.
For (33d), we deal with it as follows: Here, we introduce new variable t k and replace "=" with " ≤ ". As γ k is an increasing function of t k and the objective function is decreasing function of γ k , the objective function is decreasing with respect to t k . Consequently, the above operations do not change the original problem P2 . Obviously, (35) and (36) are posynomial functions.
Following the similar treatment of (33d), we can obtain equivalent expressions for (33e) as follows: Here, ¯ k is the new introduced variable. Clearly, (37) and (38) are posynomial functions.
With the treatment above, P2 is transformed into a GP problem. For subsequent convenience, we denote t , and ¯ as K × 1 vectors that gather the elements of t k , k and ¯ k for k = 1, · · · , K , respectively. According to [31][32][33], a successive approximation algorithm based on GP to solve P2 is proposed in Algorithm 1.
The parameter µ is used to control approximation accuracy and is set to be 1.1 in most practical cases [31] and the convergence of Algorithm 1 is also guaranteed [34].
After solving out the problem P2 , the optimal τ and α can be obtained by one-dimensional search method, which is described in Algorithm 2.

Computational complexity
Finally, we discuss the complexity of Algorithm 2, which is mainly dependent on the complexities of two outer loops and inner Algorithm 1. If assume that � τ and � α are the search step sizes for τ and α , respectively, the complexity of two outer loops approximately is O((1 − KT s )(1 − KT s + � τ )/(2� α � τ )) . On the other hand, the complexity of Algorithm 1 can be approximately expressed as O(N ap KN gp /ε 2 ) , where N gp is the number of required iteration for solving GP and N ap is the number of required iteration for successive approximation. Based on the above analyses, the complexity of Algorithm 2 is approximately given as O((1 − KT s )(1 − KT s + � τ )N ap KN gp /(2� α � τ ε 2 )) . As the computational efficiency of GP modeling is very high even for large-scale problems [31], Algorithm 2 can converge quickly once the step size of one-dimensional search and

Results and discussion
In this section, numerical results based on matlab software are conducted to validate the proposed algorithm. We set p dl b = 1.5Watt M = 300 and K = 3 . Each frame length T is normalized to be 1 and the sampling period T s is assumed to be 0.005. The noise power is set to be σ 2 b = −90dBm, σ 2 a,k = −70dBm and σ 2 c,k = −50dBm, respectively, ∀k , [35,36]. The energy conversion efficiency is set to be η k = 80% , ∀k . The minimum data requirement is set to be C ul min = 1bit/s/Hz and C dl min = 2bit/s/Hz, respectively. The largescale fading is modeled as β k = 10 −3 d −3 k , where the distance away from BS d 1 = 10 m, d 2 = 15 m and d 3 = 20 m. The tolerance ε for Algorithm 2 is chosen as 10 −4 . The above parameters are used throughout the simulations unless otherwise stated. Figure 3 shows the running process of Algorithm 1 when channel estimation duration τ is set to be 0.015 and WPT duration α is set to be 0.005. It can be seen that Algorithm 1 achieves the maximum net sum SE C ns = 18.8 bit/s/Hz with 31 iterations, which means that Algorithm 1 is able to converge quickly to match the channel condition. Meanwhile, we can obtain the optimal PA coefficient θ = [0.28, 0.29, 0.43] T , PS ratio ρ = [0.76, 0.49, 0.24] T and energy allocation ratio = [0.91, 0.53, 0.27] T , respectively. Thus, we can conclude that Algorithm 1 solves successfully the optimization problem P2 . In addition, the optimal τ and α can be obtained by one-dimensional search method in Algorithm 2. To sum up, the original optimization problem P1 can be effectively solved by Algorithm 2, which also indicates that the proposed algorithms are feasible and effective.
Although spending more time on channel estimation results in more accurate CSI, which can improve the net sum SE, this also leads to reduced duration for data transmission on the uplink and downlink, which degrades the net sum SE. Thus, there exists an optimal value for channel estimation duration. Figure 4 depicts the net sum SE versus the channel estimation duration, to show its impact on the system performance under different transmission models. Here, only uplink transmission model or only downlink transmission model can be obtained by setting ρ = [0, 0, 0] T or = [1, 1, 1] T , respectively. The step size for one-dimensional search is set to be 0.005. It is observed from Fig. 4 that the net sum SE decreases with channel estimation duration τ for three transmission models. Although we can find a very slight rise of the net sum SE when the step size of one-dimensional search is set to be small enough, it is tiny and can be negligible. For example, the improvement of the net sum SE is only the level of 10 −11 when the step size for one-dimensional search is set to be 2 × 10 −14 , as shown in Fig. 5. This means that the improvement of the net sum SE from longer channel estimation duration can not compensate for its decline from reduced duration for data transmission. This is due to the fact that the channel of each terminal fluctuates only slightly around its expected value because of the channel harding phenomenon in massive MIMO systems, and hence the longer channel estimation duration has little effect on improving CSI. Therefore, the optimal channel estimation duration τ tends to its minimum value. In addition, as there exist the minimum data rate requirements of each terminal on the uplink and downlink, the value of τ , which makes Algorithm 2 work normally, can not cover the entire feasible interval. Figure 6 captures the impact of PS, TS and hybrid EH protocols on the net sum SE under various values of the transmit power of BS. Here, the PA coefficient is set to be θ = [1, 1, 1] T /3 and the energy allocation ratio is set to be = [0.5, 0.5, 0.5] T . The channel estimation duration τ is set to be 0.015 and the search step size for α is set to be 0.005. It is observed from Fig. 6 that the net sum SE based on TS protocol is always lower than that based on PS and hybrid protocols under the same transmit power of BS. Moreover, the performance of PS protocol is nearly the same as that of hybrid protocol. When the search step size of α is set to be small enough, it can be found that the hybrid protocol outperforms weakly PS protocol, but the difference between them is tiny and thus can be negligible. This indicates that although spending more time on EH results in more energy for channel estimation and uplink transmission, which can improve the net sum SE, it can not compensate for the decline of the net sum SE due to reduced duration for data transmission on the uplink and downlink. Therefore, the optimal WPT duration α tends to zero. As a result, the hybrid protocol is very close to PS protocol for the net sum SE maximization in massive MIMO systems.
We capture the impact of PS ratio ρ and energy allocation ratio on the net SE of each terminal in Figs. 7 and 8, respectively. We focus on the impact of ρ solely in Fig. 7 and it is seen that the net SE of each terminal is a quasi-concave function with respect to its PS ratio. This is due to the fact that the higher PS ratio increases the downlink SE of each terminal, but at the same time decreases the harvested energy for channel estimation and uplink transmission, which degrades the uplink SE of each terminal. Thus there  exists an optimal PS ratio for each terminal to maximize its net SE. Similarly, we focus on the impact of solely in Fig. 8 and it is observed that the net SE of each terminal is a quasi-concave function with respect to its energy allocation ratio. This is because that the higher energy allocation ratio results in more accurate CSI, which can improve the net SE of each terminal, but meanwhile decreases the transmit power of each terminal on the uplink, which degrades the uplink SE of each terminal. Thus there also exists an optimal energy allocation ratio for each terminal to maximize its net SE. The above analyses indicate the feasibility of the optimization problem P1. Figure 9 illustrates the the impact of PA coefficient θ on the net sum SE under various values of the transmit power of BS. For comparison, the equal PA method is considered as a baseline scheme in which the BS equally allocates transmit power to each terminal it serves. The PA obtained by Algorithm 1 is denoted as "optimal PA" in Fig. 9. It is observed that the optimal PA method achieves a higher net sum SE than the equal PA method, which shows the optimality of the propose algorithm. Moreover, we can also see that the transmit power of BS and the number of BS antennas are efficient way to improve the net sum SE.
As the maximal minimum rate algorithm is common in the existing literatures, in Fig. 10, the performance of the proposed algorithm is compared with that of the maximal minimum rate algorithm for uplink and downlink data transmission, respectively. On the uplink, the asymptotically maximal minimum rate is obtained according to the analytical result derived in [17]. On the downlink, the maximal minimum rate is obtained according to the algorithm proposed in [18]. For interpreting the results easily, the maximum and minimum values of the proposed algorithm for uplink and downlink data transmission are used respectively. We denote the maximal minimum rate by "MMR" and the proposed algorithm by "SRM" in Fig. 10. It is observed that MMR is located between SRM-max and SRM-min for uplink and downlink, respectively. This is due to the fact that the MMR algorithm has to compensate the performance of the terminal with minimum rate with that of the terminal with maximum rate, indicating that MMR is a special case of the proposed algorithm with the minimum rate constraint.

Conclusions
The paper has proposed a joint uplink and downlink transmission scheme in massive MIMO systems and it can conveniently switch to only uplink transmission model or only downlink transmission model. Each terminal uses the hybrid wireless EH protocol to harvest energy and it can run in three modes, i.e., TS, PS or hybrid protocol. According to the derived low bound expressions on ergodic capacity for all terminals on the uplink and downlink, we have investigated how to jointly optimize related system parameters to maximize the net sum SE of the whole system. As the formulated problem is the non-convex and non-linear, an algorithm utilizing one-dimensional search method and successive approximation method based GP has been proposed and its convergence and complexity have also been discussed. Finally, numerical results have manifested the feasibility of the proposed algorithm. Nevertheless, there are still some issues related to practical scenarios, such as BS antenna correlation, nonlinearity of energy collection model, fading channel model with line-of-sight, and they are left for future work.