1 Introduction

Drug prices, transaction costs, and regulations usually vary across countries, even for the same medicine. Under suitable conditions, if a patented drug is sold in at least two countries, differences in such factors may create an arbitrage opportunity even before patent expiration, providing an incentive to the emergence of so-called gray markets (Scott Morton and Kyle 2012). This phenomenon (which is also referred to in the literature as parallel tradeFootnote 1, see Danzon 1998) consists in the practice of buying a patented product at one price in one country, then re-selling it at a higher price in a second country, on a parallel distribution chain where it competes with the official distribution chain. Different regulations at a national level may make this price heterogeneity last in time, possibly generating several inefficiencies, as highlighted in various theoretical and empirical works, such as Danzon (1998), Danzon and Towse (2003), Danzon et al. (2015), Duso et al. (2014), Gnecco et al. (2018), Kanavos et al. (2011), Li and Robles (2006), Maskus and Ganslandt (2002) and Towse et al. (2015). For instance, when parallel trade is legally permitted by policymakers, such heterogeneity may allow parallel traders (e.g., distributors or agents trading with authorized distributors) to obtain a profit from parallel trade itself, representing a serious threat to the investments of the manufacturers in Research and Development (R&D).Footnote 2 As a consequence, the analysis and the optimization of pricing schemes for which parallel trade does not occur (hence, for which its well-documented negative effects are not expected to arise) is a relevant research topic.Footnote 3 In this context, it is important to distinguish between uniform pricing schemes (under which manufacturers set the same price in different countries), and suitable differential pricing (or price discrimination) schemes (which allow manufacturers to set even a different price in each country, taking into account its specific features).

Contributions of the work Given this framework, in this paper, we investigate three specific pricing schemes—a uniform pricing scheme, and two differential pricing schemes—for which parallel trade does not occur, either because it is prohibited, or as a consequence of the strategy adopted by the manufacturer. Then, for a two-country model of production and trade, we evaluate and compare their global welfare effects, and the associated manufacturer’s incentive to invest in R&D. In the uniform pricing scheme considered in the paper (which was studied, in a less general setting, in other works, e.g., in Müller-Langer (2009b), where the marginal cost of production was neglected), the manufacturer sets the same drug price in both countries, making parallel trade unprofitable. Then, the common price is optimized. Conversely, in the case of differential pricing, the manufacturer has the opportunity of choosing different prices in the two countries. The two differential pricing schemes investigated in the paper refer, respectively, to the case in which parallel trade is legally forbidden, and to the one in which the manufacturer sets a price difference lower than or equal to a positive parallel trade cost per-unit,Footnote 4 in case it decides to serve both countries. In this way, potential parallel traders are discouraged.

Our analysis is mainly based on constrained quadratic optimization techniques applied to pricing. The optimization problems we solve (in closed form) take into account, at the same time, the marginal cost of production, the parallel trade cost per-unit (only for one model, in which its knowledge is needed), the decision whether to invest or not in R&D, and the total fixed cost of R&D. In contrast, other models in the literature (for instance, Maskus and Chen 2004; Müller-Langer 2009b) neglect some of the above-mentioned factors (e.g., the presence of a positive constant marginal cost of production), in order to simplify the resulting optimization problems. Taking into account such factors can be useful to assess the robustness of the conclusions obtained from the analysis of these models with respect to changes in their assumptions.

The main qualitative economic insights obtained from our analysis can be summarized as follows: for the specific model of production and trade considered in the paper (which, differently from most specialized literature, takes into account the possible presence of a positive constant marginal cost of production), we

  1. 1.

    extend the pricing analysis done in previous studies, e.g., highlighting the influence of a positive constant marginal cost of production on the optimal strategies of the manufacturer (hence, also on the optimal prices, quantities, and surpluses), under the three different pricing schemes (see Propositions 2, 3, and 4, and the respective Corollaries 2, 3, and 4);

  2. 2.

    clarify the conditions that allow/do not allow one to neglect in the analysis the presence of a positive constant marginal cost of production. Such conditions are related, e.g., to the comparison between the positive constant marginal cost of production, the parallel trade cost per-unit, and the maximal price that can be effectively charged to the consumers in the lower price country (see Remark 3 in the appendix for details);

  3. 3.

    take into account the additional possibility for the manufacturer in the various pricing schemes (particularly, in the uniform pricing scheme and in the second differential pricing scheme) to decide not to serve one of the two countries. This splits effectively the corresponding optimization problem into three optimization subproblems (see Sects. 5.1, 5.2, 5.3);

  4. 4.

    prove (by an analysis of the optimal prices) that the second differential pricing scheme, for which the manufacturer sets the price difference between the two countries to be sufficiently low to prevent parallel trade occurrence, is a “hybrid” between the uniform pricing scheme and the first differential pricing scheme, for which parallel trade is forbidden (see Remark 2, and Corollaries 2, 3, and 4 in the appendix);

  5. 5.

    prove that, at optimality, the two differential pricing schemes increase the manufacturer’s incentive to invest in R&D with respect to the uniform pricing scheme, and quantify such incentive for the three cases (see Proposition 5);

  6. 6.

    prove that, at optimality, for some choices of the model parameters, compared to the uniform pricing scheme, the two differential pricing schemes serve both countries under a larger range of values for the relative market size of the two countries. This is important from a society’s wellbeing perspective, because in this way the product becomes more accessible to consumers in the lower price country. For two pricing schemes, suitable marginal cost-dependent thresholds on the relative market size are identified, above which the lower price country is not served at optimality (see Proposition 6(i));

  7. 7.

    prove that for some other choices of the model parameters, differential pricing schemes are more efficient at optimality than the uniform pricing scheme from a global welfare perspective. Suitable marginal cost-dependent ranges of values for the relative market size for which this occurs are also identified (see Proposition 6(ii)).

Possible policy implications are discussed in Sect. 6 (where the pricing schemes are compared at optimality in terms of manufacturer’s surplus, accessibility of the product in the two countries, global welfare, and loss of efficiency with respect to an idealized situation) and in the final section, where possible developments and extensions to game-theoretic models are also discussed.

Organization of the work The paper is organized as follows. Section 2 provides a literature review on the mostly negative effects of parallel trade, and on possible ways to prevent its occurrence. Section 3 presents the two-country model of production and trade used in the analysis. In Sect. 4, we express in closed form the optimal value of the global welfare for the two countries, solving a suitable quadratic optimization problem. Then, in Sect. 5 we find the corresponding value of the global welfare obtained when the manufacturer applies at optimality, respectively, the uniform pricing scheme and the two differential pricing schemes (solving other quadratic optimization problems). In Sect. 6, we compare the three pricing schemes at optimality, focusing on sufficient conditions under which the two differential pricing schemes dominate the uniform pricing one. Finally, Sect. 7 concludes the paper and discusses possible extensions. All the proofs are reported in Appendix A. Due to their length, the expressions of the optimal prices and quantities for the various models appear in Appendix B. Some technical remarks about the various models considered are reported in Appendix C. All these appendices may be skipped at a first reading of the article.

2 Literature review

In the literature, there is a strong debate about the opportunity of permitting or not parallel trade, and on its effects on the global welfare of the countries involved (Danzon 1998; Jelovac and Bordoy 2005; Li and Robles 2006). This is motivated by the fact that, on one side, manufacturers see parallel trade as a serious threat to their investments in R&D, whereas, on the other side, parallel trade can induce price convergence in the different countries, e.g., by decreasing prices in re-importing countries, where pharmaceuticals are more expensive (Towse 1998).Footnote 5 This was empirically confirmed, e.g., in Ganslandt and Maskus (2004), where it was found that the prices of drugs subject to competition from parallel trade fell relative to those of other drugs by 12–19%. Hence, the former products turned out to be more accessible to consumers (this is important from a society’s wellbeing perspective). On the other hand, in Duso et al. (2014), the reduction of the manufacturers’ incentive for product innovation was confirmed empirically (referring to the case of the German market of oral anti-diabetics) through a structural approach, by comparing a counterfactual scenario with the status quo market. In contrast, it was confirmed empirically that this incentive tends to increase as the market size gets larger (Acemoglu and Linn 2004). This issue is particularly relevant, since endogenous sunk costs, such as those associated with R&D, can improve the product quality, raising its demand (Sutton 1998). Another negative consequence of parallel trade is the associated increase in the risk of drug shortage, for all the countries involved, limiting patients’ access to drugs.

According to the analysis presented in Danzon (1998), trade usually increases global welfare, when it makes consumers in higher price countries benefit from lower prices in other countries, and such lower prices are motivated by a better technology. Nevertheless, in the case of pharmaceuticals, lower prices are often the result of a more intensive regulation, not of lower production costs. Hence, parallel trade of pharmaceuticals is expected to reduce global welfare in strictly regulated markets. According to Kanavos et al. (2011), another negative aspect of parallel trade is that most of the revenues it originates are accrued not to the consumers, but to the parallel traders themselves. It is also worth mentioning that, following Jelovac and Bordoy (2005), parallel trade may even positively affect global welfare, e.g., when it implies a reallocation of consumption from individuals with relatively less drug needs to individuals with relatively more such needs. Nevertheless, still according to Jelovac and Bordoy (2005), parallel trade decreases global welfare when it implies, instead, a reallocation of consumption from individuals with relatively more drug needs to individuals with relatively less such needs, and also when one market is not served due to parallel trade. Concluding, the effect of parallel trade on global welfare is considered in Jelovac and Bordoy (2005) to be ambiguous, in the sense that its positive/negative influence on global welfare often depends on which issues are taken into account in the model (e.g., the need for the manufacturer to recover from the cost of R&D).

Since parallel traders and manufacturers can be modeled as different agents with their own objectives, it is natural to investigate parallel trade using noncooperative game-theoretic models (Chen and Maskus 2005; Guo et al. 2013; Maskus and Chen 2004; Müller-Langer 2012). As an example, according to the two-country model presented in Maskus and Chen (2004), forbidding parallel trade is always advantageous from the manufacturer’s point of view, even though the effect of such a prohibition on the global welfare of the two countries is ambiguous (see, again, Jelovac and Bordoy 2005). In Müller-Langer (2012), two dynamic noncooperative games are discussed, in order to analyze the equilibrium behavior of a manufacturer (which is located in one of the two countries) and a distributor (which belongs to the other country), assuming that parallel trade from the distributor is, respectively, allowed/prohibited. In the first model, it is shown therein that parallel trade actually does not even show up at the equilibrium (in another words, the quantity of the re-imported product from the parallel trader/distributor is zero). Differently, in the noncooperative game-theoretic model considered in Maskus and Chen (2004), parallel trade actually occurs at the equilibrium, when parallel trade is allowed. For other noncooperative games proposed in the literature to model parallel trade, the reader is referred to the monograph (Müller-Langer 2009a), and to our recent work (Gnecco et al. 2018), where the efficiency of their equilibria is compared (in a framework where the marginal cost of production is neglected, in order to simplify the models).

However, in case one can prevent parallel trade occurrence from the beginning of the analysis, it is possible to switch to a single-agent model, to be studied using suitable single-agent optimization techniques. In order to justify this simplification, it is worth mentioning that, in several game-theoretic models considered in the literature (see, e.g., the already mentioned article Müller-Langer 2012), the equilibrium pricing schemes adopted by one player are found to make parallel trade not occur at those equilibria. This reduces effectively such models to single-agent optimization models, where the equilibrium pricing schemes above are optimized. A second justification is that single-agent optimization models are typically more easily analyzed, which allows increasing their complexity by introducing some other parameters, often without losing the possibility to get closed-form optimal solutions. Single-agent optimization models are also justified, of course, when one desires to remove from the beginning—at the possible cost of reducing the efficiency—at least some of the negative effects of parallel trade occurrence discussed above.

Focusing on the manufacturer’s side, a possible way to prevent the occurrence of parallel trade even when it is permitted by law consists in making the price of the product be equal in different countries (Müller-Langer 2009b). Then, the common price is optimized.Footnote 6 In such a case, potential parallel traders would be discouraged, due to the presence of positive parallel trade costs (associated, e.g., with transportation/repackaging).

Although the choice of the above-mentioned uniform pricing scheme by the manufacturer may have good global welfare properties (Valletti 2006), it often has a negative effect on the manufacturer’s profit (because price uniformity may be too much constraining for the manufacturer) and also on the global welfare. This depends on the fact that the quantities sold in the markets may not be the optimal ones from the perspective of a global planner, who maximizes the global welfare. In this case, more efficient alternative pricing schemes are needed, and price discrimination schemes (Danzon and Towse 2003; Danzon et al. 2015; Liu and Shuai 2013; Towse et al. 2015)—in which possibly different prices are used in different countries—can be good candidates, and may also provide a higher incentive for the manufacturer to invest in R&D (Alexandrov and Deb 2012). For instance, price discrimination is considered in Danzon and Towse (2003) as a good way to minimize global welfare losses, when such prices are set according to the so-called Ramsey pricing model (Ramsey 1927). For the pharmaceutical sector, price discrimination is also considered in Danzon et al. (2015) as global welfare-superior to uniform pricing schemes, when both static and dynamic efficiency (related, respectively, to the optimal use of existing products, and to the optimal investment in R&D) are taken into account, as price discrimination can increase the manufacturer’s incentive to invest in R&D.

A final remark has to do with the implementation issues of price discrimination schemes. According to Towse et al. (2015), price discrimination could be implemented in European Union pharmaceutical markets:

  1. 1.

    either by a Treaty change or a voluntary agreement to power centralization;

  2. 2.

    or through the creation of different blocks of high-income/low-income countries, with parallel trade permitted only in the same block;

  3. 3.

    or under discounts/voluntary contractual agreements implemented confidentially.

A partial price discrimination, still able to prevent parallel trade, can be implemented by a variation of the uniform pricing scheme, in which the manufacturer takes explicitly into account the parallel trade cost per-unit, and sets the price difference in the two countries to be low enough to discourage potential parallel traders to adopt that practice. This second differential pricing scheme has the advantage of not needing any explicit prohibition of parallel trade. Nevertheless, the parallel trade cost per-unit has to be known (or at least a positive lower bound on it has to be known). Moreover, it should be high enough to make the resulting pricing scheme sufficiently differentiated from the already presented uniform pricing scheme.

3 Modeling framework

In this short section, we summarize (and extend, through the insertion of suitable costs) the model of production and trade considered in Müller-Langer (2009b), involving two countries, characterized by different demand functions of one product. More precisely, the following is assumed in the model. The first country (named “Country A” in the following) is the one in which a product is produced by a manufacturer with a constant total fixed cost of R&D equal to \(C_F \ge 0\) (or, more generally, a total fixed cost of market entry, which does not depend on the quantity produced), and a constant marginal cost of production equal to \(k \ge 0\).Footnote 7 Since the product can also be sold in a second country (named “Country B” in the following), the Country A is the exporting country, whereas the Country B is the importing country. The demand functions of the product in the two countries are modeled by the following piecewise-linear functionsFootnote 8 (with \(p_A,p_B \ge 0\)):

$$\begin{aligned} q_A:=\max \{0,\gamma a-b p_A\},\, q_B:= \max \{0,a-b p_B\}. \end{aligned}$$
(1)

In the above, \(q_A\) (respectively, \(q_B\)) is the quantity of the product that the consumers in Country A (respectively, B) are willing to buy at the price \(p_A\) (respectively, \(p_B\)), \(a,b>0\) are two constants (the same for both countries), and \(\gamma >1\) (relative market sizeFootnote 9) is another constant, which describes the heterogeneity of Countries A and B with respect to their market size. After it is produced, the product can be (see also Fig. 1 to further illustrate the meaning of the following notations): (i) sold by the manufacturer of Country A to the consumers of Country A in quantity \(q_A\) at the retail price \(p_A\); (ii) sold by the manufacturer of Country A to the consumers of Country B in quantity \(q_B\) at the retail price \(p_B\). No parallel trade is possible under the strategies considered in the paper. Finally, differently from previous literature, the analysis of the model reported in the next sections includes also the possibility for the manufacturer to decide not to invest in R&D. Hence, the total cost of R&D is really a fixed cost, not a sunk cost.

Fig. 1
figure 1

The model of production and trade among the manufacturer and the consumers in the Countries A and B (see Sect. 3 for details about the notation)

4 Optimization of the global welfare by the global planner

In this section, for a specific and well-known form of the global welfare function, we find its optimal value for a hypothetical global planner, whose goal is to maximize the global welfare itself. To start the analysis, we need the following definition.

Definition 1

In this paper:

  1. 1.

    the manufacturer’s surplus (profit) when selling a given quantity of a good is the difference between the total amount of money received from selling the good and its total cost of production;

  2. 2.

    for each country, the consumers’ surplus is the definite integral (from 0 to the total quantity actually bought by the consumers) of the difference between the maximum price per-unit they would pay for any quantity of the good between 0 and the total quantity actually bought, and the price per-unit of the good.

For the two-country model of production and trade of pharmaceuticals described in Sect. 3, one obtains immediately from Definition 1 the following expressions for the manufacturer’s surplus \(\varPi \), the consumers’ surplus \(CS_{A}\) in the Country A, and the consumers’ surplus \(CS_{B}\) in the Country B:

$$\begin{aligned} \varPi= & {} (p_A -k) q_A + (p_B - k) q_B -C_F \,; \end{aligned}$$
(2)
$$\begin{aligned} CS_{A}= & {} \int _0^{q_A}\left( \frac{\gamma a - q}{b} - p_A\right) \,dq=\left( \frac{\gamma a}{b} -p_A\right) q_A -\frac{q_A^2}{2 b}=\frac{q_A^2}{4b}\,; \end{aligned}$$
(3)
$$\begin{aligned} CS_{B}= & {} \int _0^{q_B}\left( \frac{a - q}{b} - p_B\right) \,dq=\left( \frac{a}{b} -p_B\right) q_B -\frac{q_B^2}{2 b}=\frac{q_B^2}{4b}. \end{aligned}$$
(4)

In Eqs. (3) and (4) above, the last equality follows from Eq. (1) when \(q_A, q_B>0\), otherwise it holds trivially when \(q_A, q_B=0\).

We assume, for the global welfare function, the Bentham model, defined as the sum of all the surpluses. When the manufacturer does R&D, it has the expression \(GW=W(q_A,q_B)-C_F\), where

$$\begin{aligned} W(q_A,q_B):= & {} (\varPi +C_F) + CS_{A} + CS_{B}\nonumber \\= & {} \left( \frac{\gamma a}{b} -k\right) q_A + \left( \frac{a}{b} -k\right) q_B - \frac{q_A^2 + q_B^2}{2 b} \end{aligned}$$
(5)

is a function of the quantities only. When the manufacturer does not invest in R&D, the total fixed cost \(C_F\) is not incurred and the quantities sold are 0, then one has \(GW=0\). In the paper, we use the symbol “\(\star \)” to denote global welfare values, or values of other expressions of interest, when they are evaluated at optimality (with respect to a suitable optimization problem). Since in some cases they are quite lengthy, the precise formulations of the optimization problems considered in the paper are reported in Appendix A. Here, we focus on the values of the global welfare at optimality (and in the next sections, also on the values of the manufacturer’s surplus at optimality, since the manufacturer is the decision maker for the optimization problems considered therein).

In the following Proposition 1, we express the optimal value of the Bentham global welfare function for a global planner who maximizes such global welfare function under suitable assumptions. In general, two situations have to be considered: the first is when the manufacturer invests in R&D, the other one is when the manufacturer does not invest in R&D. In the proofs, we use the notations “RD” and “NRD” to refer to the two cases, respectively. The overall optimization problem solved by the global planner can be stated as a dynamic optimization problem (Bertsekas 2015) with two decisional stages in which:

  1. 1.

    in the first stage, the global planner decides (through a binary decision) whether it is convenient to invest or not in R&D, comparing 0 (the value of the global welfare when R&D is not performed) with the value of the global welfare obtained from the second stage;

  2. 2.

    in the second stage, assuming that R&D is performed, the global planner maximizes the global welfare (optimizing quantities), assuming that R&D is done.

Equivalently, the two stages above could be merged into a single stage, due to the simplicity of the problem to be solved in the first stage (once the problem in the second stage has been solved). However, when solving this and the other optimization problems considered in the paper, we have preferred to use the first approach, since its first stage models explicitly the individual rationality constraint of the optimizer.

Proposition 1

The optimal value of the global welfare \(GW^{\star }\) found by the global planner is

$$\begin{aligned} GW^{\star }= & {} \left\{ \begin{array}{ll} \max \left\{ 0, W\left( \gamma a - kb, a - kb\right) -C_F \right\} , &{}\quad \mathrm{if} \;0 \le k< \frac{a}{b},\\ \max \left\{ 0, W\left( \gamma a - kb, 0\right) -C_F \right\} , &{}\quad \mathrm{if}\; \frac{a}{b} \le k < \frac{\gamma a}{b},\\ 0, &{}\quad \mathrm{if}\; k \ge \frac{\gamma a}{b}.\\ \end{array} \right. \end{aligned}$$
(6)

The proof of Proposition 1, provided in Appendix A, shows that, in case the optimal value of the global welfare \(GW^{\star }\) is higher than 0, the global planner suggests to sell in both countries if \(0 \le k < \frac{a}{b}\), and to sell only in the Country A if \(\frac{a}{b} \le k < \frac{\gamma a}{b}\). The proof provides also the expressions of the optimal quantities, for an optimal objective value higher than 0 (likewise the proofs of Propositions 2, 3, and 4, for the optimization problems studied in Sect. 5). Moreover, an inspection of the proof of Proposition 1 shows that, when \(C_F=0\), the corresponding optimal solution for the global planner is compatible with non-negative surpluses for all the entities involved (the manufacturer and the consumers of both countries), which is obtained by taking \(p_A=p_B=k\). When \(C_F>0\), in case the global planner suggests the manufacturer to invest in R&D (because this is preferable for the global planner) and side paymentsFootnote 10 to the manufacturer are inserted in the trade model, the optimal global welfare can be still implemented by prices such that all the surpluses are non-negative, the surplus of the manufacturer being higher than or equal to the 0 surplus that it would obtain in case of no R&D. Finally, the optimal value of the global welfare for the global planner is always non-negative (since the option \(C_F=0\) is always available to the global planner), and when it is 0, it is not possible for the manufacturer to have a surplus higher than 0, even under a suboptimal allocation for the global planner.

5 Optimization of the surplus by the manufacturer under uniform and differential pricing schemes

In general, the prices and quantities of the production and trade model presented in Sect. 3 cannot be chosen realistically by a global planner; because, given the two demand functions, they depend on the pricing choices of the manufacturer, and side payments may not be allowed. In the most general case, one should also take into account the possible interaction between the manufacturer and potential parallel traders, which can be studied by using game-theoretic tools (Gibbons 1992), determining suitable equilibrium solutions. Such an investigation has been done for some noncooperative game models using, for instance, a Cournot or Bertrand duopoly (Chen and Maskus 2005; Guo et al. 2013; Maskus and Chen 2004; Müller-Langer 2012), to model the competition between the manufacturer and a single parallel trader, or a (deterministic or stochastic) Stackelberg leader–follower game (Ahmadi et al. 2015; Zhang 2016). When the manufacturer adopts a pricing scheme for which parallel trade does not occur, the equilibrium analysis is simplified, because one has to take into account only its behavior (for this reason, the investigation of cases for which parallel trade can occur at equilibrium is outside the scope of this paper).Footnote 11 However, the prices and quantities at equilibrium are usually different from the ones determined by the global planner, causing a loss of efficiency.

In the following subsections, we investigate three possible pricing schemes for the manufacturer, under which parallel trade does not occur: one uniform pricing scheme, and two differential pricing schemes. In each among these three schemes, the manufacturer’s behavior is determined as the optimal strategy of a corresponding dynamic optimization problem with two decisional stages, defined as follows:

  1. 1.

    in the first stage, the manufacturer decides (through a binary decision) whether to invest or not in R&D, comparing 0 (its surplus when R&D is not performed) with its optimal surplus obtained when one assumes that R&D is performed (the latter is obtained from the second stage);

  2. 2.

    in the second stage, assuming that R&D is performed, the manufacturer maximizes its own surplus (optimizing prices and quantities) in three possible subcases (characterized by different constraints on the prices):

    1. (a)

      selling the product potentially in both the Countries A and B;

    2. (b)

      selling the product potentially only in the Country A;

    3. (c)

      not selling the product at all;

    then chooses the one associated with its highest surplus (in the following, when multiple optimal solutions are obtained, we select one corresponding to the lowest value of the global welfare among such solutions, since the manufacturer maximizes its own profit, but not necessarily the global welfare).

The three pricing schemes differ in the way the optimizations in the three subproblems of the second stage are performed, due to the presence, in the various schemes, of different constraints on the prices in the two countries. In summary,

  1. 1.

    in the uniform pricing scheme, the manufacturer applies the same price in both countries. This clearly prevents parallel trade occurrence, because any potential parallel trader would get negative profit from parallel trade itself (due to a positive parallel trade cost per-unit). However, the precise value of the parallel trade cost per-unit is not taken explicitly into account (the manufacturer knows only that it is positive);

  2. 2.

    in the first differential pricing scheme, parallel trade is forbidden (e.g., by law), so the manufacturer can choose any two possibly different feasible prices in the two countries without incurring competition coming from parallel trade;

  3. 3.

    in the second differential pricing scheme, parallel trade is not forbidden (it may occur in principle), but the manufacturer can still prevent it, by taking explicitly into account the value of the parallel trade cost per-unit. Indeed, by making the price difference low enough, the manufacturer has still the possibility to prevent parallel trade (in a similar way as in the uniform pricing scheme, but with two possibly different prices in the two countries).

5.1 Uniform pricing scheme

As a first example of a pricing scheme for which parallel trade does not occur, when the reimbursement regulations and transaction costs are the same in the two countries and R&D has been already performed, the manufacturer can simply set \(p_A=p_B=\bar{p}\ge k\) to prevent parallel trade occurrence, optimizing simultaneously the choice of the common price \(\bar{p}\). So, this is uniform pricing parallel trade-preventing pricing scheme.Footnote 12 In this scheme, due to the presence of positive parallel trade costs, it would be impossible for a potential parallel trader to obtain a positive profit from buying the product in the Country B from the manufacturer, then re-selling it in the Country A. In doing so, indeed, after buying the product in the Country B at the retail price \(p_B\), the potential parallel trader should choose a retail price in the Country A higher than \(p_B(=p_A)\), so the consumers in the Country A would have no incentive to buy the parallel traded product. In this case, parallel trade would not occur not as a consequence of a specific law forbidding parallel trade, but because of the pricing scheme chosen by the manufacturer. In the proof of the following Proposition 2, we describe how the manufacturer can optimize the common price \(\bar{p}\) to maximize its own surplus \(\varPi \), under the uniform pricing scheme described previously, taking into account also the decision whether to invest or not in R&D in the first decisional stage. The proposition reports the optimal value of the manufacturer’s surplus in the uniform pricing scheme, and the associated global welfare value, by considering various ranges of the relative market size \(\gamma \) (such ranges depend on the other parameters). We use the notation “UPS” to refer to this uniform pricing scheme.

Proposition 2

  1. (i)

    The optimal value of the manufacturer’s surplus \(\varPi _{UPS}^{\star }\) of the uniform pricing scheme is, for \(0 \le k < \frac{a}{b}\),

    $$\begin{aligned}&\varPi _{UPS}^{\star }=&\left\{ \begin{array}{llll} &{}\max \left\{ 0,\frac{b}{8}\left( \frac{(\gamma +1)a}{b}-2k\right) ^2 - C_F\right\} , &{}\mathrm{if\,\,} 1< \gamma < 1+\sqrt{2}\left( 1-\frac{kb}{a}\right) ,&{}\\ &{}\max \left\{ 0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2 - C_F\right\} , &{}\mathrm{if\,\,} \gamma \ge 1+\sqrt{2}\left( 1-\frac{kb}{a}\right) ,&{} \end{array} \right. \nonumber \\ \end{aligned}$$
    (7)

    and, for \(k \ge \frac{a}{b}\),

    $$\begin{aligned} \varPi _{UPS}^{\star }= & {} \left\{ \begin{array}{llll} &{} 0, &{}\mathrm{if\,\,} 1 < \gamma \le \frac{k b}{a},&{}\\ &{}\max \left\{ 0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2 - C_F\right\} , &{}\mathrm{if\,\,} \gamma > \frac{k b}{a}.&{} \end{array} \right. \end{aligned}$$
    (8)
  2. (ii)

    In case \(\varPi _{UPS}^{\star }{>0}\),Footnote 13 the associated global welfare value is, for \(0 \le k < \frac{a}{b}\),

    $$\begin{aligned}&GW_{UPS}^{\star }=&\left\{ \begin{array}{lll} &{}W\left( \gamma a - b \left( \frac{(\gamma +1) a}{4b}+ \frac{k}{2}\right) ,a - b \left( \frac{(\gamma +1) a}{4b}+ \frac{k}{2}\right) \right) - C_F, \\ &{} \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} 1< \gamma < 1+\sqrt{2}\left( 1-\frac{kb}{a}\right) ,&{}\\ &{}\min \bigg \{W\left( \gamma a - b \left( \frac{(\gamma +1) a}{4b}+ \frac{k}{2}\right) ,a - b \left( \frac{(\gamma +1) a}{4b}+ \frac{k}{2}\right) \right) , &{}\\ &{} \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) \bigg \} - C_F, \\ &{} \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} \gamma = 1+\sqrt{2}\left( 1-\frac{kb}{a}\right) ,&{}\\ &{}W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) - C_F, \\ &{}\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} \gamma > 1+\sqrt{2}\left( 1-\frac{kb}{a}\right) ,&{} \end{array} \right. \end{aligned}$$
    (9)

    and, for \(k \ge \frac{a}{b}\),

    $$\begin{aligned} GW_{UPS}^{\star }= & {} \begin{aligned}&W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) - C_F,&\mathrm{if\,\,} \gamma > \frac{k b}{a}.&\end{aligned} \end{aligned}$$
    (10)
  3. (iii)

    In case \(\varPi _{UPS}^{\star }{=0}\), the associated global welfare value is \(GW_{UPS}^{\star }=0\).

Our analysis reported above and its results differ from the ones detailed in Müller-Langer (2009b) because:

  1. 1.

    to make the model more complete, we also take into account the presence of a constant marginal cost of production k;

  2. 2.

    we allow the manufacturer to choose a price even higher than the threshold price above which one of the two markets is not served anymore, making it possible for the manufacturer to exploit such an information to take an optimal decision;

  3. 3.

    we consider two decisional stages, to take into account the possible decision of the manufacturer not to invest in R&D, and also the total fixed cost \(C_F\) of R&D incurred by the manufacturer in case R&D is performed.

5.2 1st differential pricing scheme

In the following, we investigate how the manufacturer changes the optimal decisions in case the following specific differential pricing scheme is adopted, which requires the additional assumption that parallel trade is forbidden.Footnote 14 In this pricing scheme, we require that, if the manufacturer has decided to invest in R&D and incurred the total fixed cost \(C_F\), then it maximizes its own surplus by optimizing the prices (and quantities) in the two countries without constraints on the relationship between the two prices. The following proposition reports the optimal value of the manufacturer’s surplus in this 1st differential pricing scheme, and the associated global welfare value, by considering various ranges of the relative market size \(\gamma \) (such ranges depend on the other parameters). We use the notation “\(DPS_I\)” to refer to this 1st differential pricing scheme.

Proposition 3

  1. (i)

    The optimal value of the manufacturer’s surplus \(\varPi _{DPS_I}^{\star }\) of the 1st differential pricing scheme is, for \(0 \le k < \frac{a}{b}\),

    $$\begin{aligned} \varPi _{DPS_I}^{\star }= & {} \max \left\{ 0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2 + \frac{b}{4}\left( \frac{a}{b}-k\right) ^2 - C_F\right\} , \end{aligned}$$
    (11)

    and, for \(k \ge \frac{a}{b}\),

    $$\begin{aligned} \varPi _{DPS_I}^{\star }= & {} \left\{ \begin{array}{llll} &{} 0, &{}\mathrm{if\,\,} 1 < \gamma \le \frac{k b}{a},&{}\\ &{}\max \left\{ 0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2 - C_F\right\} , &{}\mathrm{if\,\,} \gamma > \frac{k b}{a}.&{} \end{array} \right. \end{aligned}$$
    (12)
  2. (ii)

    In case \(\varPi _{DPS_I}^{\star }{>0}\), the associated global welfare value is, for \(0 \le k < \frac{a}{b}\),

    $$\begin{aligned} GW_{DPS_I}^{\star }= & {} \begin{aligned}&W\left( \frac{\gamma a}{2}- \frac{kb}{2},\frac{a}{2}- \frac{kb}{2}\right) - C_F,&\end{aligned} \end{aligned}$$
    (13)

    and, for \(k \ge \frac{a}{b}\),

    $$\begin{aligned} GW_{DPS_I}^{\star }= & {} \begin{aligned}&W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) - C_F,&\mathrm{if\,\,} \gamma > \frac{k b}{a}.&\end{aligned} \end{aligned}$$
    (14)
  3. (iii)

    In case \(\varPi _{DPS_I}^{\star }{=0}\), the associated global welfare value is \(GW_{DPS_I}^{\star }=0\).

Other comments about the 1st differential pricing scheme, and its comparison with both the uniform pricing scheme and the 2nd differential pricing scheme, are reported in Sect. 6. Here, we anticipate that, according to the results of that section, differential pricing has often several advantages over uniform pricing in terms of the manufacturer’s incentive to invest in R&D, product accessibility, and efficiency from a global welfare perspective.

5.3 2nd differential pricing scheme

We now consider the case in which parallel trade is allowed, but the manufacturer, rather than adopting a uniform pricing scheme, sets a price difference lower than or equal to the parallel trade cost per-unit t (assumed to be positive, constant, and known to the manufacturer, otherwise replaced in the following analysis by a known positive lower bound on it), in case both markets are served. In this way, potential parallel traders are discouraged, and parallel trade does not occur. The following proposition reports the optimal value of the manufacturer’s surplus in this 2nd differential pricing scheme, and the associated global welfare value, by considering various ranges of the relative market size \(\gamma \) (such ranges depend on the other parameters). We write “\(DPS_{II}\)” to refer to this 2nd differential pricing scheme. As shown in Sect. 6, in several cases, this 2nd differential pricing scheme is preferred by the manufacturer to the uniform pricing scheme ( moreover, differently from the 1st differential pricing scheme, it does not require the additional assumption that parallel trade is forbidden).

Proposition 4 shows that, for \(0 \le k< \frac{a}{b}\), the 2nd differential pricing scheme transforms into the 1st differential pricing scheme, when \(\gamma \) is sufficiently small, and to the uniform pricing scheme, when \(\gamma \) is sufficiently large. Finally, when \(k \ge \frac{a}{b}\), all the pricing schemes behave in the same way. Some other technical remarks about the various pricing schemes are reported in Appendix C, which focuses especially on their common aspects. Other important differences between the three pricing schemes (e.g., in terms of product accessibility and global welfare) are examined in the next section.

Proposition 4

  1. (i)

    The optimal value of the manufacturer’s surplus \(\varPi _{DPS_{II}}^{\star }\) of the 2nd differential pricing scheme is, for \(0 \le k < \frac{a}{b}\),

    $$\begin{aligned}&\varPi _{DPS_{II}}^{\star }=&\left\{ \begin{array}{ll} &{}\max \bigg \{0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2+\frac{b}{4}\left( \frac{a}{b}-k\right) ^2- C_F\bigg \}, \\ &{}\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} 1< \gamma \le 1+ \frac{2 t b}{a}, \\ &{}\max \bigg \{0,\left( \frac{(\gamma +1)a}{b}+ \frac{t-k}{2} \right) \left( \gamma a - b \left( \frac{(\gamma +1) a}{4 b}+\frac{k+t}{2}\right) \right) \\ &{}\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,+ \left( \frac{(\gamma +1)a}{b}- \frac{t+k}{2} \right) \left( a - b \left( \frac{(\gamma +1) a}{4 b}+\frac{k-t}{2}\right) \right) - C_F\bigg \}, \\ &{}\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} 1+ \frac{2 t b}{a}< \gamma < 1+ \frac{2 t b}{a}+ \sqrt{2}\left( 1-\frac{kb}{a}\right) ,\\ &{}\max \left\{ 0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2 - C_F\right\} , \\ &{}\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} \gamma \ge 1+\frac{2 t b}{a} + \sqrt{2}\left( 1-\frac{kb}{a}\right) , \\ \end{array} \right. \end{aligned}$$
    (15)

    and, for \(k \ge \frac{a}{b}\),

    $$\begin{aligned} \varPi _{DPS_{II}}^{\star }= & {} \left\{ \begin{array}{llll} &{} 0, &{}\mathrm{if\,\,} 1 < \gamma \le \frac{k b}{a},&{}\\ &{}\max \left\{ 0,\frac{b}{4}\left( \frac{\gamma a}{b}-k\right) ^2 - C_F\right\} , &{}\mathrm{if\,\,} \gamma > \frac{k b}{a}.&{} \end{array} \right. \end{aligned}$$
    (16)
  2. (ii)

    In case \(\varPi _{DPS_{II}}^{\star }{>0}\), the associated global welfare value is, for \(0 \le k < \frac{a}{b}\),

    $$\begin{aligned}&GW_{DPS_{II}}^{\star }=&\left\{ \begin{array}{l} W\left( \frac{\gamma a}{2}-\frac{kb}{2},\frac{a}{2}-\frac{kb}{2}\right) - C_F,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} 1< \gamma \le 1+\frac{2 t b}{a},\\ W\left( \gamma a -b \left( \frac{(\gamma +1)a}{4 b}+ \frac{k+t}{2}\right) , a -b \left( \frac{(\gamma +1)a}{4 b}+ \frac{k-t}{2}\right) \right) - C_F, \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\mathrm{if\,\,} 1+\frac{2 t b}{a}< \gamma < 1+\frac{2 t b}{a} + \sqrt{2} \left( 1-\frac{kb}{a}\right) ,\\ \min \bigg \{W\left( \gamma a -b \left( \frac{(\gamma +1)a}{4 b}+ \frac{k+t}{2}\right) , a -b \left( \frac{(\gamma +1)a}{4 b}+ \frac{k-t}{2}\right) \right) , \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) \bigg \} - C_F,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\\ \,\,\,\,\mathrm{if\,\,} \gamma = 1+\frac{2 t b}{a} + \sqrt{2} \left( 1-\frac{kb}{a}\right) ,\\ W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) - C_F,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\\ \,\,\,\,\mathrm{if\,\,} \gamma > 1+\frac{2 t b}{a} + \sqrt{2} \left( 1-\frac{kb}{a}\right) , \end{array} \right. \end{aligned}$$
    (17)

    and, for \(k \ge \frac{a}{b}\),

    $$\begin{aligned} GW_{DPS_{II}}^{\star }= & {} \begin{aligned}&W\left( \frac{\gamma a}{2}- \frac{kb}{2},0\right) - C_F,&\mathrm{if\,\,} \gamma > \frac{k b}{a}.&\end{aligned} \end{aligned}$$
    (18)
  3. (iii)

    In case \(\varPi _{DPS_{II}}^{\star }{=0}\), the associated global welfare value is \(GW_{DPS_{II}}^{\star }=0\).

6 Comparison of the three pricing schemes at optimality in terms of manufacturer’s surplus, accessibility of the product, global welfare, and loss of efficiency

6.1 Theoretical comparison

In the following, the three pricing schemes are compared at optimality in terms of manufacturer’s surplus, accessibility of the product in the two countries, global welfare, and loss of efficiency with respect to the optimization performed by the global planner. The next comparisons are based on the results of the analyses of the three pricing schemes, made in Sect. 5.

As a first comparison, in the next Proposition 5 we confront the three pricing schemes from the perspective of the manufacturer’s surplus. The proposition shows that, at optimality for each pricing scheme, such surplus increases when moving from the uniform pricing scheme to the 2nd differential pricing scheme, then to the 1st differential pricing scheme. The proposition, which refers to the optimal manufacturer’s surpluses under the three pricing schemes, does not necessarily extend to the corresponding values of the global welfare.

Proposition 5

The optimal manufacturer’s surpluses under the “UPS”, “\(DPS_I\)”, and “\(DPS_{II}\)” pricing schemes are related as follows:

$$\begin{aligned} \varPi ^\star _{UPS} \le \varPi ^\star _{DPS_{II}} \le \varPi ^\star _{DPS_{I}}. \end{aligned}$$
(19)

Expressions of \(\varPi ^\star _{UPS}\), \(\varPi ^\star _{DPS_{I}}\), and \(\varPi ^\star _{DPS_{II}}\) have been reported in Propositions 2, 3, and 4, respectively. When combined with such expressions, Proposition 5 shows that the uniform pricing scheme and the 1st differential pricing scheme have at optimality, respectively, the lowest and the highest threshold on the total fixed cost \(C_F\) of R&D under which there is no incentive for the manufacturer to invest in R&D (in this case, a global welfare value equal to 0 is obtained). Hence, in our framework, the manufacturer is always interested in having parallel trade forbidden. This is a first important consequence of the analysis made in the paper. The proof of this result relies on the fact that more constrained optimization problems provide a lower (or at most the same) optimal surplus to the manufacturer. Although the uniform pricing scheme is always the least preferable from the manufacturer’s viewpoint, it has been worth including such a pricing scheme in the analysis of the paper, because it does not require the manufacturer to know the exact value of the parallel trade cost per-unit t. Moreover, a comparison of the expressions of \(\varPi ^\star _{UPS}\), \(\varPi ^\star _{DPS_{II}}\) and \(\varPi ^\star _{DPS_{I}}\) (provided, respectively, in Propositions 2, 3, and 4) would easily quantify the degree of suboptimality (for the manufacturer) of the first two pricing schemes with respect to the last one. It is also worth mentioning that all these expressions are piecewise-quadratic with respect to k (and t, for the 2nd differential pricing scheme), and piecewise-affine with respect to \(C_F\). Hence, their partial derivatives are piecewise-affine with respect to k (and t, for the 2nd differential pricing scheme), and piecewise-constant with respect to \(C_F\). This would simplify a comparison of their sensitivity analyses with respect to such parameters. A similar remark holds for the comparison of the global welfare values associated with the three pricing schemes, whose expressions have been also given in Propositions 2, 3, and 4.

The following result shows other two important advantages at optimality of differential pricing over uniform pricing, which hold under specific assumptions. They concern, respectively, accessibility of the product to consumers in the country with the smallest market size, and global welfare.

Proposition 6

Let \(0 \le k < \frac{a}{b}\), \(C_F \ge 0\) and \(\gamma >1\) such that \(\varPi _{UPS}^\star ,\varPi _{DPS_I}^\star ,\varPi _{DPS_{II}}^\star {>0}\). Then,

  1. (i)

    the threshold on \(\gamma \) above which the Country B is not served at optimality but the Country A is still served is \(\gamma _{UPS}=1+\sqrt{2} \left( 1-\frac{kb}{a}\right) \) for the uniform pricing scheme, and \(\gamma _{DPS_{II}}=1+ \frac{2 t b}{a} + \sqrt{2} \left( 1-\frac{kb}{a}\right) \) for the 2nd differential pricing scheme. There is no such threshold for the 1th differential pricing scheme;

  2. (ii)

    for \(\gamma > \gamma _{UPS}\), one has \(GW_{DPS_{I}}^{\star }>GW_{UPS}^{\star }\); if \(\gamma _{UPS} < \gamma \le 1 + \frac{2 t b}{a}\), one has also \(GW_{DPS_{II}}^{\star }>GW_{UPS}^{\star }\).

Proposition 6(i) shows that, for both the uniform pricing scheme and the 2nd differential pricing scheme, the country with the smallest market size is never served at optimality when the relative market size is larger than a suitable threshold (\(\gamma _{UPS}\) for the uniform pricing scheme, \(\gamma _{DPS_{II}}\) for the 2nd differential pricing scheme). It is worth mentioning that the assumption \(0 \le k < \frac{a}{b}\) is realistic, since the marginal cost of production k is typically low, e.g., in the pharmaceuticals sector. Still, even a low k can have a non-negligible effect on the thresholds \(\gamma _{UPS}\) and \(\gamma _{DPS_{II}}\). Both thresholds are decreasing functions of k, and \(\gamma _{DPS_{II}}\) is also an increasing function in t (intuitively, this means that the parallel trade “threat” for the 2nd differential pricing scheme becomes less “fearsome” when t increases). Moreover, \(\gamma _{DPS_{II}}>\gamma _{UPS}\) holds. Hence, compared to the uniform pricing scheme, the 2nd differential pricing scheme serves the country with the smallest market size (i.e., it makes the product accessible to consumers in that country) under a larger interval of values for \(\gamma \). No such threshold is present for the case of the 1st differential pricing scheme. It has also to be noticed that, for the uniform pricing scheme and \(k=0\), the threshold \(\gamma _{UPS}' = 3\) is obtained in Müller-Langer (2009b), rather than \(\gamma _{UPS}=1+\sqrt{2} < \gamma _{UPS}'\), where \(\gamma _{UPS}\) comes from Proposition 6(i). This depends on the fact that Müller-Langer (2009b) uses a different model than ours for the uniform pricing scheme. Indeed, as discussed at the end of Sect. 5.1, in our model the manufacturer can choose a price even higher than the threshold price above which one of the two markets is not served anymore.

Proposition 6(ii) shows that, under certain conditions on the relative market size, the two differential pricing schemes produce at optimality global welfare values higher than the one obtained for the uniform pricing scheme, making them preferable from a society’s wellbeing perspective. In particular, in this case, compared to the uniform pricing scheme, they show lower losses of efficiency with respect to the optimal global welfare value obtained by the global planner. Nevertheless, one should notice that Proposition 6(ii) does not exclude the existence of other choices of the parameters for which, e.g., \(GW_{DPS_{II}}^{\star }>GW_{UPS}^{\star }\) still holds, or of other cases for which even the opposite inequality is satisfied.

Finally, our results allow one to compare the losses of efficiency of the three pricing schemes. Each of them is formally defined as the ratio between the optimal value of the global welfare function found by the global planner (see Proposition 1), and the global welfare value associated with the specific pricing scheme (see Propositions 2, 3, and 4, respectively). When an indeterminate expression of the form \(\frac{0}{0}\) is obtained, by convention, we set the loss of efficiency to be equal to 1, as the numerator and the denominator are equal.

The loss of efficiency (which is always higher than or equal to 1, by its definition) is useful to detect when a pricing scheme adopted by the manufacturer is satisfactory from the perspective of global welfare optimization. This occurs when the loss of efficiency is near 1. When this is not the case, an intervention by a policymaker (e.g., a government regulation) would be needed to increase significantly the value of the global welfare, by inducing a suitable change in the pricing scheme adopted by the manufacturer.

6.2 Numerical comparison of the three pricing schemes

Although in principle closed-form expressions for the losses of efficiency can be obtained using Propositions 1, 2, 3, and 4, in the last part of this section we prefer to do a numerical comparison. In the following, we denote, respectively, by \(LoE_{UPS}\left( \gamma ,C_F,k\right) \), \(LoE_{DPS_I}(\gamma ,C_F,k)\), and \(LoE_{DPS_{II}}(\gamma ,C_F,k,t)\) the losses of efficiency associated with the uniform pricing scheme, the 1st differential pricing scheme, and the 2nd differential pricing scheme (in each case, at optimality), highlighting their dependence on the respective parameters.

As an illustrative example, we first consider here, for simplicity, the case of a zero total fixed cost. The behavior of the function \(LoE_{UPS}\left( \gamma ,0,0\right) \) corresponding to a zero marginal cost of production is illustrated in Fig. 2a, and compared to the one of the function \(LoE_{UPS}(\gamma ,0,\frac{a}{10b})\) corresponding to our choice of a low positive constant marginal cost of production. Figure 2a shows that, the relative market size \(\gamma \) being the same, the loss of efficiency of the uniform pricing scheme is higher for the case of the zero marginal cost, with the exception of a small interval of values for \(\gamma \), i.e., the ones between the discontinuity points \(\gamma =1+\frac{9}{5 \sqrt{2}}\) and \(\gamma =1+\sqrt{2}\) associated with the functions \(LoE_{UPS}\left( \gamma ,0,0\right) \) and \(LoE_{UPS}\left( \gamma ,0,\frac{a}{10b}\right) \). In both cases, however, for small values of \(\gamma \) the loss of efficiency is close to 1, indicating that the uniform pricing scheme is quite efficient (i.e., the reduction with respect to the optimal efficiency is small), and no intervention by a policymaker is needed to increase significantly the value of the global welfare, by inducing a suitable change in the pricing scheme adopted by the manufacturer. However, for larger values of \(\gamma \), an intervention by a policy maker would be needed to make the manufacturer switch to a better pricing scheme. Figure 2b, c illustrate, for \(t=k\) and the same values for the other parameters as in Fig. 2a, the losses of efficiency \(LoE_{DPS_I}(\gamma ,C_F,k)\) and \(LoE_{DPS_{II}}(\gamma ,C_F,k,t)\) associated, respectively, with the 1st and 2nd differential pricing schemes. These figures show that the 1st differential pricing scheme is more efficient than the uniform pricing scheme for large \(\gamma \), and slightly less efficient for small \(\gamma \). This depends on the fact that, for large \(\gamma \), both schemes provide the same price and quantity for the Country A, but the uniform pricing scheme is not able to serve Country B, whereas this is served by the 1st differential pricing scheme. It is also worth mentioning that, for large \(\gamma \), the consumers’ surplus in Country A is the same for both schemes, whereas both the manufacturer’s surplus and the consumers’ surplus in Country B are higher under the 1st differential pricing scheme. Hence, in this case, the 1st differential pricing scheme implies a Pareto improvement.Footnote 15 For small \(\gamma \), instead, the quantity of the product sold in Country A under the 1st differential pricing scheme is lower than the one sold in the same country under the uniform pricing scheme. Nevertheless, the quantity sold in Country B under that differential pricing scheme is higher than the one sold in the same country under the uniform pricing scheme, and, in the specific case, the uniform pricing scheme is more efficient. However, in this case, other pricing schemes, that give the manufacturer the incentive to produce in the two countries quantities more similar to the ones suggested by the global planner, would be even more efficient. Finally, Fig. 2c shows that the 2nd differential pricing scheme has the same loss of efficiency as the uniform pricing scheme for \(k=t=0\), and a similar loss of efficiency for \(k=t=\frac{a}{10 b}\) (i.e., slightly higher than the loss of efficiency of the uniform pricing scheme for \(\gamma \) lower than the discontinuity point associated with that scheme, the same for \(\gamma \) higher than the discontinuity point associated with the 2nd differential pricing scheme, and much lower for \(\gamma \) between the two discontinuity points). Concluding, the results of this comparison show that the three pricing schemes are ranked in various ways with respect to the loss of efficiency, depending on the specific choices of their parameters. Some additional insights about the results of this comparison have been provided before by Proposition 6(ii).

Fig. 2
figure 2

For \(k=0\) (zero marginal cost of production) and \(k=\frac{a}{10b}\) (low positive constant marginal cost of production): plots of the loss of efficiency (as a function of the relative market size \(\gamma \)) associated with a the uniform pricing scheme; b the 1st differential pricing scheme; c the 2nd differential pricing scheme. The figure refers to the case \(C_F=0\), \(t=k\)

At this point, we investigate numerically the case in which either the total fixed cost \(C_F\) or the marginal cost k are random variables, which become known to the manufacturer (and also to the global planner) after taking the decision whether to invest or not in R&D. In this stochastic extension, the global welfare is replaced by its expected value. In this situation, the extension of our analysis is straightforward because, when the manufacturer decides to invest in R&D, one can simply replace the (deterministic) total fixed cost \(C_F\) and marginal cost k in various equations in Appendix A (such as (26) and (32)–(37) for the case of the loss of efficiency associated with the uniform pricing scheme) by realizations, respectively, of a stochastic total fixed cost and a stochastic marginal cost. In general, also the knowledge of the a-priori probability distributions of such random variables is needed, in order to evaluate the expected value of the optimal global welfare. With this premise, Fig. 3 shows, under various assumptions on all the costs: the optimal value of the global welfare for the global planner; the value of the global welfare associated with each pricing scheme (at optimality); the loss of efficiency associated with each pricing scheme (again, at optimality). All these quantities are plotted as functions of the relative market size \(\gamma \), for fixed choices of the other parameters. The blue curves refer to the deterministic case \(C_F=0\), \(k=0\), and \(t=0\), the dashed red curves to the deterministic case \(C_F=\frac{a^2}{20b}\), \(k=\frac{a}{10b}\), and \(t=k\), and the dash-dotted green curves to the stochastic case where \(C_F\) and k are realizations of independent random variables such that \(C_F\) assumes the two values 0 and \(\frac{a^2}{20b}\), respectively with a-priori probabilities 0.3 and 0.7, k assumes the two values 0 and \(\frac{a}{10b}\), respectively with a-priori probabilities 0.4 and 0.6, and \(t=k\). The parameters a and b are fixed, respectively, to 2 and 1.5. An inspection of the specific cases shown in the figure reveals that the optimal value of the global welfare for the global planner is the highest when all costs are 0, and in the stochastic case, it is in between the two values obtained in the deterministic case.

Fig. 3
figure 3

Plots (as functions of the relative market size \(\gamma \)) of ac the optimal value of the global welfare for the global planner; df the value of the global welfare associated with each pricing scheme; gi its loss of efficiency. In the second and third rows, the first column refers to the uniform pricing scheme, the second column to the 1st differential pricing scheme, and the third column to the 2nd differential pricing scheme. See the main text for a description of the values assumed by the various parameters

7 Conclusions

In the paper, we have investigated the welfare and R&D incentive effects of three pricing schemes under which parallel trade does not occur: a uniform pricing scheme, and two differential pricing schemes. This comparison has been motivated by the fact that price discrimination is often considered as welfare-superior to uniform pricing (Danzon and Towse 2003; Danzon et al. 2015; Towse et al. 2015). We reach a similar conclusion, considering different models, for which optimal decisions by both the global planner and the manufacturer have been obtained by formulating and solving in closed form suitable quadratic (and dynamic) optimization problems.

First, for the specific model of production and trade considered in the paper, we have obtained in closed form the optimal value of the global welfare, then we have found the prices and quantities produced by the manufacturer assuming that a specific uniform pricing scheme is applied. Our investigation of the uniform pricing scheme has been obtained by extending its analysis performed in previous works. Differently from such literature, we have taken into account in the model both the marginal cost of production and the total fixed cost of R&D. Moreover, we have allowed the manufacturer to choose a price even higher than the threshold price above which one market (the smallest one) is not served. Taking into account such factors in the analysis allows one to determine whether and how much the conclusions obtained by existing models change due to modifications in their assumptions.

Second, we have extended the analysis to the two differential pricing schemes. In both cases, parallel trade cannot occur by construction (likewise for the uniform pricing scheme).

Third, we have compared the various models with respect to the manufacturer’s incentive to invest in R&D, the loss of efficiency, and more generally, the incentive to enter the market of each country. By a comparison of the optimal objective values of the various price optimization problems considered in the paper, and an analysis of the structure of the respective optimal solutions, we have shown that, compared to the uniform pricing scheme, the two differential pricing schemes increase the incentive for the manufacturer to invest in R&D (as being associated with a higher profit for the manufacturer). We have also found a sufficient condition under which they serve both countries under a larger range of values for the relative market size of the two countries. In this case, the product becomes more accessible to the consumers in the lower price country. Moreover, for the specific model of production and trade considered in the paper, we have found a sufficient condition under which differential pricing dominates uniform pricing from a global welfare perspective.

The results of our comparisons are in accordance with the ones already obtained in the specialized literature about pricing of pharmaceuticals (see Jelovac and Bordoy 2005; Müller-Langer 2009b and the references therein), and extend them to the specific models considered in the analysis. In particular, one of the novel aspects is in quantifying, in the analysis of the specific model of production and trade, the effects on the manufacturer’s behavior of a positive marginal cost of production (a cost which is often neglected in such literature). One such effect is in changing some thresholds on the relative market size above which the lower price country is not served. As an important contribution, our analysis clarifies the conditions—which have been overlooked in the related literature—that allow/do not allow one to neglect the marginal cost of production in the analysis. In any case, the results provide a robustness check with respect to those models in the literature about pricing of pharmaceuticals that do not take into account the presence of that factor in the analysis.

The analysis of each of the pricing schemes considered in the paper is based on the following assumptions. While the uniform pricing scheme always prevents parallel trade, the first differential pricing scheme assumes additionally an effective prohibition of parallel trade, which requires an explicit intervention by policymakers. The second differential pricing scheme requires the knowledge (by the manufacturer) of the parallel trade cost per-unit (or at least of a positive lower bound on it). For what concerns policy implications, this calls for the possibility of enforcing, in each situation, the assumption (and, as a consequence, the choice of the manufacturer’s pricing scheme) that provides the lowest loss of efficiency from a global welfare perspective. Such an external intervention, indeed, would influence the manufacturer’s behavior, making it possible also to extend the models considered in the article to their possible variations. Nevertheless, it has also to be mentioned that, in cases in which, for certain choices of the parameters and of the pricing scheme, the loss of efficiency is already low, no external intervention is really needed to improve the efficiency significantly.

Our results can be applied to specific empirical case studies after identifying the values of their parameters (e.g., the relative market size, the total fixed cost, the constant marginal cost, and the parallel trade cost per-unit). Our framework is applicable also in extensions to more than two countries, to stochastic costs,Footnote 16 and to nonlinear models for the demand functions (rather than to the piecewise-linear ones studied in the paper). However, in these cases more complex expressions for the optimal strategies are expected, possibly not available in closed form (hence, differently from this paper, an analytical comparison may not be possible, and numerical approaches would be needed, see Boyd and Vandenberghe 2004; Nocedal and Wright 2006, including, e.g., those based on approximations either of the optimal strategies, see Gnecco and Sanguineti 2010 or of the value functions, see Gaggero et al. 2014). For instance, Proposition 5 extends directly to the nonlinear case. In that case, for each pricing scheme, the Lagrange multipliers associated with the various subproblems of the second decisional stage could be used to perform a sensitivity analysis of their optimal objective values. This would allow to characterize for which of these subproblems the maximum among their optimal manufacturer’s surpluses is achieved, when changing some parameters. This kind of analysis was not needed in the paper, having already obtained closed-form optimal solutions. Moreover, our analysis could be applied, with no significant changes in the adopted methodology, to the case in which the demand functions depend on the total fixed cost of Research and Development \(C_F\), by modeling the parameters a, b, and \(\gamma \) of the demands as functions of \(C_F\), and considering \(C_F\) itself as an optimization variable.Footnote 17 In this case, one should take into account that the various optimization problems modeling the manufacturer’s behavior may have different total fixed costs of R&D at optimality. Another possible extension would consist in replacing the hypothetical global planner with a national government, whose objective is to maximize the welfare—now defined as the sum of the domestic consumers’ surplus and of the global profits of the manufacturer—in the country where the manufacturer is located. In this case, closed-form optimal solutions are still expected. However, such an extension would not change the optimization problems defining the optimal strategies of the manufacturer under the three pricing schemes, but only their common term of comparison.

Finally, in the paper, we have assumed that the manufacturer is the only decision maker. This has been possible due to the absence of parallel trade. However, its main ideas can be applied also to other models, e.g., in a noncooperative game-theoretic setting (Gibbons 1992) with two or more agents. In particular, the manufacturer’s surpluses associated with its optimal strategies for the three models examined in the paper could be used as terms of comparison for more complex models involving game theory and modeling the possible presence of parallel trade at the game-theoretic equilibrium. Indeed, when moving from a single-agent to a multiple-agent setting, the manufacturer would accept changing its strategy (possibly allowing parallel trade occurrence at the equilibrium) only if this resulted in its higher profit. Some results about the comparison of such noncooperative game-theory models have been recently obtained in Gnecco et al. (2018). However, differently from the present paper, that work does not include the marginal cost of production in the noncooperative game-theoretic models examined. Including such a cost, and considering the single-agent optimal strategies of the present paper as terms of comparison, would allow a quantification of the effect of the marginal cost of production in such noncooperative game-theoretic extensions.