Network revenue management with inventory-sensitive bid prices and customer choice

doi:10.1016/j.ejor.2011.06.033

European Journal of Operational Research

Volume 216, Issue 2, 16 January 2012, Pages 459-468

https://doi.org/10.1016/j.ejor.2011.06.033 Get rights and content

Abstract

We develop an approximate dynamic programming approach to network revenue management models with customer choice that approximates the value function of the Markov decision process with a non-linear function which is separable across resource inventory levels. This approximation can exhibit significantly improved accuracy compared to currently available methods. It further allows for arbitrary aggregation of inventory units and thereby reduction of computational workload, yields upper bounds on the optimal expected revenue that are provably at least as tight as those obtained from previous approaches. Computational experiments for the multinomial logit choice model with distinct consideration sets show that policies derived from our approach can outperform some recently proposed alternatives, and we demonstrate how aggregation can be used to balance solution quality and runtime.

Highlights

► Solution method for network revenue management problems with improved accuracy compared to other methods. ► Approximate dynamic programming approach with an arbitrary aggregation of inventory units. ► The algorithm allows a trade-off between solution quality and runtime.

Introduction

A particular area of revenue management (RM) that currently receives much interest is the approximate solution of the RM network problem including models of customer choice behavior. Network problems arise in many applications such as hospitality or transportation where the managed products might require more than one resource, for example a hotel that sells rooms over several nights. While network models have been around for some time already, only in recent years researchers devoted themselves to advancing discrete choice models where the purchase decisions also depend on the offered product alternatives. The need for such models is heightened by the rise of low cost service providers since they cut many of the traditional restrictions meant to segment the market, leaving the customer with similar products whose essentially only distinguishing feature is the price. Even if there are still some restrictions, customers increasingly tend to ignore them in their purchase decision so that in some business applications demand can only be observed for the product with the lowest available price, as pointed out by Boyd and Kallesen (2004). Such a behavior is in stark contrast to the traditional independent demand setting where it is assumed that demand is associated with a product and does not depend on market conditions such as which other products the firm offers. Therefore it is crucial to incorporate customer choice models into RM; more on the advantages of customer choice in the RM context can be found in van Ryzin (2005) and, for a comprehensive treatment of RM, in Talluri and van Ryzin (2004b).

We base our investigations on the particularly interesting work of Zhang and Adelman (2009) who extend the previous independent demand RM model of Adelman (2007) to incorporate customer choice behavior. Their approach differs from others in that they use an affine function of the state vector to approximate the value function of the exact dynamic programming formulation with a linear program (LP) in a way such that it yields time-dependent estimates of the marginal capacity values. The optimal objective of this LP constitutes an upper bound on the exact optimal expected revenue which is tighter than those obtained by several other currently available methods. Since the LP possesses many variables, solving the problem by column generation is shown for the multinomial logit choice model (MNL) with disjoint consideration sets to reduce essentially to solving smaller mixed integer linear programs and is thus implementable in practice. They construct policies directly from the dual solution as well as through a dynamic programming decomposition scheme and show that both perform very well. The most important reason for the improved performance is that the LP naturally generates time-dependent marginal capacity value estimates which gives this approach a cutting edge compared to methods that generate static values.

However, intuitively these values should not only depend on time to departure (for the ease of presentation we will stick to airline terminology), but also on the inventory levels. This dependence on intermediate capacity levels of the resources is not captured by current approaches to network RM with choice behavior. In the independent demand setting, a suitable approximation function was recently proposed by Farias and Van Roy (2007). Instead of using constraint generation to deal with the many constraints of the arising linear program they propose using a constraint sampling procedure which is based on the work of de Farias and Van Roy, 2003, de Farias and Van Roy, 2004. The same approximation was independently proposed by Talluri (2008) under the name of strong affine relaxation and shown to provide tighter upper bounds on the optimal expected revenue than other available methods for the no-choice setting. Also Topaloglu (2009) recently focussed on time- and capacity-dependent bid prices: He proposed a network RM approach based on Lagrangian relaxation, but again without inclusion of choice behavior.

Our key contributions are the following:

•
We propose a new linear programming approach to approximate dynamic programming that approximates the value function with a nonlinear function of the state vector which is separable over arbitrarily chosen ranges of resource inventory levels. As a special case, we can choose this approximation to be separable over each possible inventory level, which then corresponds to the approximation proposed by Farias and Van Roy (2007), but, in contrast to their approach, our model also accounts for customer choice behavior.
•
We show that all the linear programs of Liu and van Ryzin, 2008, Zhang and Adelman, 2009, Kunnumkal and Topaloglu, 2008 can be seen as special cases of our linear programming formulation. In particular, for that reason we obtain tighter upper bounds on the objective value than these other approaches and that are asymptotically optimal as time horizon, demand and capacities are linearly scaled up.
•
We prove that column generation essentially reduces to solving small mixed integer linear programs. Policies for the MNL model with disjoint consideration sets are numerically tested and show significantly improved results.
•
Due to the larger number of constraints, our approach is considerably more expensive than others if we allow the marginal capacity value estimates to change from any possible inventory level to another. However, we find that sensitivity to inventory levels is most pronounced only relatively close to the departures: Therefore, in order to cut down computational requirements for large networks without much deterioration of the solution quality, we can exploit the flexibility of our model with respect to arbitrary aggregations of inventory levels to solve it with high inventory aggregation at the beginning of the booking horizon, and later to re-solve it with lower aggregation and thus higher accuracy so that we capture the typically more pronounced nonlinearity in inventory levels of the value function closer to the end of the time horizon.
•
A seemingly new upper bound relationship between the approaches of Zhang and Adelman, 2009, Kunnumkal and Topaloglu, 2008 is shown, namely that the former provides a tighter upper bound on the objective value than the latter.

The paper at hand is organized as follows: In the next section we briefly review the related literature, then in Section 3 we present our model including the required notation followed by the resulting Markov decision process and its equivalent linear programming form in Section 4. We introduce the linear programming models that we compare our approach with in Sections 4.1 Choice-based deterministic LP, 4.2 Alternative deterministic LP, 5 Approximation based on the equivalent LP. Our own approach is derived in Section 5 as well. We show that the column generation subproblem is reducible to a mixed integer linear program in Section 6 and describe bid price policies in Section 7. Finally, we present the computational results in Section 8 and conclude in Section 9.

Section snippets

Literature review

The earliest contributions to single leg RM with choice behavior include (Brumelle et al. (1990) and Belobaba and Weatherford (1996)), amongst others, and for networks the PODS simulation studies by Belobaba and Hopperstad (1999). Zhang and Cooper (2005) consider an inventory control problem of a set of parallel flights including a customer choice model yielding a stochastic optimization problem which is being solved by simulation-based methods. Zhang and Cooper (2009) develop a pricing model

Products

Let our network consist of m resources – that means flight legs in the airline application – and n products. A product consists of a seat on one or several flight legs in combination with a fare class and departure date. Each resource i has a fixed capacity of c_i, and the network capacity is given by the corresponding vector c = [c₁, … , c_m]^T. The capacity is homogenous, that means all seats are perfectly substitutable and do not differ, hence allowing us to accommodate all kind of requests from the

Current solution approaches

Let v_t(x) denote the expected revenue-to-go from time period t until the final period τ, given the vector x ∈ X of still available resources in the network. The well-known optimality equation for maximizing expected revenue is then given by $v_{t} (x) = \max_{S \subseteq N (x)} \sum_{j \in S} λ P_{j} (S) (f_{j} + v_{t + 1} (x - A^{j})) + (λ P_{0} (S) + 1 - λ) v_{t + 1} (x) = \max_{S \subseteq N (x)} \sum_{j \in S} λ P_{j} (S) [f_{j} - (v_{t + 1} (x) - v_{t + 1} (x - A^{j}))] + v_{t + 1} (x), \forall t, x,$ with boundary condition v_τ+1(x) = 0 for all x. The decision to be made within each time period is which set of products to offer before we can

Approximation based on the equivalent LP

The following linear programming formulation will serve as the starting point of our considerations. It is equivalent to the dynamic program (1) and, for that reason, we denote it by (EQ). The equivalence can be derived from fundamental results of value iteration, see (Powell, 2007), for example. $(EQ) \min_{v (\cdot)} v_{1} (c)$ $v_{t} (x) ⩾ λ \sum_{j \in S} P_{j} (S) [f_{j} - (v_{t + 1} (x) - v_{t + 1} (x - A^{j}))] + v_{t + 1} (x), \forall t, x, S \subseteq N (x) .$ The decision variables are v_t(x), for all t, x, and therefore the problem is also intractable for a large state space. The

Solution via column generation

The problem (P) has $O (τ \prod_{i = 1}^{m} (c_{i} + 1) 2^{n})$ variables and, for realistic network sizes, cannot be solved in moderate time unless techniques such as column generation are used to deal with problem size. This method builds upon the observation that for large problems most columns never enter the basis matrix and therefore do not need to be stored. Apparently, the main task is then to provide a way of how to find the next column to enter the basis without having to generate the whole coefficient matrix.

Policies

In this section, we address the question of how the solution to (D) can actually be used to obtain a control policy that tells us which set of fares S to offer at any given time t and state x of the network. We use again the notion r(x_i) to denote the inventory range of x_i on leg i for an arbitrary aggregation.

Numerical results

In this section, we present the results of numerical experiments that shed light on the quality of the upper bounds and performance of policies obtained for our approach, compared with the above mentioned alternative approaches. We consider TISA with different aggregations. The rationale is that we intend to demonstrate the obtainable gains by splitting up the inventory while balancing the computational effort required to solve (P). Our numerical examples provide a framework of what

Conclusion and future research

In the context of quantity-based network revenue management, we presented a linear programming approach to approximate dynamic programming with nonlinear approximation of the value function with the specific feature that it incorporates both customer choice behavior as well as estimates of marginal capacity values that depend on time and resource inventory level. As a result of the improved approximation, we obtain a better estimate of the opportunity cost, which is reflected in provably

References (33)

L. Chen et al.
Mathematical programming models for revenue management under customer choice
European Journal of Operational Research
(2010)
D. Zhang et al.
Pricing substitutable flights in airline revenue management
European Journal of Operational Research
(2009)
D. Adelman
Dynamic bid prices in revenue management
Operations Research
(2007)
Belobaba, P.P., Hopperstad, C., 1999. Boeing/MIT simulation study: PODS results update. In: 1999 AGIFORS Reservations...
P.P. Belobaba et al.
Comparing decision rules that incorporate customer diversion in perishable asset revenue management situations
Decision Sciences
(1996)
D.P. Bertsekas et al.
Neuro-Dynamic Programming
(1996)
E.A. Boyd et al.
The science of revenue management when passengers purchase the lowest available fare
Journal of Revenue and Pricing Management
(2004)
S.P. Boyd et al.
Convex Optimization
(2004)
S. Brumelle et al.
Allocation of airline seats between stochastically dependent demands
Transportation Science
(1990)
J. Camm et al.
Cutting Big M down to size
Interfaces
(1990)

W.-C. Chiang et al.

An overview of research revenue management: current issues and future research

International Journal of Revenue Management

(2007)

D. de Farias et al.

The linear programming approach to approximate dynamic programming

Operations Research

(2003)

D.P. de Farias et al.

On constraint sampling in the linear programming approach to approximate dynamic programming

Mathematics of Operations Research

(2004)

Farias, V.F., Van Roy, B., 2007. An approximate dynamic programming approach to network revenue management, working...

Gallego, G., Iyengar, G., Phillips, R., Dubey, A., 2004. Managing flexible products on a network. Technical Report...

S. Kunnumkal et al.

A refined deterministic linear program for the network revenue management problem with customer choice behavior

Naval Research Logistics

(2008)

Cited by (85)

Reductions of non-separable approximate linear programs for network revenue management
2023, European Journal of Operational Research
We suggest a novel choice of non-separable basis functions for an approximate linear programming approach to the well-known network revenue management problem. Considering non-separability is particularly important when interdependencies between resources are large. Such a situation can be illustrated for example by a bus line, where different origin-destination pairs have many overlapping segments. Traditional separable approximation approaches tend to ignore the resulting interactions.
We suggest to group resources into non-separable subnetworks. For each chosen subnetwork, basis functions either span the whole function space or consist of linear functions. Given this more general choice of basis functions, we extend existing reductions of approximate linear programs. If there is only one subnetwork, for which the basis functions span the whole function space, we prove the equivalence to a compact linear program of polynomial size. For the general case, we suggest an approximate reduction. Numerical examples illustrate our novel upper bounds for the maximum expected revenue and the corresponding competitive policies. In particular, we find that the added benefit of non-separability heavily depends on the network structure and the capacity.
Our work helps to better understand the impact of assuming separability in network revenue management. The polynomial sized reductions make it possible to estimate the added average revenue resulting from incorporating interactions between resources. The theory we develop demonstrates how the interpretation of dual variables as state-action probabilities can be applied to reduce exponentially large approximate linear programs via variable aggregation.
An Event-Triggered Adaptive Dynamic Programming Method for Large-Scale HVAC Systems
2023, IFAC-PapersOnLine
In this paper, an event-triggered adaptive dynamic programming (ADP) algorithm is developed for the temperature control of large-scale heating, ventilation, and air conditioning (HVAC) systems. First, the dynamic model of the system is established by using the conservation of mass and energy, which involves the dynamics of the fan and the cooling coil. Second, the multi-zone temperature control problem is treated as a non-zero-sum game, which requires solving the coupled Hamilton-Jacobian (HJ) equations. Then, the ADP algorithm is employed to solve the HJ equation. Importantly, in order to reduce the network transmission burden of large-scale HVAC systems, the ADP algorithm developed in this paper is based on event-triggering mechanism. Theoretical analysis proves that the tracking error and the neural network weight estimation error are uniformly ultimately bounded. Simulation results verify the effectiveness of the algorithm.
An interactive booking-limit control for passenger railway revenue management
2021, Journal of Rail Transport Planning and Management
Optimally rationing the number of seats sold among various products is a central problem in passenger railway revenue management. The most commonly used control for passenger railway in both practice and literature is partitioned booking-limit (PBL) control. However, PBL control will divide the train capacity into a number of small allocations if there are many products, leading to inefficiency when the booking demand has high randomness. Considering the deficiency of PBL control, this paper proposes a novel booking-limit control, namely interactive booking-limit (IBL) control, which can better deal with the problem of high demand randomness. The new control has two types of decision variables, i.e., the booking limit and the interactive factor. It is shown that PBL control is a special case of the proposed IBL control. A mathematical model of IBL control is formulated and a simulation-based stochastic approximation algorithm for the model is developed. The experimental results show that the solution algorithm can search an IBL solution that can bring more revenue than an optimal PBL solution within acceptable computing time. In addition, it is found that the revenue gap between the two solutions becomes larger when booking demand is highly stochastic.
A dynamic programming framework for optimal delivery time slot pricing
2021, European Journal of Operational Research
We study the dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems to show that the underlying Bellman operator has a unique fixed point. We then provide a closed-form expression for the resulting fixed point and show that it admits a natural interpretation. Moreover, we also show that – under certain technical assumptions – the value function, which has a discrete domain and a continuous codomain, admits a continuous extension, which is a finite-valued, concave function of its state variables, at every time step. Furthermore, we derive results on the monotonicity of prices with respect to the number of orders placed in our setting. These results open the road for achieving scalable implementations of the proposed formulation, as it allows making informed choices of basis functions in an approximate dynamic programming context. We illustrate our findings on a low-dimensional and an industry-sized numerical example using real-world data, for which we derive an approximately optimal pricing policy based on our theoretical results.
Simulation-based integrated optimization of nesting policy and booking limits for revenue management
2020, Computers and Industrial Engineering
Nesting control is one of the most prevalent quantity-based controls for revenue management (RM) problem. This type of control sets nested booking limits to avoid the situation in which high-fare bookings are rejected in favor of low-fare class. There are two nesting policies that are commonly adopted in practical application, namely standard nesting policy (SNP) and theft nesting policy (TNP). Existing research usually chooses one of them and then studies how to optimize the booking limits under the selected policy. In this paper, we newly introduce a generalized nesting policy (GNP) that can enrich the family of nesting policies. It is certified that the traditional SNP and TNP are special cases of GNP. A mathematical model for the nesting control under GNP is proposed, in which the nesting policy and booking limits are both taken as the decision variables. Followed, a simulation-based optimization algorithm integrated of simulated annealing (SA) algorithm and finite difference (FD) algorithm is designed to explore an improved solution to the proposed model. In the integrated algorithm, SA is used to search a better nesting policy and FD is to further optimize the booking limits under the current policy. Numerical experiments are conducted on a three-leg airline network with customer choice behavior and they mainly show three aspects of findings. First, the solutions obtained by simulation-based algorithms make significant improvements of revenues compared to the popular EMSR heuristics. Second, on average, GNP outperforms traditional nesting policies SNP and TNP. Third, in most of arrival patterns, GNP has no clear advantage over SNP and TNP, but in about 18% of arrival patterns, some new nesting policies embedded in GNP make significant improvements.
A grey-layered ANP based decision support model for analyzing strategies of resilience in electronic supply chains
2020, Engineering Applications of Artificial Intelligence
Augmented globalizationand vertical integration have made contemporary supply chains an intricate network subject to a number of vulnerabilities. Preemptive measures are needed for dealing with mutable risks and vulnerabilities to safeguard robust supply chain systems. Supply chain risk management (SCRM) connotes a set of risk management responses essentially instigated to confront supply chain risks. As supply chain risks are intertwined, one resilient strategy for risk mitigation can moderate several supply chain risks. A complex decision making problem involving twelve major supply chain risks and twenty one resilient strategies for risk mitigation have been acknowledged in this research with archetypal focus on electronics manufacturing supply chains. A combination of Multi criteria decision aid (MCDA) and artificial intelligence (AI) is increasingly used in decision making of complex real world problems. A decision support model incorporating an amalgamation of grey theory and layered analytic network process (ANP) has been employed for quantifying various resilient strategies for risk mitigation. The proposed model was also applied in a practical setting taking a case study of an electronics manufacturing company. Sensitivity analysis was also conducted to ensure the robustness of obtained results. The combined methodology proposed in this research could be effectively used by top management, to pigeonhole the resilient supply chain strategies for better managing their supply chains.

View all citing articles on Scopus

View full text

Innovative Applications of O.R.Network revenue management with inventory-sensitive bid prices and customer choice

Abstract

Highlights

Introduction

Section snippets

Literature review

Products

Current solution approaches

Approximation based on the equivalent LP

Solution via column generation

Policies

Numerical results

Conclusion and future research

European Journal of Operational Research

European Journal of Operational Research

Dynamic bid prices in revenue management

Operations Research

Comparing decision rules that incorporate customer diversion in perishable asset revenue management situations

Decision Sciences

Neuro-Dynamic Programming

The science of revenue management when passengers purchase the lowest available fare

Journal of Revenue and Pricing Management

Convex Optimization

Allocation of airline seats between stochastically dependent demands

Transportation Science

Cutting Big M down to size

Interfaces

An overview of research revenue management: current issues and future research

International Journal of Revenue Management

The linear programming approach to approximate dynamic programming

Operations Research

On constraint sampling in the linear programming approach to approximate dynamic programming

Mathematics of Operations Research

A refined deterministic linear program for the network revenue management problem with customer choice behavior

Naval Research Logistics

Innovative Applications of O.R.
Network revenue management with inventory-sensitive bid prices and customer choice