Predictability and identifiability assessment of models for prostate cancer under androgen suppression therapy

The past two decades have seen the development of numerous mathematical models to study various aspects of prostate cancer in clinical settings. These models often contain large sets of parameters and rely on limited data sets for validation. The quantitative analysis of the dynamics of prostate cancer under treatment may be hindered by the lack of identifiability of the parameters from the available data, which limits the predictive ability of the model. Using three ordinary differential equation models as case studies, we carry out a numerical investigation of the identifiability and uncertainty quantification of the model parameters. In most cases, the parameters are not identifiable from time series of prostate-specific antigen, which is used as a clinical proxy for tumor progression. It may not be possible to define a finite confidence bound on an unidentifiable parameter, and the relative uncertainties in even identifiable parameters may be large in some cases. The Fisher information matrix may be used to determine identifiable parameter subsets for a given model. The use of biological constraints and additional types of measurements, should they become available, may reduce these uncertainties. Ensemble Kalman filtering may provide clinically useful, short-term predictions of patient outcomes from imperfect models, though care must be taken when estimating “patient-specific” parameters. Our results demonstrate the importance of parameter identifiability in the validation and predictive ability of mathematical models of prostate tumor treatment. Observing-system simulation experiments, widely used in meteorology, may prove useful in the development of biomathematical models intended for future clinical application.


Introduction
By 1941, Huggins and Hodes [1] had demonstrated that castration induces the regression of prostate tumors, which implies that the growth of prostate cancer cells is highly dependent on androgen (male sex hormones).Their work showed, perhaps for the first time, that some cancers potentially could be clinically treatable by chemical means, and Huggins shared the 1966 Nobel Prize in Medicine and Physiology for this discovery [2].Nowadays, hormone therapy-specifically, androgen suppression therapy-is a common treatment for localized and locally advanced prostate cancer and has been shown to have a significant survival benefit, albeit with considerable side effects [3].Typically, continuous androgen suppression (CAS) therapy is given until the tumor evolves resistance, at which point the patient's prognosis becomes unfavorable [4,5].
Intermittent androgen suppression (IAS) therapy aims to reduce the side effects of CAS and potentially delay the onset of hormonal resistance [6].During IAS, the patient is put on treatment until his blood serum level of prostate-specific antigen (PSA, a biomarker for prostate cancer) falls below a predetermined lower threshold.Treatment is halted until the patient's PSA level later rises above a predetermined upper threshold, when it is restarted.The on-off cycles are repeated until the tumor becomes resistant.Although IAS reduces the side effects of treatment, its ability to forestall treatment resistance is not yet clear [7].
Over the last decade, various mathematical models have been developed to describe the dynamics of prostate cancer under IAS, as measured by clinical time series of blood serum levels of PSA, and possibly also of androgen, of individual patients in clinical trials.The models are clinically validated insofar as initial conditions and rate parameters can be selected so that the numerical solutions reproduce the observed time series with reasonable fidelity.Several such models are described in Section 1.2.

Motivating example
One of the main clinical problems in any individual patient case, and one for which a validated mathematical model of prostate cancer might provide insight, is whether another round of IAS therapy will be helpful.Thus it is reasonable to ask, if a given mathematical model accurately reproduces past observations of PSA, can it predict future observations with an acceptable degree of reliability?If not, then why not?
To illustrate, we consider a particular patient in a clinical trial of IAS whose serum PSA levels are measured approximately every 30 days.This patient starts treatment on day 0, goes off treatment around day 300, restarts around day 600, and so on for five intervals of approximately 300 days each.Using a fitting procedure described in Section 2.4, parameters for two models, which we call Model H and Model P in Section 1.2, are selected to provide "optimal" values of the model parameters, insofar as each model solution, starting from a fixed initial condition and integrated for 900 days, yields the smallest root-mean-square difference with the data.In addition, we vary the relative value of one of the model parameters by up to 10 percent from the "optimal" one.
Figure 1 shows the results.The solid curves in each panel show the respective model's solutions obtained from both the "optimal" parameters and from varying one of the parameters slightly as just described.The models are integrated for 1500 days from a suitable initial condition to demonstrate 900 days of approximate agreement with clinical measurements of PSA (the "fitted" interval) and another 600 days of predicted PSA levels (the "predicted" interval), assuming that the last cycle of IAS therapy Results of fitting model parameters to approximately 900 days of clinical measurements of PSA, and the corresponding model predictions, for two models of prostate cancer dynamics under IAS for the same patient.Red dots: clinical measurements.Solid curves: model solutions for "optimal" parameters chosen according to the procedure described in Section 2.4 and for a parameter value with a relative difference of 10% from optimal.(a) Model H; (b) Model P. begins around day 1200.Although Model P (Figure 1(b)) reproduces the observed PSA levels through day 900 better than Model H, its predictions of PSA levels for days 900-1500 are much worse.Both Model H and Model P significantly overestimate-by factors of greater than 2 and 4, respectively-the observed PSA level in the second off-treatment interval, depending on the choice of parameter.One choice of parameter vector for Model P yields a pretty good prediction of the effect of the next cycle of treatment, but the others do not, even though all the choices accurately reproduce the observations over the "fitted" interval.
We investigate two principal mathematical questions in this paper.First, can the parameters of a given model be estimated reliably from available clinical data, which typically consists of serial measurements of serum PSA?This question motivates our investigation into parameter identifiability, which includes issues of structural and practical identifiability [8,9].Second, what are some useful approaches to quantify the uncertainties in parameter estimates and the reliability of "patient-specific" forecasts of treatment response, based on a particular model?This question motivates our use of an ensemble Kalman filter (EnKF), which is widely used for data assimilation and parameter estimation in weather forecasting and similar predictive settings.
Our main conclusions are that, while parameters for a deterministic model may reproduce clinical measurements well, they may be subject to large uncertainties, as may the predictions of such a model.Additional types of clinical measurements, if available, may help constrain the values of more parameters.To be clinically practicable, model design must take into consideration the nature of the available observations.The next sections provide an overview of some dynamical models of IAS therapy and some background on parameter identifiability and uncertainty quantification.

Mathematical models of prostate cancer treatment
Jackson [10] laid the groundwork for several data-driven mathematical models for prostate cancer [11,12,13].The three models considered in this paper all assume that the tumor is composed of two or more subpopulations of cells that differ in their degree of androgen dependence.However, the models differ in the way they represent the biological details, particularly the role in which androgen fuels the growth of the tumor cells.[14,15,16] introduced a piecewise-linear model ("Model H") of the dynamics of prostate cancer under IAS therapy.Model H proposes that there are three cancer cell subpopulations, x 1 , x 2 , and x 3 , that represent, respectively, treatment-sensitive, -insensitive, and irreversibly treatmentinsensitive cells.It has been used to study parameter estimation and optimal control of IAS therapy as well as forecasting of treatment resistance and uncertainty quantification [14,17,18,19].
Model H is constructed by assuming that the rates of biphasic increase and decrease of PSA levels during the respective off-and on-treatment intervals are linear combinations of the tumor cell subpopulations.At any given instant, cells in subpopulation j grow at the rate w j j x j ; cells transition from subpopulation j to k at the rate w k j x j , j, k ∈ {1, 2, 3}, and w 3 j = 0, j = 1, 2. There are two versions of each parameter: w 0 jk during the off-treatment intervals and w 1 jk during treatment.During the on-treatment intervals, Model H takes the form This form implicitly takes into account the lack of androgen, which retards the growth of the x 1 and x 2 subpopulations and also selects for the fully treatment-resistant phenotype, x 3 .
During off-treatment intervals, Model H assumes that the partially treatment-resistant (x 2 ) subpopulation reverts back to the sensitive one (x 1 ) at a certain rate, while the growth of the x 3 population depends only on its present size: Model H also assumes that the level of serum PSA at time t is proportional to the sum of the three tumor subpopulations, For simplicity, the units of tumor cell populations and PSA level are scaled such that α = 1.Although the validity of the biological assumptions in Model H is debatable, it reproduces clinical time series of PSA data well [14,16,20].Model H has 10 parameters and 3 initial conditions that must be estimated from PSA data (androgen is not explicitly incorporated).There are no a priori constraints on the parameters, other than clinically realistic bounds on the net tumor growth rate.

Model P: The Portz et al. model
In contrast to the phenomenological approach in Model H, Portz, Kuang, and Nagy [12] have formulated a mechanistic model ("Model P"), inspired by Droop's nutrient-limiting theory [21], that regards androgen as the limiting nutrient for cancer growth and transformation.Model P has inspired several variants [9,20,22,23,24,25], but we focus on the original formulation here.Model P presumes that the tumor is comprised of treatment-sensitive and -insensitive prostate cancer cells, each of which requires a minimum level of androgen, called the cell quota.When the available androgen falls below the cell quota, the respective subpopulation declines.Cells may mutate from one subpopulation to the other in response to selection pressure from IAS therapy.
Model P contains explicit equations that govern the intracellular androgen levels, Q 1 and Q 2 , of the treatment-sensitive and -insensitive subpopulations x 1 and x 2 , respectively.Androgen data is used to interpolate values for the serum androgen function A(t), which acts as an external forcing to represent the on-and off-treatment cycles.The growth and transformation rates for both cell subpopulations depend on the intracellular androgen levels, while the death rates are constant [12].
Model P is defined by the following system of differential equations: x 1 androgen uptake − bQ 1 degradation (1.6) baseline PSA production The cell quota for each subpopulation is q j , the death rate is d j , and the mutation rate from subpopulation i is λ j (Q j ) = K j /(Q j + K j ), j = 1, 2. Since x 2 is the less treatment-sensitive subpopulation, q 2 < q 1 .The PSA production rate depends on the tumor cell subpopulations.Further details about the development of this model are found in [12].
Model P contains 21 parameters and a 5-dimensional state vector.It is possible to find parameters so that model solutions replicate clinically observed time series of serum PSA levels [12,20].While the PSA data is utilized for estimating the model parameters, the available androgen data is used only for the interpolation of A(t).The parameters in Model P are biologically meaningful, and ranges for most of them can be found in the literature.

Model T: the Phan et al. model
One difficulty with Model P is that it cannot reproduce PSA relapse under CAS [26] within the parameter ranges given in [12].The problem may arise from the assumption that the cell subpopulations can mutate from one to the other [9].To address these issues, Baez and Kuang [22] introduced a simplification of Model P, which has been further refined into Model T, below.(See [24] for the derivation.)Key changes include the addition of an irreversibly treatment-resistant tumor subpopulation (similar to that of Model H) and the addition of a compartment for serum androgen, A: androgen uptake (1.11) suppression of production (1.12) tumor PSA production where the respective androgen-dependent death and mutation rates are D j (Q) = d j R j /(Q + R j ), j = 1, 2 and λ(Q) = cK/(Q + K).The unit step function u(t) is 1 when when the patient is on treatment and is 0 otherwise; the treatment time intervals are determined directly from the data.Model T has 21 parameters and a 5-dimensional state vector.Biologically realistic estimates of the parameters are given in Table 1.In this paper, we fix the parameters c = 0.00015, K = 1, δ 1 = 5, δ 2 = 5, and γ 2 = 0.005 in all simulations, as their values have relatively little effect on the model output [24].For simplicity, we also set µ 1 = µ 2 = µ m , which leaves a model with 13 adjustable parameters that we call T-13.Based on the results of a prior a sensitivity analysis [24], we also consider a version of Model T, which we call T-5, in which all but 5 of the parameters are fixed.(The adjustable parameters in Eqs.1.9-1.13 in Model T-5 are µ m , q 2 , d 1 , γ 1 , and A 0 .)The parameter estimation and time series prediction procedures processes described in Section 2 are carried out on versions T-13 and T-5 using time series of serum PSA and androgen levels.

Materials and methods
As described in the introduction, we are concerned with two main questions: the reliability with which model parameters can be estimated from the available data, and the trustworthiness of model predictions after the parameters have been selected to maximize the agreement between model observables and prior clinical measurements of PSA.In this section, we briefly survey the data sources that we use and the mathematical approaches that we apply to each question.Table 1.Estimated biologically realistic ranges of the parameters for Model T. The values of parameters with a single asterisk are fixed in all simulations described here (see Section 1.2.3), and column T-5 shows the values of the parameters that are fixed in Model T-5.Parameters with double asterisks may vary considerably between patients [24].

Data
The data for this study comes from a clinical trial at the Vancouver Prostate Center, which admitted patients who demonstrated a rising serum PSA level after they received radiotherapy and had no evidence of metastasis [27].The study admitted patients who had not had hormonal suppression therapy, with the exception of less than 3 months of neoadjuvant androgen suppression, and who experienced PSA relapses after radiotherapy with no evidence of distant metastasis (e.g., bone metastasis) [27].The study contains patients with high serum PSA levels prior to androgen suppression therapy (i.e., PSA > 6 µg/L).
The data used in the parameter-fitting and prediction efforts in this paper come from two disjoint subsets totaling 54 patients, for whom we have approximately monthly measurements of serum PSA and androgen levels.One subgroup of 28 patients has data for at least one cycle of androgensuppression treatment, and the second subgroup of 26 patients, 2.5 cycles.We do not have imaging data or other independent measures of tumor volume, nor any biopsy, genomic, or cell receptor information from these patients.

Parameter identifiability and associated metrics
Each of the models considered here defines a vector field F of an m-dimensional state vector x(t) that depends upon a fixed q-vector of parameters p.At each measurement time, the clinical observations are assumed to be a function H : R m → R s of the tumor state; in this paper, s = 1 or 2 depending on whether both androgen and PSA levels are observed.We may represent the mathematical setup as a system of the form Here y(t, p) represents the observables (predicted observations) from the given initial condition and parameter vector at time t.We represent by the s-vector y obs (t) the actual data (observations) at selected measurement times.If any component of p is an implicit function of some of the others, then, in general, different values of p yield the same solution vector x(t, p) for a given initial condition.In such a case, p is said to be structurally non-identifiable because p cannot be estimated uniquely from a given set of observables.In general, it is not possible to define a confidence interval for numerical estimates of structurally non-identifiable parameters.
If no implicit functional relationship exists between the elements, then p is structurally identifiable [28,29].Practical identifiability refers to the question of whether the components of p can be estimated with suitably small confidence bounds from observations with noise of finite variance.Structural identifiability is a necessary condition for practical identifiability [30,31,32,33].

The Fisher information matrix
The Fisher information matrix (FIM) measures the sensitivity of the model output at measurement times t n with respect to the model's parameter vector.It can detect combinations of identifiable parameters [34,35,36].Given a set of measurements at N distinct time points and an ODE model with an m-dimensional state vector and q-vector of parameters, the Nm × q sensitivity matrix S consists of N time-dependent m × q blocks A(t n ): . . . (2. 3) The q × q Fisher information matrix is M = S T S. (In practice, each A jk must be approximated numerically.)The parameter vector p is structurally identifiable if and only if M has full rank [31].The rank of M corresponds to the number of identifiable parameter combinations in p.The condition number of M is a measure of the practical identifiability of the parameters: if the condition number is large, then small amounts of measurement noise can render reliable estimation of p impossible.The Cramér-Rao bound covariance matrix is C = M −1 .The diagonal element C j j is the asymptotic variance of the jth parameter; C 1/2 j j is the standard deviation (S D j ) of p j .The coefficient of variation [34] for the jth parameter is If CV j ≥ 1, then the uncertainty in p j is at least as great as its value, which may be considered a useful upper bound for practical identifiability.If CV j is large (say, CV j > 10 4 ), then p j simply may not be estimable from the data, which renders it structurally unidentifiable [8,34].

The profile likelihood method
The available observations in this study are time series of serum PSA, and, in some cases, of serum androgen, so the observation vector y obs (t n ) has at most 2 components (with respective standard errors σ jn ).For a fixed time series of such observation vectors, and a given set of observables y(t, p) (Eq.2.9), the negative log likelihood function takes the form where p is the model parameter vector.There are N measurement time points and s observations at each time point (s = 1 or 2 for the models discussed here).When the measurement variances σ jn are all equal to the same known constant, the minimization of −LL is equivalent to the minimization of the third term on the right-hand side of Eq. 2.5, and, thus, parameter estimation with the negative log likelihood is equivalent to least-squares minimization.(We take σ jn = 1 for the PSA and androgen time series used in this paper.) The profile likelihood is computed by fixing one element of p and choosing the remaining elements over appropriate ranges to minimize Eq. 2.5.The profile likelihood PL j is computed as where all but the jth component of p is varied over an appropriate interval.The confidence interval α for the profile likelihood of parameter p j is For a sufficiently large data set, ∆(α) takes the following form asymptotically: Here icdf(χ 2 , α, d f ) is the inverse cumulative distribution function for χ 2 with α-quantiles (e.g., α = 0.95 for a 95% confidence interval) and d f degrees of freedom, which can be taken as 1 when all but one element of the parameter parameter vector is fixed, or as q for a simultaneous confidence interval for all components of the model parameter vector [8,29,37].

Uncertainty quantification: the ensemble transform Kalman filter
The ensemble Kalman filter (EnKF) is a common approach to data assimilation [38,39].It is used to estimate the m-dimensional state vector of a dynamical model given a set of noisy observations that can be regarded as a "predictor-corrector" scheme.
The simulations presented here use the ensemble transform Kalman filter [40], which is one of several possible square-root filters.Details of the implementation are given in the appendix, but we briefly summarize the approach here.Starting with a set of K initial conditions at t n , we integrate the model to time t n+1 to produce a background ensemble x(t n+1 ) b(k) K k=1 .Let y obs (t n+1 ) denote the vector of measurements at t n+1 .(Depending on the time series, the measurement is either a 1-vector of serum PSA level or a 2-vector of serum PSA and androgen levels.)Here we assume that the observables (i.e., predicted observations) are related to the model state vector by the function H, sometimes called the forward operator, as y(t n+1 ) = H(x(t n+1 )). (2.9) In the models considered in this paper, H may be a linear (e.g., Eq. 1. 3) or a nonlinear (e.g., Eqs.1.8 and 1.13) function of the model state vector.
The objective of the ensemble transform Kalman filter is to find a linear combination of the elements of the background ensemble that best match the data, in the sense that the weight vector w minimizes the sum of squares where is the ensemble mean, X b is the m× K ensemble perturbation matrix whose ith column is is the covariance matrix of the errors in the observations, and H is the forward operator associated with the model.(The ensemble mean and perturbation matrix vary with time, as may H and R, but the explicit time dependence is omitted here to simplify the notation.)The filter produces the analysis ensemble {w a(k) } K k=1 , which in model space becomes (2.12) (See the appendix for further details.)The analysis ensemble defined in Eq. 2.12 forms a new set of initial conditions, which are then integrated to the next observation time t n+2 , and the process is repeated until the end of the data series.
The process begins with a preselected set of initial conditions at t 0 .The results described below include patients for whom there are at least 20 observation time points.
State augmentation.The Kalman filter can be used to estimate q model parameters with the rest of the m-dimensional state vector by considering the (m + q)-dimensional vector x * = (x, p) T that solves the augmented system

.13)
In our case, the associated forward operator is H * = (H, 0) T because the parameter vector is not observed directly.Here G can be any model for the evolution of the parameter vector p.In the simplest case, which is considered here, the parameters are assumed constant over the course of treatment, so G = 0, but a more sophisticated model of parameter drift could be used.

Mean-squared-error parameter estimation
Our metric for the agreement between the observations and the model observables at the nth measurement time, n = 1, 2, . . ., N is the mean squared error (MSE) defined by where σ jn is the variance of the measurement error in the kth component of the observation vector at time t n .(We take σ jn = 1 for the data used in this paper.)If the measurement errors are normally distributed, then E has a χ 2 distribution, and minimizing E is equivalent to maximum likelihood estimation [29].In the numerical investigations described in Section 3, the parameter estimation is carried out using the MATLAB built-in function fmincon.
Because the exact analytical expressions for the solutions of the models in question are difficult or impossible to find, the blocks of the sensitivity matrix S (Eq.2.3) are estimated numerically at the time of each clinical measurement.We illustrate the approach first with Model H, Eqs.(1.1)-(1.2),which has 10 parameters.
Initially, the total tumor cell population (x 1 + x 2 + x 3 ) in Model H numerically equals the first clinical measurement of serum PSA level.For parameter identification purposes, we fix the initial cell populations x 1 (0), x 2 (0), and x 3 (0) to be 0.95%, 0.05%, and 0%, respectively, of the initial PSA measurement.For our numerical computations, we use the time series of serum PSA data from Patient 1 in the Vancouver Prostate Center public dataset [27] (any patient will do).Clinical measurements are taken approximately at monthly intervals; P(t 0 ) is the initial clinical measurement of PSA level at the start of treatment, and P(t n ), the nth subsequent measurement.We select the parameters to minimize E(x 0 , p), Eq. (2.14), using the built-in MATLAB function fmincon.
All 10 parameters are minimized simultaneously using the time series of PSA data from the first 1.5 cycles of treatment (i.e., the initial on-off-on treatment intervals).The allowed parameter ranges for Model H, used as constraints in fmincon, are taken from [20].The output is an estimated parameter vector, for which a model integration starting from the initial condition above produces observables (i.e., predicted serum PSA levels, Eq. 1. 3), that minimize E(x 0 , p).
An analogous procedure is performed for the 21 parameters of Model P, using the initial condition x 1 (0) = 99, x 2 (0) = 1, Q 1 (0) = 0.5, Q 2 (0) = 0.5.The initial PSA level, P(0), is taken as the first clinical measurement of PSA.The process is similar for Model T using the initial conditions in Section 3.2.

Numerical estimation of the Fisher information matrix
After the parameter estimates are obtained from fmincon, we estimate the uncertainty in each parameter as follows.At time t 0 , i.e., at the time of the first clinical measurement of PSA after treatment begins, we perturb the first parameter, p * 1 = w 1 * 1,1 , to generate two new values, w 1 * ,± 1,1 = (1 ± 0.01)w 1 * 1,1 , from which we integrate the model to the first measurement time, t 1 .Then we approximate 3) by finite differencing the two solutions at t 1 with respect to the jth component of the model state vector, j = 1, 2, 3. We proceed likewise for w 1 * 2,1 through w 1 * 3,3 to estimate the first block A(t 0 ) of the sensitivity matrix.During the on-treatment intervals, the elements of A corresponding to parameters w 0 * 1,1 through w 0 * 3,3 are zero.This process begins anew at t 1 , starting with the state vector x(t 1 , p * ), to estimate A(t 1 ).When we reach the end of the first on-treatment interval at t n 1 , the process continues with components w 0 * 1,1 , w 0 * 1,2 , w 0 * 2,2 , and w 0 * 3,3 of p * through the first off-treatment interval at t n 2 ; the elements of A corresponding to w 1 * 1,1 through w 1 * 3,3 are zero.Finally, we proceed as above with parameters w 1 * 1,1 through w 1 * 3,3 for the second on-treatment interval through t n 3 .The numerical estimates of A(t n ), n = 1, . . ., N 3 yield the sensitivity matrix S, from which we compute the Fisher information matrix M = S T S and the Cramér-Rao bound covariance matrix, C = M −1 .

Results and discussion
One naïve way to estimate the uncertainty in model parameters is to run MATLAB's fmincon function to determine a vector of parameters for which the model's observables of PSA agree closely with a given patient's clinical time series.For this purpose, we have used Model T-13 and minimized Eq. 2.14 subject to the biological constraints in Table 1.Such a calculation was run for 28 different patients over their first on-off-on treatment intervals.
Table 2 shows the sample means and variances of the parameters returned by fmincon for this set of patients.The "optimal" parameter values are about the same for each patient for the most part.One may wonder whether the sample variances obtained in this manner are realistic.The analyses in Sections 3.1-3.2suggest that they are not, and, in fact, that the actual uncertainties are much larger than the results in Table 2 might suggest.
In this section, we present the results obtained from studying the identifiability and uncertainty quantification of parameters and predictions from Models H, P, T-13, and T-5.We show that, based on the available time series data, the parameters of models H, P, and T-13 are not practically identifiable.The method of profile likelihood shows that the parameters of Model T-5 are identifiable under the biological constraints given in Table 1.
We also use the ensemble Kalman filter on the associated augmented systems to assess the shortand long-term predictive capability of Model T-5.Although we obtain some promising initial results for many patients, there is evidence of significant model error.

Parameter identifiability of Model H and Model P
Table 3 shows the rank of the Fisher information matrix M obtained using the numerical procedure described in Section 2.5.In particular, the rank of M for Models H and P is 9 and 15, respectively.Because the rank is less than the number of model parameters, we conclude that neither model has a parameter vector that can be identified uniquely from the available observations.
To illustrate one of the implicit functional relationships between some of the model parameters, Figure 2 shows a plot of the profile likelihood (Eq.2.6) between a selected pair of parameters from each of Models H and P. The negative log likelihood function can be minimized by multiple choices of the two selected parameters within the ranges specified on each graph.The two parameters are related by an implicit function within and beyond the indicated bounds on the axes, and it is not possible to associate a confidence interval to them.These results suggest that the parameters in Models H and P are structurally unidentifiable when the only observables are time series of PSA levels.
Next we turn to the uncertainties in the parameters returned by fmincon, as estimated from the Crámer-Rao covariance matrix, M −1 .Table 4 shows the resulting coefficients of variation (Eq.2.4).The coefficient of variation of all parameters except w 1 1,1 in Model H and µ m in Model P is greater than unity (i.e., the uncertainty is at least as large as the estimated parameter value).Thus, the parameters are practically unidentifiable given the available clinical time series.
The numerical results presented here are based on time series from one representative patient (Patient 39) in the Vancouver Prostate Center clinical dataset; analogous results are obtained for other patients.Although it is possible to find estimates of the parameter vector for each of Model H and P that provide reasonably good agreement with clinical observations, such estimates are not unique, and they may be subject to large relative errors.The uncertainties in the parameters lead to unreliable predictions for the future clinical course of individual patients (Figure 1).

Parameter identifiability in Models T-13 and T-5
The numerical estimation of the Fisher information matrix for these models proceeds in the same manner as described in Section 2.5.We fix the initial values of x 1 and x 2 to 0.01 and 0.0001, respectively, and A(0) and P(0) are taken as the first value of recorded androgen and PSA data; we assume that Q(0) = 0.90A(0).The results shown in Table 3 indicate that Model T-13 has 2 unidentifiable parameters, as the rank of the Fisher information matrix is 11.In contrast, the rank of the Fisher information matrix is 5 for Model T-5, so all of the adjustable parameters are identifiable.
The coefficients of variation in Table 4 indicate that estimates of all but two of the parameters in Model T-13 are subject to large uncertainties.In contrast, 4 of the 5 adjustable parameters in Model T-5 may be practically identifiable.
The computations reported in Table 4 do not account for biologically realistic bounds on the parameters.To assess the practical identifiability of model T-5 with respect to the biological constraints in Table 1, we use the clinical time series from Patient 39 in the Vancouver database as a representative case to compute the profile likelihood for all 5 model parameters, shown in Figure 3.Each black curve represents the value of the negative log likelihood, Eq. 2.5, when all parameters are set to the values returned by fmincon (Section 2.4) except for the one that is varied within the interval shown on the horizontal axis.
Also shown in the graphs is the 95% confidence threshold, Eq. 2.7, as a dotted red line.The corresponding confidence interval in the parameter consists of those parameter values for which the profile likelihood lies below the dotted line.In some cases, the confidence interval lies outside a biologically realistic range.For example, only nonnegative values of parameter d 1 (Figure 3(c   Figure 3.The negative log likelihood (vertical axes), Eq. 2.6, of all adjustable parameters in Model T-5.Intervals where the black curves lie below the red dotted line are the 95% confidence intervals, which in some cases lie outside of the biologically realistic ranges indicated in Table 1.
biologically meaningful, yet the left endpoint of the 95% confidence interval is a negative value.The parameter q 2 corresponds to the cell quota for the androgen-independent call subpopulation, and, by hypothesis, is less than q 1 .That the right endpoint of the 95% confidence interval extends beyond the biologically meaningful upper limit for q 2 suggests that q 2 may not be practically identifiable from the available observational data.

Data assimilation for Model T-5
The ensemble Kalman filter (EnKF) provides another way to quantify the uncertainties in model predictions and parameters, provided that the latter are identifiable.As outlined in Section 2.3, the Kalman filter updates the model state vector whenever new observations become available, and the updated state vector becomes the new initial condition for the subsequent time interval.We use an augmented system, Eq. 2.13, for the vector field F defined by Model T, Eqs.1.9-1.13, in the results described here.We consider only version T-5 of this model, as the 5 adjustable parameters form an identifiable subset.Of these 5 parameters, we fix µ m and d 1 , as their values do not vary significantly between patients.Only A 0 , γ 1 , and q 2 are estimated as part of the augmented state vector.In general, a smaller state vector helps to reduce the ensemble variance, all other factors being equal.In this case, however, including q 2 as a parameter to be estimated proves problematic, as we discuss in more detail below.
An open question is a suitable choice of the starting ensemble.The sample variance of the initial conditions should reflect that of the underlying state vector, but we know neither the initial fraction of treatment-resistant cells nor the amount of intracellular androgen in each patient's tumor.Furthermore, the uncertainty in "patient-specific" model parameters may be large.As a guess, we form the ensemble by choosing 100 initial values of the initial state vector from 0.7 to 1.6 times the value of each component of the initial condition x 0 estimated for each patient in [24].The initial ensemble mean values of the parameters A 0 and γ 1 are given by the output of fmincon to minimize Eq. 2.14 for each respective patient, and the standard errors are approximated from the respective 95% confidence intervals in Figure 3. Altogether, the augmented state vector is x * = (x 1 , . . ., x 5 , A 0 , γ 1 , q 2 ) T .We assume that the errors in the observations of serum PSA and androgen levels are independent and have variance 1.
We integrate each initial condition of the 100-member ensemble to the first observation time t 1 , then compute the linear combination of ensemble members that minimizes Eq. 2.10, which becomes the new initial condition.The integration continues to the second observation time t 2 , and the process is repeated until the end of the dataset or until another prespecified ending time.Altogether, we ran the EnKF on time series from 26 patients from the Vancouver Prostate Center database.This set of patients is disjoint from the set used to estimate model parameters in Sections 3.1-3.2.
Figure 4 shows some representative results for the PSA component of the ensemble state vectors for 3 different patients using the EnKF as a data assimilation system.The red curve in each plot shows the ensemble mean value of the PSA level, and the gray curves show each ensemble realization of the PSA levels.(Analogous plots can be made of the other components of the model state vector.) To return to one of the questions asked in the introduction, we consider what happens when the EnKF is run for the first on-off-on treatment intervals to compute an ensemble estimate of the augmented state vector.Using the analysis at the end of the second on-treatment interval, we run the model for a period of 400 to 800 days, depending on the patient, to predict the serum PSA levels for the next cycle, that is, over the next treatment hiatus and subsequent on-treatment interval.The  The panels on the right, Figure 4(d)-(f), show the outputs of the filter as it is run over the entire time series of clinical PSA observations (shown as round circles).At each observation time t n , we compute the mean of the predicted PSA level from the analysis at t n−1 over all 100 ensemble members, i.e., ȳn = 1 100 and, at the end of the time series, the root mean square (RMS) of the one-step ahead prediction error of serum PSA level averaged over all N observation time points is .
In the case of Patient 15, the EnKF produces good agreement with observations throughout the clinical data record.When the forecast/update cycle is halted shortly after day 800 and the model is run freely for the next 500 days or so, the model provides a good prediction of the patient's clinical course during the next off-and on-treatment cycle.When the EnKF is run continuously throughout Patient 15's clinical course, the ensemble variance increases notably toward the end of the second offtreatment interval, and the filter takes 5 or so update cycles for the ensemble mean to settle near the observed clinical PSA values.As mentioned above, the RMS error of the ensemble mean's one-step ahead prediction of serum PSA is about 1.80 µg/L.
The results are similar for Patient 39, panels (b) and (e), although the ensemble mean tends to underestimate the observed serum PSA levels towards the end of each on-treatment interval.As with Patient 15, the EnKF produces an augmented state vector at the end of the second on-treatment interval that yields a reasonable 450-day prediction of Patient 39's clinical course.
Serious problems are evident in the case of Patient 29.It is not clear whether the filter is even convergent after more than a handful of observation time points.How can a filter initialized in the same way as for the other patients produce such poor results?The answer turns on the inclusion of Model T-5's adjustable parameter q 2 , the cell quota for the treatment-resistant tumor cell subpopulation, in the augmented state vector.The analysis in Section 3.2 suggests that q 2 may not be practically identifiable; in any case, its 95% confidence interval as estimated from the profile likelihood includes values that are not biologically meaningful.A similar problem bedevils the augmented ensemble for this patient case.
Figure 5 shows the time evolution of the EnKF's estimates of the adjustable parameters A 0 , γ 1 , and q 2 for Patient 15 (Figure 4(a)).The green curve shows the ensemble mean value of each parameter as the forecast/update cycle proceeds, and the blue starred curves at the bottom and top represent the ensemble minimum and maximum values at each analysis time point.In particular, the EnKF's estimate of q 2 remains within the biologically realistic interval given in Table 1 and is less than the value of q 1 = 0.613 that is fixed for Model T-5.
In the case of Patient 29, however, the ensemble estimates of q 2 are much larger.After a few analysis cycles, some of the ensemble estimates of q 2 exceed q 1 (not shown in the figure).At this point, the dynamical model no longer makes sense, because for these ensemble members, the model includes

Mathematical Biosciences and Engineering
Volume 16, Issue 5, 3512-3536.a treatment-resistant cell subpopulation with a greater cell quota for androgen than the treatmentsensitive one.The forecast ensemble quickly diverges from reality, and of course the model's predictions are not believable.We include this last example as a cautionary note in the application of ensemble Kalman filters.One must be careful that the parameters being estimated have a sufficiently small variance so that their ensemble updates remain within a sensible range for the model under consideration.It is possible to modify the EnKF to add a suitable penalty term to Eq. 2.10 to further constrain the parameter values, but the resulting minimization problem becomes nonlinear and is more expensive to solve numerically.

Conclusions
Prostate cancer has attracted the development of numerous mathematical models to study various aspects of the cancer dynamics under different treatment scenarios, especially in the last two decades.However, deterministic ODE models tend to contain many rate parameters [12,15,22,24].Because clinical data used for parameter calibration typically is limited to time series of serum PSA and possibly also androgen levels, parameter identifiability is a significant issue.It is not possible to quantify the uncertainty in unidentifiable parameters.Furthermore, the ability of the model to fit prior clinical data with suitable choices of parameters does not imply that the model's predictions of the future clinical course can be trusted.
In this paper, we investigate three existing models (H, P, and T) for prostate cancer, each of which has been validated with clinical data [12,15,24].All of the models divide the tumor cell population into two or more classes that differ in their sensitivity to androgen deprivation therapy and presume an autonomous relationship between the size of the tumor cell classes and the rate of serum PSA production.
The limited set of observables leads to unidentifiable sets of parameters in each of Models H, P, and T. By restricting the set of adjustable parameters in Model T using a sensitivity analysis in [24], we show that a 5-parameter version of Model T is identifiable (Table 3) given only clinical time series of PSA levels.Nevertheless, given the available time series, 3 of the 5 identifiable parameters are subject to relative uncertainties of at least 40 percent (Table 4).Models H and P have only one practically identifiable parameter, i.e., only one of the parameters is estimable from the available data with a relative uncertainty of less than unity.
Past is not prologue.Although it may be possible to find a set of parameters so that the model observables agree well with previous clinical measurements, the model's predictions of future time series of PSA need not be reliable.
In some cases, fixing many of the parameters in a complex model may yield an identifiable version that can produce reasonable predictions of PSA dynamics.This is the approach taken with Model T, which contains 21 parameters, all but 5 of which can be fixed to yield an identifiable subset and for which confidence intervals can be constructed using the method of profile likelihood (Section 3.2).However, the confidence intervals may not lie within biologically realistic bounds.Conversely, however, biological constraints may help to reduce the uncertainty in numerical estimates of a given parameter.
The ensemble Kalman filter is a useful way to assimilate data and quantify the uncertainty in model predictions.The EnKF provides a systematic and consistent method to account for new observations and update the associated model state vector accordingly.Nevertheless, for best results, one needs good estimates of the initial uncertainty in the state vector, which are not available for the models considered here.Systematic model errors (biases) can be accounted for in EnKF schemes [41], but we have not pursued this approach here.
Improved models are needed.Resistance to androgen deprivation therapy, or, alternatively, the androgen cell quota, probably exists on a continuum in a typical prostate tumor.The division of the tumor cell population into two or three distinct subpopulations may be too much of an oversimplification for clinically useful predictions.Although the notion of a nutrient-limiting mechanism to tumor growth is biologically reasonable, the dynamics of androgen metabolism probably are not well characterized in Models P and T.More work is needed to improve them.
The development of clinically applicable dynamical models of prostate cancer treatment, and of many other cancers, may benefit from the application of observing-system simulation experiments.OSSEs have been widely used in numerical weather prediction for over half a century.Broadly speaking, in an OSSE, synthetic data is assimilated into a complex dynamical model to assess measures like prediction accuracy (forecast skill) as a function of the accuracy, sparsity, and types of observations.An historical overview may be found in [42]; one early example was an attempt to test whether the wind field of an atmospheric model could be estimated from measurements of temperature alone.Another important application concerns "the potential improvements in climate analysis and weather prediction to be gained by augmenting the present atmospheric observing system with additional envisioned types of observations that do not yet exist" [43].In the present context, OSSEs might be used to quantify errors in parameter estimates from data with various noise levels or to assess the impact of a new assay or other tumor diagnostic on the predictive ability of a given model.In the authors' view, such efforts will be necessary to develop clinically validated mathematical models that will be useful tools for physicians and patients.
At time t n , we begin with an ensemble of initial conditions, which can be regarded as a sample drawn from a normal distribution about a mean state that we wish to estimate.A forecast (often called the background) is made by integrating the model of interest from each ensemble member to time t n+1 , at which point additional measurements have become available.The EnKF seeks a linear combination of the forecasts that best fits the data in a weighted least-squares sense; the linear combination serves as a new set of initial conditions, called the analysis, at t n+1 from which the model is integrated to time t n+2 , when the process is repeated.The mean of the analysis ensemble is the updated estimate of the "true" state of the system, and the sample covariance represents the uncertainty in the analysis mean.
All the computations below are done at t n+1 , so we omit the time dependence to simplify the notation.Let {x b(k) } K k=1 be the set of K state vectors (each of dimension m) that represent the background ensemble, and let x b denote the ensemble mean, Eq. 2.11.Let X b be the m × K matrix whose kth column is x b(k) − x b , k = 1, . . ., K. If w is a Gaussian K-vector with mean 0 and covariance (K − 1) −1 I, then x = x b + X b w has mean x b and covariance (K − 1)X b X T b , which, in model space, is the ensemble estimate of the forecast uncertainty.
Let y obs be the s-vector of observations, which is assumed to be related to the "true" model state vector x t by y obs = H(x t ) + , ( where is a Gaussian random s-vector of mean 0 and covariance R (cf.Eq. 2.9).The ensemble transform Kalman filter minimizes the cost function given in Eq. 2.10, where, as necessary, we approximate a nonlinear forward operator H as As mentioned above, the analysis ensemble is constructed from a linear combination of the background ensemble members.The first step in this procedure is to compute the symmetric square root W a = (K − 1) P a 1/2 .(.8) One rationale for the symmetric square root in Eq. .8 is that the elements of the resulting matrix vary continuously with the observational data.The analysis ensemble {w a(k) } K k=1 is formed by adding w a to each column of W a ; in model space, the analysis ensemble is x a(k) = x b + X b w a(k) , k = 1, . . ., K. (.9) Further details on the derivation of the ensemble transform Kalman filter are given in [40].
In the results described in the paper, we apply the ensemble transform Kalman filter to the augmented dynamical system defined by Eq. 2.13.Let F = (F, G) T denote the augmented vector field (F is the vector field representing the ODE model in question, and G is the model for the time evolution of the parameters).The (m + q)-dimensional state vector of the augmented system is x * = (x, p) T .
The forecast-update cycle may be expressed in the following steps: 1. Begin with the analysis ensemble at t n , {x n a(k) } k k=1 with the associated analysis covariance matrix P n a , and set each x a(k) , k = 1, . . ., K as the initial condition for the next forecast.2. Integrate the model F(t, x * ) from t n to t n+1 for each initial ensemble member x n a(k) , k = 1, . . ., K. The K solutions now become the background ensemble at time Figure1.Results of fitting model parameters to approximately 900 days of clinical measurements of PSA, and the corresponding model predictions, for two models of prostate cancer dynamics under IAS for the same patient.Red dots: clinical measurements.Solid curves: model solutions for "optimal" parameters chosen according to the procedure described in Section 2.4 and for a parameter value with a relative difference of 10% from optimal.(a) Model H; (b) Model P.

Figure 2 .
Figure 2. Profile likelihoods from selected pairs of parameters in Models H and P. The color coding represents the values of the profile likelihood, Eq. 2.6, as a function of each parameter.(a) Profile likelihood obtained from Model H during an off-treatment interval when fixing w 0 1,2 and w 0 3,3 (cf.Eq. 1.2).(b) Similarly for Model P (Eqs.1.4-1.8),when all parameters are fixed except for µ m and c 1 .

Figure 4 .
Figure 4. Representative ensemble Kalman filtering results from slightly different initial ensembles for three patients.Black dots: clinical observations.Gray curves: individual ensemble solutions.Red curves: ensemble mean.Left panels, (a)-(c): the Kalman update is run for the first on-off-on treatment intervals for each patient, marked "filtered," and the last analysis update ensemble is used as initial conditions for a free run of the model for the next off-on treatment interval, marked "predicted."Right panels, (d)-(f): the ensemble Kalman filter is run for the patient's entire clinical course.

Figure 5 .
Figure 5. Augmented ensemble Kalman filter parameter estimates of Patient 15 from Model T-5. Green curve: ensemble mean; blue curves: minimum and maximum ensemble values.

H 1 (. 5 )
(x b + X b w) ≈ y b + Y b w, (.2)where the s-vector y b is the mean of the background observation ensembley b(k) = H(x b(k) ), k = 1, . . ., K (.3)and the kth column of thes × K matrix Y b is y b(k) − y b .The minimizer of Eq. 2.10 is found fromw a = P a Y T b R −1 (y obs − y b ) (.4) P a = (K − 1)I + Y T b R −1 Y b −which in model space yields the analysis mean and covariancex a = x b + X b w a (.6) P a = X b P a X T b .(

. 3 .
t n+1 : x n+1 b(k) K k=1 Interpolate between the background ensemble and observations at time t n+1 by applying the forward operator H * to each of the background ensemble member and form a new ensemble y b(k) = H * (x b(k) ), k = 1, . . ., K. 4. Minimize the Kalman filter cost function, Eq. 2.10, of the current time step to produce the analysis at time t n+1 : conditions for the next background ensemble and repeat the steps.c 2019 the Author(s), licensee AIMS Press.This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0) 1.2.1.Model H: the Hirata et al. model Hirata et al.

Table 2 .
Sample mean and variance of parameter vector components of Model T-13 from running fmincon on a set of 28 patients from the Vancouver Prostate Center over their first on-off-on treatment intervals.

Table 3 .
Rank of the numerically estimated Fisher information matrix for a fixed set of initial conditions.Only the parameters in Model T-5 are identifiable from the available data.

Table 4 .
Coefficients of variation (CV) for Models H, P, T-13, and T-5.A parameter for which CV ≥ 1 has an uncertainty that is at least as large as its estimated value.The estimated uncertainties do not account for biological constraints on the parameter values.