Optimized scaling of translational factors in oncology: from xenografts to RECIST

Baaz, Marcus; Cardilin, Tim; Lignet, Floriane; Jirstrand, Mats

doi:10.1007/s00280-022-04458-8

Optimized scaling of translational factors in oncology: from xenografts to RECIST

Original Article
Open access
Published: 03 August 2022

Volume 90, pages 239–250, (2022)
Cite this article

Download PDF

You have full access to this open access article

Cancer Chemotherapy and Pharmacology Aims and scope Submit manuscript

Optimized scaling of translational factors in oncology: from xenografts to RECIST

Download PDF

Marcus Baaz^1,2,
Tim Cardilin¹,
Floriane Lignet³ &
…
Mats Jirstrand¹

2740 Accesses
1 Citation
Explore all metrics

Abstract

Purpose

Tumor growth inhibition (TGI) models are regularly used to quantify the PK–PD relationship between drug concentration and in vivo efficacy in oncology. These models are typically calibrated with data from xenograft mice and before being used for clinical predictions, translational methods have to be applied. Currently, such methods are commonly based on replacing model components or scaling of model parameters. However, difficulties remain in how to accurately account for inter-species differences. Therefore, more research must be done before xenograft data can fully be utilized to predict clinical response.

Method

To contribute to this research, we have calibrated TGI models to xenograft data for three drug combinations using the nonlinear mixed effects framework. The models were translated by replacing mice exposure with human exposure and used to make predictions of clinical response. Furthermore, in search of a better way of translating these models, we estimated an optimal way of scaling model parameters given the available clinical data.

Results

The predictions were compared with clinical data and we found that clinical efficacy was overestimated. The estimated optimal scaling factors were similar to a standard allometric scaling exponent of − 0.25.

Conclusions

We believe that given more data, our methodology could contribute to increasing the translational capabilities of TGI models. More specifically, an appropriate translational method could be developed for drugs with the same mechanism of action, which would allow for all preclinical data to be leveraged for new drugs of the same class. This would ensure that fewer clinically inefficacious drugs are tested in clinical trials.

Special FDA designations for drug development: orphan, fast track, accelerated approval, priority review, and breakthrough therapy

Article Open access 14 November 2023

Systematic evaluation of high-throughput PBK modelling strategies for the prediction of intravenous and oral pharmacokinetics in humans

Article Open access 09 May 2024

Can current preclinical strategies for radiopharmaceutical development meet the needs of targeted alpha therapy?

Article Open access 27 April 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

A major problem in the drug development process in oncology is translating results from preclinical studies to a clinical setting [1, 2]. Clinical efficacy is frequently overpredicted, which means that test compounds showing promising preclinical results fail when they enter clinical trials [3, 4]. This is one of the main reasons for the high attrition rates seen for anticancer drugs [5]. However, there exists a correlation between preclinical efficacy, estimated from studies using either patient-derived xenografts (PDXs) or traditional xenografts based on cell lines, and clinical efficacy [6,7,8]. This shows the potential of using xenograft mice for testing compounds, in particular PDXs as they represent the human disease condition better [9], but also highlights the need for further translational research.

Combination therapies have come to play a leading role in anticancer treatment during the last decades [10]. The strengths of this type of treatment are, e.g., synergistic effects between the drugs and slower onset of resistance [11, 12]. However, giving two drugs concomitantly leads to complex pharmacokinetic (PK) as well as pharmacodynamic (PD) interactions that need to be analyzed [13]. Moreover, these effects can differ between species, making translational efforts even more challenging [14].

Mathematical modeling is a powerful tool for drug development and, in particular, for evaluating combination therapies [15]. Typically, a tumor growth inhibition (TGI) model is developed and calibrated to xenograft tumor volume data [16,17,18], and then used to investigate the efficacy of alternative treatments scenarios such as different drug doses or treatment schedules [19]. However, inter-species differences have to be accounted for to make clinical predictions [20]. The translational methods that are currently used can primarily be divided into two categories: replacement of model components, e.g., growth rate parameter, exposure, or even the entire PK model; and scaling of model parameters [21, 22]. However, these methods are often insufficient to accurately translate the relationship between drug dose or concentration and in vivo efficacy due to the physiological differences between tumor xenograft in mice and cancer progression in human [23]. A potential contributing factor is also inadequate experimental design [24]. Therefore, additional model-based translational approaches are needed to make full use of preclinical data and minimize drug attrition rates.

In this paper, we calibrate preclinical TGI models using xenograft data from the literature for three drug combinations. We then replace mice PK with human PK, accounting for differences in protein binding, and formulate a mathematical optimization problem to find how to best scale the PD rate parameters to describe published clinical data. We hypothesize that the optimal scaling factors could be drug/cancer type specific and could thus be used to leverage all preclinical data when developing new drug combinations for the same cancer type and with the same drug mechanisms of action. Finally, we compare the optimal scaling factor with the standard allometric scaling factor for rate parameters.

Methods

Data

Preclinical data

We analyzed PDX data for combination therapies for which we were also able to find clinical data in the literature. The PDXs had either cutaneous melanoma (CM) or colorectal cancer (CRC) and data for combinations of binimetinib/encorafenib (CM), binimetinib/ribociclib (CM), and cetuximab/encorafenib (CRC) were taken from Gao et al., 2015 [7]. Data for vehicle groups of the two cancer types and single agent data were also extracted. All-time series were cut at day 60 to better reflect a typical xenograft study. Exposure data for encorafenib and ribociclib in mice were extracted from the same publication, whereas data for the other two drugs were gathered from other sources [25, 26]. Treatment schedules and sample size of each treatment group can be found in the Supplementary Information (Table S1).

Anticancer drugs can have different efficacy depending on the specific cancer cells mutations the patient has [27]. We have, therefore, stratified the data into BRAF-mutants, NRAS-mutants, and all other mutants. In the binimetinib/ribociclib combination group, there were five CM PDXs that had a mutation in the BRAF gene and five that had a mutation in the NRAS gene. There were 13 BRAF mutants and nine NRAS mutants in all other CM treatment groups. Among the CRC PXDs, there were only six BRAF mutants and a single NRAS mutant.

Clinical data

In clinical oncology studies, patient response is categorized using the RECIST criteria. The sum of the longest diameters for all target lesions (SLD) is measured at the start of treatment (baseline) and at subsequent checkups. Each patient is categorized based on their best response using four response categories: Complete Response (CR), Partial Response (PR), Progressive Disease (PD), and Stable Disease (SD) [28].

Clinical RECIST data were obtained from ClinicalTrials.gov. Data for the following treatment groups were available: binimetinib (NRAS/BRAF, CM) [29, 30], binimetinib/ribociclib (NRAS, CM) [31], encorafenib (BRAF, CM) [32], binimetinib/encorafenib (BRAF, CM) [32], cetuximab (CRC) [33], and encorafenib/cetuximab (CRC, BRAF) [34]. All drugs were given orally, except for cetuximab, which was given intravenously. Treatment schedule, sample size, checkup time, response rate, cancer type, and mutations for each clinical trial can be found in the Supplementary Information (Table S1). For more information regarding each study, the reader is referred to the corresponding article.

Preclinical modeling

Exposure to anticancer drugs

Daily unbound average concentration, ${C}_{avg,u}$, was used to describe the exposure to all drugs except binimetinib for which unbound maximum concentration, ${C}_{max,u}$ was instead used, as maximal concentration has been shown to correlate better with clinical efficacy than overall exposure for this particular compound [26]. The unbound concentrations were computed by first estimating the total average or maximum concentration, ${C}_{avg,tot}\, \mathrm{or}\, {C}_{max,tot}$, and then adjusting for in vitro mean unbound protein fraction in mice, ${f}_{u, Mouse} (17)$, according to

$$C_{avg,u} = C_{avg,tot} \cdot f_{u, Mouse} ,\,C_{max,u} = C_{max,tot} \cdot f_{u, Mouse} ,$$

(1)

${f}_{u, Mouse}$ for each drug was extracted from the literature [25, 35,36,37]. Compartmental models were fitted to the extracted exposure data of encorafenib and ribociclib. One-compartment models were sufficient to describe the PK data of both compounds. For cetuximab, we used a one-compartment model from the literature [25]. These three models were used to estimate ${C}_{avg,tot}$ of encorafenib, ribociclib, and cetuximab in the TGI model. Due to lack of an adequate PK model for binimetinib, the ${C}_{max,tot}$ value was gathered from the literature [26]. Total and unbound exposure for each drug and treatment schedule are summarized in Table 1. A detailed description of how these values were derived is available in the Supplementary Information.

Table 1 Preclinical and clinical drug exposure

Full size table

Tumor growth inhibition model

To quantify the preclinical anticancer efficacy of each drug and drug combination, a one-compartment TGI model was calibrated to each tumor type. The choice of this relatively simple model was made to balance model complexity with the amount of available data. In the model, all tumor cells are assumed to be proliferating and located in a single compartment. A schematic representation of the model is shown in Fig. 1.

Turnover of tumors cells exposed to drug i as single agent is described by the following differential equation,

$$\frac{dV}{{dt}} = \left( {k_{ng}^{ } - a_{i}^{ } \cdot C_{i}^{ } } \right)V\left( t \right), V\left( 0 \right) = V_{0} ,$$

(2)

where V is the volume of tumor cells, ${V}_{0}$ the initial tumor volume, ${k}_{ng}$ the net tumor growth rate constant, ${a}_{i}$ the potency of drug i, and ${C}_{i}$ average or maximum unbound drug concentration. T

When two drugs, i and j, are given in combination, the turnover is instead described by,

$$\frac{dV}{{dt}} = \left( {k_{ng}^{ } - a_{i}^{ } \cdot C_{i}^{ } - a_{j}^{ } \cdot C_{j}^{ } - \gamma_{ i,j}^{ } C_{i}^{ } C_{j}^{ } } \right)V\left( t \right),$$

(3)

where ${\gamma }_{i,j}$ is included to describe a potential synergistic or antagonistic effect between the drugs [9].

Mathematical modeling and parameter estimation were performed using an NLME framework (more details are found in Computational Methods). One TGI model for each cancer type was fitted to the data and log-normal between-subject variability (BSV) was accounted for on the parameters ${k}_{ng}$ and ${V}_{0}$ in both models and on the potency parameter of binimetinib, ${a}_{Bini}$, in the CM model. No correlation between random effects was assumed and a proportional observation error was used in the model based on residual analysis. We also investigated if there was a significant difference between parameter estimates if treatment groups were stratified in BRAF-mutants, NRAS-mutants, and others.

Clinical modeling

Translational

To predict clinical response, translational methods were applied to the preclinical TGI models. Initially, we only replaced mouse exposure with human exposure, after accounting for differences in protein binding [20, 21]. For each drug, reported $AU{C}_{tot}$ or ${C}_{max,tot}$ values were taken from the clinical study if available, or otherwise values from similar studies. The exposure was then adjusted by in vitro mean unbound protein fraction in humans, ${f}_{u, Human}$ [6, 24, 35, 36]. Total and unbound exposures for each drug and treatment schedule are summarized in Table 1. A detailed description of how values were derived is available in the Supplementary Information.

Clinical predictions

We used our translated preclinical TGI models to predict the proportion of patients in each RECIST category. To do this, two important aspects first had to be considered. First, the RECIST criteria are based on SLD, whereas predictions from the models are on volumes. Therefore, we converted the volume predictions to SLD by assuming either spherical or ellipsoid tumors [39]. In the ellipsoid case, prolate ellipsoids were assumed as well as that tumor growth or shrinkage only occurs along the longest radius. This leads to the volumetric change being the same as the change in SLD between two time points. For the spherical case, the volumetric change has to be greater than the SLD change to achieve CR/PR or PD [28, 39, 40]. Both assumptions of spherical and ellipsoid tumors were evaluated in this paper.

Second, only the best response, which can occur at any checkup, for each patient is reported in the clinical studies. Therefore, we made the simplifying assumption that the best response occurred at the first evaluation, i.e., at week 6 or 8, and we called this time $T$. We subsequently investigated how the predictions were affected if a different $T$ was chosen.

To make the predictions, we used the translated preclinical model (formed by the preclinical tumor model combined with the human PK) to generate 1000 studies with the same number of individuals as in the original study. The time evolution of tumor volume of each individual was simulated and converted to SLD. After that, the percentage change between baseline and week $T$ was calculated, using the following equation,

$${\Delta }SLD = 100 \cdot \frac{{SLD_{T} - SLD_{0} }}{{SLD_{0} }}.$$

(5)

A patient is classified as CR&PR if$\Delta SLD\le -30$, as PD if $\Delta SLD\ge 20$, and as SD if $-30\le\Delta SLD\le 20$ [28]. This process of generating and categorizing individuals is illustrated in Fig. 2.

Each individual’s $\Delta SLD$ was compared with the RECIST thresholds and thus, the proportion of patients in each RECIST category was estimated for each study. Subsequently, mean and 95% confidence interval (PCI) of each RECIST category was calculated. We considered a prediction to be adequately good if the PCI covers the clinical data observation.

Optimization

After making our predictions with the translated models we wanted to investigate how the parameters in the model should be scaled to describe the clinical data better. The parameters that we focused on were the PD rate parameters, ${k}_{ng}$, ${a}_{i}$, and ${\gamma }_{i,j}$. We allowed the scaling of these parameters to be different and denoted the optimal scaling factors for them by A, B, and C, respectively. The optimal scaling factors were introduced to the model using the following expressions and were found by formulating and solving an optimization problem.

$$k_{ng}^{H} = A \cdot k_{ng}^{M}$$

$$a_{i}^{H} = B \cdot a_{i }^{M}$$

$$\gamma_{ }^{H} = C \cdot \gamma_{ }^{M}$$

(6)

Here the superscript, $M$, denotes that the parameter is estimated from PDX data and, $H$, that the parameter is scaled for human predictions.

To formulate the optimization problem, we denoted the clinically observed and predicted percentage of patients in RECIST category i for treatment group j by ${y}_{ij}$ and ${y}_{ij}^{*}$, respectively. Furthermore, ${y}_{ij}^{*}$ is a function of the scaling factors $x=(A, B, C).$ A least-squares problem was formulated to find x such that the difference between ${y}_{ji}$ and ${y}_{ij}^{*}$ is minimized for all i and j. Mathematically this is described by the equation,

$$f\left( x \right) = \mathop \sum \limits_{i,j} \left( {y_{ij}^{*} \left( x \right) - y_{ij} } \right)^{2} .^{ }$$

(7)

However, this objective function can lead to optimal solutions where some RECIST categories are not adequately predicted, which is compensated by very accurate predictions of other categories. Thus, to improve the predictions, on a study level, we penalized the solution for each RECIST category in y that was not covered by the PCI. This promotes solutions with as many adequate predictions as possible and was done by introducing the following penalty term,

$$\lambda \mathop \sum \limits_{i,j} g_{ij} \left( x \right) = 0,$$

(8)

where $\lambda$ is a penalty constant and,

$$g_{ij} \left( x \right) = \left\{ {\begin{array}{*{20}l} {0\,{\text{ if}}\,y_{ij} \, \in PCI_{j} } \\ {1\,{\text{ if}}\,y_{ij} \, \notin PCI_{j} .} \\ \end{array} } \right.$$

(9)

Combining this penalty term with Eq. 7 results in the following equation,

$$L\left( x \right) = f\left( x \right) + \lambda \mathop \sum \limits_{i,j} g_{j} \left( x \right).$$

(10)

The optimization problem was formulated as,

$${\text{minimize}} \; L\left( x \right),\,{\text{subject to }} - \infty < x \le 0.$$

(11)

The optimization procedure was validated by first synthesizing data with known optimal scaling factors and then re-estimating these known factors. To give an idea of the uncertainty of the estimates, a non-parametric bootstrap was performed to calculate RSE % of each optimal scaling factor.

Allometric scaling

The heart rate of organisms has been shown to be proportional to the body weight of the organism raised to power of −0.25 [41]. This is the underlying rationale for some to propose that parameters associated with tumor growth can also be allometric scaling with exponent −0.25 [42]. Standard values of the body weight of a human and a mouse are assumed to be 70 kg and 20 g, respectively, which results in a scaling factor of approximately 0.13. We compared this scaling factor with the optimal scaling factors we found through our optimization procedure.

Computational methods

Mathematical modeling and parameter estimation were performed using an NLME modeling approach based on the first-order conditional estimation (FOCE) method. The computational framework used was developed at the Fraunhofer-Chalmers Research Centre for Industrial Mathematics (Gothenburg, Sweden) [43]. The preclinical TGI models were simultaneously fitted to tumor volume data from all treatment groups of the same cancer type. The models were introduced based on the precision of estimated parameters, individual fits, empirical Bayes estimates (EBEs), Akaike information criterion (AIC), and visual predictive checks (VPC). We used Simulated Annealing and set $\lambda$ to 1000 to solve the optimization problem. Mathematica was used to create all figures and to perform all computations.