RBesT for a Normal Endpoint

Yue Li

2019-04-04

Introduction

The R Bayesian evidence synthesis Tools (RBesT) have been created to facilitate the use of historical information in clinical trials. Once relevant historical information has been identified, RBesT supports the derivation of informative priors via the Meta-Analytic-Predictive (MAP) approach [1], the evaluation of the trial’s operating characteristics, and the data analysis with the actual trial data. RBesT has been developed for endpoints from a number of well known distributions. Here we consider an example for a normally distributed response variable.

Trial Design with Historical Control Data

Historical Data

Let’s look at the Crohn’s disease example data [2] (data-set crohn in RBesT). The primary endpoint is the change from baseline in Crohn’s Disease Activity Index (CDAI), which is assumed to be normally distributed. Note that for CDAI, an improved outcome corresponds to a negative change from baseline.

First from historical studies we get the estimated standard deviation of the response variable as \(\sigma\) = 88, which is used to obtain the standard errors of the effect estimates.

study n y y.se
Gastr06 74 -51 10.229792
AIMed07 166 -49 6.830125
NEJM07 328 -36 4.858987
Gastr01a 20 -47 19.677398
APhTh04 25 -90 17.600000
Gastr01b 58 -54 11.554966

Derivation of MAP Prior

The MAP prior can be derived with the function gMAP. The between-trial heterogeneity parameter \(\tau\) governs how much information will be shared from the historical trials into the design and analysis of the future trials. In the normal case with a known sampling standard deviation \(\sigma\), the amount of borrowing from historical data depends on the ratio \(\tau/\sigma\). A conservative choice for the prior on \(\tau\) is a HalfNormal(0, \(\sigma/2\)) distribution. For the prior on the intercept we recommend a “unit-information” prior [3] which is set to a precision corresponding to a single observation and centered here at no change from baseline. Please refer to the help page ?gMAP for detailed information. The set.seed function is used to make the results exactly reproducible.

## Generalized Meta Analytic Predictive Prior Analysis
## 
## Call:  gMAP(formula = cbind(y, y.se) ~ 1 | study, family = gaussian, 
##     data = dat, weights = n, tau.dist = "HalfNormal", tau.prior = cbind(0, 
##         crohn_sigma/2), beta.prior = cbind(0, crohn_sigma))
## 
## Exchangeability tau strata: 1 
## Prediction tau stratum    : 1 
## Maximal Rhat              : 1 
## Estimated reference scale : 88 
## 
## Between-trial heterogeneity of tau prediction stratum
##  mean    sd  2.5%   50% 97.5% 
## 14.50  9.79  1.23 12.70 38.90 
## 
## MAP Prior MCMC sample
##  mean    sd  2.5%   50% 97.5% 
## -50.1  19.8 -95.2 -48.6 -10.5
## [1] "densityThetaStar"     "densityThetaStarLink" "forest_model"

Approximation of MAP Prior using a Mixture Distribution

Next, the MCMC MAP prior from the previous section is converted to a parametric representation with the automixfit function. This function fits a parametric mixture representation using expectation-maximization (EM). The number of mixture components is chosen automatically using AIC. One can also specify the number of components for the mixture via mixfit function and compare with the automixfit outcome.

## EM for Normal Mixture Model
## Log-Likelihood = -17213.33
## 
## Univariate normal mixture
## Reference scale: 88
## Mixture Components:
##   comp1       comp2       comp3       comp4      
## w   0.3450238   0.3304501   0.2238776   0.1006485
## m -46.0922976 -57.1189870 -53.5474204 -33.5582472
## s   6.7830507  12.3283282  34.2780050  10.2729225

## EM for Normal Mixture Model
## Log-Likelihood = -17239.6
## 
## Univariate normal mixture
## Reference scale: 88
## Mixture Components:
##   comp1       comp2      
## w   0.6559396   0.3440604
## m -48.3050490 -53.6486480
## s  10.5431993  30.1344332

Effective Sample Size (ESS)

The main advantage of using historical information is the possibility to reduce the number of control patients, as the informative prior is effectively equivalent to a certain number of control patients. This is called the effective sample size (ESS) and can be calculated in RBesT with the ess function.

Note, however, that various methods to calculate ESS have been proposed, and that they can give fairly different answers. The default moment-based approach yields conservative ESS, while the Morita et. al [4] approach normally gives an over optimistic ESS. From experience, a realistic ESS can be obtained by applying the Morita et. al method to a MAP approximation with few mixture components.

In the study protocol of this Crohn’s disease data example, the moment-based ESS of 20 was used to reduce the planned sample size of the control group.

## Using default prior reference scale 88
## [1] 20
## Using default prior reference scale 88
## [1] 107
## Using default prior reference scale 88
## [1] 60

Robustification of MAP Prior

We recommend robustifying [5] the prior with the robustify function, which protects against type-I error inflation in presence of prior-data conflict. For the normal case we strongly recommend explicitly choosing the mean of the robust component. We use \(-50\) consistent with the mean of the MAP prior. Furthermore, 20% probability is used for the additional robust (unit-information) mixture component. The choice of such probability reflects the confidence about the validitiy of the model assumptions, i.e. the possibility of a non-exchangable control group to be enrolled per inclusion/exclusion criteria in the current trial as compared to the historical control group population. Note that robustification decreases the ESS (numbers not shown).

## Using default prior reference scale 88
## Univariate normal mixture
## Reference scale: 88
## Mixture Components:
##   comp1        comp2        comp3        comp4        robust      
## w   0.27601904   0.26436007   0.17910208   0.08051881   0.20000000
## m -46.09229761 -57.11898704 -53.54742037 -33.55824722 -50.00000000
## s   6.78305075  12.32832819  34.27800497  10.27292253  88.00000000

Operating Characteristics of Design Options

Typically, operating characteristics are required to evaluate the proposed design or compare a few design options. RBesT requires the input of decision rules via decision2S function and then calculates the operating characteristics with the oc2S function. The calculation is expedited based on analytic expressions. In the following we compare a few design options, which differ in the choice of the control priors and the sample size of the control group for the planned trial. There may be other design factors (such as the outcome standard deviation), which should be considered when comparing design options. Such factors are not considered here for simplicity purpose.

Decision Rules

Consider this 2-arm design of placebo (with an informative prior) against an experimental treatment. The dual-criterion for success is defined as follows: \[ \begin{align*} \textrm{Criterion 1:} & \Pr(y_{act} - y_{pbo} \lt 0) &> 0.95 \\ \textrm{Criterion 2:} & \Pr(y_{act} - y_{pbo} \lt -50) &> 0.50. \end{align*} \]

Equivalently, the second criterion requires that the posterior median difference exceeds -50. The dual-criteria account for statistical significance as well as clinical relevance. Note that a negative change from baseline in CDAI corresponds to improvement.

## 2 sample decision function
## Conditions for acceptance:
## P(x1 - x2 <= 0) > 0.95
## P(x1 - x2 <= -50) > 0.5
## Link: identity

Final Analysis with Trial Data

When the actual trial data are available, the final analysis can be run with RBesT via the postmix function. The real data are used for this example, where the trial data led to a negative conclusion. However, note that postmix assumes that the sampling standard deviation is known and fixed. Therefore, no uncertainty in its estimate is taken into account.

## one can either use summary data or individual data. See ?postmix.
y.act <- -29.15
y.act.se <- 16.69
n.act <- 39

y.pbo <- -76.01
y.pbo.se <- 21.93
n.pbo <- 20

## first obtain posterior distributions
post_act <- postmix(weak_prior, m=y.act, se=y.act.se)
post_pbo <- postmix(map_robust, m=y.pbo, se=y.pbo.se)

## then calculate probability for the dual criteria
## and compare to the predefined threshold values
p1 <- pmixdiff(post_act, post_pbo, 0); print(p1)
## [1] 0.08406058
p2 <- pmixdiff(post_act, post_pbo, -50); print(p2)
## [1] 8.970822e-05
print(p1>0.95 & p2>0.5)
## [1] FALSE
## or we can use the decision function
poc(post_act, post_pbo)
## [1] 0

References

[1] Neuenschwander B et. al, Clin Trials. 2010; 7(1):5-18
[2] Hueber W. et. al, Gut, 2012, 61(12):1693-1700
[3] Kass RE, Wasserman L, J Amer Statist Assoc; 1995, 90(431):928-934.
[4] Morita S. et. al, Biometrics 2008;64(2):595-602
[5] Schmidli H. et. al, Biometrics 2014;70(4):1023-1032

R Session Info

## R version 3.5.1 (2018-07-02)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 16.04.5 LTS
## 
## Matrix products: default
## BLAS: /usr/lib/libblas/libblas.so.3.6.0
## LAPACK: /usr/lib/lapack/liblapack.so.3.6.0
## 
## locale:
## [1] C
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
##  [1] forcats_0.3.0   stringr_1.3.1   readr_1.3.0     tidyr_0.8.2    
##  [5] tibble_1.4.2    tidyverse_1.2.1 scales_1.0.0    bindrcpp_0.2.2 
##  [9] ggplot2_3.1.0   purrr_0.2.5     dplyr_0.7.8     bayesplot_1.6.0
## [13] knitr_1.21      RBesT_1.3-8     Rcpp_1.0.0     
## 
## loaded via a namespace (and not attached):
##  [1] httr_1.4.0           jsonlite_1.6         modelr_0.1.2        
##  [4] StanHeaders_2.18.0-1 Formula_1.2-3        assertthat_0.2.0    
##  [7] highr_0.7            stats4_3.5.1         cellranger_1.1.0    
## [10] yaml_2.2.0           pillar_1.3.0         backports_1.1.3     
## [13] lattice_0.20-38      glue_1.3.0           digest_0.6.18       
## [16] checkmate_1.8.5      rvest_0.3.2          colorspace_1.3-2    
## [19] htmltools_0.3.6      plyr_1.8.4           pkgconfig_2.0.2     
## [22] rstan_2.18.2         broom_0.5.1          haven_2.0.0         
## [25] mvtnorm_1.0-8        processx_3.2.1       generics_0.0.2      
## [28] withr_2.1.2          lazyeval_0.2.1       cli_1.0.1           
## [31] magrittr_1.5         crayon_1.3.4         readxl_1.1.0        
## [34] evaluate_0.12        ps_1.2.1             nlme_3.1-137        
## [37] xml2_1.2.0           pkgbuild_1.0.2       tools_3.5.1         
## [40] loo_2.0.0            prettyunits_1.0.2    hms_0.4.2           
## [43] matrixStats_0.54.0   munsell_0.5.0        callr_3.1.0         
## [46] compiler_3.5.1       rlang_0.3.0.1        grid_3.5.1          
## [49] ggridges_0.5.1       rstudioapi_0.8       labeling_0.3        
## [52] rmarkdown_1.11       gtable_0.2.0         codetools_0.2-15    
## [55] inline_0.3.15        reshape2_1.4.3       R6_2.3.0            
## [58] gridExtra_2.3        lubridate_1.7.4      bindr_0.1.1         
## [61] stringi_1.2.4        parallel_3.5.1       tidyselect_0.2.5    
## [64] xfun_0.4