Augmenting the logrank test in the design of clinical trials in which non-proportional hazards of the treatment effect may be anticipated
Open Access
- 11 February 2016
- journal article
- research article
- Published by Springer Science and Business Media LLC in BMC Medical Research Methodology
- Vol. 16 (1), 1-13
- https://doi.org/10.1186/s12874-016-0110-x
Abstract
Most randomized controlled trials with a time-to-event outcome are designed assuming proportional hazards (PH) of the treatment effect. The sample size calculation is based on a logrank test. However, non-proportional hazards are increasingly common. At analysis, the estimated hazards ratio with a confidence interval is usually presented. The estimate is often obtained from a Cox PH model with treatment as a covariate. If non-proportional hazards are present, the logrank and equivalent Cox tests may lose power. To safeguard power, we previously suggested a ‘joint test’ combining the Cox test with a test of non-proportional hazards. Unfortunately, a larger sample size is needed to preserve power under PH. Here, we describe a novel test that unites the Cox test with a permutation test based on restricted mean survival time. We propose a combined hypothesis test based on a permutation test of the difference in restricted mean survival time across time. The test involves the minimum of the Cox and permutation test P-values. We approximate its null distribution and correct it for correlation between the two P-values. Using extensive simulations, we assess the type 1 error and power of the combined test under several scenarios and compare with other tests. We investigate powering a trial using the combined test. The type 1 error of the combined test is close to nominal. Power under proportional hazards is slightly lower than for the Cox test. Enhanced power is available when the treatment difference shows an ‘early effect’, an initial separation of survival curves which diminishes over time. The power is reduced under a ‘late effect’, when little or no difference in survival curves is seen for an initial period and then a late separation occurs. We propose a method of powering a trial using the combined test. The ‘insurance premium’ offered by the combined test to safeguard power under non-PH represents about a single-digit percentage increase in sample size. The combined test increases trial power under an early treatment effect and protects power under other scenarios. Use of restricted mean survival time facilitates testing and displaying a generalized treatment effect.Keywords
Funding Information
- Medical Research Council (GB)
This publication has 22 references indexed in Scilit:
- An approach to trial design and analysis in the era of non-proportional hazards of the treatment effectTrials, 2014
- Penicillin to Prevent Recurrent Leg CellulitisThe New England Journal of Medicine, 2013
- A Phase 3 Trial of Bevacizumab in Ovarian CancerThe New England Journal of Medicine, 2011
- The use of restricted mean survival time to estimate the treatment effect in randomized clinical trials when the proportional hazards assumption is in doubtStatistics in Medicine, 2011
- The estimation of average hazard ratios by weighted Cox regressionStatistics in Medicine, 2009
- SAS and R functions to compute pseudo-values for censored data regressionComputer Methods and Programs in Biomedicine, 2008
- Iterated Residuals and Time-Varying Covariate Effects in Cox RegressionJournal of the Royal Statistical Society Series B: Statistical Methodology, 2003
- Cyclophosphamide and Cisplatin Compared with Paclitaxel and Cisplatin in Patients with Stage III and Stage IV Ovarian CancerThe New England Journal of Medicine, 1996
- Proportional hazards tests and diagnostics based on weighted residualsBiometrika, 1994
- The standard error of an estimate of expectation of life, with special reference to expectation of tumourless life in experiments with miceEpidemiology and Infection, 1949