Augmenting the logrank test in the design of clinical trials in which non-proportional hazards of the treatment effect may be anticipated

Open Access

11 February 2016

journal article
research article
Published by Springer Science and Business Media LLC in BMC Medical Research Methodology

Vol. 16 (1), 1-13
https://doi.org/10.1186/s12874-016-0110-x

Abstract

Most randomized controlled trials with a time-to-event outcome are designed assuming proportional hazards (PH) of the treatment effect. The sample size calculation is based on a logrank test. However, non-proportional hazards are increasingly common. At analysis, the estimated hazards ratio with a confidence interval is usually presented. The estimate is often obtained from a Cox PH model with treatment as a covariate. If non-proportional hazards are present, the logrank and equivalent Cox tests may lose power. To safeguard power, we previously suggested a ‘joint test’ combining the Cox test with a test of non-proportional hazards. Unfortunately, a larger sample size is needed to preserve power under PH. Here, we describe a novel test that unites the Cox test with a permutation test based on restricted mean survival time. We propose a combined hypothesis test based on a permutation test of the difference in restricted mean survival time across time. The test involves the minimum of the Cox and permutation test P-values. We approximate its null distribution and correct it for correlation between the two P-values. Using extensive simulations, we assess the type 1 error and power of the combined test under several scenarios and compare with other tests. We investigate powering a trial using the combined test. The type 1 error of the combined test is close to nominal. Power under proportional hazards is slightly lower than for the Cox test. Enhanced power is available when the treatment difference shows an ‘early effect’, an initial separation of survival curves which diminishes over time. The power is reduced under a ‘late effect’, when little or no difference in survival curves is seen for an initial period and then a late separation occurs. We propose a method of powering a trial using the combined test. The ‘insurance premium’ offered by the combined test to safeguard power under non-PH represents about a single-digit percentage increase in sample size. The combined test increases trial power under an early treatment effect and protects power under other scenarios. Use of restricted mean survival time facilitates testing and displaying a generalized treatment effect.

Keywords

Funding Information

Medical Research Council (GB)

This publication has 22 references indexed in Scilit:

An approach to trial design and analysis in the era of non-proportional hazards of the treatment effect
Trials, 2014
Penicillin to Prevent Recurrent Leg Cellulitis
The New England Journal of Medicine, 2013
A Phase 3 Trial of Bevacizumab in Ovarian Cancer
The New England Journal of Medicine, 2011
The use of restricted mean survival time to estimate the treatment effect in randomized clinical trials when the proportional hazards assumption is in doubt
Statistics in Medicine, 2011
The estimation of average hazard ratios by weighted Cox regression
Statistics in Medicine, 2009
SAS and R functions to compute pseudo-values for censored data regression
Computer Methods and Programs in Biomedicine, 2008
Iterated Residuals and Time-Varying Covariate Effects in Cox Regression
Journal of the Royal Statistical Society Series B: Statistical Methodology, 2003
Cyclophosphamide and Cisplatin Compared with Paclitaxel and Cisplatin in Patients with Stage III and Stage IV Ovarian Cancer
The New England Journal of Medicine, 1996
Proportional hazards tests and diagnostics based on weighted residuals
Biometrika, 1994
The standard error of an estimate of expectation of life, with special reference to expectation of tumourless life in experiments with mice
Epidemiology and Infection, 1949

Cited by 50 articles