Efficiency of two sample tests via the restricted mean survival time for analyzing event time observations

1 June 2018

journal article
research article
Published by Oxford University Press (OUP) in Biometrics

Vol. 74 (2), 694-702
https://doi.org/10.1111/biom.12770

Abstract

In comparing two treatments with the event time observations, the hazard ratio (HR) estimate is routinely used to quantify the treatment difference. However, this model dependent estimate may be difficult to interpret clinically especially when the proportional hazards (PH) assumption is violated. An alternative estimation procedure for treatment efficacy based on the restricted means survival time or t-year mean survival time (t-MST) has been discussed extensively in the statistical and clinical literature. On the other hand, a statistical test via the HR or its asymptotically equivalent counterpart, the logrank test, is asymptotically distribution-free. In this article, we assess the relative efficiency of the hazard ratio and t-MST tests with respect to the statistical power under various PH and non-PH models theoretically and empirically. When the PH assumption is valid, the t-MST test performs almost as well as the HR test. For non-PH models, the t-MST test can substantially outperform its HR counterpart. On the other hand, the HR test can be powerful when the true difference of two survival functions is quite large at end but not the beginning of the study. Unfortunately, for this case, the HR estimate may not have a simple clinical interpretation for the treatment effect due to the violation of the PH assumption.

Funding Information

NIH
NHLBI (R01 HL089778)
AHRQ (R00 HS022193)
NIA (R21 AG049385)

This publication has 28 references indexed in Scilit:

Improved Logrank‐Type Tests for Survival Data Using Adaptive Weights
Biometrics, 2010
Lenalidomide plus high-dose dexamethasone versus lenalidomide plus low-dose dexamethasone as initial therapy for newly diagnosed multiple myeloma: an open-label randomised controlled trial
The Lancet Oncology, 2010
Power and sample size calculation for log‐rank test with a time lag in treatment effect
Statistics in Medicine, 2009
Supremum weighted log-rank test and sample size for comparing two-stage adaptive treatment strategies
Biometrika, 2008
The Versatility of Function-Indexed Weighted Log-Rank Statistics
Journal of the American Statistical Association, 1999
Restricted Mean Life with Covariates: Modification and Extension of a Useful Survival Analysis Method
Journal of the American Statistical Association, 1998
The Robust Inference for the Cox Proportional Hazards Model
Journal of the American Statistical Association, 1989
Restricted Mean Life with Adjustment for Covariates
Journal of the American Statistical Association, 1987
Supremum Versions of the Log-Rank and Generalized Wilcoxon Statistics
Journal of the American Statistical Association, 1987
Efficiencies of some two–sample location tests for a broad class of distributions
Communications in Statistics - Theory and Methods, 1986

Cited by 58 articles