Efficiency of two sample tests via the restricted mean survival time for analyzing event time observations
- 1 June 2018
- journal article
- research article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 74 (2), 694-702
- https://doi.org/10.1111/biom.12770
Abstract
In comparing two treatments with the event time observations, the hazard ratio (HR) estimate is routinely used to quantify the treatment difference. However, this model dependent estimate may be difficult to interpret clinically especially when the proportional hazards (PH) assumption is violated. An alternative estimation procedure for treatment efficacy based on the restricted means survival time or t-year mean survival time (t-MST) has been discussed extensively in the statistical and clinical literature. On the other hand, a statistical test via the HR or its asymptotically equivalent counterpart, the logrank test, is asymptotically distribution-free. In this article, we assess the relative efficiency of the hazard ratio and t-MST tests with respect to the statistical power under various PH and non-PH models theoretically and empirically. When the PH assumption is valid, the t-MST test performs almost as well as the HR test. For non-PH models, the t-MST test can substantially outperform its HR counterpart. On the other hand, the HR test can be powerful when the true difference of two survival functions is quite large at end but not the beginning of the study. Unfortunately, for this case, the HR estimate may not have a simple clinical interpretation for the treatment effect due to the violation of the PH assumption.Funding Information
- NIH
- NHLBI (R01 HL089778)
- AHRQ (R00 HS022193)
- NIA (R21 AG049385)
This publication has 28 references indexed in Scilit:
- Improved Logrank‐Type Tests for Survival Data Using Adaptive WeightsBiometrics, 2010
- Lenalidomide plus high-dose dexamethasone versus lenalidomide plus low-dose dexamethasone as initial therapy for newly diagnosed multiple myeloma: an open-label randomised controlled trialThe Lancet Oncology, 2010
- Power and sample size calculation for log‐rank test with a time lag in treatment effectStatistics in Medicine, 2009
- Supremum weighted log-rank test and sample size for comparing two-stage adaptive treatment strategiesBiometrika, 2008
- The Versatility of Function-Indexed Weighted Log-Rank StatisticsJournal of the American Statistical Association, 1999
- Restricted Mean Life with Covariates: Modification and Extension of a Useful Survival Analysis MethodJournal of the American Statistical Association, 1998
- The Robust Inference for the Cox Proportional Hazards ModelJournal of the American Statistical Association, 1989
- Restricted Mean Life with Adjustment for CovariatesJournal of the American Statistical Association, 1987
- Supremum Versions of the Log-Rank and Generalized Wilcoxon StatisticsJournal of the American Statistical Association, 1987
- Efficiencies of some two–sample location tests for a broad class of distributionsCommunications in Statistics - Theory and Methods, 1986