Distributed Cox proportional hazards regression using summary-level information

Biostatistics. 2023 Jul 14;24(3):776-794. doi: 10.1093/biostatistics/kxac006.

Abstract

Individual-level data sharing across multiple sites can be infeasible due to privacy and logistical concerns. This article proposes a general distributed methodology to fit Cox proportional hazards models without sharing individual-level data in multi-site studies. We make inferences on the log hazard ratios based on an approximated partial likelihood score function that uses only summary-level statistics. This approach can be applied to both stratified and unstratified models, accommodate both discrete and continuous exposure variables, and permit the adjustment of multiple covariates. In particular, the fitting of stratified Cox models can be carried out with only one file transfer of summary-level information. We derive the asymptotic properties of the proposed estimators and compare the proposed estimators with the maximum partial likelihood estimators using pooled individual-level data and meta-analysis methods through simulation studies. We apply the proposed method to a real-world data set to examine the effect of sleeve gastrectomy versus Roux-en-Y gastric bypass on the time to first postoperative readmission.

Keywords: Distributed Cox PH regression; Meta-analysis; Multi-site study; Summary-level information.

Publication types

  • Meta-Analysis

MeSH terms

  • Computer Simulation
  • Gastrectomy / methods
  • Gastric Bypass* / methods
  • Humans
  • Probability
  • Proportional Hazards Models