Indices of non-ignorable selection bias for proportions estimated from non-probability samples

Rebecca R Andridge; Brady T West; Roderick J A Little; Philip S Boonstra; Fernanda Alvarado-Leiton

doi:10.1111/rssc.12371

Indices of non-ignorable selection bias for proportions estimated from non-probability samples

J R Stat Soc Ser C Appl Stat. 2019 Nov;68(5):1465-1483. doi: 10.1111/rssc.12371. Epub 2019 Aug 2.

Authors

Rebecca R Andridge¹, Brady T West², Roderick J A Little², Philip S Boonstra², Fernanda Alvarado-Leiton²

Affiliations

¹ Ohio State University, Columbus, USA.
² University of Michigan, Ann Arbor, USA.

Abstract

Rising costs of survey data collection and declining response rates have caused researchers to turn to non-probability samples to make descriptive statements about populations. However, unlike probability samples, non-probability samples may produce severely biased descriptive estimates due to selection bias. The paper develops and evaluates a simple model-based index of the potential selection bias in estimates of population proportions due to non-ignorable selection mechanisms. The index depends on an inestimable parameter ranging from 0 to 1 that captures the amount of deviation from selection at random and is thus well suited to a sensitivity analysis. We describe modified maximum likelihood and Bayesian estimation approaches and provide new and easy-to-use R functions for their implementation. We use simulation studies to evaluate the ability of the proposed index to reflect selection bias in non-probability samples and show how the index outperforms a previously proposed index that relies on an underlying normality assumption. We demonstrate the use of the index in practice with real data from the National Survey of Family Growth.

Keywords: Non-ignorable selection bias; Non-probablity samples; Selection at random; Survey data collection.

Grants and funding

R21 HD090366/HD/NICHD NIH HHS/United States