On using a non-probability sample for the estimation of population parameters
Articles
Ieva Burakauskaitė
Vilnius University
Andrius Čiginas
Vilnius University
Published 2023-11-20
https://doi.org/10.15388/LMR.2003.33587
PDF

Keywords

data integration
not missing at random
propensity score adjustment
population census

How to Cite

Burakauskaitė, I. and Čiginas, A. (2023) “On using a non-probability sample for the estimation of population parameters”, Lietuvos matematikos rinkinys, 64(A), pp. 1–11. doi:10.15388/LMR.2003.33587.

Abstract

We aim to find a way to effectively integrate a non-probability (voluntary) sample under the data framework, where the study variable is also observed in a probability sample of some statistical survey. The selection bias that arises from voluntary participation in the survey is corrected by estimating the inclusion into the sample probabilities (propensity scores) for the units in the non-probability sample. The estimators for the propensity scores are constructed using a parametric logistic regression model. We consider two modeling scenarios: with an assumption that the willingness to participate in the voluntary survey does not depend on the survey variable itself and that such a variable does contribute to whether the individual responds or not. The maximum likelihood method is applied in both scenarios to estimate the propensity scores. The estimators of the population mean based on the estimated propensity scores are linearly combined with the unbiased estimator using the probability sample data. We compare the constructed estimators in the simulation study, where we estimate the population proportions using data from the Population and Housing Census surveys.

PDF
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Downloads

Download data is not yet available.