Impact of selection bias on polygenic risk score estimates in healthcare settings


Journal article


Y. Lee, Tanayott Thaweethai, Y. Sheu, Y. Feng, E. Karlson, T. Ge, P. Kraft, J. Smoller
Psychological Medicine, 2022

Semantic Scholar DOI PubMed
Cite

Cite

APA   Click to copy
Lee, Y., Thaweethai, T., Sheu, Y., Feng, Y., Karlson, E., Ge, T., … Smoller, J. (2022). Impact of selection bias on polygenic risk score estimates in healthcare settings. Psychological Medicine.


Chicago/Turabian   Click to copy
Lee, Y., Tanayott Thaweethai, Y. Sheu, Y. Feng, E. Karlson, T. Ge, P. Kraft, and J. Smoller. “Impact of Selection Bias on Polygenic Risk Score Estimates in Healthcare Settings.” Psychological Medicine (2022).


MLA   Click to copy
Lee, Y., et al. “Impact of Selection Bias on Polygenic Risk Score Estimates in Healthcare Settings.” Psychological Medicine, 2022.


BibTeX   Click to copy

@article{y2022a,
  title = {Impact of selection bias on polygenic risk score estimates in healthcare settings},
  year = {2022},
  journal = {Psychological Medicine},
  author = {Lee, Y. and Thaweethai, Tanayott and Sheu, Y. and Feng, Y. and Karlson, E. and Ge, T. and Kraft, P. and Smoller, J.}
}

Abstract

Abstract Background Hospital-based biobanks are being increasingly considered as a resource for translating polygenic risk scores (PRS) into clinical practice. However, since these biobanks originate from patient populations, there is a possibility of bias in polygenic risk estimation due to overrepresentation of patients with higher frequency of healthcare interactions. Methods PRS for schizophrenia, bipolar disorder, and depression were calculated using summary statistics from the largest available genomic studies for a sample of 24 153 European ancestry participants in the Mass General Brigham (MGB) Biobank. To correct for selection bias, we fitted logistic regression models with inverse probability (IP) weights, which were estimated using 1839 sociodemographic, clinical, and healthcare utilization features extracted from electronic health records of 1 546 440 non-Hispanic White patients eligible to participate in the Biobank study at their first visit to the MGB-affiliated hospitals. Results Case prevalence of bipolar disorder among participants in the top decile of bipolar disorder PRS was 10.0% (95% CI 8.8–11.2%) in the unweighted analysis but only 6.2% (5.0–7.5%) when selection bias was accounted for using IP weights. Similarly, case prevalence of depression among those in the top decile of depression PRS was reduced from 33.5% (31.7–35.4%) to 28.9% (25.8–31.9%) after IP weighting. Conclusions Non-random selection of participants into volunteer biobanks may induce clinically relevant selection bias that could impact implementation of PRS in research and clinical settings. As efforts to integrate PRS in medical practice expand, recognition and mitigation of these biases should be considered and may need to be optimized in a context-specific manner.


Share

Tools
Translate to