SORA

Advancing, promoting and sharing knowledge of health through excellence in teaching, clinical practice and research into the prevention and treatment of illness

A Machine Learning Model to Aid Detection of Familial Hypercholesterolemia

Gratton, J; Futema, M; Humphries, SE; Hingorani, AD; Finan, C; Schmidt, AF (2023) A Machine Learning Model to Aid Detection of Familial Hypercholesterolemia. JACC: Advances, 2 (4). p. 100333. ISSN 2772-963X https://doi.org/10.1016/j.jacadv.2023.100333
SGUL Authors: Futema, Marta

[img]
Preview
PDF Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview
[img] Microsoft Word (.docx) (Supplementary information) Published Version
Available under License Creative Commons Attribution.

Download (764kB)

Abstract

Background People with monogenic familial hypercholesterolemia (FH) are at an increased risk of premature coronary heart disease and death. With a prevalence of 1:250, FH is relatively common; but currently there is no population screening strategy in place and most carriers are identified late in life, delaying timely and cost-effective interventions. Objectives The purpose of this study was to derive an algorithm to identify people with suspected monogenic FH for subsequent confirmatory genomic testing and cascade screening. Methods A least absolute shrinkage and selection operator logistic regression model was used to identify predictors that accurately identified people with FH in 139,779 unrelated participants of the UK Biobank. Candidate predictors included information on medical and family history, anthropometric measures, blood biomarkers, and a low-density lipoprotein cholesterol (LDL-C) polygenic score (PGS). Model derivation and evaluation were performed in independent training and testing data. Results A total of 488 FH variant carriers were identified using whole-exome sequencing of the low-density lipoprotein receptor, apolipoprotein B, apolipoprotein E, proprotein convertase subtilisin/kexin type 9 genes. A 14-variable algorithm for FH was derived, with an area under the curve of 0.77 (95% CI: 0.71-0.83), where the top 5 most important variables included triglyceride, LDL-C, apolipoprotein A1 concentrations, self-reported statin use, and LDL-C PGS. Excluding the PGS as a candidate feature resulted in a 9-variable model with a comparable area under the curve: 0.76 (95% CI: 0.71-0.82). Both multivariable models (w/wo the PGS) outperformed screening-prioritization based on LDL-C adjusted for statin use. Conclusions Detecting individuals with FH can be improved by considering additional predictors. This would reduce the sequencing burden in a 2-stage population screening strategy for FH.

Item Type: Article
Additional Information: © 2023 The Authors. Published by Elsevier on behalf of the American College of Cardiology Foundation. This is an Open Access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)
SGUL Research Institute / Research Centre: Academic Structure > Molecular and Clinical Sciences Research Institute (MCS)
Journal or Publication Title: JACC: Advances
ISSN: 2772-963X
Language: en
Dates:
DateEvent
30 June 2023Published
24 May 2023Published Online
17 March 2023Accepted
Publisher License: Creative Commons: Attribution 4.0
Projects:
Project IDFunderFunder ID
FS/17/70/33482British Heart Foundationhttp://dx.doi.org/10.13039/501100000274
PG 08/008British Heart Foundationhttp://dx.doi.org/10.13039/501100000274
UNSPECIFIEDNational Institute for Health Researchhttp://dx.doi.org/10.13039/501100000272
PG/18/5033837British Heart Foundationhttp://dx.doi.org/10.13039/501100000274
PG/22/10989British Heart Foundationhttp://dx.doi.org/10.13039/501100000274
AA/18/6/34223British Heart Foundationhttp://dx.doi.org/10.13039/501100000274
URI: https://openaccess.sgul.ac.uk/id/eprint/115453
Publisher's version: https://doi.org/10.1016/j.jacadv.2023.100333

Actions (login required)

Edit Item Edit Item