SORA

Advancing, promoting and sharing knowledge of health through excellence in teaching, clinical practice and research into the prevention and treatment of illness

Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation.

Xu, C; Ciampi, A; Greenwood, CMT; UK10K Consortium (2014) Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation. Front Genet, 5. p. 11. ISSN 1664-8021 https://doi.org/10.3389/fgene.2014.00011
SGUL Authors: Jamshidi, Yalda

[img]
Preview
PDF Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview

Abstract

When analyzing the data that arises from exome or whole-genome sequencing studies, window-based tests, (i.e., tests that jointly analyze all genetic data in a small genomic region), are very popular. However, power is known to be quite low for finding associations with phenotypes using these tests, and therefore a variety of analytic strategies may be employed to potentially improve power. Using sequencing data of all of chromosome 3 from an interim release of data on 2432 individuals from the UK10K project, we simulated phenotypes associated with rare genetic variation, and used the results to explore the window-based test power. We asked two specific questions: firstly, whether there could be substantial benefits associated with incorporating information from external annotation on the genetic variants, and secondly whether the false discovery rate (FDRs) would be a useful metric for assessing significance. Although, as expected, there are benefits to using additional information (such as annotation) when it is associated with causality, we confirmed the general pattern of low sensitivity and power for window-based tests. For our chosen example, even when power is high to detect some of the associations, many of the regions containing causal variants are not detectable, despite using lax significance thresholds and optimal analytic methods. Furthermore, our estimated FDR values tended to be much smaller than the true FDRs. Long-range correlations between variants-due to linkage disequilibrium-likely explain some of this bias. A more sophisticated approach to using the annotation information may improve power, however, many causal variants of realistic effect sizes may simply be undetectable, at least with this sample size. Perhaps annotation information could assist in distinguishing windows containing causal variants from windows that are merely correlated with causal variants.

Item Type: Article
Additional Information: Copyright © 2014 Xu, Ciampi and Greenwood. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
Keywords: SNV, false discovery rate, genomic annotation, multiple testing, rare genetic variants, stratified false discovery rate, whole genome sequencing, window-based tests, UK10K Consortium
SGUL Research Institute / Research Centre: Academic Structure > Molecular and Clinical Sciences Research Institute (MCS)
Journal or Publication Title: Front Genet
ISSN: 1664-8021
Language: eng
Dates:
DateEvent
29 January 2014Published
13 January 2014Accepted
Publisher License: Creative Commons: Attribution 4.0
PubMed ID: 24523729
Web of Science ID: WOS:000346979500001
Go to PubMed abstract
URI: https://openaccess.sgul.ac.uk/id/eprint/112982
Publisher's version: https://doi.org/10.3389/fgene.2014.00011

Actions (login required)

Edit Item Edit Item