Kernel machine SNP-set testing under multiple candidate kernels

Michael C Wu; Arnab Maity; Seunggeun Lee; Elizabeth M Simmons; Quaker E Harmon; Xinyi Lin; Stephanie M Engel; Jeffrey J Molldrem; Paul M Armistead

doi:10.1002/gepi.21715

Kernel machine SNP-set testing under multiple candidate kernels

Genet Epidemiol. 2013 Apr;37(3):267-75. doi: 10.1002/gepi.21715. Epub 2013 Mar 7.

Authors

Michael C Wu¹, Arnab Maity, Seunggeun Lee, Elizabeth M Simmons, Quaker E Harmon, Xinyi Lin, Stephanie M Engel, Jeffrey J Molldrem, Paul M Armistead

Affiliation

¹ Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7420, USA. mwu@bios.unc.edu

Abstract

Joint testing for the cumulative effect of multiple single-nucleotide polymorphisms grouped on the basis of prior biological knowledge has become a popular and powerful strategy for the analysis of large-scale genetic association studies. The kernel machine (KM)-testing framework is a useful approach that has been proposed for testing associations between multiple genetic variants and many different types of complex traits by comparing pairwise similarity in phenotype between subjects to pairwise similarity in genotype, with similarity in genotype defined via a kernel function. An advantage of the KM framework is its flexibility: choosing different kernel functions allows for different assumptions concerning the underlying model and can allow for improved power. In practice, it is difficult to know which kernel to use a priori because this depends on the unknown underlying trait architecture and selecting the kernel which gives the lowest P-value can lead to inflated type I error. Therefore, we propose practical strategies for KM testing when multiple candidate kernels are present based on constructing composite kernels and based on efficient perturbation procedures. We demonstrate through simulations and real data applications that the procedures protect the type I error rate and can lead to substantially improved power over poor choices of kernels and only modest differences in power vs. using the best candidate kernel.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Computer Simulation
Female
Genetic Association Studies
Genetic Predisposition to Disease
Humans
Infant, Newborn
Models, Genetic*
Phenotype
Polymorphism, Single Nucleotide*
Pregnancy
Premature Birth / genetics*
Software*

Abstract

Publication types

MeSH terms

Grants and funding