PaintorPipe: a pipeline for genetic variant fine-mapping using functional annotations

Bioinform Adv. 2023 Dec 21;4(1):vbad188. doi: 10.1093/bioadv/vbad188. eCollection 2024.

Abstract

Motivation: Genome-wide association studies (GWAS) have identified thousands of genetic variants associated with common diseases. These results include a mix of causal and non-causal variants related through strong linkage disequilibrium (LD, i.e. highly correlated). Fine-mapping methods have been developed to decipher the causal from non-causal variants using GWAS results and LD information, assigning to each variant a probability of being causal. In this field, the PAINTOR program has become a standard, one of its advantages being its ability to take into account functional annotations. This approach requires many pre- and post-processing steps. Here, we developed a Nextflow pipeline called PaintorPipe that wraps all these steps and the fine-mapping itself together. PaintorPipe uses three independent sources of information: GWAS summary statistics, LD information and functional annotations, to rank the variants according to their susceptibility to be involved in the disease development. The PAINTOR framework is used to calculate the posterior probability of each variant (single nucleotide polymorphism) to be causal (a.k.a. Bayesian fine-mapping). The resulting credible sets of variants are annotated with their biological functions and visualized using CANVIS. This pipeline requires minimal input from users (a GWAS summary statistics file and a set of functional annotation files) and is designed to be modular and customizable, allowing for an easy integration of diverse functional annotations.

Availability and implementation: PaintorPipe is implemented in the Nextflow pipeline specific language, can be run locally or on a slurm cluster and handles containerization using Singularity. PaintorPipe is freely available on GitHub (https://github.com/sdjebali/PaintorPipe).