ERAPlotype determines ERAP1 haplotype from human whole exome sequencing data.
Endoplasmic reticulum aminopeptidase 1 (ERAP1) is an important protein for processing and transport in the endoplasmic reticulum. Due to its involvement in the processing of peptides prior to MHC-I presentation, mutations in ERAP1 that alter its activity are associated predisposition to certain autoinflammatory diseases.
There are defined naturally occuring haplotypes of ERAP1, which reflect different combinations of single nucleotide polymorphisms. Although the full implications of these haplotypes remain unknown, certain haplotypes have been associated with specific diseases. For example, HLA-B-51+ individuals homozygous for ERAP1 Hap10 have been found to have around a 3x higher predisposition to Behcet's Disease than HLA-B-51 individuals with other ERAP1 haplotypes.
Where ERAP1 haplotypes are traditionally determined through genotyping assays or manual sequence scrutiny, ERAPlotype determines both ERAP1 haplotypes from whole exome sequencing data.
ERAPlotype takes input of an annotated csv file (which can be generated quickly and freely from a vcf file through wANNOVAR. ERAPlotype filters this file for ERAP1 variants, and then computes both ERAP1 haplotypes based on the combination of mutations present.
ERAPloype uses the definition of ERAP1 haplotypes as described in Ombrello et al, 2015.
N.B. It is not possible to distinguish between Hap1/Hap4 and Hap2/Hap6 genotypes using only short read sequencing data. In this specific scenario, both haplotype combinations are returned along with a warning message.
ERAPlotype uses RStudio and the following R packages:
- Dplyr
- Readr
- Stringr
- Tidyr
- Create a file named 'ERAPlotype'.
- Download the ERAPlotype R script and install packages accordingly using install.packages().
- Download the ERAP_ref_df.csv to ERAPlotype folder.
- If file for analysis is in VCF format, annotate the file at wANNOVAR.
- Download the resulting exome csv file to ERAPlotype folder.
- Enter file name where indicated within script.
- Run script.
- ERAP1 haplotypes HapA/HapB are printed.
Alice Burleigh
University College London Great Ormond Street Institute of Child Health
If you use ERAPlotype, please reference ANNOVAR accordingly:
Wang, K., M. Li, and H. Hakonarson. 2010. 'ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data', Nucleic Acids Research, 38: e164-e64.