Accepted author manuscript, 287 KB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Genotyping, characterization, and imputation of known and novel CYP2A6 structural variants using SNP array data
AU - Langlois, Alec W R
AU - El-Boraie, Ahmed
AU - Pouget, Jennie G
AU - Cox, Lisa Sanderson
AU - Ahluwalia, Jasjit S
AU - Fukunaga, Koya
AU - Mushiroda, Taisei
AU - Knight, Jo
AU - Chenoweth, Meghan J
AU - Tyndale, Rachel F
PY - 2023/8/31
Y1 - 2023/8/31
N2 - CYP2A6 metabolically inactivates nicotine. Faster CYP2A6 activity is associated with heavier smoking and higher lung cancer risk. The CYP2A6 gene is polymorphic, including functional structural variants (SV) such as gene deletions (CYP2A6*4), duplications (CYP2A6*1 × 2), and hybrids with the CYP2A7 pseudogene (CYP2A6*12, CYP2A6*34). SVs are challenging to genotype due to their complex genetic architecture. Our aims were to develop a reliable protocol for SV genotyping, functionally phenotype known and novel SVs, and investigate the feasibility of CYP2A6 SV imputation from SNP array data in two ancestry populations. European- (EUR; n = 935) and African- (AFR; n = 964) ancestry individuals from smoking cessation trials were genotyped for SNPs using an Illumina array and for CYP2A6 SVs using Taqman copy number (CN) assays. SV-specific PCR amplification and Sanger sequencing was used to characterize a novel SV. Individuals with SVs were phenotyped using the nicotine metabolite ratio, a biomarker of CYP2A6 activity. SV diplotype and SNP array data were integrated and phased to generate ancestry-specific SV reference panels. Leave-one-out cross-validation was used to investigate the feasibility of CYP2A6 SV imputation. A minimal protocol requiring three Taqman CN assays for CYP2A6 SV genotyping was developed and known SV associations with activity were replicated. The first domain swap CYP2A6-CYP2A7 hybrid SV, CYP2A6*53, was identified, sequenced, and associated with lower CYP2A6 activity. In both EURs and AFRs, most SV alleles were identified using imputation (>70% and >60%, respectively); importantly, false positive rates were <1%. These results confirm that CYP2A6 SV imputation can identify most SV alleles, including a novel SV.
AB - CYP2A6 metabolically inactivates nicotine. Faster CYP2A6 activity is associated with heavier smoking and higher lung cancer risk. The CYP2A6 gene is polymorphic, including functional structural variants (SV) such as gene deletions (CYP2A6*4), duplications (CYP2A6*1 × 2), and hybrids with the CYP2A7 pseudogene (CYP2A6*12, CYP2A6*34). SVs are challenging to genotype due to their complex genetic architecture. Our aims were to develop a reliable protocol for SV genotyping, functionally phenotype known and novel SVs, and investigate the feasibility of CYP2A6 SV imputation from SNP array data in two ancestry populations. European- (EUR; n = 935) and African- (AFR; n = 964) ancestry individuals from smoking cessation trials were genotyped for SNPs using an Illumina array and for CYP2A6 SVs using Taqman copy number (CN) assays. SV-specific PCR amplification and Sanger sequencing was used to characterize a novel SV. Individuals with SVs were phenotyped using the nicotine metabolite ratio, a biomarker of CYP2A6 activity. SV diplotype and SNP array data were integrated and phased to generate ancestry-specific SV reference panels. Leave-one-out cross-validation was used to investigate the feasibility of CYP2A6 SV imputation. A minimal protocol requiring three Taqman CN assays for CYP2A6 SV genotyping was developed and known SV associations with activity were replicated. The first domain swap CYP2A6-CYP2A7 hybrid SV, CYP2A6*53, was identified, sequenced, and associated with lower CYP2A6 activity. In both EURs and AFRs, most SV alleles were identified using imputation (>70% and >60%, respectively); importantly, false positive rates were <1%. These results confirm that CYP2A6 SV imputation can identify most SV alleles, including a novel SV.
U2 - 10.1038/s10038-023-01148-y
DO - 10.1038/s10038-023-01148-y
M3 - Journal article
C2 - 37059825
VL - 68
SP - 533
EP - 541
JO - Journal of Human Genetics
JF - Journal of Human Genetics
SN - 1434-5161
ER -