Genome-wide association studies (GWAS) have evolved over the last ten years into a powerful tool for investigating the genetic architecture of human disease. In this work, we review the key concepts ...underlying GWAS, including the architecture of common diseases, the structure of common human genetic variation, technologies for capturing genetic information, study designs, and the statistical methods used for data analysis. We also look forward to the future beyond GWAS.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Advances in genotyping technology have, over the past decade, enabled the focused search for common genetic variation associated with human diseases and traits. With the recently increased ...availability of detailed phenotypic data from electronic health records and epidemiological studies, the impact of one or more genetic variants on the phenome is starting to be characterized both in clinical and population-based settings using phenome-wide association studies (PheWAS). These studies reveal a number of challenges that will need to be overcome to unlock the full potential of PheWAS for the characterization of the complex human genome-phenome relationship.
Transcriptional regulation is complex, requiring multiple cis (local) and trans acting mechanisms working in concert to drive gene expression, with disruption of these processes linked to multiple ...diseases. Previous computational attempts to understand the influence of regulatory mechanisms on gene expression have used prediction models containing input features derived from cis regulatory factors. However, local chromatin looping and trans-acting mechanisms are known to also influence transcriptional regulation, and their inclusion may improve model accuracy and interpretation. In this study, we create a general model of transcription factor influence on gene expression by incorporating both cis and trans gene regulatory features.
We describe a computational framework to model gene expression for GM12878 and K562 cell lines. This framework weights the impact of transcription factor-based regulatory data using multi-omics gene regulatory networks to account for both cis and trans acting mechanisms, and measures of the local chromatin context. These prediction models perform significantly better compared to models containing cis-regulatory features alone. Models that additionally integrate long distance chromatin interactions (or chromatin looping) between distal transcription factor binding regions and gene promoters also show improved accuracy. As a demonstration of their utility, effect estimates from these models were used to weight cis-regulatory rare variants for sequence kernel association test analyses of gene expression.
Our models generate refined effect estimates for the influence of individual transcription factors on gene expression, allowing characterization of their roles across the genome. This work also provides a framework for integrating multiple data types into a single model of transcriptional regulation.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Pharmacogenomic and Statistical Analysis Bai, Haimeng; Zhang, Xueyi; Bush, William S
Methods in molecular biology (Clifton, N.J.),
2023, Letnik:
2629
Journal Article
Genetic variants can alter response to drugs and other therapeutic interventions. The study of this phenomenon, called pharmacogenomics, is similar in many ways to other types of genetic studies but ...has distinct methodological and statistical considerations. Genetic variants involved in the processing of exogenous compounds exhibit great diversity and complexity, and the phenotypes studied in pharmacogenomics are also more complex than typical genetic studies. In this chapter, we review basic concepts in pharmacogenomic study designs, data generation techniques, statistical analysis approaches, and commonly used methods and briefly discuss the ultimate translation of findings to clinical care.
Many modern human genomes retain DNA inherited from interbreeding with archaic hominins, such as Neandertals, yet the influence of this admixture on human traits is largely unknown. We analyzed the ...contribution of common Neandertal variants to over 1000 electronic health record (EHR)–derived phenotypes in ~28,000 adults of European ancestry. We discovered and replicated associations of Neandertal alleles with neurological, psychiatric, immunological, and dermatological phenotypes. Neandertal alleles together explained a significant fraction of the variation in risk for depression and skin lesions resulting from sun exposure (actinic keratosis), and individual Neandertal alleles were significantly associated with specific human phenotypes, including hypercoagulation and tobacco use. Our results establish that archaic admixture influences disease risk in modern humans, provide hypotheses about the effects of hundreds of Neandertal haplotypes, and demonstrate the utility of EHR data in evolutionary analyses.
To evaluate the validity of, characterize the usage of, and propose potential research applications for International Classification of Diseases, Ninth Revision (ICD-9) tobacco codes in clinical ...populations.
Using data on cancer cases and cancer-free controls from Vanderbilt's biorepository, BioVU, we evaluated the utility of ICD-9 tobacco use codes to identify ever-smokers in general and high smoking prevalence (lung cancer) clinic populations. We assessed potential biases in documentation, and performed temporal analysis relating transitions between smoking codes to smoking cessation attempts. We also examined the suitability of these codes for use in genetic association analyses.
ICD-9 tobacco use codes can identify smokers in a general clinic population (specificity of 1, sensitivity of 0.32), and there is little evidence of documentation bias. Frequency of code transitions between 'current' and 'former' tobacco use was significantly correlated with initial success at smoking cessation (p<0.0001). Finally, code-based smoking status assignment is a comparable covariate to text-based smoking status for genetic association studies.
Our results support the use of ICD-9 tobacco use codes for identifying smokers in a clinical population. Furthermore, with some limitations, these codes are suitable for adjustment of smoking status in genetic studies utilizing electronic health records.
Researchers should not be deterred by the unavailability of full-text records to determine smoking status if they have ICD-9 code histories.
Recently, novel biotechnologies to quantify RNA modifications became an increasingly popular choice for researchers who study epitranscriptome. When studying RNA methylations such as ...N6-methyladenosine (m6A), researchers need to make several decisions in its experimental design, especially the sample size and a proper statistical power. Due to the complexity and high-throughput nature of m6A sequencing measurements, methods for power calculation and study design are still currently unavailable. In this work, we propose a statistical power assessment tool, magpie, for power calculation and experimental design for epitranscriptome studies using m6A sequencing data. Our simulation-based power assessment tool will borrow information from real pilot data, and inspect various influential factors including sample size, sequencing depth, effect size, and basal expression ranges. We integrate two modules in magpie: (i) a flexible and realistic simulator module to synthesize m6A sequencing data based on real data; and (ii) a power assessment module to examine a set of comprehensive evaluation metrics.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Hand position in the visual field influences performance in several visual tasks. Recent theoretical accounts have proposed that hand position either (a) influences the allocation of spatial ...attention, or (b) biases processing toward the magnocellular visual pathway. Comparing these accounts is difficult as some studies manipulate the distance of one hand in the visual field while others vary the distance of both hands, and it is unclear whether single and dual hand manipulations have the same impact on perception. We ask if hand position affects the spatial distribution of attention, with a broader distribution of attention when both hands are near a visual display and a narrower distribution when one hand is near a display. We examined the effects of four hand positions near the screen (left hand, right hand, both hands, no hands) on both temporal and spatial discrimination tasks. Placing two hands near the display compared to two hands distant resulted in improved sensitivity for the temporal task and reduced sensitivity in the spatial task, replicating previous results. However, the single hand manipulations showed the opposite pattern of results. Together these results suggest that visual attention is focused on the graspable space for a single hand, and expanded when two hands frame an area of the visual field.
Hypertension is more prevalent in African Americans (AA) than other ethnic groups. Genome-wide association studies (GWAS) have identified loci associated with hypertension and other cardio-metabolic ...traits like type 2 diabetes, coronary artery disease, and body mass index (BMI), however the AA population is underrepresented in these studies. In this study, we examined a large AA cohort for the generalizability of 14 Metabochip array SNPs with previously reported European hypertension associations.
To evaluate associations, we analyzed genotype data of 14 SNPs for their associations with a diagnosis of hypertension, systolic blood pressure (SBP), and diastolic blood pressure (DBP) in a case-control study of an AA population (N = 9,534). We also performed an age-stratified analysis (>30, 30≥59 and ≥60 years) following the hypertension definition described by the 8th Joint National Committee (JNC). Associations were adjusted for BMI, age, age2, sex, clinical confounders, and genetic ancestry using multivariable regression models to estimate odds ratios (ORs) and beta-coefficients. Analyses stratified by sex were also conducted. Meta-analyses (including both BioVU and COGENT-BP cohorts) were performed using a random-effects model.
We found rs880315 to be associated with systolic hypertension (SBP≥140 mmHg) in the entire cohort (OR = 1.14, p = 0.003) and within women only (OR = 1.16, p = 0.012). Variant rs17080093 associated with lower SBP and DBP (β = -2.99, p = 0.0352 and - β = 1.69, p = 0.0184) among younger individuals, particularly in younger women (β = -3.92, p = 0.0025 and β = -1.87, p = 0.0241 for SBP and DBP respectively). SNP rs1530440 associated with higher SBP and DBP measurements (younger individuals β = 4.1, p = 0.039 and β = 2.5, p = 0.043 for SBP and DBP; (younger women β = 4.5, p = 0.025 and β = 2.9, p = 0.028 for SBP and DBP), and hypertension risk in older women (OR = 1.4, p = 0.050). rs16948048 increases hypertension risk in younger individuals (OR = 1.31, p = 0.011). Among mid-age women rs880315 associated with higher risk of hypertension (OR = 1.20, p = 0.027). rs1361831 associated with DBP (β = -1.96, p = 0.02) among individuals older than 60 years. rs3096277 increases hypertension risk among older individuals (OR = 1.26 p = 0.0015), however, this variant also reduces SBP among younger women (β = -2.63, p = 0.0102).
These findings suggest that European-descent and AA populations share genetic loci that contribute to blood pressure traits and hypertension. However, the OR and beta-coefficient estimates differ, and some are age-dependent. Additional genetic studies of hypertension in AA are warranted to identify new loci associated with hypertension and blood pressure traits in this population.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The importance of epistasis—or statistical interactions between genetic variants—to the development of complex disease in humans has been controversial. Genome-wide association studies of statistical ...interactions influencing human traits have recently become computationally feasible and have identified many putative interactions. However, statistical models used to detect interactions can be confounded, which makes it difficult to be certain that observed statistical interactions are evidence for true molecular epistasis. In this study, we investigate whether there is evidence for epistatic interactions between genetic variants within the cis-regulatory region that influence gene expression after accounting for technical, statistical, and biological confounding factors. We identified 1,119 (FDR = 5%) interactions that appear to regulate gene expression in human lymphoblastoid cell lines, a tightly controlled, largely genetically determined phenotype. Many of these interactions replicated in an independent dataset (90 of 803 tested, Bonferroni threshold). We then performed an exhaustive analysis of both known and novel confounders, including ceiling/floor effects, missing genotype combinations, haplotype effects, single variants tagged through linkage disequilibrium, and population stratification. Every interaction could be explained by at least one of these confounders, and replication in independent datasets did not protect against some confounders. Assuming that the confounding factors provide a more parsimonious explanation for each interaction, we find it unlikely that cis-regulatory interactions contribute strongly to human gene expression, which calls into question the relevance of cis-regulatory interactions for other human phenotypes. We additionally propose several best practices for epistasis testing to protect future studies from confounding.