A multivariate test of association Ferreira, Manuel A. R.; Purcell, Shaun M.
Bioinformatics,
01/2009, Letnik:
25, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Although genetic association studies often test multiple, related phenotypes, few formal multivariate tests of association are available. We describe a test of association that can be efficiently ...applied to large population-based designs. Availability: A C++ implementation can be obtained from the authors. Contact: manuel.ferreira@qimr.edu.au Supplementary information: Supplementary figures are available at Bioinformatics online.
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets ...in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.
The study of continuously varying, quantitative traits is important in evolutionary biology, agriculture, and medicine. Variation in such traits is attributable to many, possibly interacting, genes ...whose expression may be sensitive to the environment, which makes their dissection into underlying causative factors difficult. An important population parameter for quantitative traits is heritability, the proportion of total variance that is due to genetic factors. Response to artificial and natural selection and the degree of resemblance between relatives are all a function of this parameter. Following the classic paper by R. A. Fisher in 1918, the estimation of additive and dominance genetic variance and heritability in populations is based upon the expected proportion of genes shared between different types of relatives, and explicit, often controversial and untestable models of genetic and non-genetic causes of family resemblance. With genome-wide coverage of genetic markers it is now possible to estimate such parameters solely within families using the actual degree of identity-by-descent sharing between relatives. Using genome scans on 4,401 quasi-independent sib pairs of which 3,375 pairs had phenotypes, we estimated the heritability of height from empirical genome-wide identity-by-descent sharing, which varied from 0.374 to 0.617 (mean 0.498, standard deviation 0.036). The variance in identity-by-descent sharing per chromosome and per genome was consistent with theory. The maximum likelihood estimate of the heritability for height was 0.80 with no evidence for non-genetic causes of sib resemblance, consistent with results from independent twin and family studies but using an entirely separate source of information. Our application shows that it is feasible to estimate genetic variance solely from within-family segregation and provides an independent validation of previously untestable assumptions. Given sufficient data, our new paradigm will allow the estimation of genetic variation for disease susceptibility and quantitative traits that is free from confounding with non-genetic factors and will allow partitioning of genetic variation into additive and non-additive components.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The extent to which genetic risk factors are shared between childhood-onset (COA) and adult-onset (AOA) asthma has not been estimated. On the basis of data from the UK Biobank study (n = 447,628), we ...found that the variance in disease liability explained by common variants is higher for COA (onset at ages between 0 and 19 years; h2g = 25.6%) than for AOA (onset at ages between 20 and 60 years; h2g = 10.6%). The genetic correlation (rg) between COA and AOA was 0.67. Variation in age of onset among COA-affected individuals had a low heritability (h2g = 5%), which we confirmed in independent studies and also among AOA-affected individuals. To identify subtype-specific genetic associations, we performed a genome-wide association study (GWAS) in the UK Biobank for COA (13,962 affected individuals) and a separate GWAS for AOA (26,582 affected individuals) by using a common set of 300,671 controls for both studies. We identified 123 independent associations for COA and 56 for AOA (37 overlapped); of these, 98 and 34, respectively, were reproducible in an independent study (n = 262,767). Collectively, 28 associations were not previously reported. For 96 COA-associated variants, including five variants that represent COA-specific risk factors, the risk allele was more common in COA- than in AOA-affected individuals. Conversely, we identified three variants that are stronger risk factors for AOA. Variants associated with obesity and smoking had a stronger contribution to the risk of AOA than to the risk of COA. Lastly, we identified 109 likely target genes of the associated variants, primarily on the basis of correlated expression quantitative trait loci (up to n = 31,684). GWAS informed by age of onset can identify subtype-specific risk variants, which can help us understand differences in pathophysiology between COA and AOA and so can be informative for drug development.
We present a method for testing overrepresentation of biological pathways, indexed by gene-ontology terms, in lists of significant SNPs from genome-wide association studies. This method corrects for ...linkage disequilibrium between SNPs, variable gene size, and multiple testing of nonindependent pathways. The method was applied to the Wellcome Trust Case-Control Consortium Crohn disease (CD) data set. At a general level, the biological basis of CD is relatively well known for a complex genetic trait, and it thus acted as a test of the method. The method, known as ALIGATOR (Association LIst Go AnnoTatOR), successfully detected biological pathways implicated in CD. The method was also applied to a meta-analysis of bipolar disorder, and it implicated the modulation of transcription and cellular activity, including that which occurs via hormonal action, as an important player in pathogenesis.
Micro/nanomotors represent a burgeoning field of research featuring small devices capable of autonomous movement in liquid environments through catalytic reactions and/or external stimuli. This ...review delves into recent advancements in light-driven semiconductor-based micro/nanomotors (LDSM), focusing on optimized syntheses, enhanced motion mechanisms, and emerging applications in the environmental and biomedical domains. The survey commences with a theoretical introduction to micromotors and their propulsion mechanisms, followed by an exploration of commonly studied LDSM, emphasizing their advantages. Critical properties affecting propulsion, such as surface features, morphology, and size, are presented alongside discussions on external conditions related to light sources and intensity, which are crucial for optimizing the propulsion speed. Each property is accompanied by a theoretical background and conclusions drawn up to 2018. The review further investigates recent adaptations of LDSM, uncovering underlying mechanisms and associated benefits. A brief discussion is included on potential synergistic effects between different external conditions, aiming to enhance efficiency-a relatively underexplored topic. In conclusion, the review outlines emerging applications in biomedicine and environmental monitoring/remediation resulting from recent LDSM research, highlighting the growing significance of this field. The comprehensive exploration of LDSM advancements provides valuable insights for researchers and practitioners seeking to leverage these innovative micro/nanomotors in diverse applications.
The causes of autism are largely unknown. This study establishes that aberrant dosage of a large genomic segment is associated with autism spectrum disorder. Deletion or duplication of the segment, ...which encompasses 25 known genes, was present in approximately 1% of case subjects and less than 0.1% of unscreened control subjects.
This study establishes that an aberrant dosage of a large genomic segment is associated with autism spectrum disorder. Deletion or duplication of the segment, which encompasses 25 known genes, was present in approximately 1% of case subjects and less than 0.1% of unscreened control subjects.
Autism is a pervasive developmental disorder defined by a neurobehavioral phenotype that includes social disability, communication impairment, repetitive behaviors, and restricted interests. The onset is generally before the age of 3 years, and the disorder has a prevalence of 0.6% in the population, affecting many more boys than girls.
1
Results of twin and family studies have shown that the heritability of autism is approximately 90%, making it one of the most heritable complex disorders.
2
In approximately 10% of patients, autism can be explained by genetic syndromes and known chromosomal anomalies (most of which have recognizable features in addition to autism), . . .
A major goal in human genetics is to use natural variation to understand the phenotypic consequences of altering each protein-coding gene in the genome. Here we used exome sequencing
to explore ...protein-altering variants and their consequences in 454,787 participants in the UK Biobank study
. We identified 12 million coding variants, including around 1 million loss-of-function and around 1.8 million deleterious missense variants. When these were tested for association with 3,994 health-related traits, we found 564 genes with trait associations at P ≤ 2.18 × 10
. Rare variant associations were enriched in loci from genome-wide association studies (GWAS), but most (91%) were independent of common variant signals. We discovered several risk-increasing associations with traits related to liver disease, eye disease and cancer, among others, as well as risk-lowering associations for hypertension (SLC9A3R2), diabetes (MAP3K15, FAM234A) and asthma (SLC27A3). Six genes were associated with brain imaging phenotypes, including two involved in neural development (GBE1, PLD1). Of the signals available and powered for replication in an independent cohort, 81% were confirmed; furthermore, association signals were generally consistent across individuals of European, Asian and African ancestry. We illustrate the ability of exome sequencing to identify gene-trait associations, elucidate gene function and pinpoint effector genes that underlie GWAS signals at scale.
Background To date, no genome-wide association study (GWAS) has considered the combined phenotype of asthma with hay fever. Previous analyses of family data from the Tasmanian Longitudinal Health ...Study provide evidence that this phenotype has a stronger genetic cause than asthma without hay fever. Objective We sought to perform a GWAS of asthma with hay fever to identify variants associated with having both diseases. Methods We performed a meta-analysis of GWASs comparing persons with both physician-diagnosed asthma and hay fever (n = 6,685) with persons with neither disease (n = 14,091). Results At genome-wide significance, we identified 11 independent variants associated with the risk of having asthma with hay fever, including 2 associations reaching this level of significance with allergic disease for the first time: ZBTB10 (rs7009110; odds ratio OR, 1.14; P = 4 × 10−9 ) and CLEC16A (rs62026376; OR, 1.17; P = 1 × 10−8 ). The rs62026376:C allele associated with increased asthma with hay fever risk has been found to be associated also with decreased expression of the nearby DEXI gene in monocytes. The 11 variants were associated with the risk of asthma and hay fever separately, but the estimated associations with the individual phenotypes were weaker than with the combined asthma with hay fever phenotype. A variant near LRRC32 was a stronger risk factor for hay fever than for asthma, whereas the reverse was observed for variants in/near GSDMA and TSLP . Single nucleotide polymorphisms with suggestive evidence for association with asthma with hay fever risk included rs41295115 near IL2RA (OR, 1.28; P = 5 × 10−7 ) and rs76043829 in TNS1 (OR, 1.23; P = 2 × 10−6 ). Conclusion By focusing on the combined phenotype of asthma with hay fever, variants associated with the risk of allergic disease can be identified with greater efficiency.
Hair morphology is highly differentiated between populations and among people of European ancestry. Whereas hair morphology in East Asian populations has been studied extensively, relatively little ...is known about the genetics of this trait in Europeans. We performed a genome-wide association scan for hair morphology (straight, wavy, curly) in three Australian samples of European descent. All three samples showed evidence of association implicating the Trichohyalin gene (TCHH), which is expressed in the developing inner root sheath of the hair follicle, and explaining ∼6% of variance (p = 1.5 × 10−31). These variants are at their highest frequency in Northern Europeans, paralleling the distribution of the straight-hair EDAR variant in Asian populations.