Polygenic risk scores (PRS) have been widely applied in research studies, showing how population groups can be stratified into risk categories for many common conditions. As healthcare systems ...consider applying PRS to keep their populations healthy, little work has been carried out demonstrating their implementation at an individual level. Our results highlight the need for further standardisation in the way PRS are developed and shared, the importance of individual risk assessment rather than the assumption of inherited averages, and the challenges currently posed when translating PRS into risk metrics.
Primary membranoproliferative GN, including complement 3 (C3) glomerulopathy, is a rare, untreatable kidney disease characterized by glomerular complement deposition. Complement gene mutations can ...cause familial C3 glomerulopathy, and studies have reported rare variants in complement genes in nonfamilial primary membranoproliferative GN.
We analyzed whole-genome sequence data from 165 primary membranoproliferative GN cases and 10,250 individuals without the condition (controls) as part of the National Institutes of Health Research BioResource-Rare Diseases Study. We examined copy number, rare, and common variants.
Our analysis included 146 primary membranoproliferative GN cases and 6442 controls who were unrelated and of European ancestry. We observed no significant enrichment of rare variants in candidate genes (genes encoding components of the complement alternative pathway and other genes associated with the related disease atypical hemolytic uremic syndrome; 6.8% in cases versus 5.9% in controls) or exome-wide. However, a significant common variant locus was identified at 6p21.32 (rs35406322) (
=3.29×10
; odds ratio OR, 1.93; 95% confidence interval 95% CI, 1.53 to 2.44), overlapping the HLA locus. Imputation of HLA types mapped this signal to a haplotype incorporating DQA1*05:01, DQB1*02:01, and DRB1*03:01 (
=1.21×10
; OR, 2.19; 95% CI, 1.66 to 2.89). This finding was replicated by analysis of HLA serotypes in 338 individuals with membranoproliferative GN and 15,614 individuals with nonimmune renal failure.
We found that HLA type, but not rare complement gene variation, is associated with primary membranoproliferative GN. These findings challenge the paradigm of complement gene mutations typically causing primary membranoproliferative GN and implicate an underlying autoimmune mechanism in most cases.
Although best practices have emerged on how to analyse and interpret personal genomes, the utility of whole genome screening remains underdeveloped. A large amount of information can be gathered from ...various types of analyses via whole genome sequencing including pathogenicity screening, genetic risk scoring, fitness, nutrition, and pharmacogenomic analysis. We recognize different levels of confidence when assessing the validity of genetic markers and apply rigorous standards for evaluation of phenotype associations. We illustrate the application of this approach on a family of five. By applying analyses of whole genomes from different methodological perspectives, we are able to build a more comprehensive picture to assist decision making in preventative healthcare and well-being management. Our interpretation and reporting outputs provide input for a clinician to develop a healthcare plan for the individual, based on genetic and other healthcare data.
To identify novel causes of hereditary thrombocytopenia, we performed a genetic association analysis of whole-genome sequencing data from 13 037 individuals enrolled in the National Institute for ...Health Research (NIHR) BioResource, including 233 cases with isolated thrombocytopenia. We found an association between rare variants in the transcription factor-encoding gene IKZF5 and thrombocytopenia. We report 5 causal missense variants in or near IKZF5 zinc fingers, of which 2 occurred de novo and 3 co-segregated in 3 pedigrees. A canonical DNA-zinc finger binding model predicts that 3 of the variants alter DNA recognition. Expression studies showed that chromatin binding was disrupted in mutant compared with wild-type IKZF5, and electron microscopy revealed a reduced quantity of α granules in normally sized platelets. Proplatelet formation was reduced in megakaryocytes from 7 cases relative to 6 controls. Comparison of RNA-sequencing data from platelets, monocytes, neutrophils, and CD4+ T cells from 3 cases and 14 healthy controls showed 1194 differentially expressed genes in platelets but only 4 differentially expressed genes in each of the other blood cell types. In conclusion, IKZF5 is a novel transcriptional regulator of megakaryopoiesis and the eighth transcription factor associated with dominant thrombocytopenia in humans.
•Mutations in the transcription factor IKZF5 cause autosomal dominant thrombocytopenia and a paucity of α granules.•Although IKZF5 is expressed across hematopoietic lineages, misregulation in IKZF5 cases is restricted to the megakaryocytic lineage.
Display omitted
A germline heterozygous gain-of-function
.E527K variant in tyrosine kinase SRC was previously found to cause thrombocytopenia, myelofibrosis, bleeding, bone pathologies, premature edentulism and mild ...facial dysmorphia in nine patients of a single pedigree. Because of this variant, SRC loses its self-inhibitory capacity, causing constitutively active SRC expression in platelets. These patients have fewer and heterogeneous-sized platelets that are hyporeactive to collagen. We now report a 5-year-old girl with syndromic thrombocytopenia due to the same SRC-E527K variant that occurs
. A bone marrow biopsy, blood smear analysis, platelet aggregations, flow cytometric analysis of
-selectin, SRC expression and tyrosine phosphorylation studies were performed to confirm the similarities between the two families. This study strengthens our previous finding that hyperactive SRC kinase results in mild platelet dysfunction and thrombocytopenia with hypogranular platelets and further expands the clinical description of this syndrome to improve early recognition.
To facilitate early deployment of whole-genome sequencing (WGS) for severely ill children, a standardized pipeline for WGS analysis with timely turnaround and primary care pediatric uptake is needed. ...We developed a bioinformatics pipeline for comprehensive gene-agnostic trio WGS analysis of children suspected of having an undiagnosed monogenic disease that included detection and interpretation of primary genetic mechanisms of disease, including SNVs/indels, CNVs/SVs, uniparental disomy (UPD), imprinted genes, short tandem repeat expansions, mobile element insertions, SMN1/2 copy number calling, and mitochondrial genome variants. We assessed primary care practitioner experience and competence in a large cohort of 521 families (comprising 90% WGS trios). Children were identified by primary practitioners for recruitment, and we used the UK index of multiple deprivation to confirm lack of patient socio-economic status ascertainment bias. Of the 521 children sequenced, 176 (34%) received molecular diagnoses, with rates as high as 45% for neurology clinics. Twenty-three of the diagnosed cases (13%) required bespoke methods beyond routine SNV/CNV analysis. In our multidisciplinary clinician user experience assessment, both pediatricians and clinical geneticists expressed strong support for rapid WGS early in the care pathway, but requested further training in determining patient selection, consenting, and variant interpretation. Rapid trio WGS provides an efficacious single-pass screening test for children when deployed by primary practitioners in clinical settings that carry high a priori risk for rare pediatric disease presentations.
We describe a comprehensive, rapid, and gene-agnostic whole-genome sequence bioinformatics pipeline for analysis of DNA from severely ill children. Of 521 children, 34% received molecular diagnoses, with rates as high as 45% in pediatric neurology clinics. Both pediatricians and geneticist expressed strong support for this method.
Studies have shown that complex structural variants (cxSVs) contribute to human genomic variation and can cause Mendelian disease. We aimed to identify cxSVs relevant to Mendelian disease using ...short-read whole-genome sequencing (WGS), resolve the precise variant configuration and investigate possible mechanisms of cxSV formation.
We performed short-read WGS and analysis of breakpoint junctions to identify cxSVs in a cohort of 1324 undiagnosed rare disease patients. Long-read WGS and gene expression analysis were used to resolve one case.
We identified three pathogenic cxSVs: a de novo duplication-inversion-inversion-deletion affecting ARID1B, a de novo deletion-inversion-duplication affecting HNRNPU and a homozygous deletion-inversion-deletion affecting CEP78. Additionally, a de novo duplication-inversion-duplication overlapping CDKL5 was resolved by long-read WGS demonstrating the presence of both a disrupted and an intact copy of CDKL5 on the same allele, and gene expression analysis showed both parental alleles of CDKL5 were expressed. Breakpoint analysis in all the cxSVs revealed both microhomology and longer repetitive elements.
Our results corroborate that cxSVs cause Mendelian disease, and we recommend their consideration during clinical investigations. We show that resolution of breakpoints can be critical to interpret pathogenicity and present evidence of replication-based mechanisms in cxSV formation.
The discovery and mapping of cis-regulatory elements is important for understanding regulation of gene transcription in mosquito vectors of human diseases. Genome sequence data are available for 3 ...species, Aedes aegypti, Anopheles gambiae, and Culex quinquefasciatus (Diptera: Culicidae), representing 2 subfamilies (Culicinae and Anophelinae) that are estimated to have diverged 145 to 200 million years ago. Comparative genomics tools were used to screen genomic DNA fragments located in the 5'-end flanking regions of orthologous genes. These analyses resulted in the identification of 137 sequences, designated "mosquito motifs," 7 to 9 nucleotides in length, representing 18 families of putative cis-regulatory elements conserved significantly among the 3 species when compared to the fruit fly, Drosophila melanogaster. Forty-one of the motifs were implicated previously in experiments as sites for binding transcription factors or functioning in the regulation of mosquito gene expression. Further analyses revealed associations between specific motifs and expression profiles, particularly in those genes that show increased or decreased mRNA abundance in females following a blood meal, and those accumulating transcription products exclusively or preferentially in the midgut, fat bodies, or ovaries. These results validate the methodology and support a relationship between the discovered motifs and the conservation of hematophagy in mosquitoes.
The ArrayExpress Archive of Functional Genomics Data (http://www.ebi.ac.uk/arrayexpress) is an international functional genomics database at the European Bioinformatics Institute (EMBL-EBI) ...recommended by most journals as a repository for data supporting peer-reviewed publications. It contains data from over 7000 public sequencing and 42,000 array-based studies comprising over 1.5 million assays in total. The proportion of sequencing-based submissions has grown significantly over the last few years and has doubled in the last 18 months, whilst the rate of microarray submissions is growing slightly. All data in ArrayExpress are available in the MAGE-TAB format, which allows robust linking to data analysis and visualization tools and standardized analysis. The main development over the last two years has been the release of a new data submission tool Annotare, which has reduced the average submission time almost 3-fold. In the near future, Annotare will become the only submission route into ArrayExpress, alongside MAGE-TAB format-based pipelines. ArrayExpress is a stable and highly accessed resource. Our future tasks include automation of data flows and further integration with other EMBL-EBI resources for the representation of multi-omics data.