Current diagnostic testing for genetic disorders involves serial use of specialized assays spanning multiple technologies. In principle, genome sequencing (GS) can detect all genomic pathogenic ...variant types on a single platform. Here we evaluate copy-number variant (CNV) calling as part of a clinically accredited GS test.
We performed analytical validation of CNV calling on 17 reference samples, compared the sensitivity of GS-based variants with those from a clinical microarray, and set a bound on precision using orthogonal technologies. We developed a protocol for family-based analysis of GS-based CNV calls, and deployed this across a clinical cohort of 79 rare and undiagnosed cases.
We found that CNV calls from GS are at least as sensitive as those from microarrays, while only creating a modest increase in the number of variants interpreted (~10 CNVs per case). We identified clinically significant CNVs in 15% of the first 79 cases analyzed, all of which were confirmed by an orthogonal approach. The pipeline also enabled discovery of a uniparental disomy (UPD) and a 50% mosaic trisomy 14. Directed analysis of select CNVs enabled breakpoint level resolution of genomic rearrangements and phasing of de novo CNVs.
Robust identification of CNVs by GS is possible within a clinical testing environment.
We report the first whole-genome sequences for five strains, two carried and three pathogenic, of the emerging pathogen Haemophilus haemolyticus. Preliminary analyses indicate that these genome ...sequences encode markers that distinguish H. haemolyticus from its closest Haemophilus relatives and provide clues to the identity of its virulence factors.
Predicting Patients health is a critical task in the Healthcare Industry. Healthcare datasets show a high degree of imbalance especially for rare diseases. The current work aims at predicting the ...post operative survival rate in thoracic surgery datasets. The dataset exhibits data imbalance with around 15% positive cases and remaining 85% negative cases. The commonly applicable machine learning techniques for prediction score poorly in predicting the positive cases in spite of high accuracy of the predictions for the negative cases. We use SMOTE (synthetic minority oversampling technique) to reduce the degree of imbalance and increase the positive samples proportion before the application of the following classifiers: Naive Bayes, Neural Networks, Random Forest, Boosting algorithms Adaboost, Extreme Gradient boosting and Support Vector Machines and examine the results. The study shows that SVM and Naïve Bayes show significantly better performance on the imbalanced datasets than other models using synthetic datasets than under normal conditions.
Repeat expansion disorders affect about 1 in 3000 individuals and are clinically heterogeneous diseases caused by expansions of short tandem DNA repeats. Genetic testing is often locus-specific, ...resulting in underdiagnosis of people who have atypical clinical presentations, especially in paediatric patients without a previous positive family history. Whole genome sequencing is increasingly used as a first-line test for other rare genetic disorders, and we aimed to assess its performance in the diagnosis of patients with neurological repeat expansion disorders.
We retrospectively assessed the diagnostic accuracy of whole genome sequencing to detect the most common repeat expansion loci associated with neurological outcomes (AR, ATN1, ATXN1, ATXN2, ATXN3, ATXN7, C9orf72, CACNA1A, DMPK, FMR1, FXN, HTT, and TBP) using samples obtained within the National Health Service in England from patients who were suspected of having neurological disorders; previous PCR test results were used as the reference standard. The clinical accuracy of whole genome sequencing to detect repeat expansions was prospectively examined in previously genetically tested and undiagnosed patients recruited in 2013–17 to the 100 000 Genomes Project in the UK, who were suspected of having a genetic neurological disorder (familial or early-onset forms of ataxia, neuropathy, spastic paraplegia, dementia, motor neuron disease, parkinsonian movement disorders, intellectual disability, or neuromuscular disorders). If a repeat expansion call was made using whole genome sequencing, PCR was used to confirm the result.
The diagnostic accuracy of whole genome sequencing to detect repeat expansions was evaluated against 793 PCR tests previously performed within the NHS from 404 patients. Whole genome sequencing correctly classified 215 of 221 expanded alleles and 1316 of 1321 non-expanded alleles, showing 97·3% sensitivity (95% CI 94·2–99·0) and 99·6% specificity (99·1–99·9) across the 13 disease-associated loci when compared with PCR test results. In samples from 11 631 patients in the 100 000 Genomes Project, whole genome sequencing identified 81 repeat expansions, which were also tested by PCR: 68 were confirmed as repeat expansions in the full pathogenic range, 11 were non-pathogenic intermediate expansions or premutations, and two were non-expanded repeats (16% false discovery rate).
In our study, whole genome sequencing for the detection of repeat expansions showed high sensitivity and specificity, and it led to identification of neurological repeat expansion disorders in previously undiagnosed patients. These findings support implementation of whole genome sequencing in clinical laboratories for diagnosis of patients who have a neurological presentation consistent with a repeat expansion disorder.
Medical Research Council, Department of Health and Social Care, National Health Service England, National Institute for Health Research, and Illumina.
Whole-genome sequencing (WGS) shows promise as a first-line genetic test for acutely ill infants, but widespread adoption and implementation requires evidence of an effect on clinical management.
To ...determine the effect of WGS on clinical management in a racially and ethnically diverse and geographically distributed population of acutely ill infants in the US.
This randomized, time-delayed clinical trial enrolled participants from September 11, 2017, to April 30, 2019, with an observation period extending to July 2, 2019. The study was conducted at 5 US academic medical centers and affiliated children's hospitals. Participants included infants aged between 0 and 120 days who were admitted to an intensive care unit with a suspected genetic disease. Data were analyzed from January 14 to August 20, 2020.
Patients were randomized to receive clinical WGS results 15 days (early) or 60 days (delayed) after enrollment, with the observation period extending to 90 days. Usual care was continued throughout the study.
The main outcome was the difference in the proportion of infants in the early and delayed groups who received a change of management (COM) 60 days after enrollment. Additional outcome measures included WGS diagnostic efficacy, within-group COM at 90 days, length of hospital stay, and mortality.
A total of 354 infants were randomized to the early (n = 176) or delayed (n = 178) arms. The mean participant age was 15 days (IQR, 7-32 days); 201 participants (56.8%) were boys; 19 (5.4%) were Asian; 47 (13.3%) were Black; 250 (70.6%) were White; and 38 (10.7%) were of other race. At 60 days, twice as many infants in the early group vs the delayed group received a COM (34 of 161 21.1%; 95% CI, 15.1%-28.2% vs 17 of 165 10.3%; 95% CI, 6.1%-16.0%; P = .009; odds ratio, 2.3; 95% CI, 1.22-4.32) and a molecular diagnosis (55 of 176 31.0%; 95% CI, 24.5%-38.7% vs 27 of 178 15.0%; 95% CI, 10.2%-21.3%; P < .001). At 90 days, the delayed group showed a doubling of COM (to 45 of 161 28.0%; 95% CI, 21.2%-35.6%) and diagnostic efficacy (to 56 of 178 31.0%; 95% CI, 24.7%-38.8%). The most frequent COMs across the observation window were subspecialty referrals (39 of 354; 11%), surgery or other invasive procedures (17 of 354; 4%), condition-specific medications (9 of 354; 2%), or other supportive alterations in medication (12 of 354; 3%). No differences in length of stay or survival were observed.
In this randomized clinical trial, for acutely ill infants in an intensive care unit, introduction of WGS was associated with a significant increase in focused clinical management compared with usual care. Access to first-line WGS may reduce health care disparities by enabling diagnostic equity. These data support WGS adoption and implementation in this population.
ClinicalTrials.gov Identifier: NCT03290469.
Identifying large expansions of short tandem repeats (STRs), such as those that cause amyotrophic lateral sclerosis (ALS) and fragile X syndrome, is challenging for short-read whole-genome sequencing ...(WGS) data. A solution to this problem is an important step toward integrating WGS into precision medicine. We developed a software tool called ExpansionHunter that, using PCR-free WGS short-read data, can genotype repeats at the locus of interest, even if the expanded repeat is larger than the read length. We applied our algorithm to WGS data from 3001 ALS patients who have been tested for the presence of the
repeat expansion with repeat-primed PCR (RP-PCR). Compared against this truth data, ExpansionHunter correctly classified all (212/212, 95% CI 0.98, 1.00) of the expanded samples as either expansions (208) or potential expansions (4). Additionally, 99.9% (2786/2789, 95% CI 0.997, 1.00) of the wild-type samples were correctly classified as wild type by this method with the remaining three samples identified as possible expansions. We further applied our algorithm to a set of 152 samples in which every sample had one of eight different pathogenic repeat expansions, including those associated with fragile X syndrome, Friedreich's ataxia, and Huntington's disease, and correctly flagged all but one of the known repeat expansions. Thus, ExpansionHunter can be used to accurately detect known pathogenic repeat expansions and provides researchers with a tool that can be used to identify new pathogenic repeat expansions.
Here we report whole exome sequencing (WES) on a cohort of 71 patients with persistently unresolved white matter abnormalities with a suspected diagnosis of leukodystrophy or genetic ...leukoencephalopathy. WES analyses were performed on trio, or greater, family groups. Diagnostic pathogenic variants were identified in 35% (25 of 71) of patients. Potentially pathogenic variants were identified in clinically relevant genes in a further 7% (5 of 71) of cases, giving a total yield of clinical diagnoses in 42% of individuals. These findings provide evidence that WES can substantially decrease the number of unresolved white matter cases. Ann Neurol 2016;79:1031–1037
There is mounting evidence of the value of clinical genome sequencing (cGS) in individuals with suspected rare genetic disease (RGD), but cGS performance and impact on clinical care in a diverse ...population drawn from both high-income countries (HICs) and low- and middle-income countries (LMICs) has not been investigated. The iHope program, a philanthropic cGS initiative, established a network of 24 clinical sites in eight countries through which it provided cGS to individuals with signs or symptoms of an RGD and constrained access to molecular testing. A total of 1,004 individuals (median age, 6.5 years; 53.5% male) with diverse ancestral backgrounds (51.8% non-majority European) were assessed from June 2016 to September 2021. The diagnostic yield of cGS was 41.4% (416/1,004), with individuals from LMIC sites 1.7 times more likely to receive a positive test result compared to HIC sites (LMIC 56.5% 195/345 vs. HIC 33.5% 221/659, OR 2.6, 95% CI 1.9–3.4, p < 0.0001). A change in diagnostic evaluation occurred in 76.9% (514/668) of individuals. Change of management, inclusive of specialty referrals, imaging and testing, therapeutic interventions, and palliative care, was reported in 41.4% (285/694) of individuals, which increased to 69.2% (480/694) when genetic counseling and avoidance of additional testing were also included. Individuals from LMIC sites were as likely as their HIC counterparts to experience a change in diagnostic evaluation (OR 6.1, 95% CI 1.1–∞, p = 0.05) and change of management (OR 0.9, 95% CI 0.5–1.3, p = 0.49). Increased access to genomic testing may support diagnostic equity and the reduction of global health care disparities.
iHope provided clinical genome testing to a global population of 1,004 underserved individuals with suspected rare genetic disease. Diagnostic yield, and the proportion with a change of management, was >40% for individuals from both low- and middle-income or high-income countries, suggesting that widespread availability of genomic testing may reduce health disparities.