Haemoglobinopathies are the commonest monogenic diseases worldwide and are caused by variants in the globin gene clusters. With over 2400 variants detected to date, their interpretation using the ...American College of Medical Genetics and Genomics (ACMG)/Association for Molecular Pathology (AMP) guidelines is challenging and computational evidence can provide valuable input about their functional annotation. While many in silico predictors have already been developed, their performance varies for different genes and diseases. In this study, we evaluate 31 in silico predictors using a dataset of 1627 variants in
,
and
. By varying the decision threshold for each tool, we analyse their performance (a) as binary classifiers of pathogenicity and (b) by using different non-overlapping pathogenic and benign thresholds for their optimal use in the ACMG/AMP framework. Our results show that CADD, Eigen-PC, and REVEL are the overall top performers, with the former reaching moderate strength level for pathogenic prediction. Eigen-PC and REVEL achieve the highest accuracies for missense variants, while CADD is also a reliable predictor of non-missense variants. Moreover, SpliceAI is the top performing splicing predictor, reaching strong level of evidence, while GERP++ and phyloP are the most accurate conservation tools. This study provides evidence about the optimal use of computational tools in globin gene clusters under the ACMG/AMP framework.
Hemoglobinopathies are among the most common autosomal-recessive disorders worldwide. A comprehensive next-generation sequencing (NGS) test would greatly facilitate screening and diagnosis of these ...disorders. An NGS panel targeting the coding regions of hemoglobin genes and four modifier genes was designed. We validated the assay by using 2522 subjects affected with hemoglobinopathies and applied it to carrier testing in a cohort of 10,111 couples who were also screened through traditional methods. In the clinical genotyping analysis of 1182 β-thalassemia subjects, we identified a group of additional variants that can be used for accurate diagnosis. In the molecular screening analysis of the 10,111 couples, we detected 4180 individuals in total who carried 4840 mutant alleles, and identified 186 couples at risk of having affected offspring. 12.1% of the pathogenic or likely pathogenic variants identified by our NGS assay, which were undetectable by traditional methods. Compared with the traditional methods, our assay identified an additional at-risk 35 couples. We describe a comprehensive NGS-based test that offers advantages over the traditional screening/molecular testing methods. To our knowledge, this is among the first large-scale population study to systematically evaluate the application of an NGS technique in carrier screening and molecular diagnosis of hemoglobinopathies.
•In 10,111 couples in prevention programs for thalassemia, 186 at-risk couples were identified by NGS and 151 by traditional routine method.•NGS assay had better performance than traditional routine screening approach in carrier screening for hemoglobinopathies.•NGS assay can characterize patients with Thalassemia Major or Thalassemia Intermedia more accurately.
Hemoglobinopathies represent a major cause of birth defects in China. Traditional carrier screening strategy is phenotype-based that is associated with risk of miss phenotype-negative carriers. In this study, we validated an effective method for molecular screening and clinical genotyping of hemoglobinopathies using NGS assay. Considering the broad mutation spectrum and high prevalence of hemoglobinopathy-causing mutations across ethnic groups, this technology could make it technically feasible to offer more effective preconception screening and diagnosis in large populations worldwide. Therefore, the current phenotype-based screening guideline might be modified to a better genotype-based screening guideline.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NLZOH, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UILJ, UL, UM, UPCLJ, UPUK, ZAGLJ, ZRSKP
Multidrug-resistant tuberculosis (MDR-TB) has emerged as a major clinical public health threat and challenges the national TB control program in Malaysia. Data that elaborates on the risk factors ...associated with the development of MDR-TB is highly limited in this country. This study was aimed to determine the risk factors associated with the development of MDR-TB patients in peninsular Malaysia.
This was a case control study; the data were collected from medical records of all the registered MDR-TB patients at five referral TB hospitals in peninsular Malaysia from January 2010 to April 2014. The 105 cases were all confirmed by a positive sputum culture of Mycobacterium tuberculosis for MDR-TB and extensively drug-resistant (XDR)-TB. As a comparison, a total of 209 non-MDR-TB cases were randomly selected as controls.
A total of 105 MDR-TB and 209 non MDR-TB patients were studied. The risk factors associated with MDR-TB within the multivariate analysis were previous tuberculosis treatment, HIV infection, being an immigrant, and high load of positive for acid-fast bacillus (AFB) smear.
The findings of this study revealed that patients who had received previous treatment for tuberculosis, were infected with HIV, were immigrants, and had a high burden of positive testing for AFB smear were more likely to have MDR-TB. An enhanced understanding of the risk factors associated with MDR-TB strains is imperative in the development of a national policy for public health interventions.
Full text
Available for:
IZUM, KILJ, NUK, PILJ, PNG, SAZU, UL, UM, UPUK
HbE/Beta thalassemia (HbE/β-thalassemia) is one of the common genetic disorders in South East Asia. It is heterogeneous in its clinical presentation and molecular defects. There are genetic modifiers ...which have been reported to influence the disease severity of this disorder. The aim of this study was to determine the genetic polymorphisms which were responsible for the disease clinical diversity. A case-control study was conducted among Malay transfusion-dependent HbE/β-thalassemia patients. Patients who were confirmed HbE/β-thalassemia were recruited and genotyping study was performed on these subjects. Ninety-eight patients were selected and divided into moderate and severe groups based on clinical parameters using Sripichai scoring system (based on hemoglobin level, spleen size, growth development, the age of first transfusion and age of disease presentation). Forty-three (44.9%) and 55 (56.1%) patients were found to have moderate and severe clinical presentation, respectively. Genotyping analysis was performed using Affymetrix 6.0 microarray platform. The SNPs were filtered using PLINK and Manhattan plot by R software. From the GWAS results, 20 most significant SNPs were selected based on disease severity when compared between moderate and severe groups. The significant SNPs found in this study were mostly related to thalassemia complications such as rs7372408, associated with
KCNMB2-AS1
and SNPs associated with disease severity. These findings could be used as genetic predictors in managing patients with HbE/β-thalassemia and served as platform for future study.
Full text
Available for:
EMUNI, FIS, FZAB, GEOZS, GIS, IJS, IMTLJ, KILJ, KISLJ, MFDPS, NLZOH, NUK, OBVAL, OILJ, PNG, SAZU, SBCE, SBJE, SBMB, SBNM, UKNU, UL, UM, UPUK, VKSCE, ZAGLJ
Ancestry-informative markers (AIMs) can be used to infer the ancestry of an individual to minimize the inaccuracy of self-reported ethnicity in biomedical research. In this study, we describe three ...methods for selecting AIM SNPs for the Malay population (Malay AIM panel) using different approaches based on pairwise F
ST
, informativeness for assignment (
I
n
), and PCA-correlated SNPs (PCAIMs). These Malay AIM panels were extracted from genotype data stored in SNP arrays hosted by the Malaysian node of the Human Variome Project (MyHVP) and the Singapore Genome Variation Project (SGVP). In particular, genotype data from a total of 165 Malay individuals were analyzed, comprising data on 117 individual genotypes from the Affymetrix SNP-6 SNP array platform and data on 48 individual genotypes from the OMNI 2.5 Illumina SNP array platform. The HapMap phase 3 database (1397 individuals from 11 populations) was used as a reference for comparison with the Malay genotype data. The accuracy of each resulting Malay AIM panel was evaluated using a machine learning “ancestry-predictive model” constructed by using WEKA, a comprehensive machine learning platform written in Java. A total of 1250 SNPs were finally selected, which successfully identified Malay individuals from other world populations with an accuracy of 90%, but the accuracy decreased to 80% using 157 SNPs according to the pairwise F
ST
method, while a panel of 200 SNPs selected using
I
n
and PCAIMs could be used to identify Malay individuals with an accuracy of approximately 80%.
Patterns of modern human population structure are helpful in understanding the history of human migration and admixture. We conducted a study on genetic structure of the Malay population in Malaysia, ...using 54,794 genome-wide single nucleotide polymorphism genotype data generated in four Malay sub-ethnic groups in peninsular Malaysia (Melayu Kelantan, Melayu Minang, Melayu Jawa and Melayu Bugis). To the best of our knowledge this is the first study conducted on these four Malay sub-ethnic groups and the analysis of genotype data of these four groups were compiled together with 11 other populations' genotype data from Indonesia, China, India, Africa and indigenous populations in Peninsular Malaysia obtained from the Pan-Asian SNP database. The phylogeny of populations showed that all of the four Malay sub-ethnic groups are separated into at least three different clusters. The Melayu Jawa, Melayu Bugis and Melayu Minang have a very close genetic relationship with Indonesian populations indicating a common ancestral history, while the Melayu Kelantan formed a distinct group on the tree indicating that they are genetically different from the other Malay sub-ethnic groups. We have detected genetic structuring among the Malay populations and this could possibly be accounted for by their different historical origins. Our results provide information of the genetic differentiation between these populations and a valuable insight into the origins of the Malay sub-ethnic groups in Peninsular Malaysia.
Full text
Available for:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Management of nasopharyngeal carcinoma (NPC) remains elusive despite new developments and advancement that has been made in the current management approaches. A patient's survival and prognosis ...remain dismal especially for a late-stage disease. This is highly attribute to the chemoradiation resistance. Arrays of genes and molecular mechanisms underlie the development of chemoradiation resistance in NPC. Imperatively, unravelling the true pathogenesis of chemoradiation resistance is crucial as these significant proteins and genes can be modulated to produce an effective therapeutic target. It is pivotal to identify the chemoradiation resistance at the very beginning in order to combat the chemoradiation resistance efficiently. Intense research in the genetic ecosphere is critical, as the discovery and development of novel therapeutic targets can be used for screening, diagnosis, and treating the chemoradiation resistance aggressively. This will escalate the management trajectory of NPC patients. This article highlights the significance of genetic and molecular factors that play critical roles in the chemoradiation resistance and how these factors may be modified for next-generation targeted therapy products.
•A total of eight Malay sub-ethnic groups were used to study the genetic structure of the Malay population and to construct a panel of AIM SNPs for Malay population.•This study utilized PCA, ipPCA ...and ADMIXTURE algorithms to explore the genetic structure pattern of the Malay population.•The AIMs panel for Malays in this study was constructed using In, and KNN algorithm to teach the classification models.
Malay, the main ethnic group in Peninsular Malaysia, is represented by various sub-ethnic groups such as Melayu Banjar, Melayu Bugis, Melayu Champa, Melayu Java, Melayu Kedah Melayu Kelantan, Melayu Minang and Melayu Patani. Using data retrieved from the MyHVP (Malaysian Human Variome Project) database, a total of 135 individuals from these sub-ethnic groups were profiled using the Affymetrix GeneChip Mapping Xba 50-K single nucleotide polymorphism (SNP) array to identify SNPs that were ancestry-informative markers (AIMs) for Malays of Peninsular Malaysia. Prior to selecting the AIMs, the genetic structure of Malays was explored with reference to 11 other populations obtained from the Pan-Asian SNP Consortium database using principal component analysis (PCA) and ADMIXTURE. Iterative pruning principal component analysis (ipPCA) was further used to identify sub-groups of Malays. Subsequently, we constructed an AIMs panel for Malays using the informativeness for assignment (In) of genetic markers, and the K-nearest neighbor classifier (KNN) was used to teach the classification models. A model of 250 SNPs ranked by In, correctly classified Malay individuals with an accuracy of up to 90%. The identified panel of SNPs could be utilized as a panel of AIMs to ascertain the specific ancestry of Malays, which may be useful in disease association studies, biomedical research or forensic investigation purposes.
Full text
Available for:
GEOZS, IJS, IMTLJ, KILJ, KISLJ, NUK, OILJ, PNG, SAZU, SBCE, SBJE, UL, UM, UPCLJ, UPUK, ZRSKP