Somatic mosaicism in the human brain may alter function of individual neurons. We analyzed genomes of single cells from the forebrains of three human fetuses (15 to 21 weeks postconception) using ...clonal cell populations. We detected 200 to 400 single-nucleotide variations (SNVs) per cell. SNV patterns resembled those found in cancer cell genomes, indicating a role of background mutagenesis in cancer. SNVs with a frequency of >2% in brain were also present in the spleen, revealing a pregastrulation origin. We reconstructed cell lineages for the first five postzygotic cleavages and calculated a mutation rate of ~1.3 mutations per division per cell. Later in development, during neurogenesis, the mutation spectrum shifted toward oxidative damage, and the mutation rate increased. Both neurogenesis and early embryogenesis exhibit substantially more mutagenesis than adulthood.
SARS-CoV-2 is a member of the Coronavirus family which recently originated from the Wuhan province of China and spread very rapidly through the world infecting more than 4 million people. In the ...past, other Coronaviruses have also been found to cause human infection, but not as widespread as COVID-19. Since Coronavirus sequences constantly change due to mutation and recombination, it is important to understand the pattern of changes and likely path the virus can take in the future. In this study, we have used the Shewhart control chart to identify and analyze hypervariable (hotspots) and hypovariable (coldspots) regions of the virus. Our analysis shows that SARS-CoV-2 has changed in a few regions of the genome. Analysis of SARS-CoV-1 and MERS sequences suggests that over time, mutations start accumulating in different regions and most likely SARS-CoV-2 may also follow a similar path. The results suggest a possible emergence of modified viruses over some time.
Many trematode parasites cause infection in humans and are thought to be a major public health problem. Their ecological diversity in different regions provides challenging questions on evolution of ...these organisms. In this report, we perform transcriptome analysis of the giant intestinal fluke, Fasciolopsis buski, using next generation sequencing technology. Short read sequences derived from polyA containing RNA of this organism were assembled into 30,677 unigenes that led to the annotation of 12,380 genes. Annotation of the assembled transcripts enabled insight into processes and pathways in the intestinal fluke, such as RNAi pathway and energy metabolism. The expressed kinome of the organism was characterized by identifying all protein kinases. A rough draft genome assembly for Fasciolopsis buski is also reported herewith with SRA accessions for crosschecking the findings in the analyzed transcriptome data. Transcriptome data also helped us to identify some of the expressed transposable elements. Though many Long Interspersed elements (LINEs) were identified, only two Short Interspersed Elements (SINEs) were visible. Overall transcriptome and draft genome analysis of F. buski helped us to characterize some of its important biological characteristics and provided enormous resources for development of a suitable diagnostic system and anti-parasitic therapeutic molecules.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
The cereal cyst nematode (CCN, Heterodera avenae) is a major pest of wheat (Triticum spp) that reduces crop yields in many countries. Cyst nematodes are obligate sedentary endoparasites that ...reproduce by amphimixis. Here, we report the first transcriptome analysis of two stages of H. avenae. After sequencing extracted RNA from pre parasitic infective juvenile and adult stages of the life cycle, 131 million Illumina high quality paired end reads were obtained which generated 27,765 contigs with N50 of 1,028 base pairs, of which 10,452 were annotated. Comparative analyses were undertaken to evaluate H. avenae sequences with those of other plant, animal and free living nematodes to identify differences in expressed genes. There were 4,431 transcripts common to H. avenae and the free living nematode Caenorhabditis elegans, and 9,462 in common with more closely related potato cyst nematode, Globodera pallida. Annotation of H. avenae carbohydrate active enzymes (CAZy) revealed fewer glycoside hydrolases (GHs) but more glycosyl transferases (GTs) and carbohydrate esterases (CEs) when compared to M. incognita. 1,280 transcripts were found to have secretory signature, presence of signal peptide and absence of transmembrane. In a comparison of genes expressed in the pre-parasitic juvenile and feeding female stages, expression levels of 30 genes with high RPKM (reads per base per kilo million) value, were analysed by qRT-PCR which confirmed the observed differences in their levels of expression levels. In addition, we have also developed a user-friendly resource, Heterodera transcriptome database (HATdb) for public access of the data generated in this study. The new data provided on the transcriptome of H. avenae adds to the genetic resources available to study plant parasitic nematodes and provides an opportunity to seek new effectors that are specifically involved in the H. avenae-cereal host interaction.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Abstract
Structural variations (SVs) in the human genome originate from different mechanisms related to DNA repair, replication errors, and retrotransposition. Our analyses of 26 927 SVs from the ...1000 Genomes Project revealed differential distributions and consequences of SVs of different origin, e.g. deletions from non-allelic homologous recombination (NAHR) are more prone to disrupt chromatin organization while processed pseudogenes can create accessible chromatin. Spontaneous double stranded breaks (DSBs) are the best predictor of enrichment of NAHR deletions in open chromatin. This evidence, along with strong physical interaction of NAHR breakpoints belonging to the same deletion suggests that majority of NAHR deletions are non-meiotic i.e. originate from errors during homology directed repair (HDR) of spontaneous DSBs. In turn, the origin of the spontaneous DSBs is associated with transcription factor binding in accessible chromatin revealing the vulnerability of functional, open chromatin. The chromatin itself is enriched with repeats, particularly fixed Alu elements that provide the homology required to maintain stability via HDR. Through co-localization of fixed Alus and NAHR deletions in open chromatin we hypothesize that old Alu expansion had a stabilizing role on the human genome.
Insertion sequence (IS) 6110 is found at multiple sites in the Mycobacterium tuberculosis genome and displays a high degree of polymorphism with respect to copy number and insertion sites. Therefore, ...IS6110 is considered to be a useful molecular marker for diagnosis and strain typing of M. tuberculosis. Generally IS6110 elements are identified using experimental methods, useful for analysis of a limited number of isolates. Since short read genome sequences generated using next-generation sequencing (NGS) platforms are available for a large number of isolates, a computational pipeline for identification of IS6110 elements from these datasets was developed. This study shows results from analysis of NGS data of 1377 M. tuberculosis isolates. These isolates represent all seven major global lineages of M. tuberculosis. Lineage specific copy number patterns and preferential insertion regions were observed. Intra-lineage differences were further analyzed for identifying spoligotype specific variations. Copy number distribution and preferential locations of IS6110 in different lineages imply independent evolution of IS6110, governed mainly through ancestral insertion, fitness (gene truncation, promoter activity) and recombinational loss of some copies. A phylogenetic tree based on IS6110 insertion data of different isolates was constructed in order to understand genome level variations of different markers across different lineages.
In a context specific manner, Intra-species genomic variation plays an important role in phenotypic diversity observed among pathogenic microbes. Efficient classification of these pathogens is ...important for diagnosis and treatment of several infectious diseases. NGS technologies have provided access to wealth of data that can be utilized to discover important markers for pathogen classification. In this paper, we described three different approaches (Jensen-Shannon divergence, random forest and Shewhart control chart) for identification of a minimal set of SNPs that can be used for classification of organisms. These methods are generic and can be implemented for analysis of any organism. We have shown usefulness of these approaches for analysis of Mycobacterium tuberculosis and Escherichia coli isolates. We were able to identify a minimal set of 18 SNPs that can be used as molecular markers for phylogroup based classification and 8 SNPs for pathogroup based classification of E. coli.
•Classification of pathogenic microbes based on their phylogroup or pathogroup•Minimal set of SNPs for the classification•Use of random forest, JSD and Shewhart control chart to identify minimal set from WGS data•Minimal set of 18 SNPs for phylogroup based classification of Escherichia coli
Abstract only
Introduction:
Observational analyses have linked circulating inflammatory biomarkers with an increased risk of abdominal aortic aneurysms (AAA). Although observational studies implicate ...inflammatory biomarkers in AAA, whether these are a cause or consequence of the disease is unknown. A limited number of trials have explored agents that reduce inflammation in slowing the progression of AAA. Observational study quality may be limited by residual confounding and reverse causality. We sought to leverage genetic instruments and Mendelian Randomization (MR) to evaluate the human genetic evidence supporting inflammatory biomarkers in the pathogenesis of AAA.
Methods:
We leveraged a meta-analysis of 6 genome-wide association studies (GWAS) of 66 circulating inflammatory markers including up to 59,969 participants of European ancestry. We tested the association for each biomarker with AAA using summary statistics from a meta-analysis of 17 individual GWAS from 14 discovery cohorts in the AAAgen consortium comprising 39,221 individuals with AAA. We employed Wald ratio or inverse-variance weighted MR; weighted median and weighted mode methods were applied as sensitivity analyses.
Results:
After accounting for multiple testing by controlling the false discovery rate (q < 0.05), we identified a total of 6/66 (9.1%) inflammatory biomarkers that had a significant association with AAA: sIL6R, CCL8, CCL7, ILRN, F2, and PIGF. These results were consistent across a range of linkage-disequilibrium thresholds to identify independent genetic instruments. Some of these biomarkers have been previously implicated in the development of AAA but have not been validated in human RCTs of aneurysm development, growth, or rupture.
Conclusions:
This analysis highlights the role of several circulating inflammatory markers in the development of AAA. These findings represent potential therapeutic targets for further investigation in mechanistic trials.
Abstract only Aneurysmal diseases of the aorta are among the most morbid cardiovascular diseases without effective medical therapies. Many genetic and environmental factors have been linked to ...thoracic and abdominal aortic aneurysmal disease; however, there is little data identifying shared and distinct genetic factors underlying aortic aneurysmal disease. In both diseases, degeneration of the medial smooth muscle layer leads to dilation of the vessel and predisposition to and rupture. Whether there are common and distinct genetic factors that function within the vasculature to predispose to aortic aneurysms is unknown. We utilized structural equation modeling to identify a common genetic signature associated with cardiovascular diseases and traits including: abdominal aortic aneurysms (AAA), thoracic aortic aneurysms (TAA), thoracic aortic diameter, peripheral arterial disease (PAD), and coronary artery disease (CAD) as a means to identify novel targets to study. We identified that AAA displays a positive genetic correlation with TAA as well as CAD and PAD. We then constructed a structural equation model that contained four novel factors: 1. Atherosclerotic factor, 2. Aneurysmal factor, 3. AAA-specific factor, 4. Aortic dimension factor. We identified that the atherosclerotic factor explained a large amount of genetic variance in CAD and PAD as well as a modest amount of variance in AAA. GWAS of the ‘atherosclerotic factor’ identified loci implicated in lipid homeostasis and inflammation. The ‘aneurysmal factor’ explained a large amount of genetic variance in TAA and aortic dimension as well as a modest amount in AAA. Notably GWAS for the ‘aneurysmal factor’ identified loci enriched in ECM components and smooth muscle contractile genes. Finally, we identified a factor explained the ‘residual’ genetic heritability underlying AAA. GWAS for this ‘residual factor’ identified a smaller number of hits enriched in matrix metalloprotease biology. Taken together, this study has highlighted the shared and distinct hereditability underlying aortic aneurysmal phenotypes and highlights a novel strategy for prioritizing loci for future study.
Whole genome sequences are ideally suited for deriving evolutionary relationship among organisms. With the availability of Next Generation sequencing (NGS) datasets in an unprecedented scale, it will ...be highly desirable if phylogenetic analysis can be carried out using short read NGS data. We described here an anchor based approach NexABP for phylogenetic construction of closely related strains/isolates from NGS data. This approach can be used even in the absence of a fully assembled reference genome and works by reducing the complexity of the datasets without compromising results. NexABP was used for constructing phylogeny of different strains of some of the common pathogens, such as Mycobacterium tuberculosis, Vibrio cholera and Escherichia coli. In addition to classification into distinct lineages, NexABP could resolve inner branches and also allow statistical testing using bootstrap analysis. We believe that there are some clear advantages of using NexABP based phylogenetic analysis as compared to other methods.