DNA samples derived from vertebrate skin, bodily cavities and body fluids contain both host and microbial DNA; the latter often present as a minor component. Consequently, DNA sequencing of a ...microbiome sample frequently yields reads originating from the microbe(s) of interest, but with a vast excess of host genome-derived reads. In this study, we used a methyl-CpG binding domain (MBD) to separate methylated host DNA from microbial DNA based on differences in CpG methylation density. MBD fused to the Fc region of a human antibody (MBD-Fc) binds strongly to protein A paramagnetic beads, forming an effective one-step enrichment complex that was used to remove human or fish host DNA from bacterial and protistan DNA for subsequent sequencing and analysis. We report enrichment of DNA samples from human saliva, human blood, a mock malaria-infected blood sample and a black molly fish. When reads were mapped to reference genomes, sequence reads aligning to host genomes decreased 50-fold, while bacterial and Plasmodium DNA sequences reads increased 8-11.5-fold. The Shannon-Wiener diversity index was calculated for 149 bacterial species in saliva before and after enrichment. Unenriched saliva had an index of 4.72, while the enriched sample had an index of 4.80. The similarity of these indices demonstrates that bacterial species diversity and relative phylotype abundance remain conserved in enriched samples. Enrichment using the MBD-Fc method holds promise for targeted microbiome sequence analysis across a broad range of sample types.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Massively parallel sequencing technology is revolutionizing approaches to genomic and genetic research. Since its advent, the scale and efficiency of Next-Generation Sequencing (NGS) has rapidly ...improved. In spite of this success, sequencing genomes or genomic regions with extremely biased base composition is still a great challenge to the currently available NGS platforms. The genomes of some important pathogenic organisms like Plasmodium falciparum (high AT content) and Mycobacterium tuberculosis (high GC content) display extremes of base composition. The standard library preparation procedures that employ PCR amplification have been shown to cause uneven read coverage particularly across AT and GC rich regions, leading to problems in genome assembly and variation analyses. Alternative library-preparation approaches that omit PCR amplification require large quantities of starting material and hence are not suitable for small amounts of DNA/RNA such as those from clinical isolates. We have developed and optimized library-preparation procedures suitable for low quantity starting material and tolerant to extremely high AT content sequences.
We have used our optimized conditions in parallel with standard methods to prepare Illumina sequencing libraries from a non-clinical and a clinical isolate (containing ~53% host contamination). By analyzing and comparing the quality of sequence data generated, we show that our optimized conditions that involve a PCR additive (TMAC), produces amplified libraries with improved coverage of extremely AT-rich regions and reduced bias toward GC neutral templates.
We have developed a robust and optimized Next-Generation Sequencing library amplification method suitable for extremely AT-rich genomes. The new amplification conditions significantly reduce bias and retain the complexity of either extremes of base composition. This development will greatly benefit sequencing clinical samples that often require amplification due to low mass of DNA starting material.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Avian malaria parasites are prevalent around the world and infect a wide diversity of bird species. Here, we report the sequencing and analysis of high-quality draft genome sequences for two avian ...malaria species,
and
We identify 50 genes that are specific to avian malaria, located in an otherwise conserved core of the genome that shares gene synteny with all other sequenced malaria genomes. Phylogenetic analysis suggests that the avian malaria species form an outgroup to the mammalian
species, and using amino acid divergence between species, we estimate the avian- and mammalian-infective lineages diverged in the order of 10 million years ago. Consistent with their phylogenetic position, we identify orthologs of genes that had previously appeared to be restricted to the clades of parasites containing
and
, the species with the greatest impact on human health. From these orthologs, we explore differential diversifying selection across the genus and show that the avian lineage is remarkable in the extent to which invasion-related genes are evolving. The subtelomeres of the
and
genomes contain several novel gene families, including an expanded
multigene family. We also identify an expansion of reticulocyte binding protein homologs in
, and within these proteins, we detect distinct regions that are specific to nonhuman primate, humans, rodent, and avian hosts. For the first time in the
lineage, we find evidence of transposable elements, including several hundred fragments of LTR-retrotransposons in both species and an apparently complete LTR-retrotransposon in the genome of
.
Genetic evolution of Rift Valley fever virus (RVFV) in Africa has been shaped mainly by environmental changes such as abnormal rainfall patterns and climate change that has occurred over the last few ...decades. These gradual environmental changes are believed to have effected gene migration from macro (geographical) to micro (reassortment) levels. Presently, 15 lineages of RVFV have been identified to be circulating within the Sub-Saharan Africa. International trade in livestock and movement of mosquitoes are thought to be responsible for the outbreaks occurring outside endemic or enzootic regions. Virus spillover events contribute to outbreaks as was demonstrated by the largest epidemic of 1977 in Egypt. Genomic surveillance of the virus evolution is crucial in developing intervention strategies. Therefore, we have developed a computational tool for rapidly classifying and assigning lineages of the RVFV isolates. The computational method is presented both as a command line tool and a web application hosted at https://www.genomedetective.com/app/typingtool/rvfv/. Validation of the tool has been performed on a large dataset using glycoprotein gene (Gn) and whole genome sequences of the Large (L), Medium (M) and Small (S) segments of the RVFV retrieved from the National Center for Biotechnology Information (NCBI) GenBank database. Using the Gn nucleotide sequences, the RVFV typing tool was able to correctly classify all 234 RVFV sequences at species level with 100% specificity, sensitivity and accuracy. All the sequences in lineages A (n = 10), B (n = 1), C (n = 88), D (n = 1), E (n = 3), F (n = 2), G (n = 2), H (n = 105), I (n = 2), J (n = 1), K (n = 4), L (n = 8), M (n = 1), N (n = 5) and O (n = 1) were also correctly classified at phylogenetic level. Lineage assignment using whole RVFV genome sequences (L, M and S-segments) did not achieve 100% specificity, sensitivity and accuracy for all the sequences analyzed. We further tested our tool using genomic data that we generated by sequencing 5 samples collected following a recent RVF outbreak in Kenya. All the 5 samples were assigned lineage C by both the partial (Gn) and whole genome sequence classifiers. The tool is useful in tracing the origin of outbreaks and supporting surveillance efforts.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
•Antibody discovery in African bovine breeds using a nested PCR strategy.•A methodology for sequencing of bovine ultra-long Ig VH transcripts using low error rate short-read Illumina technology.•Two ...indigenous African bovine breeds demonstrate application in ultra-long VH sequencing.
The advances in high-throughput DNA sequencing and recombinant antibody technologies has presented new methods for characterizing antibody repertoires and significantly increased our understanding on the functional role of antibodies in immunity and their use in diagnostics, vaccine antigen design and as biological therapeutics. A subset of Bos taurus antibodies possesses unique ultra-long third complementary-determining region of the heavy chain (CDRH3) and are of special interest because they are thought to have unique functional abilities of broadly neutralizing properties – a functional role that has not been fully explored in vaccine development. Next generation sequencing technologies that are widely used to profile immunoglobulin (Ig) repertoires are based on short-read methods such as the Illumina technology. Although this technology has worked well in sequencing Ig V-D-J regions of most jawed vertebrates, it has faced serious technical challenges with sequencing regions in bovine Ig bearing ultra-long CDRH3 sequences, which are longer than 120 bp. To overcome this limitation, we have developed a sequencing strategy based on nested PCR products that allows sequence assembly of full-length bovine Ig heavy-chain (IgH) V-D-J regions. We have used this strategy to sequence IgH V-D-J regions of two Bos indicus breeds, Ankole and Boran. We confirm the presence of ultra-long CDRH3 sequences in IgG transcripts in both African cattle breeds, and provide preliminary evidence for differences and preferences in germline VH, DH and JH allele gene usage as well as differences in the length of the VH region in the two bovine breeds. Our method provides tools that should allow more robust analyses of ultra-long CDRH3 sequences aiding antibody and epitope discovery in different cattle breeds and their role in mediating immunity.
We analyzed the whole genomes of cecum microbiomes of Ethiopian indigenous chickens from two distinct geographical zones: Afar (AF) district (Dulecha, 730 m above sea level) and Amhara (AM) district ...(Menz Gera Midir, 3300 m). Through metagenomic analysis we found that microbial populations were mainly dominated by Bacteroidetes and Firmicutes. We identified 2210 common genes in the two groups. LEfSe showed that the distribution of Coprobacter, Geobacter, Cronobacter, Alloprevotella, and Dysgonomonas were more abundant in AF than AM. Analyses using KEGG, eggNOG, and CAZy databases indicated that the pathways of metabolism, genetic information processing, environmental information processing, and cellular process were significantly enriched. Functional abundance was found to be associated with the nutrient absorption and microbial localization of indigenous chickens. We also investigated antibiotic resistant genes and found antibiotics like LSM, cephalosporin, and tetracycline were significantly more abundant in AF than AM.
•Microbial comparison of scavenging chicken's cecum content between high land and low land region of Ethiopia.•Differential functional annotations of identified microbiota from both regions.•Antibiotic resistant genes identification and their differential abundance.
•The Pangolin classification clustered the South Kivu sequences into seven lineages.•The Delta variant is responsible for COVID-19 outbreaks during the third wave.•D614G is the most frequent mutation ...observed, followed by L4715L.•There are multiple introductions of SARS-CoV-2 and subsequent circulating variants.
We used whole-genome sequencing of SARS-CoV-2 to identify variants circulating in the Democratic Republic of the Congo and obtain molecular information useful for diagnosis, improving treatment, and general pandemic control strategies.
A total of 74 SARS-CoV-2 isolates were sequenced using Oxford Nanopore platforms. Generated reads were processed to obtain consensus genome sequences. Sequences with more than 80% genome coverage were used for variant calling, phylogenetic analysis, and classification using Pangolin lineage annotation nomenclature.
Phylogenetic analysis based on Pangolin classification clustered South Kivu sequences into seven lineages (A.23.1, B.1.1.6, B.1.214, B.1.617.2, B.1.351, C.16, and P.1). The Delta (B.1.617.2) variant was the most dominant and responsible for outbreaks during the third wave. Based on the Wuhan reference genome, 289 distinct mutations were detected, including 141 missenses, 123 synonymous, and 25 insertions/deletions when our isolates were mapped to the Wuhan reference strain. Most of these point mutations were located within the coding sequences of the SARS-CoV-2 genome that includes spike, ORF1ab, ORF3, and nucleocapsid protein genes. The most common mutation was D614G (1841A>G) observed in 61 sequences, followed by L4715L (14143 C>T) found in 60 sequences.
Our findings highlight multiple introductions of SARS-CoV-2 into South Kivu through different sources and subsequent circulation of variants in the province. These results emphasize the importance of timely monitoring of genetic variation and its effect on disease severity. This work set a foundation for the use of genomic surveillance as a tool for future global pandemic management and control.
Translating genomic technologies into healthcare applications for the malaria parasite Plasmodium falciparum has been limited by the technical and logistical difficulties of obtaining high quality ...clinical samples from the field. Sampling by dried blood spot (DBS) finger-pricks can be performed safely and efficiently with minimal resource and storage requirements compared with venous blood (VB). Here, the use of selective whole genome amplification (sWGA) to sequence the P. falciparum genome from clinical DBS samples was evaluated, and the results compared with current methods that use leucodepleted VB.
Parasite DNA with high (>95%) human DNA contamination was selectively amplified by Phi29 polymerase using short oligonucleotide probes of 8-12 mers as primers. These primers were selected on the basis of their differential frequency of binding the desired (P. falciparum DNA) and contaminating (human) genomes.
Using sWGA method, clinical samples from 156 malaria patients, including 120 paired samples for head-to-head comparison of DBS and leucodepleted VB were sequenced. Greater than 18-fold enrichment of P. falciparum DNA was achieved from DBS extracts. The parasitaemia threshold to achieve >5× coverage for 50% of the genome was 0.03% (40 parasites per 200 white blood cells). Over 99% SNP concordance between VB and DBS samples was achieved after excluding missing calls.
The sWGA methods described here provide a reliable and scalable way of generating P. falciparum genome sequence data from DBS samples. The current data indicate that it will be possible to get good quality sequence on most if not all drug resistance loci from the majority of symptomatic malaria patients. This technique overcomes a major limiting factor in P. falciparum genome sequencing from field samples, and paves the way for large-scale epidemiological applications.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK