This report describes the isolation, molecular characterization, and phylogenetic analysis of the first three complete genomes of severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) isolated ...from three patients involved in the first outbreak of COVID‐19 in Lombardy, Italy. Early molecular epidemiological tracing suggests that SARS‐CoV‐2 was present in Italy weeks before the first reported cases of infection.
•Medicago sativa alphapartitivirus genome sequences were redefined.•Medicago sativa alphapartitivirus 1 genome was revisited.•Medicago sativa alphapartitivirus 2 genome described for first ...time.•Alfalfa RNA datasets were analyzed to corroborate the identification of both alphapartitiviruses.•Both medicago sativa alphapartitviruses were detected in several alfalfa cultivars.
In alfalfa samples analyzed by hightroughput sequencing, four de novo assembled contigs encoding gene products showing identities to alphapartitiviruses proteins were found based on BlastX analysis. The predicted amino acid (aa) sequences of two contigs presented 99–100% identity to the RNA-dependent RNA polymerase (RdRp) and the capsid protein (CP) of the recently reported medicago sativa alphapartitivirus 1 (MsAPV1). In addition, the remaining two contigs shared only 56% (CP) and 70% (RdRp) pairwise aa identity with the proteins of MsAPV1, suggesting that these samples presented also a novel Alphapartitivirus species. Further analyses based on complete genome segments termini and the presence/absence of alphapartitivirus RNA in several samples and public alfalfa RNA datasets corroborated the identification of two different alphapartitivirus members. Our results likely indicate that the reported MsAPV1 genome was previously reconstructed with genome segments of two different alphapartitiviruses. Overall, we not only revisited the MsAPV1 genome sequence but also report a new tentative alphapartitivirus species, which we propose the name medicago sativa alphapartitivirus 2. In addition, the RT-PCR detection of both MsAPV1 and MsAPV2 in several alfalfa cultivars suggests a broad distribution of both viruses.
Since 2005, novel duck reoviruses have been outbreaks in duck breeding areas such as central China and South China. In recent years, the incidence rate of this disease is still increasing, bringing ...serious economic losses to waterfowl breeding industry. This study isolated 3 novel duck reoviruses (NDRV-SDLS, NDRV-SDWF, and NDRV-SDYC) from sick ducks in 3 local duck farms in Shandong Province. The study aimed to investigate the characteristics of these viruses. The virus is inoculated into duck embryo fibroblasts, where the virus replicates to produce syncytium and dies within 3 to 5 d. The viruses were also isolated from infected ducks, and RT-PCR amplified the whole genomes after passage purification in duck embryos. The resulting whole genome was analyzed for genetic evolution. The total length of the gene sequencing was 23,418 bp, divided into 10 fragments. Gene sequence comparison showed that the 3 strains had high similarity with novel duck reoviruses (NDRV) but low similarity with chicken-origin reovirus (chicken ARV) and Muscovy duck reovirus (MDRV), especially in the σC segment. Phylogenetic analysis of the 10 fragments showed that the 3 isolates constituted the same evolutionary clade as other DRV reference strains and were far related to ARV and MDRV in different evolutionary clades. The results of all 10 segments indicate that the isolates are in the evolutionary branch of NDRV, suggesting that the novel waterfowl reovirus is the dominant circulating strain in Shandong. This study complements the gene bank information of NDRV and provides references for vaccine research and disease prediction of NDRV in Shandong.
The
species comprises phytopathogenic bacteria that can cause serious damage to cereals and to forage grasses. So far, the genomic resources for
were limited, which hindered further understanding of ...the host-pathogen interactions at the molecular level and the development of disease-resistant cultivars. To this end, we complemented the available complete genome sequence of the
pv.
pathotype strain DSM 18974 by sequencing the genomes of all the other 10
pathotype strains using PacBio long-read technology and assembled complete genome sequences. Phylogeny based on average nucleotide identity (ANI) revealed three distinct clades within the species, which we propose to classify as clades Xt-I, Xt-II, and Xt-III. In addition to 2,181 core
genes, a total of 190, 588, and 168 genes were found to be exclusive to each clade, respectively. Moreover, 29 non-transcription activator-like effector (TALE) and 21 TALE type III effector classes were found, and clade- or strain-specific effectors were identified. Further investigation of these genes could help to identify genes that are critically involved in pathogenicity and/or host adaptation, setting the grounds for the development of new resistant cultivars.
•21 new complete cacao badnavirus sequences reconstructed de novo thanks to the NGS.•The taxonomic status of the molecular groups of cacao badnaviruses elucidated.•Ten species associated with cacao ...swollen shoot disease.
Cacao swollen shoot virus is a member of the family Caulimoviridae, genus Badnavirus and is naturally transmitted to Theobroma cacao (L.) by several mealybug species. CSSV populations in West African countries are highly variable and genetically structured into several different groups based on the diversity in the first part of ORF3 which encodes the movement protein. To unravel the extent of isolate diversity and address the problems of low titer and mixed viral sequences in samples, we used Illumina MiSeq and HiSeq technology. We were able to reconstruct de novo 20 new complete genomes from cacao samples collected in the Cocoa Research Institute of Ghana (CRIG) Museum and from the field samples collected in Côte d’Ivoire or Ghana. Based on the 20% threshold of nucleotide divergence in the reverse transcriptase/ribonuclease H (RT/RNase H) region which denotes species demarcation, we conclude there exist seven new species associated with the cacao swollen shoot disease. These new species along with the three already described leads to ten, the total number of the complex of viral species associated with the disease. A sample from Sri Lanka exhibiting similar leaf symptomology to West African CSSD-affected plants was also included in the study and the corresponding sequence represents the genome of a new virus named cacao bacilliform SriLanka virus (CBSLV).
Bovine herpesvirus-1 (BoHV-1) is a major pathogen affecting cattle worldwide causing primarily respiratory illness referred to as infectious bovine rhinotracheitis (IBR), along with reproductive ...disorders including abortion and infertility in cattle. While modified live vaccines (MLVs) effectively induce immune response against BoHV-1, they are implicated in disease outbreaks in cattle. Current diagnostic methods cannot distinguish between MLVs and field strains of BoHV-1. We performed whole genome sequencing of 18 BoHV-1 isolates from Pennsylvania and Minnesota along with five BoHV-1 vaccine strains using the Illumina Miseq platform. Based on nucleotide polymorphisms (SNPs) the sequences were clustered into three groups with two different vaccine groups and one distinct cluster of field isolates. Using this information, we developed a novel SNP-based PCR assay that can allow differentiation of vaccine and clinical strains and help accurately determine the incidence of BoHV-1 and the association of MLVs with clinical disease in cattle.
Although fossil remains show that anatomically modern humans dispersed out of Africa into the Near East ∼100 to 130 ka, genetic evidence from extant populations has suggested that non-Africans ...descend primarily from a single successful later migration. Within the human mitochondrial DNA (mtDNA) tree, haplogroup L3 encompasses not only many sub-Saharan Africans but also all ancient non-African lineages, and its age therefore provides an upper bound for the dispersal out of Africa. An analysis of 369 complete African L3 sequences places this maximum at ∼70 ka, virtually ruling out a successful exit before 74 ka, the date of the Toba volcanic supereruption in Sumatra. The similarity of the age of L3 to its two non-African daughter haplogroups, M and N, suggests that the same process was likely responsible for both the L3 expansion in Eastern Africa and the dispersal of a small group of modern humans out of Africa to settle the rest of the world. The timing of the expansion of L3 suggests a link to improved climatic conditions after ∼70 ka in Eastern and Central Africa rather than to symbolically mediated behavior, which evidently arose considerably earlier. The L3 mtDNA pool within Africa suggests a migration from Eastern Africa to Central Africa ∼60 to 35 ka and major migrations in the immediate postglacial again linked to climate. The largest population size increase seen in the L3 data is 3-4 ka in Central Africa, corresponding to Bantu expansions, leading diverse L3 lineages to spread into Eastern and Southern Africa in the last 3-2 ka.
The Roma Diaspora-traditionally known as Gypsies-remains among the least explored population migratory events in historical times. It involved the migration of Roma ancestors out-of-India through the ...plateaus of Western Asia ultimately reaching Europe. The demographic effects of the Diaspora-bottlenecks, endogamy, and gene flow-might have left marked molecular traces in the Roma genomes. Here, we analyze the whole-genome sequence of 46 Roma individuals pertaining to four migrant groups in six European countries. Our analyses revealed a strong, early founder effect followed by a drastic reduction of ∼44% in effective population size. The Roma common ancestors split from the Punjabi population, from Northwest India, some generations before the Diaspora started, <2,000 years ago. The initial bottleneck and subsequent endogamy are revealed by the occurrence of extensive runs of homozygosity and identity-by-descent segments in all Roma populations. Furthermore, we provide evidence of gene flow from Armenian and Anatolian groups in present-day Roma, although the primary contribution to Roma gene pool comes from non-Roma Europeans, which accounts for >50% of their genomes. The linguistic and historical differentiation of Roma in migrant groups is confirmed by the differential proportion, but not a differential source, of European admixture in the Roma groups, which shows a westward cline. In the present study, we found that despite the strong admixture Roma had in their diaspora, the signature of the initial bottleneck and the subsequent endogamy is still present in Roma genomes.
Coronavirus disease 2019 (COVID‐19) is a pandemic infectious disease caused by novel severe acute respiratory syndrome coronavirus‐2 (SARS CoV‐2). The SARS CoV‐2 is transmitted more rapidly and ...readily than SARS CoV. Both, SARS CoV and SARS CoV‐2 via their glycosylated spike proteins recognize the human angiotensin converting enzyme‐2 (ACE‐2) receptor. We generated multiple sequence alignments and phylogenetic trees for representative spike proteins of SARS CoV and SARS CoV‐2 from various host sources in order to analyze the specificity in SARS CoV‐2 spike proteins required for causing infection in humans. Our results show that among the genomes analyzed, two sequence regions in the N‐terminal domain “MESEFR” and “SYLTPG” are specific to human SARS CoV‐2. In the receptor‐binding domain, two sequence regions “VGGNY“ and ”EIYQAGSTPCNGV” and a disulfide bridge connecting 480C and 488C in the extended loop are structural determinants for the recognition of human ACE‐2 receptor. The complete genome analysis of representative SARS CoVs from bat, civet, human host sources, and human SARS CoV‐2 identified the bat genome (GenBank code: MN996532.1) as closest to the recent novel human SARS CoV‐2 genomes. The bat SARS CoV genomes (GenBank codes: MG772933 and MG772934) are evolutionary intermediates in the mutagenesis progression toward becoming human SARS CoV‐2.
: Although thousands of clinical isolates of
are being sequenced and analysed by short read technology, the data do not resolve the highly variable subtelomeric regions of the genomes that contain ...polymorphic gene families involved in immune evasion and pathogenesis. There is also no current standard definition of the boundaries of these variable subtelomeric regions.
: Using long-read sequence data (Pacific Biosciences SMRT technology), we assembled and annotated the genomes of 15
isolates, ten of which are newly cultured clinical isolates. We performed comparative analysis of the entire genome with particular emphasis on the subtelomeric regions and the internal
genes clusters.
: The nearly complete sequence of these 15 isolates has enabled us to define a highly conserved core genome, to delineate the boundaries of the subtelomeric regions, and to compare these across isolates. We found highly structured variable regions in the genome. Some exported gene families purportedly involved in release of merozoites show copy number variation. As an example of ongoing genome evolution, we found a novel CLAG gene in six isolates. We also found a novel gene that was relatively enriched in the South East Asian isolates compared to those from Africa.
: These 15 manually curated new reference genome sequences with their nearly complete subtelomeric regions and fully assembled genes are an important new resource for the malaria research community. We report the overall conserved structure and pattern of important gene families and the more clearly defined subtelomeric regions.