We used long read sequencing data generated from Knightia excelsa, a nectar‐producing Proteaceae tree endemic to Aotearoa (New Zealand), to explore how sequencing data type, volume and workflows can ...impact final assembly accuracy and chromosome reconstruction. Establishing a high‐quality genome for this species has specific cultural importance to Māori and commercial importance to honey producers in Aotearoa. Assemblies were produced by five long read assemblers using data subsampled based on read lengths, two polishing strategies and two Hi‐C mapping methods. Our results from subsampling the data by read length showed that each assembler tested performed differently depending on the coverage and the read length of the data. Subsampling highlighted that input data with longer read lengths but perhaps lower coverage constructed more contiguous, kmers and gene‐complete assemblies than short read length input data with higher coverage. The final genome assembly was constructed into 14 pseudochromosomes using an initial flye long read assembly, a racon/medaka/pilon combined polishing strategy, salsa2 and allhic scaffolding, juicebox curation, and Macadamia linkage map validation. We highlighted the importance of developing assembly workflows based on the volume and read length of sequencing data and established a robust set of quality metrics for generating high‐quality assemblies. Scaffolding analyses highlighted that problems found in the initial assemblies could not be resolved accurately by Hi‐C data and that assembly scaffolding was more successful when the underlying contig assembly was of higher accuracy. These findings provide insight into how quality assessment tools can be implemented throughout genome assembly pipelines to inform the de novo reconstruction of a high‐quality genome assembly for nonmodel organisms.
The Earth BioGenome Project (EBP) is an audacious endeavor to obtain whole-genome sequences of representatives from all eukaryotic species on Earth. In addition to the project's technical and ...organizational challenges, it also faces complicated ethical, legal, and social issues. This paper, from members of the EBP's Ethical, Legal, and Social Issues (ELSI) Committee, catalogs these ELSI concerns arising from EBP. These include legal issues, such as sample collection and permitting; the applicability of international treaties, such as the Convention on Biological Diversity and the Nagoya Protocol; intellectual property; sample accessioning; and biosecurity and ethical issues, such as sampling from the territories of Indigenous peoples and local communities, the protection of endangered species, and cross-border collections, among several others. We also comment on the intersection of digital sequence information and data rights. More broadly, this list of ethical, legal, and social issues for large-scale genomic sequencing projects may be useful in the consideration of ethical frameworks for future projects. While we do not-and cannot-provide simple, overarching solutions for all the issues raised here, we conclude our perspective by beginning to chart a path forward for EBP's work.
Rewarewa (Knightia excelsa, Proteaceae) is a tree species endemic to Aotearoa New Zealand, with a natural distribution spanning Te Ika-a-Māui (North Island) and the top of Te Waipounamu (South ...Island). We used the pseudo-chromosome genome assembly of rewarewa as a reference and whole genome pooled sequencing from 35 populations sampled across Aotearoa New Zealand, including trees growing on Māori-owned land, to identify 1,443,255 single nucleotide polymorphisms (SNPs). Four genetic clusters located in the northern North Island (NNI), eastern North Island (NIE), western and southern North Island (NIWS), and the South Island (SI) were identified. Gene flow was revealed between the SI and NIE genetic clusters, plus bottleneck and contraction events within the genetic clusters since the mid-late Pleistocene, with divergence between North and South Island clusters estimated to have occurred ~115,000–230,000 years ago. Genotype environment analysis (GEA) was used to identify loci and genes linked with altitude, soil pH, soil carbon, slope, soil size, annual mean temperature, mean diurnal range, isothermality, annual precipitation, and precipitation seasonality. The location of the SNPs associated with these environmental variables was compared with the position of 52,192 gene-coding sequences that were predicted in the rewarewa genome using RNA sequencing. This new understanding of the genetic variation present in rewarewa and insights into the genetic control of adaptive traits will inform efforts to incorporate the species in restoration plantings and for marketing rewarewa honey based on provenance.
Despite its incorporation into research studies, the safety aspects of segmental allergen bronchoprovocation and differences in cellular response among different allergens have received limited ...consideration.
We performed 87 segmental challenges in 77 allergic asthma subjects. Allergen dose was based on each subject's response to whole lung allergen challenge. Bronchoalveolar lavage was performed at 0 and 48 hours. Safety indicators included spirometry, oxygen saturation, heart rate, and symptoms.
Among subjects challenged with ragweed, cat dander, or house dust mite, there were no differences in safety indicators. Subjects demonstrated a modest oxygen desaturation and tachycardia during the procedure that returned to normal prior to discharge. We observed a modest reduction in forced vital capacity and forced expiratory volume in one second following bronchoscopy. The most common symptoms following the procedure were cough, sore throat and fatigue. Total bronchoalveolar lavage fluid cell numbers increased from 13±4 to 106±108×10(4) per milliliter and eosinophils increased from 1±2 to 44±20 percent, with no significant differences among the three allergens.
In mild allergic asthma, segmental allergen bronchoprovocation, using individualized doses of aeroallergens, was safe and yielded similar cellular responses.
Gene fusion occurs when two or more individual genes with independent open reading frames becoming juxtaposed under the same open reading frame creating a new fused gene. A small number of gene ...fusions described in detail have been associated with novel functions, for example, the hominid-specific PIPSL gene, TNFSF12, and the TWE-PRIL gene family. We use Sequence Similarity Networks and species level comparisons of great ape genomes to identify 45 new genes that have emerged by transcriptional readthrough, that is, transcription-derived gene fusion. For 35 of these putative gene fusions, we have been able to assess available RNAseq data to determine whether there are reads that map to each breakpoint. A total of 29 of the putative gene fusions had annotated transcripts (9/29 of which are human-specific). We carried out RT-qPCR in a range of human tissues (placenta, lung, liver, brain, and testes) and found that 23 of the putative gene fusion events were expressed in at least one tissue. Examining the available ribosome foot-printing data, we find evidence for translation of three of the fused genes in human. Finally, we find enrichment for transcription-derived gene fusions in regions of known segmental duplication in human. Together, our results implicate chromosomal structural variation brought about by segmental duplication with the emergence of novel transcripts and translated protein products.
Complete genomic and epigenetic maps of human centromeres Altemose, Nicolas; Logsdon, Glennis A; Bzikadze, Andrey V ...
Science (American Association for the Advancement of Science),
04/2022, Volume:
376, Issue:
6588
Journal Article
Peer reviewed
Open access
Existing human genome assemblies have almost entirely excluded repetitive sequences within and near centromeres, limiting our understanding of their organization, evolution, and functions, which ...include facilitating proper chromosome segregation. Now, a complete, telomere-to-telomere human genome assembly (T2T-CHM13) has enabled us to comprehensively characterize pericentromeric and centromeric repeats, which constitute 6.2% of the genome (189.9 megabases). Detailed maps of these regions revealed multimegabase structural rearrangements, including in active centromeric repeat arrays. Analysis of centromere-associated sequences uncovered a strong relationship between the position of the centromere and the evolution of the surrounding DNA through layered repeat expansions. Furthermore, comparisons of chromosome X centromeres across a diverse panel of individuals illuminated high degrees of structural, epigenetic, and sequence variation in these complex and rapidly evolving regions.
Although substantial attention has focused on efforts by the new Administration to block environmental policies, climate politics have been contentious in the US since well before the election of ...Donald Trump. In this paper, we extend previous work on empirical examinations of echo chambers in US climate politics using new data collected on the federal climate policy network in summer 2016. We test for the similarity and differences at two points in time in homophily and echo chambers using Exponential Random Graph Models (ERGM) to compare new findings from 2016 to previous work on data from 2010. We show that echo chambers continue to play a significant role in the network of information exchange among policy elites working on the issue of climate change. In contrast to previous findings where echo chambers centered on a binding international commitment to emission reductions, we find that the pre-existing echo chambers have almost completely disappeared and new structures have formed around one of the main components of the Obama Administration's national climate policy: the Clean Power Plan. These results provide empirical evidence that science communication and policymaking at the elite level shift in relation to the policy instruments under consideration.
Biological aging estimators derived from DNA methylation data are heritable and correlate with morbidity and mortality. Consequently, identification of genetic and environmental contributors to the ...variation in these measures in populations has become a major goal in the field.
Leveraging DNA methylation and SNP data from more than 40,000 individuals, we identify 137 genome-wide significant loci, of which 113 are novel, from genome-wide association study (GWAS) meta-analyses of four epigenetic clocks and epigenetic surrogate markers for granulocyte proportions and plasminogen activator inhibitor 1 levels, respectively. We find evidence for shared genetic loci associated with the Horvath clock and expression of transcripts encoding genes linked to lipid metabolism and immune function. Notably, these loci are independent of those reported to regulate DNA methylation levels at constituent clock CpGs. A polygenic score for GrimAge acceleration showed strong associations with adiposity-related traits, educational attainment, parental longevity, and C-reactive protein levels.
This study illuminates the genetic architecture underlying epigenetic aging and its shared genetic contributions with lifestyle factors and longevity.
Delayed or impaired language development is a common developmental concern, yet there is little agreement about the criteria used to identify and classify language impairments in children. Children's ...language difficulties are at the interface between education, medicine and the allied professions, who may all adopt different approaches to conceptualising them. Our goal in this study was to use an online Delphi technique to see whether it was possible to achieve consensus among professionals on appropriate criteria for identifying children who might benefit from specialist services. We recruited a panel of 59 experts representing ten disciplines (including education, psychology, speech-language therapy/pathology, paediatrics and child psychiatry) from English-speaking countries (Australia, Canada, Ireland, New Zealand, United Kingdom and USA). The starting point for round 1 was a set of 46 statements based on articles and commentaries in a special issue of a journal focusing on this topic. Panel members rated each statement for both relevance and validity on a seven-point scale, and added free text comments. These responses were synthesised by the first two authors, who then removed, combined or modified items with a view to improving consensus. The resulting set of statements was returned to the panel for a second evaluation (round 2). Consensus (percentage reporting 'agree' or 'strongly agree') was at least 80 percent for 24 of 27 round 2 statements, though many respondents qualified their response with written comments. These were again synthesised by the first two authors. The resulting consensus statement is reported here, with additional summary of relevant evidence, and a concluding commentary on residual disagreements and gaps in the evidence base.