Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of ...developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants). We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5–15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself.
Mosses are a highly diverse lineage of land plants, whose diversification, spanning at least 400 million years, remains phylogenetically ambiguous due to the lack of fossils, massive early ...extinctions, late radiations, limited morphological variation, and conflicting signal among previously used markers. Here, we present phylogenetic reconstructions based on complete organellar exomes and a comparable set of nuclear genes for this major lineage of land plants. Our analysis of 142 species representing 29 of the 30 moss orders reveals that relative average rates of non-synonymous substitutions in nuclear versus plastid genes are much higher in mosses than in seed plants, consistent with the emerging concept of evolutionary dynamism in mosses. Our results highlight the evolutionary significance of taxa with reduced morphologies, shed light on the relative tempo and mechanisms underlying major cladogenic events, and suggest hypotheses for the relationships and delineation of moss orders.
Premise of the study: Using sequence data generated via target enrichment for phylogenetics requires reassembly of high-throughput sequence reads into loci, presenting a number of bioinformatics ...challenges. We developed HybPiper as a user-friendly platform for assembly of gene regions, extraction of exon and intron sequences, and identification of paralogous gene copies. We test HybPiper using baits designed to target 333 phylogenetic markers and 125 genes of functional significance in Artocarpus (Moraceae). Methods and Results: HybPiper implements parallel execution of sequence assembly in three phases: read mapping, contig assembly, and target sequence extraction. The pipeline was able to recover nearly complete gene sequences for all genes in 22 species of Artocarpus. HybPiper also recovered more than 500 bp of nontargeted intron sequence in over half of the phylogenetic markers and identified paralogous gene copies in Artocarpus. Conclusions: HybPiper was designed for Linux and Mac OS X and is freely available at https://github.com/mossmatters/HybPiper.
Reconstructing phylogenetic relationships at the micro- and macroevoutionary levels within the same tree is problematic because of the need to use different data types and analytical frameworks. We ...test the power of target enrichment to provide phylogenetic resolution based on DNA sequences from above species to within populations, using a large herbarium sampling and Euphorbia balsamifera (Euphorbiaceae) as a case study.
Target enrichment with custom probes was combined with genome skimming (Hyb-Seq) to sequence 431 low-copy nuclear genes and partial plastome DNA. We used supermatrix, multi-species-coalescent approaches, and Bayesian dating to estimate phylogenetic relationships and divergence times.
Euphorbia balsamifera, with a disjunct Rand Flora-type distribution at opposite sides of Africa, comprises three well-supported subspecies: western Sahelian sepium is sister to eastern African-southern Arabian adenensis and Macaronesian-southwest Moroccan balsamifera. Lineage divergence times support Late Miocene to Pleistocene diversification and climate-driven vicariance to explain the Rand Flora pattern.
We show that probes designed using genomic resources from taxa not directly related to the focal group are effective in providing phylogenetic resolution at deep and shallow evolutionary levels. Low capture efficiency in herbarium samples increased the proportion of missing data but did not bias estimation of phylogenetic relationships or branch lengths.
Area-level measures are often used to approximate socioeconomic status (SES) when individual-level data are not available. However, no national studies have examined the validity of these measures in ...approximating individual-level SES.
Data came from ~ 3,471,000 participants in the Mortality Disparities in American Communities study, which links data from 2008 American Community Survey to National Death Index (through 2015). We calculated correlations, specificity, sensitivity, and odds ratios to summarize the concordance between individual-, census tract-, and county-level SES indicators (e.g., household income, college degree, unemployment). We estimated the association between each SES measure and mortality to illustrate the implications of misclassification for estimates of the SES-mortality association.
Participants with high individual-level SES were more likely than other participants to live in high-SES areas. For example, individuals with high household incomes were more likely to live in census tracts (r = 0.232; odds ratio OR = 2.284) or counties (r = 0.157; OR = 1.325) whose median household income was above the US median. Across indicators, mortality was higher among low-SES groups (all p < .0001). Compared to county-level, census tract-level measures more closely approximated individual-level associations with mortality.
Moderate agreement emerged among binary indicators of SES across individual, census tract, and county levels, with increased precision for census tract compared to county measures when approximating individual-level values. When area level measures were used as proxies for individual SES, the SES-mortality associations were systematically underestimated. Studies using area-level SES proxies should use caution when selecting, analyzing, and interpreting associations with health outcomes.
Abstract
The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are ...fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96$\%$) and 2333 genera (17$\%$). A “first pass” angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world’s natural history collections. Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.
We assess the importance of interpersonal income comparisons using data on suicide deaths. We examine whether suicide risk is related to others' income, holding own income and other individual and ...environmental factors fixed. We estimate models of the suicide hazard using two independent data sets: the National Longitudinal Mortality Study and the National Center for Health Statistics' Multiple Cause of Death Files combined with the 5% Public Use Micro Sample of the 1990 decennial census. Results from both data sources show that, controlling for own income and individual characteristics, individual suicide risk rises with others' income.
Hyb-Seq for Flowering Plant Systematics Dodsworth, Steven; Pokorny, Lisa; Johnson, Matthew G. ...
Trends in plant science,
October 2019, 2019-10-00, 20191001, Letnik:
24, Številka:
10
Journal Article
Recenzirano
Odprti dostop
High-throughput DNA sequencing (HTS) presents great opportunities for plant systematics, yet genomic complexity needs to be reduced for HTS to be effectively applied. We highlight Hyb-Seq as a ...promising approach, especially in light of the recent development of probes enriching 353 low-copy nuclear genes from any flowering plant taxon.
The widespread use of combinational antiretroviral therapies (cART) in developed countries has changed the course of Human Immunodeficiency Virus (HIV) infection from an almost universally fatal ...disease to a chronic infection for the majority of individuals. Although cART has reduced the severity of neurological damage in HIV-infected individuals, the likelihood of cognitive impairment increases with age, and duration of infection. As cART does not suppress the expression of HIV non-structural proteins, it has been proposed that a constitutive production of HIV regulatory proteins in infected brain cells may contribute to neurological damage. However, this assumption has never been experimentally tested. Here we take advantage of the leaky tetracycline promoter system in the Tat-transgenic mouse to show that a chronic very low-level expression of Tat is associated with astrocyte activation, inflammatory cytokine expression, ceramide accumulation, reductions in brain volume, synaptic, and axonal damage that occurs over a time frame of 1 year. These data suggest that a chronic low-level production of Tat may contribute to progressive neurological damage in virally suppressed HIV-infected individuals.
What is the recommended assessment and management of women with polycystic ovary syndrome (PCOS), based on the best available evidence, clinical expertise, and consumer preference?
International ...evidence-based guidelines including 166 recommendations and practice points, addressed prioritized questions to promote consistent, evidence-based care and improve the experience and health outcomes of women with PCOS.
Previous guidelines either lacked rigorous evidence-based processes, did not engage consumer and international multidisciplinary perspectives, or were outdated. Diagnosis of PCOS remains controversial and assessment and management are inconsistent. The needs of women with PCOS are not being adequately met and evidence practice gaps persist.
International evidence-based guideline development engaged professional societies and consumer organizations with multidisciplinary experts and women with PCOS directly involved at all stages. Appraisal of Guidelines for Research and Evaluation (AGREE) II-compliant processes were followed, with extensive evidence synthesis. The Grading of Recommendations, Assessment, Development, and Evaluation (GRADE) framework was applied across evidence quality, feasibility, acceptability, cost, implementation and ultimately recommendation strength.
Governance included a six continent international advisory and a project board, five guideline development groups, and consumer and translation committees. Extensive health professional and consumer engagement informed guideline scope and priorities. Engaged international society-nominated panels included pediatrics, endocrinology, gynecology, primary care, reproductive endocrinology, obstetrics, psychiatry, psychology, dietetics, exercise physiology, public health and other experts, alongside consumers, project management, evidence synthesis, and translation experts. Thirty-seven societies and organizations covering 71 countries engaged in the process. Twenty face-to-face meetings over 15 months addressed 60 prioritized clinical questions involving 40 systematic and 20 narrative reviews. Evidence-based recommendations were developed and approved via consensus voting within the five guideline panels, modified based on international feedback and peer review, with final recommendations approved across all panels.
The evidence in the assessment and management of PCOS is generally of low to moderate quality. The guideline provides 31 evidence based recommendations, 59 clinical consensus recommendations and 76 clinical practice points all related to assessment and management of PCOS. Key changes in this guideline include: i) considerable refinement of individual diagnostic criteria with a focus on improving accuracy of diagnosis; ii) reducing unnecessary testing; iii) increasing focus on education, lifestyle modification, emotional wellbeing and quality of life; and iv) emphasizing evidence based medical therapy and cheaper and safer fertility management.
Overall evidence is generally low to moderate quality, requiring significantly greater research in this neglected, yet common condition, especially around refining specific diagnostic features in PCOS. Regional health system variation is acknowledged and a process for guideline and translation resource adaptation is provided.
The international guideline for the assessment and management of PCOS provides clinicians with clear advice on best practice based on the best available evidence, expert multidisciplinary input and consumer preferences. Research recommendations have been generated and a comprehensive multifaceted dissemination and translation program supports the guideline with an integrated evaluation program.
The guideline was primarily funded by the Australian National Health and Medical Research Council of Australia (NHMRC) supported by a partnership with ESHRE and the American Society for Reproductive Medicine. Guideline development group members did not receive payment. Travel expenses were covered by the sponsoring organizations. Disclosures of conflicts of interest were declared at the outset and updated throughout the guideline process, aligned with NHMRC guideline processes. Full details of conflicts declared across the guideline development groups are available at https://www.monash.edu/medicine/sphpm/mchri/pcos/guideline in the Register of disclosures of interest. Of named authors, Dr Costello has declared shares in Virtus Health and past sponsorship from Merck Serono for conference presentations. Prof. Laven declared grants from Ferring, Euroscreen and personal fees from Ferring, Euroscreen, Danone and Titus Healthcare. Prof. Norman has declared a minor shareholder interest in an IVF unit. The remaining authors have no conflicts of interest to declare. The guideline was peer reviewed by special interest groups across our partner and collaborating societies and consumer organizations, was independently assessed against AGREEII criteria and underwent methodological review. This guideline was approved by all members of the guideline development groups and was submitted for final approval by the NHMRC.