To assess the accuracy of machine learning models in predicting kidney stone composition using variables extracted from the electronic health record (EHR).
We identified kidney stone patients (
= ...1296) with both stone composition and 24-hour (24H) urine testing. We trained machine learning models (XGBoost XG and logistic regression LR) to predict stone composition using 24H urine data and EHR-derived demographic and comorbidity data. Models predicted either binary (calcium
noncalcium stone) or multiclass (calcium oxalate, uric acid, hydroxyapatite, or other) stone types. We evaluated performance using area under the receiver operating curve (ROC-AUC) and accuracy and identified predictors for each task.
For discriminating binary stone composition, XG outperformed LR with higher accuracy (91%
71%) with ROC-AUC of 0.80 for both models. Top predictors used by these models were supersaturations of uric acid and calcium phosphate, and urinary ammonium. For multiclass classification, LR outperformed XG with higher accuracy (0.64
0.56) and ROC-AUC (0.79
0.59), and urine pH had the highest predictive utility. Overall, 24H urine analyte data contributed more to the models' predictions of stone composition than EHR-derived variables.
Machine learning models can predict calcium stone composition. LR outperforms XG in multiclass stone classification. Demographic and comorbidity data are predictive of stone composition; however, including 24H urine data improves performance. Further optimization of performance could lead to earlier directed medical therapy for kidney stone patients.
Sex and sexual differentiation are pervasive across the tree of life. Because females and males often have substantially different functional requirements, we expect selection to differ between the ...sexes. Recent studies in diverse species, including humans, suggest that sexually antagonistic viability selection creates allele frequency differences between the sexes at many different loci. However, theory and population-level simulations indicate that sex-specific differences in viability would need to be very large to produce and maintain reported levels of between-sex allelic differentiation. We address this contradiction between theoretical predictions and empirical observations by evaluating evidence for sexually antagonistic viability selection on autosomal loci in humans using the largest cohort to date (UK Biobank, n = 487,999) along with a second large, independent cohort (BioVU, n = 93,864). We performed association tests between genetically ascertained sex and autosomal loci. Although we found dozens of genome-wide significant associations, none replicated across cohorts. Moreover, closer inspection revealed that all associations are likely due to cross-hybridization with sex chromosome regions during genotyping. We report loci with potential for mis-hybridization found on commonly used genotyping platforms that should be carefully considered in future genetic studies of sex-specific differences. Despite being well powered to detect allele frequency differences of up to 0.8% between the sexes, we do not detect clear evidence for this signature of sexually antagonistic viability selection on autosomal variation. These findings suggest a lack of strong ongoing sexually antagonistic viability selection acting on single locus autosomal variation in humans.
Drip irrigation, combined with wastewater reuse, may offer the most effective and efficient way to cope with water shortage for crops and protect the environment receiving wastewater. Emitter and ...filter clogging are the main problems in the operation of drip systems in developing countries and small communities where treated wastewater is of poor quality. The main results of experimental trials on the behaviour of several kinds of filter and drip emitters using poor quality municipal wastewater are: the performance of the emitters and filters depends on the quality of the wastewater; Total Suspended Solids (TSS) influence the percentage of totally clogged emitters, the mean discharge emitted, the emission uniformity, and the operating time of the filter between cleaning operations; vortex emitters were more sensitive to clogging than labyrinth emitters; no significant difference was observed between the same kind of emitter placed on soil or sub-soil; gravel media and disk filters assured better performance than screen filters. The use of wastewater with a TSS greater than 50
mg
l
−1, did not permit optimal emission uniformity to be achieved.
Placental dysfunction is implicated in many pregnancy complications, including preeclampsia and preterm birth (PTB). While both these syndromes are influenced by environmental risk factors, they also ...have a substantial genetic component that is not well understood. Precisely controlled gene expression during development is crucial to proper placental function and often mediated through gene regulatory enhancers. However, we lack accurate maps of placental enhancer activity due to the challenges of assaying the placenta and the difficulty of comprehensively identifying enhancers. To address the gap in our knowledge of gene regulatory elements in the placenta, we used a two-step machine learning pipeline to synthesize existing functional genomics studies, transcription factor (TF) binding patterns, and evolutionary information to predict placental enhancers. The trained classifiers accurately distinguish enhancers from the genomic background and placental enhancers from enhancers active in other tissues. Genomic features collected from tissues and cell lines involved in pregnancy are the most predictive of placental regulatory activity. Applying the classifiers genome-wide enabled us to create a map of 33,010 predicted placental enhancers, including 4,562 high-confidence enhancer predictions. The genome-wide placental enhancers are significantly enriched nearby genes associated with placental development and birth disorders and for SNPs associated with gestational age. These genome-wide predicted placental enhancers provide candidate regions for further testing in vitro, will assist in guiding future studies of genetic associations with pregnancy phenotypes, and aid interpretation of potential mechanisms of action for variants found through genetic studies.
microRNAs (miRNAs) are essential to the regulation of gene expression in eukaryotes, and improper expression of miRNAs contributes to hundreds of diseases. Despite the essential functions of miRNAs, ...the evolutionary dynamics of how they are integrated into existing gene regulatory and functional networks is not well understood. Knowledge of the origin and evolutionary history a gene has proven informative about its functions and disease associations; we hypothesize that incorporating the evolutionary origins of miRNAs into analyses will help resolve differences in their functional dynamics and how they influence disease.
We computed the phylogenetic age of miRNAs across 146 species and quantified the relationship between human miRNA age and several functional attributes. Older miRNAs are significantly more likely to be associated with disease than younger miRNAs, and the number of associated diseases increases with age. As has been observed for genes, the miRNAs associated with different diseases have different age profiles. For example, human miRNAs implicated in cancer are enriched for origins near the dawn of animal multicellularity. Consistent with the increasing contribution of miRNAs to disease with age, older miRNAs target more genes than younger miRNAs, and older miRNAs are expressed in significantly more tissues. Furthermore, miRNAs of all ages exhibit a strong preference to target older genes; 93% of validated miRNA gene targets were in existence at the origin of the targeting miRNA. Finally, we find that human miRNAs in evolutionarily related families are more similar in their targets and expression profiles than unrelated miRNAs.
Considering the evolutionary origin and history of a miRNA provides useful context for the analysis of its function. Consistent with recent work in Drosophila, our results support a model in which miRNAs increase their expression and functional regulatory interactions over evolutionary time, and thus older miRNAs have increased potential to cause disease. We anticipate that these patterns hold across mammalian species; however, comprehensively evaluating them will require refining miRNA annotations across species and collecting functional data in non-human systems.
The importance of epistasis—or statistical interactions between genetic variants—to the development of complex disease in humans has been controversial. Genome-wide association studies of statistical ...interactions influencing human traits have recently become computationally feasible and have identified many putative interactions. However, statistical models used to detect interactions can be confounded, which makes it difficult to be certain that observed statistical interactions are evidence for true molecular epistasis. In this study, we investigate whether there is evidence for epistatic interactions between genetic variants within the cis-regulatory region that influence gene expression after accounting for technical, statistical, and biological confounding factors. We identified 1,119 (FDR = 5%) interactions that appear to regulate gene expression in human lymphoblastoid cell lines, a tightly controlled, largely genetically determined phenotype. Many of these interactions replicated in an independent dataset (90 of 803 tested, Bonferroni threshold). We then performed an exhaustive analysis of both known and novel confounders, including ceiling/floor effects, missing genotype combinations, haplotype effects, single variants tagged through linkage disequilibrium, and population stratification. Every interaction could be explained by at least one of these confounders, and replication in independent datasets did not protect against some confounders. Assuming that the confounding factors provide a more parsimonious explanation for each interaction, we find it unlikely that cis-regulatory interactions contribute strongly to human gene expression, which calls into question the relevance of cis-regulatory interactions for other human phenotypes. We additionally propose several best practices for epistasis testing to protect future studies from confounding.
Phenome-wide association is a novel reverse genetic strategy to analyze genome-to-phenome relations in human clinical cohorts. Here we test this approach using a large murine population segregating ...for ∼5 million sequence variants, and we compare our results to those extracted from a matched analysis of gene variants in a large human cohort. For the mouse cohort, we amassed a deep and broad open-access phenome consisting of ∼4,500 metabolic, physiological, pharmacological and behavioural traits, and more than 90 independent expression quantitative trait locus (QTL), transcriptome, proteome, metagenome and metabolome data sets--by far the largest coherent phenome for any experimental cohort (www.genenetwork.org). We tested downstream effects of subsets of variants and discovered several novel associations, including a missense mutation in fumarate hydratase that controls variation in the mitochondrial unfolded protein response in both mouse and Caenorhabditis elegans, and missense mutations in Col6a5 that underlies variation in bone mineral density in both mouse and human.
The regulation of replication is essential to preserve genome integrity. Mms1 is part of the E3 ubiquitin ligase complex that is linked to replication fork progression. By identifying Mms1 binding ...sites genome-wide in Saccharomyces cerevisiae we connected Mms1 function to genome integrity and replication fork progression at particular G-rich motifs. This motif can form G-quadruplex (G4) structures in vitro. G4 are stable DNA structures that are known to impede replication fork progression. In the absence of Mms1, genome stability is at risk at these G-rich/G4 regions as demonstrated by gross chromosomal rearrangement assays. Mms1 binds throughout the cell cycle to these G-rich/G4 regions and supports the binding of Pif1 DNA helicase. Based on these data we propose a mechanistic model in which Mms1 binds to specific G-rich/G4 motif located on the lagging strand template for DNA replication and supports Pif1 function, DNA replication and genome integrity.
Replicative DNA helicases expose the two strands of the double helix to the replication apparatus, but accessory helicases are often needed to help forks move past naturally occurring ...hard-to-replicate sites, such as tightly bound proteins, RNA/DNA hybrids, and DNA secondary structures. Although the Schizosaccharomyces pombe 5'-to-3' DNA helicase Pfh1 is known to promote fork progression, its genomic targets, dynamics, and mechanisms of action are largely unknown. Here we address these questions by integrating genome-wide identification of Pfh1 binding sites, comprehensive analysis of the effects of Pfh1 depletion on replication and DNA damage, and proteomic analysis of Pfh1 interaction partners by immunoaffinity purification mass spectrometry. Of the 621 high confidence Pfh1-binding sites in wild type cells, about 40% were sites of fork slowing (as marked by high DNA polymerase occupancy) and/or DNA damage (as marked by high levels of phosphorylated H2A). The replication and integrity of tRNA and 5S rRNA genes, highly transcribed RNA polymerase II genes, and nucleosome depleted regions were particularly Pfh1-dependent. The association of Pfh1 with genomic integrity at highly transcribed genes was S phase dependent, and thus unlikely to be an artifact of high transcription rates. Although Pfh1 affected replication and suppressed DNA damage at discrete sites throughout the genome, Pfh1 and the replicative DNA polymerase bound to similar extents to both Pfh1-dependent and independent sites, suggesting that Pfh1 is proximal to the replication machinery during S phase. Consistent with this interpretation, Pfh1 co-purified with many key replisome components, including the hexameric MCM helicase, replicative DNA polymerases, RPA, and the processivity clamp PCNA in an S phase dependent manner. Thus, we conclude that Pfh1 is an accessory DNA helicase that interacts with the replisome and promotes replication and suppresses DNA damage at hard-to-replicate sites. These data provide insight into mechanisms by which this evolutionarily conserved helicase helps preserve genome integrity.
This letter focuses on the problem of handling impacts by means of an aerial manipulator and proposes a solution that combines the control of the aerial manipulator's end-effector position with an ...innovative design approach of aerial manipulation systems, consisting of both active and passive joints. The approach aims at limiting the influence of impacts on the controlled attitude dynamics in order to allow the aerial manipulator to remain stable during and after impact. The developed concept is intended to convert kinetic energy into potential energy, which is permanently stored into elastic elements by means of directional locking mechanisms. The proposed approach has been tested on a 2 d.o.f. manipulator mounted on a quadrotor UAV. The manipulation system has one active rotational d.o.f. compensating for pitch movements of the UAV and one passive linear joint which is in charge of absorbing the impact energy. The device has been used to validate the method through experiments, in comparison with a rigid manipulator. The results show that the proposed approach and the developed mechanical system achieve stable impact absorption without bouncing away from the interacting environment. Our work has the ambition to propose a new direction towards aerial manipulators that are capable of performing highly dynamic physical interaction tasks.