In psychology and the behavioral sciences generally, the use of the hierarchical linear model (HLM) and its extensions for discrete outcomes are popular methods for modeling clustered data. HLM and ...its discrete outcome extensions, however, are certainly not the only methods available to model clustered data. Although other methods exist and are widely implemented in other disciplines, it seems that psychologists have yet to consider these methods in substantive studies. This article compares and contrasts HLM with alternative methods including generalized estimating equations and cluster-robust standard errors. These alternative methods do not model random effects and thus make a smaller number of assumptions and are interpreted identically to single-level methods with the benefit that estimates are adjusted to reflect clustering of observations. Situations where these alternative methods may be advantageous are discussed including research questions where random effects are and are not required, when random effects can change the interpretation of regression coefficients, challenges of modeling with random effects with discrete outcomes, and examples of published psychology articles that use HLM that may have benefitted from using alternative methods. Illustrative examples are provided and discussed to demonstrate the advantages of the alternative methods and also when HLM would be the preferred method.
Display omitted
•We introduce software package RASP, which is a tool for historical biogeography.•We improve the widely used S-DIVA method, the DEC model, the Statistical DEC model and BayArea.•RASP ...also contains tools for combing the results from multiple runs and tools for removing widely distributed outgroups.
We announce the release of Reconstruct Ancestral State in Phylogenies (RASP), a user-friendly software package for inferring historical biogeography through reconstructing ancestral geographic distributions on phylogenetic trees. RASP utilizes the widely used Statistical-Dispersal Vicariance Analysis (S-DIVA), the Dispersal–Extinction–Cladogenesis (DEC) model (Lagrange), a Statistical DEC model (S-DEC) and BayArea. It provides a graphical user interface (GUI) to specify a phylogenetic tree or set of trees and geographic distribution constraints, draws pie charts on the nodes of a phylogenetic tree to indicate levels of uncertainty, and generates high-quality exportable graphical results. RASP can run on both Windows and Mac OS X platforms. All documentation and source code for RASP is freely available at http://mnh.scu.edu.cn/soft/blog/RASP.
Survival analyses are commonly applied to study death or other events of interest. In such analyses, so-called competing risks may form an important problem. A competing risk is an event that either ...hinders the observation of the event of interest or modifies the chance that this event occurs. For example, when studying death on dialysis, receiving a kidney transplant is an event that competes with the event of interest. Conventional methods for survival analysis ignoring the competing event(s), such as the Kaplan-Meier method and standard Cox proportional hazards regression, may be inappropriate in the presence of competing risks, and alternative methods specifically designed for analysing competing risks data should then be applied. This problem deserves more attention in nephrology research and in the current article, we therefore explain the problem of competing risks in survival analysis and how using different techniques may affect study results.
Scale-free networks are rare Broido, Anna D; Clauset, Aaron
Nature communications,
03/2019, Letnik:
10, Številka:
1
Journal Article
Recenzirano
Odprti dostop
Real-world networks are often claimed to be scale free, meaning that the fraction of nodes with degree k follows a power law k
, a pattern with broad implications for the structure and dynamics of ...complex systems. However, the universality of scale-free networks remains controversial. Here, we organize different definitions of scale-free networks and construct a severe test of their empirical prevalence using state-of-the-art statistical tools applied to nearly 1000 social, biological, technological, transportation, and information networks. Across these networks, we find robust evidence that strongly scale-free structure is empirically rare, while for most networks, log-normal distributions fit the data as well or better than power laws. Furthermore, social networks are at best weakly scale free, while a handful of technological and biological networks appear strongly scale free. These findings highlight the structural diversity of real-world networks and the need for new theoretical explanations of these non-scale-free patterns.
We have previously estimated that respiratory syncytial virus (RSV) was associated with 22% of all episodes of (severe) acute lower respiratory infection (ALRI) resulting in 55 000 to 199 000 deaths ...in children younger than 5 years in 2005. In the past 5 years, major research activity on RSV has yielded substantial new data from developing countries. With a considerably expanded dataset from a large international collaboration, we aimed to estimate the global incidence, hospital admission rate, and mortality from RSV-ALRI episodes in young children in 2015.
We estimated the incidence and hospital admission rate of RSV-associated ALRI (RSV-ALRI) in children younger than 5 years stratified by age and World Bank income regions from a systematic review of studies published between Jan 1, 1995, and Dec 31, 2016, and unpublished data from 76 high quality population-based studies. We estimated the RSV-ALRI incidence for 132 developing countries using a risk factor-based model and 2015 population estimates. We estimated the in-hospital RSV-ALRI mortality by combining in-hospital case fatality ratios with hospital admission estimates from hospital-based (published and unpublished) studies. We also estimated overall RSV-ALRI mortality by identifying studies reporting monthly data for ALRI mortality in the community and RSV activity.
We estimated that globally in 2015, 33·1 million (uncertainty range UR 21·6-50·3) episodes of RSV-ALRI, resulted in about 3·2 million (2·7-3·8) hospital admissions, and 59 600 (48 000-74 500) in-hospital deaths in children younger than 5 years. In children younger than 6 months, 1·4 million (UR 1·2-1·7) hospital admissions, and 27 300 (UR 20 700-36 200) in-hospital deaths were due to RSV-ALRI. We also estimated that the overall RSV-ALRI mortality could be as high as 118 200 (UR 94 600-149 400). Incidence and mortality varied substantially from year to year in any given population.
Globally, RSV is a common cause of childhood ALRI and a major cause of hospital admissions in young children, resulting in a substantial burden on health-care services. About 45% of hospital admissions and in-hospital deaths due to RSV-ALRI occur in children younger than 6 months. An effective maternal RSV vaccine or monoclonal antibody could have a substantial effect on disease burden in this age group.
The Bill & Melinda Gates Foundation.
Exploratory graph analysis (EGA) is a new technique that was recently proposed within the framework of network psychometrics to estimate the number of factors underlying multivariate data. Unlike ...other methods, EGA produces a visual guide-network plot-that not only indicates the number of dimensions to retain, but also which items cluster together and their level of association. Although previous studies have found EGA to be superior to traditional methods, they are limited in the conditions considered. These issues are addressed through an extensive simulation study that incorporates a wide range of plausible structures that may be found in practice, including continuous and dichotomous data, and unidimensional and multidimensional structures. Additionally, two new EGA techniques are presented: one that extends EGA to also deal with unidimensional structures, and the other based on the triangulated maximally filtered graph approach (EGAtmfg). Both EGA techniques are compared with 5 widely used factor analytic techniques. Overall, EGA and EGAtmfg are found to perform as well as the most accurate traditional method, parallel analysis, and to produce the best large-sample properties of all the methods evaluated. To facilitate the use and application of EGA, we present a straightforward R tutorial on how to apply and interpret EGA, using scores from a well-known psychological instrument: the Marlowe-Crowne Social Desirability Scale.
Translational Abstract
Understanding the structure and composition of data is an important undertaking for a wide range of scientific domains. An initial step in this endeavor is to determine how the data can be summarized into a smaller set of meaningful variables (i.e., dimensions). In this article, we extend a state-of-the-art network science approach, called exploratory graph analysis (EGA), used to identify the dimensions that exist in multivariate data. Using Monte Carlo methods, we compared EGA with several traditional eigenvalue-based approaches that are commonly used in the psychological literature including parallel analysis. Additionally, the simulation study evaluated the performance of new variants of the EGA method and considered a wider set of realistic conditions, such as unidimensional structures and variables of continuous and categorical levels of measurement. We found that EGA performed as well as or better than the most accurate traditional method (i.e., parallel analysis). Importantly, EGA offers a few advantages over traditional methods: (a) it provides an intuitive visual representation of the results, (b) this representation offers a more complex understanding of the data's structure, and (c) the algorithm is deterministic meaning there are fewer researcher degrees of freedom. In sum, our study demonstrates that EGA can accurately identify the underlying structure of multivariate data, while retaining the complexity of the data's structure. This implies that researchers can meaningfully summarize their data without sacrificing the finer details.
The scale of genetic, epigenomic, transcriptomic, cheminformatic and proteomic data available today, coupled with easy-to-use machine learning (ML) toolkits, has propelled the application of ...supervised learning in genomics research. However, the assumptions behind the statistical models and performance evaluations in ML software frequently are not met in biological systems. In this Review, we illustrate the impact of several common pitfalls encountered when applying supervised ML in genomics. We explore how the structure of genomics data can bias performance evaluations and predictions. To address the challenges associated with applying cutting-edge ML methods to genomics, we describe solutions and appropriate use cases where ML modelling shows great potential.