There is a rich amount of information in co‐occurrence (presence–absence) data that could be used to understand community assembly. This proposition first envisioned by Forbes (1907) and then Diamond ...(1975) prompted the development of numerous modelling approaches (e.g. null model analysis, co‐occurrence networks and, more recently, joint species distribution models). Both theory and experimental evidence support the idea that ecological interactions may affect co‐occurrence, but it remains unclear to what extent the signal of interaction can be captured in observational data. It is now time to step back from the statistical developments and critically assess whether co‐occurrence data are really a proxy for ecological interactions. In this paper, we present a series of arguments based on probability, sampling, food web and coexistence theories supporting that significant spatial associations between species (or lack thereof) is a poor proxy for ecological interactions. We discuss appropriate interpretations of co‐occurrence, along with potential avenues to extract as much information as possible from such data.
We present seven arguments highlighting why co‐occurrence data is not, and should never be considered, evidence ecological interaction. These arguments are based on probability, sampling, food web and coexistence theories. We discuss appropriate interpretations of co‐occurrence, along with potential avenues to extract as much information as possible from co‐occurrence data. In addition, we present ideas to better describe, understand and predict ecological interactions.
Forward selection of explanatory variables Blanchet, F. Guillaume; Legendre, Pierre; Borcard, Daniel
Ecology (Durham),
September 2008, Letnik:
89, Številka:
9
Journal Article
Recenzirano
This paper proposes a new way of using forward selection of explanatory variables in regression or canonical redundancy analysis. The classical forward selection method presents two problems: a ...highly inflated Type I error and an overestimation of the amount of explained variance. Correcting these problems will greatly improve the performance of this very useful method in ecological modeling. To prevent the first problem, we propose a two-step procedure. First, a global test using all explanatory variables is carried out. If, and only if, the global test is significant, one can proceed with forward selection. To prevent overestimation of the explained variance, the forward selection has to be carried out with two stopping criteria: (1) the usual alpha significance level and (2) the adjusted coefficient of multiple determination ($R_{a}^{2}$) calculated using all explanatory variables. When forward selection identifies a variable that brings one or the other criterion over the fixed threshold, that variable is rejected, and the procedure is stopped. This improved method is validated by simulations involving univariate and multivariate response data. An ecological example is presented using data from the Bryce Canyon National Park, Utah, USA.
Community ecology aims to understand what factors determine the assembly and dynamics of species assemblages at different spatiotemporal scales. To facilitate the integration between conceptual and ...statistical approaches in community ecology, we propose Hierarchical Modelling of Species Communities (HMSC) as a general, flexible framework for modern analysis of community data. While non‐manipulative data allow for only correlative and not causal inference, this framework facilitates the formulation of data‐driven hypotheses regarding the processes that structure communities. We model environmental filtering by variation and covariation in the responses of individual species to the characteristics of their environment, with potential contingencies on species traits and phylogenetic relationships. We capture biotic assembly rules by species‐to‐species association matrices, which may be estimated at multiple spatial or temporal scales. We operationalise the HMSC framework as a hierarchical Bayesian joint species distribution model, and implement it as R‐ and Matlab‐packages which enable computationally efficient analyses of large data sets. Armed with this tool, community ecologists can make sense of many types of data, including spatially explicit data and time‐series data. We illustrate the use of this framework through a series of diverse ecological examples.
Technological advances have enabled a new class of multivariate models for ecology, with the potential now to specify a statistical model for abundances jointly across many taxa, to simultaneously ...explore interactions across taxa and the response of abundance to environmental variables. Joint models can be used for several purposes of interest to ecologists, including estimating patterns of residual correlation across taxa, ordination, multivariate inference about environmental effects and environment-by-trait interactions, accounting for missing predictors, and improving predictions in situations where one can leverage knowledge of some species to predict others. We demonstrate this by example and discuss recent computation tools and future directions.
Many ecological questions require the joint analysis of abundances collected simultaneously across many taxonomic groups, and, if organisms are identified using modern tools such as metabarcoding, their number can be in the thousands.
While historically such data have been analyzed using ad hoc algorithms, it is now possible to fully specify joint statistical models for abundance using multivariate extensions of generalized linear mixed models.
These modern ‘joint modeling’ approaches allow the study of correlation patterns across taxa, at the same time as studying environmental response, to tease the two apart.
Latent variable models are an especially exciting tool that has recently been used for ordination as well as for studying the factors driving co-occurrence.
Weather and land use can significantly impact mosquito abundance and presence, and by consequence, mosquito-borne disease (MBD) dynamics. Knowledge of vector ecology and mosquito species response to ...these drivers will help us better predict risk from MBD. In this study, we evaluated and compared the independent and combined effects of weather and land use on mosquito species occurrence and abundance in Eastern Ontario, Canada. Data on occurrence and abundance (245,591 individuals) of 30 mosquito species were obtained from mosquito capture at 85 field sites in 2017 and 2018. Environmental variables were extracted from weather and land use datasets in a 1-km buffer around trapping sites. The relative importance of weather and land use on mosquito abundance (for common species) or occurrence (for all species) was evaluated using multivariate hierarchical statistical models. Models incorporating both weather and land use performed better than models that include weather only for approximately half of species (59% for occurrence model and 50% for abundance model). Mosquito occurrence was mainly associated with temperature whereas abundance was associated with precipitation and temperature combined. Land use was more often associated with abundance than occurrence. For most species, occurrence and abundance were positively associated with forest cover but for some there was a negative association. Occurrence and abundance of some species (47% for occurrence model and 88% for abundance model) were positively associated with wetlands, but negatively associated with urban (Culiseta melanura and Anopheles walkeri) and agriculture (An. quadrimaculatus, Cs. minnesotae and An. walkeri) environments. This study provides predictive relationships between weather, land use and mosquito occurrence and abundance for a wide range of species including those that are currently uncommon, yet known as arboviruses vectors. Elucidation of these relationships has the potential to contribute to better prediction of MBD risk, and thus more efficiently targeted prevention and control measures.
Summary
Leaves interact with a wealth of microorganisms. Among these, fungi are highly diverse and are known to contribute to plant health, leaf senescence and early decomposition. However, patterns ...and drivers of the seasonal dynamics of foliar fungal communities are poorly understood.
We used a multifactorial experiment to investigate the influence of warming and tree genotype on the foliar fungal community on the pedunculate oak Quercus robur across one growing season.
Fungal species richness increased, evenness tended to decrease, and community composition strongly shifted during the growing season. Yeasts increased in relative abundance as the season progressed, while putative fungal pathogens decreased. Warming decreased species richness, reduced evenness and changed community composition, especially at the end of the growing season. Warming also negatively affected putative fungal pathogens. We only detected a minor imprint of tree genotype and warming × genotype interactions on species richness and community composition.
Overall, our findings demonstrate that warming plays a larger role than plant genotype in shaping the seasonal dynamics of the foliar fungal community on oak. These warming‐induced shifts in the foliar fungal community may have a pronounced impact on plant health, plant–fungal interactions and ecosystem functions.
Recently developing hierarchical community models (HCMs) accounting for incomplete sampling are promising approaches to understand community organization. However, pros and cons of incorporating ...incomplete sampling in the analysis and related design issues remain unknown. In this study, we compared HCM and canonical redundancy analysis (RDA) carried out with 10 different dissimilarity coefficients to evaluate how each approach restores true community abundance data sampled with imperfect detection. We conducted simulation experiments with varying numbers of sampling sites, visits, mean detectability and mean abundance. Performance of HCM was measured by estimates of “expected” (mean) abundance (λ̂ij) and realized abundance (N̂ij : direct estimate of site- and species-specific abundance). We also compared HCM and different types of RDA (normal, partial, and weighted), all performed with the same ten different dissimilarity coefficients, with unequal number of visits to sampling sites. In addition, we applied the models to a virtual survey carried out on the Barro Colorado Island tree plot data for which we know true community abundance. Simulation experiments showed that N̂ij yielded by HCM best restored the underlying abundance of constituent species among 12 abundance estimates by HCM and RDA regardless if the sampling was equal or unequal. Mean abundance predominantly affected the performance of HCM and RDA while λ̂ij yielded by HCM had comparable performance to percentage difference and Gower dissimilarity coefficients of RDA. Relative performance of RDA types depended on the combination of dissimilarity coefficients and the distribution of sampling effort. Best performance of N̂ij followed by λ̂ij, percentage difference and Gower dissimilarity were also observed for the analysis of tree plot data, and graphical plots (triplots) based on λ̂ij rather than N̂ij clearly separated the effects of two environmental covariates on the abundance of constituent species. Under our conditions of model evaluation and the method, we concluded that, in terms of assessing the environmental dependence of abundance, HCMs and RDA can have comparable performance if we can choose appropriate dissimilarity coefficients for RDA. However, since HCMs provide straightforward biological interpretations of parameter estimates and flexibility of the analysis, HCMs would be useful in many situations as well as conventional canonical ordinations.
Arbuscular mycorrhizal (AM) fungi form diverse communities and are known to influence above-ground community dynamics and biodiversity. However, the multiscale patterns and drivers of AM fungal ...composition and diversity are still poorly understood.
We sequenced DNA markers from roots and root-associated soil from Plantago lanceolata plants collected across multiple spatial scales to allow comparison of AM fungal communities among neighbouring plants, plant subpopulations, nearby plant populations, and regions. We also measured soil nutrients, temperature, humidity, and community composition of neighbouring plants and nonAM root-associated fungi.
AM fungal communities were already highly dissimilar among neighbouring plants (c. 30 cm apart), albeit with a high variation in the degree of similarity at this small spatial scale. AM fungal communities were increasingly, and more consistently, dissimilar at larger spatial scales. Spatial structure and environmental drivers explained a similar percentage of the variation, from 7% to 25%. A large fraction of the variation remained unexplained, which may be a result of unmeasured environmental variables, species interactions and stochastic processes.
We conclude that AM fungal communities are highly variable among nearby plants. AM fungi may therefore play a major role in maintaining small-scale variation in community dynamics and biodiversity.
A large array of species distribution model (SDM) approaches has been developed for explaining and predicting the occurrences of individual species or species assemblages. Given the wealth of ...existing models, it is unclear which models perform best for interpolation or extrapolation of existing data sets, particularly when one is concerned with species assemblages. We compared the predictive performance of 33 variants of 15 widely applied and recently emerged SDMs in the context of multispecies data, including both joint SDMs that model multiple species together, and stacked SDMs that model each species individually combining the predictions afterward. We offer a comprehensive evaluation of these SDM approaches by examining their performance in predicting withheld empirical validation data of different sizes representing five different taxonomic groups, and for prediction tasks related to both interpolation and extrapolation. We measure predictive performance by 12 measures of accuracy, discrimination power, calibration, and precision of predictions, for the biological levels of species occurrence, species richness, and community composition. Our results show large variation among the models in their predictive performance, especially for communities comprising many species that are rare. The results do not reveal any major trade-offs among measures of model performance; the same models performed generally well in terms of accuracy, discrimination, and calibration, and for the biological levels of individual species, species richness, and community composition. In contrast, the models that gave the most precise predictions were not well calibrated, suggesting that poorly performing models can make overconfident predictions. However, none of the models performed well for all prediction tasks. As a general strategy, we therefore propose that researchers fit a small set of models showing complementary performance, and then apply a cross-validation procedure involving separate data to establish which of these models performs best for the goal of the study.
Summary
Landscape patterns influence a range of ecological processes at multiple spatial scales. Landscape pattern metrics are often used to study the patterns that result from the linear and ...nonlinear interactions between spatial aggregation and abundance of habitat.
However, many class‐level pattern metrics are highly correlated with habitat abundance, making their use as a measure of habitat fragmentation problematic.
We argue that a class‐level pattern metric should be (1) able to differentiate landscapes across a range of spatial aggregations, and (2) independent of habitat abundance, if it is to be used to distinguish between effects of habitat amount and fragmentation.
Based on these criteria and using both simulated and actual landscapes, we evaluated 64 class‐level pattern metrics. These metrics were reclassified into four groups based on their correlation with aggregation and abundance.
Among all these metrics, nine were considered robust for fragmentation measurements, which cover most of the characteristics that define pattern, including core area, shape, proximity / isolation, contrast, and contagion / interspersion.
Optimal metrics for individual studies will depend on both biological rationales and statistically robust metrics that are appropriate for achieving each study objectives.