Geostatistics is concerned with estimation and prediction problems for spatially continuous phenomena, using data obtained at a limited number of spatial locations. The name reflects its origins in ...mineral exploration, but the methods are now used in a wide range of settings including public health and the physical and environmental sciences. Model-based geostatistics refers to the application of general statistical principles of modeling and inference to geostatistical problems. This volume is the first book-length treatment of model-based geostatistics. The authors have written an expository text, emphasizing statistical methods and applications rather than the underlying mathematical theory. Analyses of datasets from a range of scientific contexts feature prominently, and simulations are used to illustrate theoretical results. Readers can reproduce most of the computational results in the book by using the authors' R-based software package, geoR, whose usage is illustrated in a computation section at the end of each chapter. The book assumes a working knowledge of classical and Bayesian methods of inference, linear models, and generalized linear models, but does not require previous exposure to spatial statistical models or methods. The authors have used the material in MSc-level statistics courses.
In this paper we first describe the class of log-Gaussian Cox processes (LGCPs) as models for spatial and spatio-temporal point process data. We discuss inference, with a particular focus on the ...computational challenges of likelihood-based inference. We then demonstrate the usefulness of the LGCP by describing four applications: estimating the intensity surface of a spatial point process; investigating spatial segregation in a multi-type process; constructing spatially continuous maps of disease risk from spatially discrete data; and real-time health surveillance. We argue that problems of this kind fit naturally into the realm of geostatistics, which traditionally is defined as the study of spatially continuous processes using spatially discrete observations at a finite number of locations. We suggest that a more useful definition of geostatistics is by the class of scientific problems that it addresses, rather than by particular models or data formats.
In low-resource settings, prevalence mapping relies on empirical prevalence data from a finite, often spatially sparse, set of surveys of communities within the region of interest, possibly ...supplemented by remotely sensed images that can act as proxies for environmental risk factors. A standard geostatistical model for data of this kind is a generalized linear mixed model with binomial error distribution, logistic link, and a combination of explanatory variables and a Gaussian spatial stochastic process in the linear predictor. In this article, we first review statistical methods and software associated with this standard model, then consider several methodological extensions whose development has been motivated by the requirements of specific applications. These include: methods for combining randomized survey data with data from nonrandomized, and therefore potentially biased, surveys; spatio-temporal extensions; and spatially structured zero-inflation. Throughout, we illustrate the methods with disease mapping applications that have arisen through our involvement with a range of African public health programs.
In this paper we introduce a new R package, PrevMap, for the analysis of spatially referenced prevalence data, including both classical maximum likelihood and Bayesian approaches to parameter ...estimation and plug-in or Bayesian prediction. More specifically, the new package implements fitting of geostatistical models for binomial data, based on two distinct approaches. The first approach uses a generalized linear mixed model with logistic link function, binomial error distribution and a Gaussian spatial process as a stochastic component in the linear predictor. A simpler, but approximate, alternative approach consists of fitting a linear Gaussian model to empirical-logit-transformed data. The package also includes implementations of convolution-based low-rank approximations to the Gaussian spatial process to enable computationally efficient analysis of large spatial datasets. We illustrate the use of the package through the analysis of Loa loa prevalence data from Cameroon and Nigeria. We illustrate the use of the low rank approximation using a simulated geostatistical dataset.
The unprecedented rise in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections during December 2021 was concurrent with rapid spread of the Omicron variant in England and globally. ...We analyzed the prevalence of SARS-CoV-2 and its dynamics in England from the end of November to mid-December 2021 among almost 100,000 participants in the REACT-1 study. Prevalence was high with rapid growth nationally and particularly in London during December 2021, with an increasing proportion of infections due to Omicron. We observed large decreases in swab positivity among mostly vaccinated older children (12 to 17 years) relative to unvaccinated younger children (5 to 11 years), and in adults who received a third (booster) vaccine dose versus two doses. Our results reinforce the importance of vaccination and booster campaigns, although additional measures have been needed to control the rapid growth of the Omicron variant.
User-friendly interfaces have been increasingly used to facilitate the learning of advanced statistical methodology, especially for students with only minimal statistical training. In this paper, we ...illustrate the use of MBGapp for teaching geostatistical analysis to population health scientists. Using a case-study on Loa loa infections, we show how MBGapp can be used to teach the different stages of a geostatistical analysis in a more interactive fashion. For wider accessibility and usability, MBGapp is available as an R package and as a Shiny web-application that can be freely accessed on any web browser. In addition to MBGapp, we also present an auxiliary Shiny app, called VariagramApp, that can be used to aid the teaching of Gaussian processes in one and two dimensions using simulations.
Celotno besedilo
Dostopno za:
DOBA, IZUM, KILJ, NUK, PILJ, PNG, SAZU, SIK, UILJ, UKNU, UL, UM, UPUK
Summary
We consider the analysis of continuous repeated measurement outcomes that are collected longitudinally. A standard framework for analysing data of this kind is a linear Gaussian mixed effects ...model within which the outcome variable can be decomposed into fixed effects, time invariant and time‐varying random effects, and measurement noise. We develop methodology that, for the first time, allows any combination of these stochastic components to be non‐Gaussian, using multivariate normal variance–mean mixtures. To meet the computational challenges that are presented by large data sets, i.e. in the current context, data sets with many subjects and/or many repeated measurements per subject, we propose a novel implementation of maximum likelihood estimation using a computationally efficient subsampling‐based stochastic gradient algorithm. We obtain standard error estimates by inverting the observed Fisher information matrix and obtain the predictive distributions for the random effects in both filtering (conditioning on past and current data) and smoothing (conditioning on all data) contexts. To implement these procedures, we introduce an R package: ngme. We reanalyse two data sets, from cystic fibrosis and nephrology research, that were previously analysed by using Gaussian linear mixed effects models.
Population antibody surveillance helps track immune responses to COVID-19 vaccinations at scale, and identify host factors that may affect antibody production. We analyse data from 212,102 vaccinated ...individuals within the REACT-2 programme in England, which uses self-administered lateral flow antibody tests in sequential cross-sectional community samples; 71,923 (33.9%) received at least one dose of BNT162b2 vaccine and 139,067 (65.6%) received ChAdOx1. For both vaccines, antibody positivity peaks 4-5 weeks after first dose and then declines. At least 21 days after second dose of BNT162b2, close to 100% of respondents test positive, while for ChAdOx1, this is significantly reduced, particularly in the oldest age groups (72.7% 70.9-74.4 at ages 75 years and above). For both vaccines, antibody positivity decreases with age, and is higher in females and those with previous infection. Antibody positivity is lower in transplant recipients, obese individuals, smokers and those with specific comorbidities. These groups will benefit from additional vaccine doses.