Food security is an important policy issue in India. As India recently ranked 107th out of 121 countries in the 2022 Global Hunger Index, there is an urgent need to dissect, and gain insights into, ...such a major decline at the national level. However, the existing surveys, due to small sample sizes, cannot be used directly to produce reliable estimates at local administrative levels such as districts.
The latest round of available data from the Household Consumer Expenditure Survey (HCES 2011-12) done by the National Sample Survey Office of India used stratified multi-stage random sampling with districts as strata, villages as first stage and households as second stage units.
Our Small Area Estimation approach estimated food insecurity prevalence, gap, and severity of each rural district of the Eastern Indo-Gangetic Plain (EIGP) region by modeling the HCES data, guided by local covariates from the 2011 Indian Population Census.
In HCES, 5915 (34429), 3310 (17534) and 3566 (15223) households (persons) were surveyed from the 71, 38 and 18 districts of the EIGP states of Uttar Pradesh, Bihar and West Bengal respectively.
We estimated the district-specific food insecurity indicators, and mapped their local disparities over the EIGP region. By comparing food insecurity with indicators of climate vulnerability, poverty and crop diversity, we shortlisted the vulnerable districts in EIGP.
Our district-level estimates and maps can be effective for informed policy-making to build local resiliency and address systemic vulnerabilities where they matter most in the post-pandemic era.
Our study computed, for the Indian states in the EIGP region, the first area-level small area estimates of food insecurity as well as poverty over the past decade, and generated a ranked list of districts upon combining these data with measures of crop diversity and climatic vulnerability.
Cancer cells may overcome growth factor dependence by deregulating oncogenic and/or tumor-suppressor pathways that affect their metabolism, or by activating metabolic pathways de novo with targeted ...mutations in critical metabolic enzymes. It is unknown whether human prostate tumors develop a similar metabolic response to different oncogenic drivers or a particular oncogenic event results in its own metabolic reprogramming. Akt and Myc are arguably the most prevalent driving oncogenes in prostate cancer. Mass spectrometry-based metabolite profiling was performed on immortalized human prostate epithelial cells transformed by AKT1 or MYC, transgenic mice driven by the same oncogenes under the control of a prostate-specific promoter, and human prostate specimens characterized for the expression and activation of these oncoproteins. Integrative analysis of these metabolomic datasets revealed that AKT1 activation was associated with accumulation of aerobic glycolysis metabolites, whereas MYC overexpression was associated with dysregulated lipid metabolism. Selected metabolites that differentially accumulated in the MYC-high versus AKT1-high tumors, or in normal versus tumor prostate tissue by untargeted metabolomics, were validated using absolute quantitation assays. Importantly, the AKT1/MYC status was independent of Gleason grade and pathologic staging. Our findings show how prostate tumors undergo a metabolic reprogramming that reflects their molecular phenotypes, with implications for the development of metabolic diagnostics and targeted therapeutics.
Data fusion involves the integration of multiple related datasets. The statistical file-matching problem is a canonical data fusion problem in multivariate analysis, where the objective is to ...characterise the joint distribution of a set of variables when only strict subsets of marginal distributions have been observed. Estimation of the covariance matrix of the full set of variables is challenging given the missing-data pattern. Factor analysis models use lower-dimensional latent variables in the data-generating process, and this introduces low-rank components in the complete-data matrix and the population covariance matrix. The low-rank structure of the factor analysis model can be exploited to estimate the full covariance matrix from incomplete data via low-rank matrix completion. We prove the identifiability of the factor analysis model in the statistical file-matching problem under conditions on the number of factors and the number of shared variables over the observed marginal subsets. Additionally, we provide an EM algorithm for parameter estimation. On several real datasets, the factor model gives smaller reconstruction errors in file-matching problems than the common approaches for low-rank matrix completion.
Progressive optic neuropathies such as glaucoma are major causes of blindness globally. Multiple sources of subjectivity and analytical challenges are often encountered by clinicians in the process ...of early diagnosis and clinical management of these diseases. In glaucoma, the structural damage is often characterized by neuroretinal rim (NRR) thinning of the optic nerve head, and other clinical parameters. Baseline structural heterogeneity in the eyes can play a key role in the progression of optic neuropathies, and present challenges to clinical decision-making. We generated a dataset of Optical Coherence Tomography (OCT) based high-resolution circular measurements on NRR phenotypes, along with other clinical covariates, of 3973 healthy eyes as part of an established clinical cohort of Asian Indian participants. We introduced CIFU, a new computational pipeline for CIrcular FUnctional data modeling and analysis. We demonstrated CIFU by unsupervised circular functional clustering of the OCT NRR data, followed by meta-clustering to characterize the clusters using clinical covariates, and presented a circular visualization of the results. Upon stratification by age, we identified a healthy NRR phenotype cluster in the age group 40-49 years with predictive potential for glaucoma. Our dataset also addresses the disparity of representation of this particular population in normative OCT databases.
After an epidemic outbreak, the infection persists in a community long enough to engulf the entire susceptible population. Local extinction of the disease could be possible if the susceptible ...population gets depleted. In large communities, the tendency of eventual damp down of recurrent epidemics is balanced by random variability. But, in small communities, the infection would die out when the number of susceptible falls below a certain threshold. Critical community size (CCS) is considered to be the mentioned threshold, at which the infection is as likely as not to die out after a major epidemic for small communities unless reintroduced from outside. The determination of CCS could aid in devising systematic control strategies to eradicate the infectious disease from small communities. In this article, we have come up with a simplified computation based approach to deduce the CCS of HIV disease dynamics. We consider a deterministic HIV model proposed by Silva and Torres, and following Nåsell, introduce stochasticity in the model through time-varying population sizes of different compartments. Besides, Metcalf's group observed that the relative risk of extinction of some infections on islands is almost double that in the mainlands i.e. infections cease to exist at a significantly higher rate in islands compared to the mainlands. They attributed this phenomenon to the greater recolonization in the mainlands. Interestingly, the application of our method on demographic facts and figures of countries in the AIDS belt of Africa led us to expect that existing control measures and isolated locations would assist in temporary eradication of HIV infection much faster. For example, our method suggests that through systematic control strategies, after 7.36 years HIV epidemics will temporarily be eradicated from different communes of island nation Madagascar, where the population size falls below its CCS value, unless the disease is reintroduced from outside.
Predicting drug response in cancer patients remains a major challenge in the clinic. We have perfected an ex vivo, reproducible, rapid and personalized culture method to investigate antitumoral ...pharmacological properties that preserves the original cancer microenvironment. Response to signal transduction inhibitors in cancer is determined not only by properties of the drug target but also by mutations in other signaling molecules and the tumor microenvironment. As a proof of concept, we, therefore, focused on the PI3K/Akt signaling pathway, because it plays a prominent role in cancer and its activity is affected by epithelial-stromal interactions. Our results show that this culture model preserves tissue 3D architecture, cell viability, pathway activity, and global gene-expression profiles up to 5 days ex vivo. In addition, we show pathway modulation in tumor cells resulting from pharmacologic intervention in ex vivo culture. This technology may have a significant impact on patient selection for clinical trials and in predicting response to small-molecule inhibitor therapy.
We generated Optical Coherence Tomography (OCT) data of much higher resolution than usual on retinal nerve fiber layer (RNFL) thickness of a given eye. These consist of measurements made at hundreds ...of angular-points defined on a circular coordinate system. Traditional analysis of OCT RNFL data does not utilize insightful characteristics such as its circularity and granularity for common downstream applications. To address this, we present a new circular statistical framework that defines an Angular Decay function and thereby provides a directionally precise representation of an eye with attention to patterns of focused RNFL loss. By applying to a clinical cohort of Asian Indian eyes, the generated circular data were modeled with a finite mixture of von Mises distributions, which led to an unsupervised identification in different age-groups of recurrent clusters of glaucomatous eyes with distinct directional signatures of RNFL decay. New indices of global and local RNFL loss were computed for comparing the structural differences between these glaucoma clusters across the age-groups and improving classification. Further, we built a catalog of directionally precise statistical distributions of RNFL thickness for the said population of normal eyes as stratified by their age and optic disc size.
Intratumor heterogeneity (ITH) is associated with therapeutic resistance and poor prognosis in cancer patients, and attributed to genetic, epigenetic, and microenvironmental factors. We developed a ...new computational platform, GATHER, for geostatistical modeling of single cell RNA-seq data to synthesize high-resolution and continuous gene expression landscapes of a given tumor sample. Such landscapes allow GATHER to map the enriched regions of pathways of interest in the tumor space and identify genes that have spatial differential expressions at locations representing specific phenotypic contexts using measures based on optimal transport. GATHER provides new applications of spatial entropy measures for quantification and objective characterization of ITH. It includes new tools for insightful visualization of spatial transcriptomic phenomena. We illustrate the capabilities of GATHER using real data from breast cancer tumor to study hallmarks of cancer in the phenotypic contexts defined by cancer associated fibroblasts.
In disease modeling, a key statistical problem is the estimation of lower and upper tail probabilities of health events from given data sets of small size and limited range. Assuming such ...constraints, we describe a computational framework for the systematic fusion of observations from multiple sources to compute tail probabilities that could not be obtained otherwise due to a lack of lower or upper tail data. The estimation of multivariate lower and upper tail probabilities from a given small reference data set that lacks complete information about such tail data is addressed in terms of pertussis case count data. Fusion of data from multiple sources in conjunction with the density ratio model is used to give probability estimates that are non-obtainable from the empirical distribution. Based on a density ratio model with variable tilts, we first present a univariate fit and, subsequently, improve it with a multivariate extension. In the multivariate analysis, we selected the best model in terms of the Akaike Information Criterion (AIC). Regional prediction, in Washington state, of the number of pertussis cases is approached by providing joint probabilities using fused data from several relatively small samples following the selected density ratio model. The model is validated by a graphical goodness-of-fit plot comparing the estimated reference distribution obtained from the fused data with that of the empirical distribution obtained from the reference sample only.