Improved prediction accuracy for disease risk mapping using Gaussian process stacked generalization.
Author(s)
Type
Journal Article
Abstract
Maps of infectious disease-charting spatial variations in the force of infection, degree of endemicity and the burden on human health-provide an essential evidence base to support planning towards global health targets. Contemporary disease mapping efforts have embraced statistical modelling approaches to properly acknowledge uncertainties in both the available measurements and their spatial interpolation. The most common such approach is Gaussian process regression, a mathematical framework composed of two components: a mean function harnessing the predictive power of multiple independent variables, and a covariance function yielding spatio-temporal shrinkage against residual variation from the mean. Though many techniques have been developed to improve the flexibility and fitting of the covariance function, models for the mean function have typically been restricted to simple linear terms. For infectious diseases, known to be driven by complex interactions between environmental and socio-economic factors, improved modelling of the mean function can greatly boost predictive power. Here, we present an ensemble approach based on stacked generalization that allows for multiple nonlinear algorithmic mean functions to be jointly embedded within the Gaussian process framework. We apply this method to mapping Plasmodium falciparum prevalence data in sub-Saharan Africa and show that the generalized ensemble approach markedly outperforms any individual method.
Date Issued
2017-09
Date Acceptance
2017-08-30
Citation
Interface, 2017, 14 (134)
ISSN
1742-5662
Publisher
Royal Society, The
Journal / Book Title
Interface
Volume
14
Issue
134
Copyright Statement
© 2017 The Author(s)
Published by the Royal Society. All rights reserved.
Published by the Royal Society. All rights reserved.
Sponsor
Medical Research Council (MRC)
Identifier
https://www.ncbi.nlm.nih.gov/pubmed/28931634
Grant Number
MR/K010174/1B
Subjects
Gaussian process
disease mapping
malaria
stacked generalization
Publication Status
Published
Coverage Spatial
England