Probabilistic prediction of oceanographic velocities with multivariate gaussian natural gradient boosting
Author(s)
O'Malley, Michael
Sykulski, Adam M
Lumpkin, Rick
Schuler, Alejandro
Type
Journal Article
Abstract
Many single-target regression problems require estimates of uncertainty along
with the point predictions. Probabilistic regression algorithms are well-suited
for these tasks. However, the options are much more limited when the prediction
target is multivariate and a joint measure of uncertainty is required. For
example, in predicting a 2D velocity vector a joint uncertainty would quantify
the probability of any vector in the plane, which would be more expressive than
two separate uncertainties on the x- and y- components. To enable joint
probabilistic regression, we propose a Natural Gradient Boosting (NGBoost)
approach based on nonparametrically modeling the conditional parameters of the
multivariate predictive distribution. Our method is robust, works
out-of-the-box without extensive tuning, is modular with respect to the assumed
target distribution, and performs competitively in comparison to existing
approaches. We demonstrate these claims in simulation and with a case study
predicting two-dimensional oceanographic velocity data. An implementation of
our method is available at https://github.com/stanfordmlgroup/ngboost.
with the point predictions. Probabilistic regression algorithms are well-suited
for these tasks. However, the options are much more limited when the prediction
target is multivariate and a joint measure of uncertainty is required. For
example, in predicting a 2D velocity vector a joint uncertainty would quantify
the probability of any vector in the plane, which would be more expressive than
two separate uncertainties on the x- and y- components. To enable joint
probabilistic regression, we propose a Natural Gradient Boosting (NGBoost)
approach based on nonparametrically modeling the conditional parameters of the
multivariate predictive distribution. Our method is robust, works
out-of-the-box without extensive tuning, is modular with respect to the assumed
target distribution, and performs competitively in comparison to existing
approaches. We demonstrate these claims in simulation and with a case study
predicting two-dimensional oceanographic velocity data. An implementation of
our method is available at https://github.com/stanfordmlgroup/ngboost.
Date Issued
2023-05-02
Date Acceptance
2023-03-22
Citation
Environmental Data Science, 2023, 2
ISSN
2634-4602
Publisher
Cambridge University Press
Journal / Book Title
Environmental Data Science
Volume
2
Copyright Statement
© The Author(s), 2023. Published by Cambridge University Press. This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
License URL
Identifier
https://www.cambridge.org/core/journals/environmental-data-science/article/probabilistic-prediction-of-oceanographic-velocities-with-multivariate-gaussian-natural-gradient-boosting/F26F2BD51213758208B0EBAE51D1A973
Subjects
cs.LG
stat.CO
stat.ML
stat.ML
Publication Status
Published
Article Number
ARTN e10
Date Publish Online
2023-05-02