-
Rational kernel-based interpolation for complex-valued frequency response functions
Authors:
Julien Bect,
Niklas Georg,
Ulrich Römer,
Sebastian Schöps
Abstract:
This work is concerned with the kernel-based approximation of a complex-valued function from data, where the frequency response function of a partial differential equation in the frequency domain is of particular interest. In this setting, kernel methods are employed more and more frequently, however, standard kernels do not perform well. Moreover, the role and mathematical implications of the und…
▽ More
This work is concerned with the kernel-based approximation of a complex-valued function from data, where the frequency response function of a partial differential equation in the frequency domain is of particular interest. In this setting, kernel methods are employed more and more frequently, however, standard kernels do not perform well. Moreover, the role and mathematical implications of the underlying pair of kernels, which arises naturally in the complex-valued case, remain to be addressed. We introduce new reproducing kernel Hilbert spaces of complex-valued functions, and formulate the problem of complex-valued interpolation with a kernel pair as minimum norm interpolation in these spaces. Moreover, we combine the interpolant with a low-order rational function, where the order is adaptively selected based on a new model selection criterion. Numerical results on examples from different fields, including electromagnetics and acoustic examples, illustrate the performance of the method, also in comparison to available rational approximation methods.
△ Less
Submitted 1 September, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Bayesian sequential design of computer experiments for quantile set inversion
Authors:
Romain Ait Abdelmalek-Lomenech,
Julien Bect,
Vincent Chabridon,
Emmanuel Vazquez
Abstract:
We consider an unknown multivariate function representing a system-such as a complex numerical simulator-taking both deterministic and uncertain inputs. Our objective is to estimate the set of deterministic inputs leading to outputs whose probability (with respect to the distribution of the uncertain inputs) of belonging to a given set is less than a given threshold. This problem, which we call Qu…
▽ More
We consider an unknown multivariate function representing a system-such as a complex numerical simulator-taking both deterministic and uncertain inputs. Our objective is to estimate the set of deterministic inputs leading to outputs whose probability (with respect to the distribution of the uncertain inputs) of belonging to a given set is less than a given threshold. This problem, which we call Quantile Set Inversion (QSI), occurs for instance in the context of robust (reliability-based) optimization problems, when looking for the set of solutions that satisfy the constraints with sufficiently large probability. To solve the QSI problem we propose a Bayesian strategy, based on Gaussian process modeling and the Stepwise Uncertainty Reduction (SUR) principle, to sequentially choose the points at which the function should be evaluated to efficiently approximate the set of interest. We illustrate the performance and interest of the proposed SUR strategy through several numerical experiments.
△ Less
Submitted 6 June, 2024; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method
Authors:
Bruno Barracosa,
Julien Bect,
Héloïse Dutrieux Baraffe,
Juliette Morin,
Josselin Fournel,
Emmanuel Vazquez
Abstract:
This article focuses on the multi-objective optimization of stochastic simulators with high output variance, where the input space is finite and the objective functions are expensive to evaluate. We rely on Bayesian optimization algorithms, which use probabilistic models to make predictions about the functions to be optimized. The proposed approach is an extension of the Pareto Active Learning (PA…
▽ More
This article focuses on the multi-objective optimization of stochastic simulators with high output variance, where the input space is finite and the objective functions are expensive to evaluate. We rely on Bayesian optimization algorithms, which use probabilistic models to make predictions about the functions to be optimized. The proposed approach is an extension of the Pareto Active Learning (PAL) algorithm for the estimation of Pareto-optimal solutions that makes it suitable for the stochastic setting. We named it Pareto Active Learning for Stochastic Simulators (PALS). The performance of PALS is assessed through numerical experiments over a set of bi-dimensional, bi-objective test problems. PALS exhibits superior performance when compared to other scalarization-based and random-search approaches.
△ Less
Submitted 20 July, 2022; v1 submitted 8 July, 2022;
originally announced July 2022.
-
Relaxed Gaussian process interpolation: a goal-oriented approach to Bayesian optimization
Authors:
Sébastien J Petit,
Julien Bect,
Emmanuel Vazquez
Abstract:
This work presents a new procedure for obtaining predictive distributions in the context of Gaussian process (GP) modeling, with a relaxation of the interpolation constraints outside some ranges of interest: the mean of the predictive distributions no longer necessarily interpolates the observed values when they are outside ranges of interest, but are simply constrained to remain outside. This met…
▽ More
This work presents a new procedure for obtaining predictive distributions in the context of Gaussian process (GP) modeling, with a relaxation of the interpolation constraints outside some ranges of interest: the mean of the predictive distributions no longer necessarily interpolates the observed values when they are outside ranges of interest, but are simply constrained to remain outside. This method called relaxed Gaussian process (reGP) interpolation provides better predictive distributions in ranges of interest, especially in cases where a stationarity assumption for the GP model is not appropriate. It can be viewed as a goal-oriented method and becomes particularly interesting in Bayesian optimization, for example, for the minimization of an objective function, where good predictive distributions for low function values are important. When the expected improvement criterion and reGP are used for sequentially choosing evaluation points, the convergence of the resulting optimization algorithm is theoretically guaranteed (provided that the function to be optimized lies in the reproducing kernel Hilbert spaces attached to the known covariance of the underlying Gaussian process). Experiments indicate that using reGP instead of stationary GP models in Bayesian optimization is beneficial.
△ Less
Submitted 22 July, 2022; v1 submitted 7 June, 2022;
originally announced June 2022.
-
Integration of bounded monotone functions: Revisiting the nonsequential case, with a focus on unbiased Monte Carlo (randomized) methods
Authors:
Subhasish Basak,
Julien Bect,
Emmanuel Vazquez
Abstract:
In this article we revisit the problem of numerical integration for monotone bounded functions, with a focus on the class of nonsequential Monte Carlo methods. We first provide new a lower bound on the maximal $L^p$ error of nonsequential algorithms, improving upon a theorem of Novak when p > 1. Then we concentrate on the case p = 2 and study the maximal error of two unbiased methods-namely, a met…
▽ More
In this article we revisit the problem of numerical integration for monotone bounded functions, with a focus on the class of nonsequential Monte Carlo methods. We first provide new a lower bound on the maximal $L^p$ error of nonsequential algorithms, improving upon a theorem of Novak when p > 1. Then we concentrate on the case p = 2 and study the maximal error of two unbiased methods-namely, a method based on the control variate technique, and the stratified sampling method.
△ Less
Submitted 21 June, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Parameter selection in Gaussian process interpolation: an empirical study of selection criteria
Authors:
Sébastien Petit,
Julien Bect,
Paul Feliot,
Emmanuel Vazquez
Abstract:
This article revisits the fundamental problem of parameter selection for Gaussian process interpolation. By choosing the mean and the covariance functions of a Gaussian process within parametric families, the user obtains a family of Bayesian procedures to perform predictions about the unknown function, and must choose a member of the family that will hopefully provide good predictive performances…
▽ More
This article revisits the fundamental problem of parameter selection for Gaussian process interpolation. By choosing the mean and the covariance functions of a Gaussian process within parametric families, the user obtains a family of Bayesian procedures to perform predictions about the unknown function, and must choose a member of the family that will hopefully provide good predictive performances. We base our study on the general concept of scoring rules, which provides an effective framework for building leave-one-out selection and validation criteria, and a notion of extended likelihood criteria based on an idea proposed by Fasshauer and co-authors in 2009, which makes it possible to recover standard selection criteria such as, for instance, the generalized cross-validation criterion. Under this setting, we empirically show on several test problems of the literature that the choice of an appropriate family of models is often more important than the choice of a particular selection criterion (e.g., the likelihood versus a leave-one-out selection criterion). Moreover, our numerical results show that the regularity parameter of a Mat{é}rn covariance can be selected effectively by most selection criteria.
△ Less
Submitted 8 August, 2023; v1 submitted 13 July, 2021;
originally announced July 2021.
-
On the quantification of discretization uncertainty: comparison of two paradigms
Authors:
Julien Bect,
Souleymane Zio,
Guillaume Perrin,
Claire Cannamela,
Emmanuel Vazquez
Abstract:
Numerical models based on partial differential equations (PDE), or integro-differential equations, are ubiquitous in engineering and science, making it possible to understand or design systems for which physical experiments would be expensive-sometimes impossible-to carry out. Such models usually construct an approximate solution of the underlying continuous equations, using discretization methods…
▽ More
Numerical models based on partial differential equations (PDE), or integro-differential equations, are ubiquitous in engineering and science, making it possible to understand or design systems for which physical experiments would be expensive-sometimes impossible-to carry out. Such models usually construct an approximate solution of the underlying continuous equations, using discretization methods such as finite differences or the finite elements method. The resulting discretization error introduces a form of uncertainty on the exact but unknown value of any quantity of interest (QoI), which affects the predictions of the numerical model alongside other sources of uncertainty such as parametric uncertainty or model inadequacy. The present article deals with the quantification of this discretization uncertainty.A first approach to this problem, now standard in the V\&V (Verification and Validation) literature, uses the grid convergence index (GCI) originally proposed by P. Roache in the field of computational fluid dynamics (CFD), which is based on the Richardson extrapolation technique. Another approach, based on Bayesian inference with Gaussian process models, was more recently introduced in the statistical literature. In this work we present and compare these two paradigms for the quantification of discretization uncertainty, which have been developped in different scientific communities, and assess the potential of the-younger-Bayesian approach to provide a replacement for the well-established GCI-based approach, with better probabilistic foundations. The methods are illustrated and evaluated on two standard test cases from the literature (lid-driven cavity and Timoshenko beam).
△ Less
Submitted 25 March, 2021;
originally announced March 2021.
-
Numerical issues in maximum likelihood parameter estimation for Gaussian process interpolation
Authors:
Subhasish Basak,
Sébastien Petit,
Julien Bect,
Emmanuel Vazquez
Abstract:
This article investigates the origin of numerical issues in maximum likelihood parameter estimation for Gaussian process (GP) interpolation and investigates simple but effective strategies for improving commonly used open-source software implementations. This work targets a basic problem but a host of studies, particularly in the literature of Bayesian optimization, rely on off-the-shelf GP implem…
▽ More
This article investigates the origin of numerical issues in maximum likelihood parameter estimation for Gaussian process (GP) interpolation and investigates simple but effective strategies for improving commonly used open-source software implementations. This work targets a basic problem but a host of studies, particularly in the literature of Bayesian optimization, rely on off-the-shelf GP implementations. For the conclusions of these studies to be reliable and reproducible, robust GP implementations are critical.
△ Less
Submitted 27 July, 2021; v1 submitted 24 January, 2021;
originally announced January 2021.
-
Sequential design of multi-fidelity computer experiments: maximizing the rate of stepwise uncertainty reduction
Authors:
Rémi Stroh,
Julien Bect,
Séverine Demeyer,
Nicolas Fischer,
Damien Marquis,
Emmanuel Vazquez
Abstract:
This article deals with the sequential design of experiments for (deterministic or stochastic) multi-fidelity numerical simulators, that is, simulators that offer control over the accuracy of simulation of the physical phenomenon or system under study. Very often, accurate simulations correspond to high computational efforts whereas coarse simulations can be obtained at a smaller cost. In this set…
▽ More
This article deals with the sequential design of experiments for (deterministic or stochastic) multi-fidelity numerical simulators, that is, simulators that offer control over the accuracy of simulation of the physical phenomenon or system under study. Very often, accurate simulations correspond to high computational efforts whereas coarse simulations can be obtained at a smaller cost. In this setting, simulation results obtained at several levels of fidelity can be combined in order to estimate quantities of interest (the optimal value of the output, the probability that the output exceeds a given threshold...) in an efficient manner. To do so, we propose a new Bayesian sequential strategy called Maximal Rate of Stepwise Uncertainty Reduction (MR-SUR), that selects additional simulations to be performed by maximizing the ratio between the expected reduction of uncertainty and the cost of simulation. This generic strategy unifies several existing methods, and provides a principled approach to develop new ones. We assess its performance on several examples, including a computationally intensive problem of fire safety analysis where the quantity of interest is the probability of exceeding a tenability threshold during a building fire.
△ Less
Submitted 28 May, 2021; v1 submitted 27 July, 2020;
originally announced July 2020.
-
Towards new cross-validation-based estimators for Gaussian process regression: efficient adjoint computation of gradients
Authors:
Sébastien Petit,
Julien Bect,
Sébastien da Veiga,
Paul Feliot,
Emmanuel Vazquez
Abstract:
We consider the problem of estimating the parameters of the covariance function of a Gaussian process by cross-validation. We suggest using new cross-validation criteria derived from the literature of scoring rules. We also provide an efficient method for computing the gradient of a cross-validation criterion. To the best of our knowledge, our method is more efficient than what has been proposed i…
▽ More
We consider the problem of estimating the parameters of the covariance function of a Gaussian process by cross-validation. We suggest using new cross-validation criteria derived from the literature of scoring rules. We also provide an efficient method for computing the gradient of a cross-validation criterion. To the best of our knowledge, our method is more efficient than what has been proposed in the literature so far. It makes it possible to lower the complexity of jointly evaluating leave-one-out criteria and their gradients.
△ Less
Submitted 6 August, 2020; v1 submitted 26 February, 2020;
originally announced February 2020.
-
User preferences in Bayesian multi-objective optimization: the expected weighted hypervolume improvement criterion
Authors:
Paul Feliot,
Julien Bect,
Emmanuel Vazquez
Abstract:
In this article, we present a framework for taking into account user preferences in multi-objective Bayesian optimization in the case where the objectives are expensive-to-evaluate black-box functions. A novel expected improvement criterion to be used within Bayesian optimization algorithms is introduced. This criterion, which we call the expected weighted hypervolume improvement (EWHI) criterion,…
▽ More
In this article, we present a framework for taking into account user preferences in multi-objective Bayesian optimization in the case where the objectives are expensive-to-evaluate black-box functions. A novel expected improvement criterion to be used within Bayesian optimization algorithms is introduced. This criterion, which we call the expected weighted hypervolume improvement (EWHI) criterion, is a generalization of the popular expected hypervolume improvement to the case where the hypervolume of the dominated region is defined using an absolutely continuous measure instead of the Lebesgue measure. The EWHI criterion takes the form of an integral for which no closed form expression exists in the general case. To deal with its computation, we propose an importance sampling approximation method. A sampling density that is optimal for the computation of the EWHI for a predefined set of points is crafted and a sequential Monte-Carlo (SMC) approach is used to obtain a sample approximately distributed from this density. The ability of the criterion to produce optimization strategies oriented by user preferences is demonstrated on a simple bi-objective test problem in the cases of a preference for one objective and of a preference for certain regions of the Pareto front.
△ Less
Submitted 14 September, 2018;
originally announced September 2018.
-
Integrating hyper-parameter uncertainties in a multi-fidelity Bayesian model for the estimation of a probability of failure
Authors:
Rémi Stroh,
Julien Bect,
Séverine Demeyer,
Nicolas Fischer,
Emmanuel Vazquez
Abstract:
A multi-fidelity simulator is a numerical model, in which one of the inputs controls a trade-off between the realism and the computational cost of the simulation. Our goal is to estimate the probability of exceeding a given threshold on a multi-fidelity stochastic simulator. We propose a fully Bayesian approach based on Gaussian processes to compute the posterior probability distribution of this p…
▽ More
A multi-fidelity simulator is a numerical model, in which one of the inputs controls a trade-off between the realism and the computational cost of the simulation. Our goal is to estimate the probability of exceeding a given threshold on a multi-fidelity stochastic simulator. We propose a fully Bayesian approach based on Gaussian processes to compute the posterior probability distribution of this probability. We pay special attention to the hyper-parameters of the model. Our methodology is illustrated on an academic example.
△ Less
Submitted 20 September, 2017;
originally announced September 2017.
-
Sequential design of experiments to estimate a probability of exceeding a threshold in a multi-fidelity stochastic simulator
Authors:
Rémi Stroh,
Séverine Demeyer,
Nicolas Fischer,
Julien Bect,
Emmanuel Vazquez
Abstract:
In this article, we consider a stochastic numerical simulator to assess the impact of some factors on a phenomenon. The simulator is seen as a black box with inputs and outputs. The quality of a simulation, hereafter referred to as fidelity, is assumed to be tunable by means of an additional input of the simulator (e.g., a mesh size parameter): high-fidelity simulations provide more accurate resul…
▽ More
In this article, we consider a stochastic numerical simulator to assess the impact of some factors on a phenomenon. The simulator is seen as a black box with inputs and outputs. The quality of a simulation, hereafter referred to as fidelity, is assumed to be tunable by means of an additional input of the simulator (e.g., a mesh size parameter): high-fidelity simulations provide more accurate results, but are time-consuming. Using a limited computation-time budget, we want to estimate, for any value of the physical inputs, the probability that a certain scalar output of the simulator will exceed a given critical threshold at the highest fidelity level. The problem is addressed in a Bayesian framework, using a Gaussian process model of the multi-fidelity simulator. We consider a Bayesian estimator of the probability, together with an associated measure of uncertainty, and propose a new multi-fidelity sequential design strategy, called Maximum Speed of Uncertainty Reduction (MSUR), to select the value of physical inputs and the fidelity level of new simulations. The MSUR strategy is tested on an example.
△ Less
Submitted 26 July, 2017;
originally announced July 2017.
-
Adaptive Design of Experiments for Conservative Estimation of Excursion Sets
Authors:
Dario Azzimonti,
David Ginsbourger,
Clément Chevalier,
Julien Bect,
Yann Richet
Abstract:
We consider the problem of estimating the set of all inputs that leads a system to some particular behavior. The system is modeled by an expensive-to-evaluate function, such as a computer experiment, and we are interested in its excursion set, i.e. the set of points where the function takes values above or below some prescribed threshold. The objective function is emulated with a Gaussian Process…
▽ More
We consider the problem of estimating the set of all inputs that leads a system to some particular behavior. The system is modeled by an expensive-to-evaluate function, such as a computer experiment, and we are interested in its excursion set, i.e. the set of points where the function takes values above or below some prescribed threshold. The objective function is emulated with a Gaussian Process (GP) model based on an initial design of experiments enriched with evaluation results at (batch-)sequentially determined input points. The GP model provides conservative estimates for the excursion set, which control false positives while minimizing false negatives. We introduce adaptive strategies that sequentially select new evaluations of the function by reducing the uncertainty on conservative estimates. Following the Stepwise Uncertainty Reduction approach we obtain new evaluations by minimizing adapted criteria. Tractable formulae for the conservative criteria are derived, which allow more convenient optimization. The method is benchmarked on random functions generated under the model assumptions in different scenarios of noise and batch size. We then apply it to a reliability engineering test case. Overall, the proposed strategy of minimizing false negatives in conservative estimation achieves competitive performance both in terms of model-based and model-free indicators.
△ Less
Submitted 4 February, 2020; v1 submitted 22 November, 2016;
originally announced November 2016.
-
Design of a commercial aircraft environment control system using Bayesian optimization techniques
Authors:
Paul Feliot,
Yves Le Guennec,
Julien Bect,
Emmanuel Vazquez
Abstract:
In this paper, we present the application of a recently developed algorithm for Bayesian multi-objective optimization to the design of a commercial aircraft environment control system (ECS). In our model, the ECS is composed of two cross-flow heat exchangers, a centrifugal compressor and a radial turbine, the geometries of which are simultaneously optimized to achieve minimal weight and entropy ge…
▽ More
In this paper, we present the application of a recently developed algorithm for Bayesian multi-objective optimization to the design of a commercial aircraft environment control system (ECS). In our model, the ECS is composed of two cross-flow heat exchangers, a centrifugal compressor and a radial turbine, the geometries of which are simultaneously optimized to achieve minimal weight and entropy generation of the system. While both objectives impact the overall performance of the aircraft, they are shown to be antagonistic and a set of trade-off design solutions is identified. The algorithm used for optimizing the system implements a Bayesian approach to the multi-objective optimization problem in the presence of non-linear constraints and the emphasis is on conducting the optimization using a limited number of system simulations. Noteworthy features of this particular application include a non-hypercubic design domain and the presence of hidden constraints due to simulation failures.
△ Less
Submitted 7 October, 2016;
originally announced October 2016.
-
A supermartingale approach to Gaussian process based sequential design of experiments
Authors:
Julien Bect,
François Bachoc,
David Ginsbourger
Abstract:
Gaussian process (GP) models have become a well-established frameworkfor the adaptive design of costly experiments, and notably of computerexperiments. GP-based sequential designs have been found practicallyefficient for various objectives, such as global optimization(estimating the global maximum or maximizer(s) of a function),reliability analysis (estimating a probability of failure) or theesti…
▽ More
Gaussian process (GP) models have become a well-established frameworkfor the adaptive design of costly experiments, and notably of computerexperiments. GP-based sequential designs have been found practicallyefficient for various objectives, such as global optimization(estimating the global maximum or maximizer(s) of a function),reliability analysis (estimating a probability of failure) or theestimation of level sets and excursion sets. In this paper, we studythe consistency of an important class of sequential designs, known asstepwise uncertainty reduction (SUR) strategies. Our approach relieson the key observation that the sequence of residual uncertaintymeasures, in SUR strategies, is generally a supermartingale withrespect to the filtration generated by the observations. Thisobservation enables us to establish generic consistency results for abroad class of SUR strategies. The consistency of several popularsequential design strategies is then obtained by means of this generalresult. Notably, we establish the consistency of two SUR strategiesproposed by Bect, Ginsbourger, Li, Picheny and Vazquez (Stat. Comp.,2012)---to the best of our knowledge, these are the first proofs ofconsistency for GP-based sequential design algorithms dedicated to theestimation of excursion sets and their measure. We also establish anew, more general proof of consistency for the expected improvementalgorithm for global optimization which, unlike previous results inthe literature, applies to any GP with continuous sample paths.
△ Less
Submitted 30 August, 2018; v1 submitted 3 August, 2016;
originally announced August 2016.
-
Gaussian process modeling for stochastic multi-fidelity simulators, with application to fire safety
Authors:
Rémi Stroh,
Julien Bect,
Séverine Demeyer,
Nicolas Fischer,
Emmanuel Vazquez
Abstract:
To assess the possibility of evacuating a building in case of a fire, a standard method consists in simulating the propagation of fire, using finite difference methods and takes into account the random behavior of the fire, so that the result of a simulation is non-deterministic. The mesh fineness tunes the quality of the numerical model, and its computational cost. Depending on the mesh fineness,…
▽ More
To assess the possibility of evacuating a building in case of a fire, a standard method consists in simulating the propagation of fire, using finite difference methods and takes into account the random behavior of the fire, so that the result of a simulation is non-deterministic. The mesh fineness tunes the quality of the numerical model, and its computational cost. Depending on the mesh fineness, one simulation can last anywhere from a few minutes to several weeks. In this article, we focus on predicting the behavior of the fire simulator at fine meshes, using cheaper results, at coarser meshes. In the literature of the design and analysis of computer experiments, such a problem is referred to as multi-fidelity prediction. Our contribution is to extend to the case of stochastic simulators the Bayesian multi-fidelity model proposed by Picheny and Ginsbourger (2013) and Tuo et al. (2014).
△ Less
Submitted 9 May, 2016;
originally announced May 2016.
-
Bayesian subset simulation
Authors:
Julien Bect,
Ling Li,
Emmanuel Vazquez
Abstract:
We consider the problem of estimating a probability of failure $α$, defined as the volume of the excursion set of a function $f:\mathbb{X} \subseteq \mathbb{R}^{d} \to \mathbb{R}$ above a given threshold, under a given probability measure on $\mathbb{X}$. In this article, we combine the popular subset simulation algorithm (Au and Beck, Probab. Eng. Mech. 2001) and our sequential Bayesian approach…
▽ More
We consider the problem of estimating a probability of failure $α$, defined as the volume of the excursion set of a function $f:\mathbb{X} \subseteq \mathbb{R}^{d} \to \mathbb{R}$ above a given threshold, under a given probability measure on $\mathbb{X}$. In this article, we combine the popular subset simulation algorithm (Au and Beck, Probab. Eng. Mech. 2001) and our sequential Bayesian approach for the estimation of a probability of failure (Bect, Ginsbourger, Li, Picheny and Vazquez, Stat. Comput. 2012). This makes it possible to estimate $α$ when the number of evaluations of $f$ is very limited and $α$ is very small. The resulting algorithm is called Bayesian subset simulation (BSS). A key idea, as in the subset simulation algorithm, is to estimate the probabilities of a sequence of excursion sets of $f$ above intermediate thresholds, using a sequential Monte Carlo (SMC) approach. A Gaussian process prior on $f$ is used to define the sequence of densities targeted by the SMC algorithm, and drive the selection of evaluation points of $f$ to estimate the intermediate probabilities. Adaptive procedures are proposed to determine the intermediate thresholds and the number of evaluations to be carried out at each stage of the algorithm. Numerical experiments illustrate that BSS achieves significant savings in the number of function evaluations with respect to other Monte Carlo approaches.
△ Less
Submitted 24 April, 2017; v1 submitted 11 January, 2016;
originally announced January 2016.
-
A Bayesian approach to constrained single- and multi-objective optimization
Authors:
Paul Feliot,
Julien Bect,
Emmanuel Vazquez
Abstract:
This article addresses the problem of derivative-free (single- or multi-objective) optimization subject to multiple inequality constraints. Both the objective and constraint functions are assumed to be smooth, non-linear and expensive to evaluate. As a consequence, the number of evaluations that can be used to carry out the optimization is very limited, as in complex industrial design optimization…
▽ More
This article addresses the problem of derivative-free (single- or multi-objective) optimization subject to multiple inequality constraints. Both the objective and constraint functions are assumed to be smooth, non-linear and expensive to evaluate. As a consequence, the number of evaluations that can be used to carry out the optimization is very limited, as in complex industrial design optimization problems. The method we propose to overcome this difficulty has its roots in both the Bayesian and the multi-objective optimization literatures. More specifically, an extended domination rule is used to handle objectives and constraints in a unified way, and a corresponding expected hyper-volume improvement sampling criterion is proposed. This new criterion is naturally adapted to the search of a feasible point when none is available, and reduces to existing Bayesian sampling criteria---the classical Expected Improvement (EI) criterion and some of its constrained/multi-objective extensions---as soon as at least one feasible point is available. The calculation and optimization of the criterion are performed using Sequential Monte Carlo techniques. In particular, an algorithm similar to the subset simulation method, which is well known in the field of structural reliability, is used to estimate the criterion. The method, which we call BMOO (for Bayesian Multi-Objective Optimization), is compared to state-of-the-art algorithms for single- and multi-objective constrained optimization.
△ Less
Submitted 9 May, 2016; v1 submitted 2 October, 2015;
originally announced October 2015.
-
The Informational Approach to Global Optimization in presence of very noisy evaluation results. Application to the optimization of renewable energy integration strategies
Authors:
Héloïse Dutrieux,
Ivana Aleksovska,
Julien Bect,
Emmanuel Vazquez,
Delille Gauthier,
Bruno François
Abstract:
We consider the problem of global optimization of a function f from very noisy evaluations. We adopt a Bayesian sequential approach: evaluation points are chosen so as to reduce the uncertainty about the position of the global optimum of f, as measured by the entropy of the corresponding random variable (Informational Approach to Global Optimization, Villemonteix et al., 2009). When evaluations ar…
▽ More
We consider the problem of global optimization of a function f from very noisy evaluations. We adopt a Bayesian sequential approach: evaluation points are chosen so as to reduce the uncertainty about the position of the global optimum of f, as measured by the entropy of the corresponding random variable (Informational Approach to Global Optimization, Villemonteix et al., 2009). When evaluations are very noisy, the error coming from the estimation of the entropy using conditional simulations becomes non negligible compared to its variations on the input domain. We propose a solution to this problem by choosing evaluation points as if several evaluations were going to be made at these points. The method is applied to the optimization of a strategy for the integration of renewable energies into an electrical distribution network.
△ Less
Submitted 15 June, 2015;
originally announced June 2015.
-
Quantifying uncertainties on excursion sets under a Gaussian random field prior
Authors:
Dario Azzimonti,
Julien Bect,
Clément Chevalier,
David Ginsbourger
Abstract:
We focus on the problem of estimating and quantifying uncertainties on the excursion set of a function under a limited evaluation budget. We adopt a Bayesian approach where the objective function is assumed to be a realization of a Gaussian random field. In this setting, the posterior distribution on the objective function gives rise to a posterior distribution on excursion sets. Several approache…
▽ More
We focus on the problem of estimating and quantifying uncertainties on the excursion set of a function under a limited evaluation budget. We adopt a Bayesian approach where the objective function is assumed to be a realization of a Gaussian random field. In this setting, the posterior distribution on the objective function gives rise to a posterior distribution on excursion sets. Several approaches exist to summarize the distribution of such sets based on random closed set theory. While the recently proposed Vorob'ev approach exploits analytical formulae, further notions of variability require Monte Carlo estimators relying on Gaussian random field conditional simulations. In the present work we propose a method to choose Monte Carlo simulation points and obtain quasi-realizations of the conditional field at fine designs through affine predictors. The points are chosen optimally in the sense that they minimize the posterior expected distance in measure between the excursion set and its reconstruction. The proposed method reduces the computational costs due to Monte Carlo simulations and enables the computation of quasi-realizations on fine designs in large dimensions. We apply this reconstruction approach to obtain realizations of an excursion set on a fine grid which allow us to give a new measure of uncertainty based on the distance transform of the excursion set. Finally we present a safety engineering test case where the simulation method is employed to compute a Monte Carlo estimate of a contour line.
△ Less
Submitted 13 April, 2016; v1 submitted 15 January, 2015;
originally announced January 2015.
-
A new integral loss function for Bayesian optimization
Authors:
Emmanuel Vazquez,
Julien Bect
Abstract:
We consider the problem of maximizing a real-valued continuous function $f$ using a Bayesian approach. Since the early work of Jonas Mockus and Antanas Žilinskas in the 70's, the problem of optimization is usually formulated by considering the loss function $\max f - M_n$ (where $M_n$ denotes the best function value observed after $n$ evaluations of $f$). This loss function puts emphasis on the va…
▽ More
We consider the problem of maximizing a real-valued continuous function $f$ using a Bayesian approach. Since the early work of Jonas Mockus and Antanas Žilinskas in the 70's, the problem of optimization is usually formulated by considering the loss function $\max f - M_n$ (where $M_n$ denotes the best function value observed after $n$ evaluations of $f$). This loss function puts emphasis on the value of the maximum, at the expense of the location of the maximizer. In the special case of a one-step Bayes-optimal strategy, it leads to the classical Expected Improvement (EI) sampling criterion. This is a special case of a Stepwise Uncertainty Reduction (SUR) strategy, where the risk associated to a certain uncertainty measure (here, the expected loss) on the quantity of interest is minimized at each step of the algorithm. In this article, assuming that $f$ is defined over a measure space $(\mathbb{X}, λ)$, we propose to consider instead the integral loss function $\int_{\mathbb{X}} (f - M_n)_{+}\, dλ$, and we show that this leads, in the case of a Gaussian process prior, to a new numerically tractable sampling criterion that we call $\rm EI^2$ (for Expected Integrated Expected Improvement). A numerical experiment illustrates that a SUR strategy based on this new sampling criterion reduces the error on both the value and the location of the maximizer faster than the EI-based strategy.
△ Less
Submitted 20 August, 2014;
originally announced August 2014.
-
Relabeling and Summarizing Posterior Distributions in Signal Decomposition Problems when the Number of Components is Unknown
Authors:
Alireza Roodaki,
Julien Bect,
Gilles Fleury
Abstract:
This paper addresses the problems of relabeling and summarizing posterior distributions that typically arise, in a Bayesian framework, when dealing with signal decomposition problems with an unknown number of components. Such posterior distributions are defined over union of subspaces of differing dimensionality and can be sampled from using modern Monte Carlo techniques, for instance the increasi…
▽ More
This paper addresses the problems of relabeling and summarizing posterior distributions that typically arise, in a Bayesian framework, when dealing with signal decomposition problems with an unknown number of components. Such posterior distributions are defined over union of subspaces of differing dimensionality and can be sampled from using modern Monte Carlo techniques, for instance the increasingly popular RJ-MCMC method. No generic approach is available, however, to summarize the resulting variable-dimensional samples and extract from them component-specific parameters. We propose a novel approach, named Variable-dimensional Approximate Posterior for Relabeling and Summarizing (VAPoRS), to this problem, which consists in approximating the posterior distribution of interest by a "simple"---but still variable-dimensional---parametric distribution. The distance between the two distributions is measured using the Kullback-Leibler divergence, and a Stochastic EM-type algorithm, driven by the RJ-MCMC sampler, is proposed to estimate the parameters. Two signal decomposition problems are considered, to show the capability of VAPoRS both for relabeling and for summarizing variable dimensional posterior distributions: the classical problem of detecting and estimating sinusoids in white Gaussian noise on the one hand, and a particle counting problem motivated by the Pierre Auger project in astrophysics on the other hand.
△ Less
Submitted 8 January, 2013;
originally announced January 2013.
-
Bayesian Subset Simulation: a kriging-based subset simulation algorithm for the estimation of small probabilities of failure
Authors:
Ling Li,
Julien Bect,
Emmanuel Vazquez
Abstract:
The estimation of small probabilities of failure from computer simulations is a classical problem in engineering, and the Subset Simulation algorithm proposed by Au & Beck (Prob. Eng. Mech., 2001) has become one of the most popular method to solve it. Subset simulation has been shown to provide significant savings in the number of simulations to achieve a given accuracy of estimation, with respect…
▽ More
The estimation of small probabilities of failure from computer simulations is a classical problem in engineering, and the Subset Simulation algorithm proposed by Au & Beck (Prob. Eng. Mech., 2001) has become one of the most popular method to solve it. Subset simulation has been shown to provide significant savings in the number of simulations to achieve a given accuracy of estimation, with respect to many other Monte Carlo approaches. The number of simulations remains still quite high however, and this method can be impractical for applications where an expensive-to-evaluate computer model is involved. We propose a new algorithm, called Bayesian Subset Simulation, that takes the best from the Subset Simulation algorithm and from sequential Bayesian methods based on kriging (also known as Gaussian process modeling). The performance of this new algorithm is illustrated using a test case from the literature. We are able to report promising results. In addition, we provide a numerical study of the statistical properties of the estimator.
△ Less
Submitted 9 July, 2012;
originally announced July 2012.
-
Summarizing posterior distributions in signal decomposition problems when the number of components is unknown
Authors:
Alireza Roodaki,
Julien Bect,
Gilles Fleury
Abstract:
This paper addresses the problem of summarizing the posterior distributions that typically arise, in a Bayesian framework, when dealing with signal decomposition problems with unknown number of components. Such posterior distributions are defined over union of subspaces of differing dimensionality and can be sampled from using modern Monte Carlo techniques, for instance the increasingly popular RJ…
▽ More
This paper addresses the problem of summarizing the posterior distributions that typically arise, in a Bayesian framework, when dealing with signal decomposition problems with unknown number of components. Such posterior distributions are defined over union of subspaces of differing dimensionality and can be sampled from using modern Monte Carlo techniques, for instance the increasingly popular RJ-MCMC method. No generic approach is available, however, to summarize the resulting variable-dimensional samples and extract from them component-specific parameters.
We propose a novel approach to this problem, which consists in approximating the complex posterior of interest by a "simple"---but still variable-dimensional---parametric distribution. The distance between the two distributions is measured using the Kullback-Leibler divergence, and a Stochastic EM-type algorithm, driven by the RJ-MCMC sampler, is proposed to estimate the parameters. The proposed algorithm is illustrated on the fundamental signal processing example of joint detection and estimation of sinusoids in white Gaussian noise.
△ Less
Submitted 27 November, 2011;
originally announced November 2011.
-
Note on the computation of the Metropolis-Hastings ratio for Birth-or-Death moves in trans-dimensional MCMC algorithms for signal decomposition problems
Authors:
Alireza Roodaki,
Julien Bect,
Gilles Fleury
Abstract:
Reversible jump MCMC (RJ-MCMC) sampling techniques, which allow to jointly tackle model selection and parameter estimation problems in a coherent Bayesian framework, have become increasingly popular in the signal processing literature since the seminal paper of Andrieu and Doucet (IEEE Trans. Signal Process., 47(10), 1999). Crucial to the implementation of any RJ-MCMC sampler is the computation of…
▽ More
Reversible jump MCMC (RJ-MCMC) sampling techniques, which allow to jointly tackle model selection and parameter estimation problems in a coherent Bayesian framework, have become increasingly popular in the signal processing literature since the seminal paper of Andrieu and Doucet (IEEE Trans. Signal Process., 47(10), 1999). Crucial to the implementation of any RJ-MCMC sampler is the computation of the so-called Metropolis-Hastings-Green (MHG) ratio, which determines the acceptance probability for the proposed moves.
It turns out that the expression of the MHG ratio that was given in the paper of Andrieu and Doucet for "Birth-or-Death" moves---the simplest kind of trans-dimensional move, used in virtually all applications of RJ-MCMC to signal decomposition problems---was erroneous. Unfortunately, this mistake has been reproduced in many subsequent papers dealing with RJ-MCMC sampling in the signal processing literature.
This note discusses the computation of the MHG ratio, with a focus on the case where the proposal kernel can be decomposed as a mixture of simpler kernels, for which the MHG ratio is easy to compute. We provide sufficient conditions under which the MHG ratio of the mixture can be deduced from the MHG ratios of the elementary kernels of which it is composed. As an application, we consider the case of Birth-or-Death moves, and provide a corrected expression for the erroneous ratio in the paper of Andrieu and Doucet.
△ Less
Submitted 9 August, 2012; v1 submitted 27 November, 2011;
originally announced November 2011.
-
Bayesian optimization using sequential Monte Carlo
Authors:
Romain Benassi,
Julien Bect,
Emmanuel Vazquez
Abstract:
We consider the problem of optimizing a real-valued continuous function $f$ using a Bayesian approach, where the evaluations of $f$ are chosen sequentially by combining prior information about $f$, which is described by a random process model, and past evaluation results. The main difficulty with this approach is to be able to compute the posterior distributions of quantities of interest which are…
▽ More
We consider the problem of optimizing a real-valued continuous function $f$ using a Bayesian approach, where the evaluations of $f$ are chosen sequentially by combining prior information about $f$, which is described by a random process model, and past evaluation results. The main difficulty with this approach is to be able to compute the posterior distributions of quantities of interest which are used to choose evaluation points. In this article, we decide to use a Sequential Monte Carlo (SMC) approach.
△ Less
Submitted 21 November, 2011;
originally announced November 2011.
-
Sequential search based on kriging: convergence analysis of some algorithms
Authors:
Emmanuel Vazquez,
Julien Bect
Abstract:
Let $\FF$ be a set of real-valued functions on a set $\XX$ and let $S:\FF \to \GG$ be an arbitrary mapping. We consider the problem of making inference about $S(f)$, with $f\in\FF$ unknown, from a finite set of pointwise evaluations of $f$. We are mainly interested in the problems of approximation and optimization. In this article, we make a brief review of results concerning average error bounds…
▽ More
Let $\FF$ be a set of real-valued functions on a set $\XX$ and let $S:\FF \to \GG$ be an arbitrary mapping. We consider the problem of making inference about $S(f)$, with $f\in\FF$ unknown, from a finite set of pointwise evaluations of $f$. We are mainly interested in the problems of approximation and optimization. In this article, we make a brief review of results concerning average error bounds of Bayesian search methods that use a random process prior about $f$.
△ Less
Submitted 16 November, 2011;
originally announced November 2011.
-
Sequential design of computer experiments for the estimation of a probability of failure
Authors:
Julien Bect,
David Ginsbourger,
Ling Li,
Victor Picheny,
Emmanuel Vazquez
Abstract:
This paper deals with the problem of estimating the volume of the excursion set of a function $f:\mathbb{R}^d \to \mathbb{R}$ above a given threshold, under a probability measure on $\mathbb{R}^d$ that is assumed to be known. In the industrial world, this corresponds to the problem of estimating a probability of failure of a system. When only an expensive-to-simulate model of the system is availab…
▽ More
This paper deals with the problem of estimating the volume of the excursion set of a function $f:\mathbb{R}^d \to \mathbb{R}$ above a given threshold, under a probability measure on $\mathbb{R}^d$ that is assumed to be known. In the industrial world, this corresponds to the problem of estimating a probability of failure of a system. When only an expensive-to-simulate model of the system is available, the budget for simulations is usually severely limited and therefore classical Monte Carlo methods ought to be avoided. One of the main contributions of this article is to derive SUR (stepwise uncertainty reduction) strategies from a Bayesian-theoretic formulation of the problem of estimating a probability of failure. These sequential strategies use a Gaussian process model of $f$ and aim at performing evaluations of $f$ as efficiently as possible to infer the value of the probability of failure. We compare these strategies to other strategies also based on a Gaussian process model for estimating a probability of failure.
△ Less
Submitted 24 April, 2012; v1 submitted 27 September, 2010;
originally announced September 2010.
-
Pointwise consistency of the kriging predictor with known mean and covariance functions
Authors:
Emmanuel Vazquez,
Julien Bect
Abstract:
This paper deals with several issues related to the pointwise consistency of the kriging predictor when the mean and the covariance functions are known. These questions are of general importance in the context of computer experiments. The analysis is based on the properties of approximations in reproducing kernel Hilbert spaces. We fix an erroneous claim of Yakowitz and Szidarovszky (J. Multivar…
▽ More
This paper deals with several issues related to the pointwise consistency of the kriging predictor when the mean and the covariance functions are known. These questions are of general importance in the context of computer experiments. The analysis is based on the properties of approximations in reproducing kernel Hilbert spaces. We fix an erroneous claim of Yakowitz and Szidarovszky (J. Multivariate Analysis, 1985) that the kriging predictor is pointwise consistent for all continuous sample paths under some assumptions.
△ Less
Submitted 8 December, 2009;
originally announced December 2009.
-
A unifying formulation of the Fokker-Planck-Kolmogorov equation for general stochastic hybrid systems (extended version)
Authors:
Julien Bect
Abstract:
This paper has been withdrawn from the arXiv. It is now published by Elsevier in Nonlinear Analysis: Hybrid Systems, see http://dx.doi.org/10.1016/j.nahs.2009.07.008 .
A general formulation of the Fokker-Planck-Kolmogorov (FPK) equation for stochastic hybrid systems is presented, within the framework of Generalized Stochastic Hybrid Systems (GSHS). The FPK equation describes the time evolution of…
▽ More
This paper has been withdrawn from the arXiv. It is now published by Elsevier in Nonlinear Analysis: Hybrid Systems, see http://dx.doi.org/10.1016/j.nahs.2009.07.008 .
A general formulation of the Fokker-Planck-Kolmogorov (FPK) equation for stochastic hybrid systems is presented, within the framework of Generalized Stochastic Hybrid Systems (GSHS). The FPK equation describes the time evolution of the probability law of the hybrid state. Our derivation is based on the concept of mean jump intensity, which is related to both the usual stochastic intensity (in the case of spontaneous jumps) and the notion of probability current (in the case of forced jumps). This work unifies all previously known instances of the FPK equation for stochastic hybrid systems, and provides GSHS practitioners with a tool to derive the correct evolution equation for the probability law of the state in any given example.
△ Less
Submitted 16 September, 2010; v1 submitted 6 January, 2009;
originally announced January 2009.
-
Probabilistic computation of wind farm power generation based on wind turbine dynamic modeling
Authors:
Herman Bayem,
Yannick Phulpin,
Philippe Dessante,
Julien Bect
Abstract:
This paper addresses the problem of predicting a wind farm's power generation when no or few statistical data is available. The study is based on a time-series wind speed model and on a simple dynamic model of a DFIG wind turbine including cut-off and cut-in behaviours. The wind turbine is modeled as a stochastic hybrid system with three operation modes. Numerical results, obtained using Monte-C…
▽ More
This paper addresses the problem of predicting a wind farm's power generation when no or few statistical data is available. The study is based on a time-series wind speed model and on a simple dynamic model of a DFIG wind turbine including cut-off and cut-in behaviours. The wind turbine is modeled as a stochastic hybrid system with three operation modes. Numerical results, obtained using Monte-Carlo simulations, provide the annual distribution of a wind farm's active power generation. For different numbers of wind turbines, we compare the numerical results obtained using the dynamic model with those obtained considering the wind turbine's steady-state power curve. Simulations show that the wind turbine's dynamics do not need to be considered for analyzing the annual distribution of a wind farm generation.
△ Less
Submitted 13 April, 2008; v1 submitted 9 April, 2008;
originally announced April 2008.
-
A unifying formulation of the Fokker-Planck-Kolmogorov equation for general stochastic hybrid systems
Authors:
Julien Bect
Abstract:
A general formulation of the Fokker-Planck-Kolmogorov (FPK) equation for stochastic hybrid systems is presented, within the framework of Generalized Stochastic Hybrid Systems (GSHS). The FPK equation describes the time evolution of the probability law of the hybrid state. Our derivation is based on the concept of mean jump intensity, which is related to both the usual stochastic intensity (in th…
▽ More
A general formulation of the Fokker-Planck-Kolmogorov (FPK) equation for stochastic hybrid systems is presented, within the framework of Generalized Stochastic Hybrid Systems (GSHS). The FPK equation describes the time evolution of the probability law of the hybrid state. Our derivation is based on the concept of mean jump intensity, which is related to both the usual stochastic intensity (in the case of spontaneous jumps) and the notion of probability current (in the case of forced jumps). This work unifies all previously known instances of the FPK equation for stochastic hybrid systems, and provides GSHS practitioners with a tool to derive the correct evolution equation for the probability law of the state in any given example.
△ Less
Submitted 26 February, 2008; v1 submitted 24 January, 2008;
originally announced January 2008.
-
Convergence properties of the expected improvement algorithm
Authors:
Emmanuel Vazquez,
Julien Bect
Abstract:
This paper has been withdrawn from the arXiv. It is now published by Elsevier in the Journal of Statistical Planning and Inference, under the modified title "Convergence properties of the expected improvement algorithm with fixed mean and covariance functions". See http://dx.doi.org/10.1016/j.jspi.2010.04.018
An author-generated post-print version is available from the HAL repository of SUPELEC a…
▽ More
This paper has been withdrawn from the arXiv. It is now published by Elsevier in the Journal of Statistical Planning and Inference, under the modified title "Convergence properties of the expected improvement algorithm with fixed mean and covariance functions". See http://dx.doi.org/10.1016/j.jspi.2010.04.018
An author-generated post-print version is available from the HAL repository of SUPELEC at http://hal-supelec.archives-ouvertes.fr/hal-00217562
Abstract : "This paper deals with the convergence of the expected improvement algorithm, a popular global optimization algorithm based on a Gaussian process model of the function to be optimized. The first result is that under some mild hypotheses on the covariance function k of the Gaussian process, the expected improvement algorithm produces a dense sequence of evaluation points in the search domain, when the function to be optimized is in the reproducing kernel Hilbert space generated by k. The second result states that the density property also holds for P-almost all continuous functions, where P is the (prior) probability distribution induced by the Gaussian process."
△ Less
Submitted 13 June, 2010; v1 submitted 21 December, 2007;
originally announced December 2007.
-
Fokker-Planck-Kolmogorov equation for stochastic differential equations with boundary hitting resets
Authors:
Julien Bect,
Hana Baili,
Gilles Fleury
Abstract:
We consider a Markov process on a Riemannian manifold, which solves a stochastic differential equation in the interior of the manifold and jumps according to a deterministic reset map when it reaches the boundary. We derive a partial differential equation for the probability density function, involving a non-local boundary condition which accounts for the jumping behaviour of the process. This i…
▽ More
We consider a Markov process on a Riemannian manifold, which solves a stochastic differential equation in the interior of the manifold and jumps according to a deterministic reset map when it reaches the boundary. We derive a partial differential equation for the probability density function, involving a non-local boundary condition which accounts for the jumping behaviour of the process. This is a generalisation of the usual Fokker-Planck-Kolmogorov equation for diffusion processes. The result is illustrated with an example in the field of stochastic hybrid systems.
△ Less
Submitted 28 April, 2005;
originally announced April 2005.