GAUCHE: a library for Gaussian processes in chemistry

RR Griffiths, L Klarner, H Moss… - Advances in …, 2024 - proceedings.neurips.cc
Advances in Neural Information Processing Systems, 2024proceedings.neurips.cc
We introduce GAUCHE, an open-source library for GAUssian processes in CHEmistry.
Gaussian processes have long been a cornerstone of probabilistic machine learning,
affording particular advantages for uncertainty quantification and Bayesian optimisation.
Extending Gaussian processes to molecular representations, however, necessitates kernels
defined over structured inputs such as graphs, strings and bit vectors. By providing such
kernels in a modular, robust and easy-to-use framework, we seek to enable expert chemists …
Abstract
We introduce GAUCHE, an open-source library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to molecular representations, however, necessitates kernels defined over structured inputs such as graphs, strings and bit vectors. By providing such kernels in a modular, robust and easy-to-use framework, we seek to enable expert chemists and materials scientists to make use of state-of-the-art black-box optimization techniques. Motivated by scenarios frequently encountered in practice, we showcase applications for GAUCHE in molecular discovery, chemical reaction optimisation and protein design. The codebase is made available at https://github. com/leojklarner/gauche.
proceedings.neurips.cc