Estimating Probability Densities with Transformer and Denoising Diffusion

Leung, Henry W.; Bovy, Jo; Speagle, Joshua S.

Computer Science > Machine Learning

arXiv:2407.15703 (cs)

[Submitted on 22 Jul 2024]

Title:Estimating Probability Densities with Transformer and Denoising Diffusion

Authors:Henry W. Leung, Jo Bovy, Joshua S. Speagle

View PDF HTML (experimental)

Abstract:Transformers are often the go-to architecture to build foundation models that ingest a large amount of training data. But these models do not estimate the probability density distribution when trained on regression problems, yet obtaining full probabilistic outputs is crucial to many fields of science, where the probability distribution of the answer can be non-Gaussian and multimodal. In this work, we demonstrate that training a probabilistic model using a denoising diffusion head on top of the Transformer provides reasonable probability density estimation even for high-dimensional inputs. The combined Transformer+Denoising Diffusion model allows conditioning the output probability density on arbitrary combinations of inputs and it is thus a highly flexible density function emulator of all possible input/output combinations. We illustrate our Transformer+Denoising Diffusion model by training it on a large dataset of astronomical observations and measured labels of stars within our Galaxy and we apply it to a variety of inference tasks to show that the model can infer labels accurately with reasonable distributions.

Comments:	Accepted at the ICML 2024 Workshop on Foundation Models in the Wild
Subjects:	Machine Learning (cs.LG); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (stat.ML)
Cite as:	arXiv:2407.15703 [cs.LG]
	(or arXiv:2407.15703v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.15703

Submission history

From: Henry Leung [view email]
[v1] Mon, 22 Jul 2024 15:10:41 UTC (2,229 KB)

Computer Science > Machine Learning

Title:Estimating Probability Densities with Transformer and Denoising Diffusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Estimating Probability Densities with Transformer and Denoising Diffusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators