(Translated by https://www.hiragana.jp/)
Introduction | Nomic Atlas Documentation
Skip to main content

Introduction

Nomic Atlas is a platform for interacting with unstructured datasets of text, image, video, audio, and embeddings at scale.

New Release: Nomic Embedding Vision and API

We've launched Nomic Embed Vision, a vision model aligned to Nomic Embed Text! All existing Nomic Embed Text embeddings are now multimodel; Nomic Embed Text embeddings can be used query the new Nomic Embed Vision embeddings out of the box, and visa versa. Together, Nomic Embed Text and Nomic Embed Vision project data into the only unified embedding space that achieves state of the art performance on vision, language, and multimodal tasks.

You can use it as the image embedding model powering your AtlasDataset and it is available in the Nomic Embedding API.

Read more in our official blog post and learn how to use it in the API Reference.

Resources

Key Concepts

Unstructured data

Nomic Atlas enables anyone to find insights in and build with unstructured data. Unstructured data is anything that you normally would not store in a spreadsheet: large collections of text documents, galleries of images, audio files, videos and the training/evaluation datasets of AI models. Nomic Atlas uses AI and Embeddings to help you quickly understand, build with and share your unstructured datasets.

Embeddings

An embedding is a vector representation of an unstructured datapoint that enables computers to manipulate the data based on semantics and meaning. All data uploaded to Nomic Atlas has a corresponding embedding assigned to it. Nomic Embedding Models are used to assign embeddings to your uploaded data if you do not specify embeddings during data upload. Embeddings are key to powering the Unstructured Data Map and organizing your data by meaning.

Unstructured data map

The fastest way to understand and work with unstructured data is to look at it. Anytime you upload a datapoint, Nomic Atlas organizes it in an AI powered data interface called the map. The map groups together similar datapoints in your dataset spatially. You can collaborate on your dataset with others by sharing a browser link to the map and developers can access its data and operations programmatically.