-
GPU-Accelerating Apache Spark @NVIDIA
- United States
Stars
Simple, portable, and self-contained stacktrace library for C++11 and newer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
Distributed data engine for Python/SQL designed for the cloud, powered by Rust
stdgpu: Efficient STL-like Data Structures on the GPU
Slint is a declarative GUI toolkit to build native user interfaces for Rust, C++, or JavaScript apps.
Create a Movie animation plus Audio plus Subtitle from a text file
Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
Notes talking about the design and implementation of Apache Spark
Examples of single-cell genomic analysis accelerated with RAPIDS
The fastest logging library in the world. Built from scratch in Scala and programmatically configurable.
Scala library for boilerplate-free, type-safe data transformations
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Spark RAPIDS Container – Docker containers for Spark RAPIDS
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
A creator library for procedural 2D noises and patterns in Rust.
A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
NVIDIA Federated Learning Application Runtime Environment
Notes from books and other interesting things that I've read. Table of contents at the end 👇
All Algorithms implemented in Python
Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask without any rewrites.
Data science interview questions and answers
An open source list of developer questions to ask prospective employers
Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)