-
Databricks, @databricks
- Boise, ID
Stars
utility to migrate Runkeeper data (GPX and CSV) to Strava
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like Apache Spark.
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
Vagrant project to run a development/test instance of codefoundry
An easy, fast, efficient way to backup multiple subversion repositories.
RVM::FW - Exposing hidden Rubies for firewalled RVMs
adamonduty / marker
Forked from rdblue/markerA markup parser that outputs html and text. Syntax is similar to MediaWiki.
A markup parser that outputs html and text. Syntax is similar to MediaWiki.
Determines which markup library to use to render a content file (e.g. README) on GitHub