(Translated by https://www.hiragana.jp/)
spark · GitHub Topics · GitHub
Skip to content
#

Apache Spark

spark logo

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Here are 8,555 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Mar 20, 2024
  • Python
flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink にゅう门、概念がいねん原理げんり、实战、性能せいのう调优、みなもと解析かいせきとう内容ないようわたる及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL とう内容ないようてきがく习案れい,还有 Flink 落地おろち应用てき大型おおがた项目あんれい(PVUV、にちこころざしそん储、百亿数据实时去重、监控つげ警)ぶんとおる。欢迎大家たいか支持しじてき专栏《大数たいすうすえ实时计算引擎 Flink 实战与性能せいのう优化》

  • Updated May 25, 2024
  • Java

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...

  • Updated Sep 13, 2024
  • Java

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • Updated Oct 9, 2024
  • Jupyter Notebook

Created by Matei Zaharia

Released May 26, 2014

Followers
421 followers
Repository
apache/spark
Website
spark.apache.org
Wikipedia
Wikipedia

Related Topics

hadoop scala