Topic: apache-spark Goto Github
Some thing interesting about apache-spark
Some thing interesting about apache-spark
apache-spark,Infrastructures™ for Machine Learning Training/Inference in Production.
User: 1duo
apache-spark,Fully managed Apache Parquet implementation
User: aloneguid
Home Page: https://aloneguid.github.io/parquet-dotnet/
apache-spark,A curated list of awesome Apache Spark packages and resources.
Organization: awesome-spark
apache-spark,Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Organization: awesome-spark
apache-spark,Apache Spark docker image
Organization: big-data-europe
apache-spark,PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
User: cartershanklin
apache-spark,Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Organization: cerndb
Home Page: http://joerihermans.com/work/distributed-keras/
apache-spark,Use SQL to build ELT pipelines on a data lakehouse.
Organization: cuebook
Home Page: https://cuelake.cuebook.ai
apache-spark,This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Organization: databricks
Home Page: https://learning.oreilly.com/library/view/learning-spark-2nd/9781492050032/
apache-spark,(Deprecated) Scikit-learn integration package for Apache Spark
Organization: databricks
apache-spark,A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
Organization: datamechanics
Home Page: https://www.datamechanics.co/delight
apache-spark,Train and run Pytorch models on Apache Spark.
User: dmmiller612
apache-spark,.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Organization: dotnet
Home Page: https://dot.net/spark
apache-spark,A boilerplate for writing PySpark Jobs
User: ekampf
apache-spark,Feathr – A scalable, unified data and AI engineering platform for enterprise
Organization: feathr-ai
Home Page: https://join.slack.com/t/feathrai/shared_invite/zt-1ffva5u6v-voq0Us7bbKAw873cEzHOSg
apache-spark,A Spark Atlas connector to track data lineage in Apache Atlas
Organization: hortonworks-spark
apache-spark,Serverless proxy for Spark cluster
Organization: hydrospheredata
Home Page: http://hydrosphere.io/mist/
apache-spark,A list about Apache Kafka
User: infoslack
apache-spark,BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
Organization: intel-analytics
Home Page: https://bigdl.readthedocs.io
apache-spark,The Internals of Apache Spark
Organization: japila-books
Home Page: https://books.japila.pl/apache-spark-internals
apache-spark,The Internals of Spark SQL
Organization: japila-books
Home Page: https://books.japila.pl/spark-sql-internals
apache-spark,The Internals of Spark Structured Streaming
Organization: japila-books
Home Page: https://books.japila.pl/spark-structured-streaming-internals
apache-spark,Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Organization: kubeflow
apache-spark,PySpark + Scikit-learn = Sparkit-learn
Organization: lensacom
apache-spark,Easy to use library to bring Tensorflow on Apache Spark
Organization: lifeomic
apache-spark,Includes notes on using Apache Spark, Spark for Physics, a tool for running TPCDS on PySpark, a tool for performance testing CPUs, Jupyter notebook examples for Spark, Oracle and other DB systems.
User: lucacanali
apache-spark,This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
User: lucacanali
apache-spark,酷玩 Spark: Spark 源代码解析、Spark 类库等
User: lw-lin
apache-spark,Streaming System 相关的论文读物
User: lw-lin
apache-spark, MapReduce, Spark, Java, and Scala for Data Algorithms Book
User: mahmoudparsian
Home Page: http://mapreduce4hackers.com
apache-spark,Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Organization: microsoft
apache-spark,Simple and Distributed Machine Learning
Organization: microsoft
Home Page: http://aka.ms/spark
apache-spark,[PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
User: miguno
Home Page: http://www.michael-noll.com/blog/2014/05/27/kafka-storm-integration-example-tutorial/
apache-spark,[PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
User: miguno
apache-spark,Notes on Apache Spark (pyspark)
User: mingchen0919
apache-spark,Open source platform for the machine learning lifecycle
Organization: mlflow
Home Page: https://mlflow.org
apache-spark,pyspark methods to enhance developer productivity 📣 👯 🎉
User: mrpowers
Home Page: https://mrpowers.github.io/quinn/
apache-spark,A command-line tool for launching Apache Spark clusters.
User: nchammas
apache-spark,Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Organization: opencypher
apache-spark,REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
Organization: openscoring
apache-spark,Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Organization: oryxproject
Home Page: http://oryx.io
apache-spark,SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.
User: ptyadana
apache-spark,Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
User: rjurney
Home Page: http://bit.ly/agile_data_science
apache-spark,An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
User: san089
apache-spark,Interactive and Reactive Data Science using Scala and Spark.
Organization: spark-notebook
apache-spark,R interface for Apache Spark
Organization: sparklyr
Home Page: https://spark.rstudio.com/
apache-spark,Fundamentals of Spark with Python (using PySpark), code examples
User: tirthajyoti
apache-spark,lakeFS - Data version control for your data lake | Git for data
Organization: treeverse
Home Page: https://docs.lakefs.io
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.