Topic: spark Goto Github
Some thing interesting about spark
Some thing interesting about spark
spark,【大厂面试专栏】一份Java程序员需要的技术指南,这里有面试题、系统架构、职场锦囊、主流中间件等,让你成为更牛的自己!
User: aalansehaiyang
Home Page: https://offercome.cn/
spark,Alluxio, data orchestration for analytics and machine learning in the cloud
Organization: alluxio
Home Page: https://www.alluxio.io
spark,A Flexible and Powerful Parameter Server for large-scale machine learning
Organization: angel-ml
spark,Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Organization: apache
Home Page: https://kyuubi.apache.org/
spark,Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Organization: apache
Home Page: https://linkis.apache.org/
spark,Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Organization: apache
Home Page: https://paimon.apache.org/
spark,Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Organization: awslabs
spark,The Hunting ELK
User: cyb3rward0g
spark,Koalas: pandas API on Apache Spark
Organization: databricks
spark,Free Data Engineering course!
Organization: datatalksclub
spark,Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...
Organization: deeplearning4j
Home Page: http://deeplearning4j.konduit.ai
spark,An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Organization: delta-io
Home Page: https://delta.io
spark,Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
User: donnemartin
spark,macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
User: donnemartin
spark,.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Organization: dotnet
Home Page: https://dot.net/spark
spark,List of Data Science Cheatsheets to rule the world
User: faviovazquez
spark,A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Organization: fugue-project
Home Page: https://fugue-tutorials.readthedocs.io/
spark,GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
User: gaizhenbiao
Home Page: https://huggingface.co/spaces/JohnSmith9982/ChuanhuChatGPT
spark,深圳地铁大数据客流分析系统🚇🚄🌟
User: geekyouth
Home Page: https://github.com/geekyouth/SZT-bigdata
spark,Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Organization: getredash
Home Page: http://redash.io/
spark,H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Organization: h2oai
Home Page: http://h2o.ai
spark,大数据入门指南 :star:
User: heibaiying
spark,Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
Organization: horovod
Home Page: http://horovod.ai
spark,State of the Art Natural Language Processing
Organization: johnsnowlabs
Home Page: https://sparknlp.org/
spark,Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Organization: kubeflow
spark,LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Organization: lakesoul-io
Home Page: https://lakesoul-io.github.io/
spark,🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
User: liyupi
Home Page: http://sql.yupi.icu
spark,酷玩 Spark: Spark 源代码解析、Spark 类库等
User: lw-lin
spark,🧙 Build, run, and manage data pipelines for integrating and transforming data.
Organization: mage-ai
Home Page: https://www.mage.ai/
spark,Simple and Distributed Machine Learning
Organization: microsoft
Home Page: http://aka.ms/spark
spark,PipelineAI
Organization: pipelineai
Home Page: https://generativeaionaws.com
spark,A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others
Organization: roaringbitmap
Home Page: http://roaringbitmap.org/
spark,TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Organization: salesforce
Home Page: https://transmogrif.ai
spark,REST job server for Apache Spark
Organization: spark-jobserver
spark,Interactive and Reactive Data Science using Scala and Spark.
Organization: spark-notebook
spark,cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
Organization: tencentmusic
spark,:herb: 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、k3s、k3d、k8s、mybatis加解密插件、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等:pushpin:
User: vector4wang
Home Page: http://blog.wangxc.club
spark,专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
User: wangzhiwubigdata
spark,DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Organization: webankfintech
Home Page: https://github.com/WeBankFinTech/DataSphereStudio-Doc
spark,TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Organization: yahoo
spark,Learn and understand Docker&Container technologies, with real DevOps practice!
User: yeasy
Home Page: https://yeasy.gitbook.io/docker_practice/
spark,flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
User: zhisheng17
Home Page: http://www.54tianzhisheng.cn/tags/Flink/
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.