john8628 Goto Github PK
Name: john
Type: User
Bio: Open Source Enthusiast; Focus on big data ecosystem;
Location: shanghai
Name: john
Type: User
Bio: Open Source Enthusiast; Focus on big data ecosystem;
Location: shanghai
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
🔥 🔥 🔥 An intelligent and versatile general-purpose SQL client and reporting tool for databases which integrates ChatGPT capabilities.
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
ClickHouse® is a free analytics DBMS for big data
《Designing Data-Intensive Application》DDIA中文翻译
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications.
Fine-tune Mistral-7b on the Enlighten codebase
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.
:book: [译] 面向机器学习的特征工程
Feature Store for Machine Learning
forcast
An open-source, cloud-native, distributed time-series database with PromQL/SQL/Python supported.
Upserts, Deletes And Incremental Processing on Big Data.
Apache Iceberg
Apache Iceberg
Apache OpenDAL: access data freely.
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Uniffle is a high performance, general purpose Remote Shuffle Service.
welcome to my friends
Demo code for implementing and showcasing a Fraud Detection Engine with Apache Flink.
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
python examples and machine learning examples using python
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.