Software Engineer. Focus On Fields Of Big Data and data integration.
E-mail: [email protected]
Type: User
Bio: Keep loving, keep hungry.
Software Engineer. Focus On Fields Of Big Data and data integration.
E-mail: [email protected]
An experimental open-source attempt to make GPT-4 fully autonomous.
阿里巴巴 MySQL binlog 增量订阅&消费组件
Based on Apache Flink. Support data synchronization/integration.
DataVines makes it easier to know your data
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
FlechazoW's blog
Apache Flink
Change Data Capture (CDC) Connectors for Apache Flink
Apache Flink connector repository
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Git Source Code Mirror - This is a publish-only repository and all pull requests are ignored. Please follow Documentation/SubmittingPatches procedure for any of your improvements.
Google core libraries for Java
Apache Hadoop
Upserts, Deletes And Incremental Processing on Big Data.
Apache Iceberg
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
StreamPark, Make stream processing easier! easy-to-use streaming application development framework and operation platform
Mirror of Apache Kafka
linux内核学习资料:200+经典内核文章,100+内核论文,50+内核项目,500+内核面试题,80+内核视频
使用 NextJS + Notion API 实现的,支持多种部署方案的静态博客,无需服务器、零门槛搭建网站,为Notion和所有创作者设计。 (A static blog built with NextJS and Notion API, supporting multiple deployment options. No server required, zero threshold to set up a website. Designed for Notion and all creators.)
《Patterns of Distributed Systems》中文版
Apache Pulsar - distributed pub-sub messaging system
Apache seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Apache Spark - A unified analytics engine for large-scale data processing
深圳地铁大数据客流分析系统🚇🚄🌟
大数据平台-分布式任务调度系统
Document templates for open-source projects (README, CONTRIBUTING, GitHub templates)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.