Pengcheng Huang's Projects
All for the software engineer interview. Leetcode, System Design, Data Structure, Design Pattern, Concurrency, etc.
Materials and Reference for Distributed
ansible web ui, more simple and better layout
A curated list of awesome JSON datasets that don't require authentication.
搜狐视频(sohu tv)Redis私有云平台
This is a consolidated list for CKAD practice questions
A simple configuration library for Java applications that can handle a variety of formats and provides a node-based data structure able to handle a wide variety of configuration schemas
A demo of how to custom fragmenttabhost
MapReduce, Spark, Java, and Scala for Data Algorithms Book
A Django application for remotely monitoring servers using SSH
内部自动化运维管理系统
The IK Analysis plugin integrates Lucene IK analyzer into elasticsearch, support customized dictionary.
ELKstack 中文指南
Build configuration-driven ETL pipelines on Apache Spark
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
基于flink-sql的实时流计算web平台
Apache Flink Training Excercises
Source of Flume NG for tailing files in multiple directories
first version
:octocat: 高质量的Git中文教程,来自国外社区的优秀文章和个人实践
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
The tool can install v2ray on the Doprax, including VMess and VLess protocols, it will automatically switch IP, you need to fork this projects, read readme.md and run it. Create By ifeng.
用java实现一下Logstash的几个常用input/filter/output, 希望能有效率上面的大提升
Spark-based hive proxy with HTTP interface
Dolphin Scheduler is a distributed and easy-to-expand visual DAG workflow scheduling system, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)