Giter VIP home page Giter VIP logo

rdp's Introduction

RDP

logo

唯品会分布式数据管道系统

简介

RDP的全称是Real-Time Data Pipeline,是一个从MySQL同步数据到Kafka的数据管道。正如这个名字一样,RDP不生产数据,只是数据的“搬运工”。

基本原理是从上游MySQL中拉取Binlog数据,对Binlog数据分析、解析、PB格式序列化化和压缩后,推送到下游系统中,比如Apache Kafka,让用户收到友好的数据变更消息。 下游业务可以从中订阅或查询数据,拿到数据后结合业务自身逻辑进行处理,RDP在其中扮演了上下游业务的数据管道角色。 RDP从上游获取数据到推送的过程中,吞吐量可以支撑20w+事务每秒(约100w+事件每秒),满足大并发的业务要求。 另外,RDP处理延迟是ms级别,也可满足业务实时流计算要求。

特性

高可用

  • 完善的Failover能力,确保对业务系统持续服务能力。 保证数据的一致性(MySQL切换导致的数据丢失不在此范畴)。

低延时

  • 精简IO路径,提升实效性。 数据的并行处理能力,保障高性能。

可追溯

  • 数据丢没丢,透明可验证。 数据丢了,丢了哪些有据可循。

Contributors

帮助文档

点击查看

rdp's People

Contributors

firnsan avatar tom-tangjp avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

rdp's Issues

过滤规则与位点信息的配置

请教2个问题:

  1. 过滤规则的配置
    看文档中描述,是一种类似于“白名单”的设置,支持“黑名单”的写法吗?

  2. 位点信息的配置
    我如果想让rdp从指定的位点开始同步数据,是否能像设置过滤规则那样,直接在zk中set?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.