tufanrakshit Goto Github PK
Type: User
Company: Big data
Bio: Senior Data Architect
Location: Poland
Type: User
Company: Big data
Bio: Senior Data Architect
Location: Poland
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Airflow helm chart for AWS EKS
Project repository of Apache Airflow, deployed on Docker in Amazon EC2 via GitLab.
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) :desktop_computer: >> [ :rocket:, :ship: ]
Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Code Repository for Apache Kafka Series - Learn Apache Kafka for Beginners, Published by Packt
:atom: The hackable text editor
Autoscaling components for Kubernetes
A curated list of awesome big data frameworks, ressources and other awesomeness.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
A curated list of awesome Apache Spark packages and resources.
Pointers to useful, well-written, and otherwise beautiful documentation.
Kafka Consumer Lag Checking
A complete computer science study plan to become a software engineer.
The Confluent Platform Helm charts enable you to deploy Confluent Platform services on Kubernetes for development, test, and proof of concept environments.
The Python programming language
Easy and safe way to manage your crontab file
My CV / Resume
Dynamically generate Apache Airflow DAGs from YAML configuration files
Extensible Rules Engine for custom Dataframe / Dataset validation
A Metadata Platform for the Modern Data Stack
a dbt package to make auditing dbt runs easy.
Collection of dbt Tips and Tricks
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Dione - a Spark and HDFS indexing library
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.