Topic: data-pipelines Goto Github
Some thing interesting about data-pipelines
Some thing interesting about data-pipelines
data-pipelines,Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Organization: apache
Home Page: https://airflow.apache.org/
data-pipelines,Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Organization: apache
Home Page: https://dolphinscheduler.apache.org/
data-pipelines,Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift) in real-time.
Organization: artie-labs
Home Page: https://artie.so
data-pipelines,Explore Apache Kafka data pipelines in Kubernetes.
Organization: bakdata
data-pipelines,Beneath is a serverless real-time data platform ⚡️
Organization: beneath-hq
Home Page: https://beneath.dev
data-pipelines,Bruin is a data pipeline tool that is designed to be easy-to-use. It allows building data pipelines using SQL and Python, and has built-in data quality checks.
Organization: bruin-data
Home Page: https://getbruin.com
data-pipelines,Building data processing pipelines for documents processing with NLP using Apache NiFi and related services
Organization: cogstack
Home Page: https://hub.docker.com/r/cogstacksystems/cogstack-nifi/
data-pipelines,MLeap: Deploy ML Pipelines to Production
Organization: combust
Home Page: https://combust.github.io/mleap-docs/
data-pipelines,Conductor OSS SDK for Python programming language
Organization: conductor-sdk
data-pipelines,Learn the basics of Apache Kafka® from leaders in the Kafka community with these video courses covering the Kafka ecosystem and hands-on exercises.
Organization: confluentinc
Home Page: https://developer.confluent.io/
data-pipelines,An orchestration platform for the development, production, and observation of data assets.
Organization: dagster-io
Home Page: https://dagster.io
data-pipelines,The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.
User: danilbaibak
data-pipelines,The best place to learn data engineering. Built and maintained by the data engineering community.
Organization: data-engineering-community
Home Page: https://dataengineering.wiki
data-pipelines,The developer-friendly ETL platform for transforming data in real-time. Based on Apache Kafka® and Kubernetes®.
Organization: datacater
Home Page: https://datacater.io
data-pipelines,Performance Observability for Apache Spark
Organization: dataflint
data-pipelines,Dataform is a framework for managing SQL based data operations in BigQuery
Organization: dataform-co
Home Page: https://cloud.google.com/dataform/docs
data-pipelines,Relational data pipelines for the science lab
Organization: datajoint
Home Page: https://datajoint.com/docs
data-pipelines,Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.
Organization: dataplane-app
Home Page: https://dataplane.app
data-pipelines,The open source, standalone, fullstack .NET job orchestrator that we've been missing.
Organization: didacthq
Home Page: https://www.didact.dev
data-pipelines,The REST API and execution engine for the Didact Platform.
Organization: didacthq
Home Page: https://www.didact.dev
data-pipelines,dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Organization: elementary-data
Home Page: https://www.elementary-data.com/
data-pipelines,The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
Organization: elementary-data
Home Page: https://www.elementary-data.com/
data-pipelines,Spark-Transformers: Library for exporting Apache Spark MLLIB models to use them in any Java application with no other dependencies.
Organization: flipkart-incubator
data-pipelines,Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
User: fmind
Home Page: https://fmind.github.io/mlops-python-package/
data-pipelines,A kedro plugin to use pandera in your kedro projects
User: galileo-galilei
Home Page: https://kedro-pandera.readthedocs.io/en/latest/
data-pipelines,Cloud-native, data onboarding architecture for Google Cloud Datasets
Organization: googlecloudplatform
Home Page: https://cloud.google.com/solutions/datasets
data-pipelines,Classwork projects and home works done through Udacity data engineering nano degree
User: immu0001
data-pipelines,RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Organization: infiniflow
Home Page: https://ragflow.io
data-pipelines,Lean and mean distributed stream processing system written in rust and web assembly.
Organization: infinyon
Home Page: https://www.fluvio.io/
data-pipelines,Udacity Data Engineering Nanodegree Program
User: kenthsu
data-pipelines,A lightweight CLI tool for versioning data alongside source code and building data pipelines.
User: kevin-hanselman
Home Page: https://kevin-hanselman.github.io/dud/
data-pipelines,An Open Source PHP Reporting Framework that helps you to write perfect data reports or to construct awesome dashboards in PHP. Working great with all PHP versions from 5.6 to latest 8.0. Fully compatible with all kinds of MVC frameworks like Laravel, CodeIgniter, Symfony.
User: koolreport
Home Page: https://www.koolreport.com/
data-pipelines,Multi-hop declarative data pipelines
Organization: linkedin
data-pipelines,🧙 Build, run, and manage data pipelines for integrating and transforming data.
Organization: mage-ai
Home Page: https://www.mage.ai/
data-pipelines,Example of an ETL Pipeline using Airflow
User: mdh266
data-pipelines,Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
Organization: meltano
Home Page: https://meltano.com/
data-pipelines,Found a data engineering challenge or participated in a selection process ? Share with us!
User: minhadona
data-pipelines,Move your data with ease.
Organization: mycelial
Home Page: https://mycelial.com
data-pipelines,First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Organization: opendatadiscovery
Home Page: https://opendatadiscovery.org
data-pipelines,Build data pipelines, the easy way 🛠️
Organization: orchest
Home Page: https://orchest.readthedocs.io/en/stable/
data-pipelines,Data pipelines from re-usable components
Organization: patterns-app
data-pipelines,Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
Organization: raystack
Home Page: https://raystack.github.io/optimus
data-pipelines,Work with your web service, database, and streaming schemas in a single format.
Organization: recap-build
Home Page: https://recap.build
data-pipelines,The framework for fast development and deployment of RAG systems.
Organization: sciphi-ai
Home Page: https://r2r-docs.sciphi.ai/
data-pipelines,Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation, validation and loading of data from S3 -> Redshift -> S3
User: shravan-kuchkula
data-pipelines,Smart Automation Tool for building modern Data Lakes and Data Pipelines
Organization: smart-data-lake
Home Page: https://www.smartdatalake.io
data-pipelines,A curated list of awesome projects and resources related to Kubeflow (a CNCF incubating project)
User: terrytangyuan
Home Page: https://terrytangyuan.github.io/awesome-kubeflow/
data-pipelines,Main repo including core data model, data marts, reference data, terminology, and the clinical concept library
Organization: tuva-health
Home Page: https://thetuvaproject.com/
data-pipelines,Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Organization: unstructured-io
Home Page: https://www.unstructured.io/
data-pipelines,One framework to develop, deploy and operate data workflows with Python and SQL.
Organization: vmware
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.