Giter VIP home page Giter VIP logo

leehuwuj / dagster Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dagster-io/dagster

0.0 0.0 0.0 438.85 MB

An orchestration platform for the development, production, and observation of data assets.

Home Page: https://dagster.io

License: Apache License 2.0

Shell 0.05% JavaScript 0.32% Python 83.19% TypeScript 14.94% CSS 0.02% Makefile 0.04% HTML 0.02% Smarty 0.12% Mako 0.01% Jupyter Notebook 1.05% Dockerfile 0.14% Mustache 0.09%

dagster's Introduction

dagster logo

Dagster is an orchestration platform for the development, production, and observation of data assets.

  • Develop and test locally, then deploy anywhere: With Dagster, the same computations can run in-process against your local file system or on a distributed work queue against your production data lake. Choose to locally develop on your laptop, deploy on-premise, or run in any cloud.
  • Model the data produced and consumed: In your orchestration graph, Dagster models data dependencies and handles how data passes between steps. Gradual typing on inputs and outputs catches bugs early.
  • Link data to computations: Dagster’s Asset Catalog tracks the data sets and ML models produced by your jobs. Understand how they were generated and trace issues when asset declarations do not match their materializations in storage.
  • Build a self-service data platform: Dagster helps platform teams build systems for data practitioners. Jobs are built from shared, reusable, configurable data processing components. Dagit, Dagster’s web interface, lets anyone inspect these objects and discover how to use them.
  • Declare and isolate dependencies: Dagster’s server model enables you to isolate codebases. Problems in one job will not bring down the system or other jobs. Each job can have its own package dependencies and Python version.
  • Debug jobs from a rich interface: Dagit includes expansive facilities for understanding the jobs it orchestrates. When inspecting a run of your job, you can query over logs, discover the most time-consuming tasks via a Gantt chart, re-execute subsets of steps, and more.

Installation

Dagster is available on PyPI and officially supports Python 3.6+.

pip install dagster dagit

This installs two modules:

  • Dagster: The core programming model.
  • Dagit: The web interface for developing and operating Dagster jobs. It includes a DAG browser, a type-aware interface to launch runs, a live view for in-progress runs, a catalog to view your data assets, and more.

For a quick overview, check out our Getting Started page.

Documentation

You can find the Dagster documentation on the website.

We've divided up the documentation into several sections:

Community

Connect with thousands of other data practitioners building with Dagster. Share knowledge, get help, and contribute to the open-source project. To see featured material and upcoming events, check out our Dagster Community page.

Join our community here:

Contributing

For details on contributing or running the project for development, check out our contributing guide.

License

Dagster is Apache 2.0 licensed.

dagster's People

Contributors

abegong avatar ajnadel avatar alangenfeld avatar asingh16 avatar bengotow avatar benpankow avatar catherinewu avatar clairelin135 avatar dpeng817 avatar elementl-devtools avatar erinkcochran87 avatar fishmanl avatar freiksenet avatar gibsondan avatar hellendag avatar helloworld avatar jbrambledc avatar jmsanders avatar johannkm avatar mgasner avatar natekupp avatar owenkephart avatar prha avatar rexledesma avatar salazarm avatar schrockn avatar sidkmenon avatar smackesey avatar sryza avatar yuhan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.