Giter VIP home page Giter VIP logo

sqlflow_public's Introduction

SQLFlow - A tool that tracks column-level data lineage

Track Column-Level Data Lineage for more than 20 major databases including Snowflake, Hive, SparkSQL, Teradata, Oracle, SQL Server, AWS redshift, BigQuery, etc.

Build and visualize lineage from SQL script from query history, ETL script, Github/Bitbucket, Local filesystem and remote databases.

Exploring lineage using an interactive diagram or programmatically using Restful APIs or SDKs.

Discover data lineage in this query:

insert into emp (id,first_name,last_name,city,postal_code,ph)
  select a.id,a.first_name,a.last_name,a.city,a.postal_code,b.ph
  from emp_addr a
  inner join emp_ph b on a.id = b.id;

SQLFlow presents a nice clean graph to you that tells where the data came from, what transformations it underwent along the way, and what other data items are derived from this data value.

SQLFlow Introduce

What SQLFlow can do for you

  • Scan your database and discover the data lineage instantly.
  • Automatically collect SQL script from github/bitbucket or local file system.
  • Provide a nice cleam diagram to the end-user to understand the data lineage quickly.
  • programmatically using Restful APIs or SDKs to get lineage in CSV, JSON, Graphml format.
  • Incorporate the lineage metadata decoded from the complex SQL script into your own metadata database for further processing.
  • Visualize the metadata already existing in your database to release the power of data.
  • Perform impact analysis and root-cause analysis by tracing lineage backwards or forwards with several mouse click.
  • Able to process SQL script from more than 20 major database vendors.

How to use SQLFlow

  • Open the official website of the SQLFlow and paste your SQL script or metadata to get a nice clean lineage diagram.
  • Call the Restful API of the SQLFlow in your own code to get data lineage metadata decoded by the SQLFlow from the SQL script.
  • The on-premise version of SQLflow enables you to use it on your own server to keep the data safer.

Restful APIs

SQLFlow architecture

User manual and FAQ

sqlflow_public's People

Contributors

sqlparser avatar shenhuan2021 avatar cnfree avatar lake2 avatar ktdynamic avatar isfd avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.