Topic: dataquality Goto Github
Some thing interesting about dataquality
Some thing interesting about dataquality
dataquality,Datailot-cli is the command line interface for accessing the AI teammate for engineers to ensure best practices in their SQL and dbt projects.
Organization: altimateai
Home Page: https://datapilot.readthedocs.io/en/latest/
dataquality,OLAP in TSQL and Python
User: ammarsahyoun
dataquality,data and pipeline testing with and for SQL
User: andrjas
Home Page: https://andrjas.github.io/data_check/
dataquality,The core library of osDQ
Organization: arrahtech
Home Page: http://arrahtech.com/
dataquality,CSV Data Validator is a tool to validate csv file. It parse csv and validate the data with .hdr(csv meta data) before ingestion to Data Lake. It checks data file availability for every day load and validate data with respective meta data like File Size, Checksum, Delimiter, Record count etc. It ensure landed data conformity before give go ahead for data ingestion to Data Lake. It generate complete stats or error log.
User: ashishkr007
dataquality,Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
User: autoviml
dataquality,Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Organization: awslabs
dataquality,Tutorial and examples of Data Quality in Big Data System
User: bikash
dataquality,BirdiDQ leverages the power of the Python Great Expectations open-source library and combines it with the simplicity of natural language queries to effortlessly identify and report data quality issues, all at the tip of your fingers.
User: birdid
dataquality,Possibly the fastest DataFrame-agnostic quality check library in town.
User: canimus
dataquality,ML powered analytics engine for outlier detection and root cause analysis.
Organization: chaos-genius
Home Page: https://www.chaosgenius.io
dataquality,The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Organization: cleanlab
Home Page: https://cleanlab.ai
dataquality,Data analysis of Axiell thesauri RCE (B&AC and KC)
Organization: cultureelerfgoed
dataquality,MongoDB connector for Data Culpa - monitor data quality automatically with Data Culpa Validator
Organization: data-culpa
Home Page: https://www.dataculpa.com/
dataquality,Snowflake connectors for Data Culpa - monitor data quality automatically with Data Culpa Validator
Organization: data-culpa
Home Page: https://www.dataculpa.com/
dataquality,Open source clients for working with Data Culpa Validator services from data pipelines
Organization: data-culpa
dataquality,The premier open source Data Quality solution
Organization: datacleaner
dataquality,Compare tables within or across databases
Organization: datafold
Home Page: https://docs.datafold.com
dataquality,🕵️♀️ Enrich Metadata Features with AI Insights through Data Inspection
Organization: datateratechnology
dataquality,Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.
Organization: datavane
Home Page: https://datavane.github.io/datavines-website/
dataquality,Always know what to expect from your data.
Organization: great-expectations
Home Page: https://docs.greatexpectations.io/
dataquality,Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool
User: grillazz
dataquality,Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.
Organization: huemulsolutions
dataquality,Library for Semi-Automated Data Science
Organization: ibm
Home Page: https://lale.readthedocs.io
dataquality,A Practical Approach for Population Data Quality Assessment
Organization: icescentral
dataquality,Demonstration of Data Quality with dbt
Organization: infinitelambda
Home Page: https://infinitelambda-data-quality-score.streamlit.app/
dataquality,Make simple storing test results and visualisation of these in a BI dashboard
Organization: infinitelambda
Home Page: https://infinitelambda.github.io/dq-tools/
dataquality,Data Quality Framework provides by Jabar Digital Service
Organization: jabardigitalservice
Home Page: https://pypi.org/project/DataSae/
dataquality,Luzzu Quality Assessment Framework
Organization: luzzu
dataquality,Udacity Data Engineering Capstone Project. The purpose of this project is to establish a single source of truth database around I94 US immigration data considering basic immigration profiles, purpose of travel, visa status and weather impacts.
User: marcus-repo
dataquality,SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.
User: martandsingh
dataquality,数据质量校验集成调度系统 告警模版
User: miyazakihayao
dataquality,Amora Data Build Tool enables analysts and engineers to transform data on the data warehouse (BigQuery) by writing Amora Models that describe the data schema using Python's "PEP484 - Type Hints" and select statements with SQLAlchemy. Amora is able to transform Python code into SQL data transformation jobs that run inside the warehouse.
Organization: mundipagg
Home Page: https://mundipagg.github.io/amora-data-build-tool
dataquality,Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Organization: open-metadata
Home Page: https://open-metadata.org
dataquality,Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Organization: open-metadata
Home Page: https://open-metadata.org
dataquality,🦆 Blazing Fast and highly customizable Github Action to setup a DuckDb runtime
Organization: opt-nc
Home Page: https://dev.to/optnc/effortless-data-quality-wduckdb-on-github-2mkb
dataquality, Frontend for the osmcha-django REST API
Organization: osmcha
Home Page: https://osmcha.org
dataquality,Codes&Datasets
User: qizhixinhit
dataquality,re_data - fix data issues before your users & CEO would discover them 😊
Organization: re-data
Home Page: https://docs.getre.io/latest/docs/start_here
dataquality,Quality Aware Feature Store
User: rodrigobaron
Home Page: https://pypi.org/project/qafs/
dataquality,The main code repository of Referencing Quality Scoring System metrics
User: seyedahbr
dataquality,:zap: Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Organization: sodadata
Home Page: https://go.soda.io/core-docs
dataquality,:zap: Prevent downstream data quality issues by integrating the Soda Library into your CI/CD pipeline.
Organization: sodadata
Home Page: https://www.soda.io/
dataquality,ADVICE: Save yourself and colleagues loads of time, by taking a few suggestions into account before using Excel.
User: steltenpower
dataquality,IDEA: Your list of files is actually a network of data to cooperate through, which the UI should reflect …
User: steltenpower
dataquality,Open Source Data Quality Monitoring.
Organization: waterdipai
Home Page: https://datachecks.io
dataquality,A real business case aiming for optimisation of stock management. Cleanse - Calculate - Visualize - Drive insights based on current dataset. Recommend new KPIs and metrics for operations.
User: xfzhang-zoey
dataquality,Make your dataset talk to you. The AI assistant for data preparation.
Organization: ydataai
Home Page: https://datacentricai.community
dataquality,Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Organization: zinggai
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.