Giter VIP home page Giter VIP logo

Albert Franzi's Projects

airbyte icon airbyte

Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

awesome-spark icon awesome-spark

A curated list of awesome Apache Spark packages and resources.

aws-glue-data-catalog-client-for-apache-hive-metastore icon aws-glue-data-catalog-client-for-apache-hive-metastore

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions

datahub icon datahub

A Generalized Metadata Search & Discovery Tool

datahub-helm icon datahub-helm

Repository of helm charts for deploying DataHub on a Kubernetes cluster

dbt-core icon dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

elementary icon elementary

Open-source data observability for analytics engineers.

json-schema icon json-schema

JSON Schema validator for java, based on the org.json API

prefect icon prefect

The easiest way to automate your data

presto icon presto

Distributed SQL query engine for big data

quinn icon quinn

pyspark methods to enhance developer productivity 📣 👯 🎉

redis-poc icon redis-poc

Notification System PoC with ZSETs using the time to send as Scores

rudderstack-helm icon rudderstack-helm

Open-source, warehouse-first Customer Data Pipeline and Segment-alternative. Collects and routes clickstream data and builds your customer data lake on your data warehouse.

spark-daria icon spark-daria

Essential Spark extensions and helper methods ✨😲

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.