gwaffen-mck Goto Github PK
Type: User
Type: User
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
Apache camel examples
Prometheus instrumentation library for JVM applications
Command Line Interface for Databricks
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB
FHIR Resources https://www.hl7.org/fhir/resourcelist.html
Apache Flink
Fork of tagtraum industries' GCViewer. Tagtraum stopped development in 2008, I aim to improve support for Sun's / Oracle's java 1.6+ garbage collector logs (including G1 collector)
An open-source toolkit for large-scale genomic analysis
Apache Hive
Tool to generate a Hive schema from a JSON example doc
Upserts, Deletes And Incremental Processing on Big Data.
Highly configurable converter from JSON-schema to XML-schema (XSD).
Mirror of Apache Kafka
🔗 A multipurpose Kafka Connect connector that makes it easy to parse, transform and stream any file, in any format, into Apache Kafka
Kafka Connect connector for JDBC-compatible databases
Kafka Connect connector for reading CSV files into Kafka.
A CLI to manage and monitor permissions in AWS Lake Formation
Java library for creating text-based GUIs
💻 A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline!
Sample projects to demonstrate LocalStack Pro features
Apache Parquet
Data validation using Python type hints
Simple Python version management
Confluent Schema Registry for Kafka
Apache Spark - A unified analytics engine for large-scale data processing
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.