awesomedatatool Goto Github PK
Type: Organization
Type: Organization
DBus
A distributed in-memory NOSQL system based on TARS framework, support LRU algorithm and data persists on back-end database. Users can easily deploy, publish, and scale services on the web interface.
Change data capture for a variety of databases. https://debezium.io Please log issues in our JIRA at https://issues.jboss.org/projects/DBZ/issues
Easy to use .NET library for data and time series manipulation and for scientific programming
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Fast, Distributed Graph DB
a Map/Reduce framework for distributed computing
A distributed task scheduler for Dask
A high performance replicated log service. (The development is moved to Apache Incubator)
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
Python clone of Spark, a MapReduce alike framework in Python
Embeddable, replicated and fault tolerant SQL engine.
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
Dremio - the missing link in modern data
Apache Drill
Apache Druid: a high performance real-time analytics database.
DuckDB is an in-process SQL OLAP Database Management System
A generic dynamo implementation for different k-v storage engines
The next generation relational database.
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Elassandra = Elasticsearch + Apache Cassandra
ElasticCTR是基于Kubernetes的企业级推荐系统解决方案,该方案融合了百度业务场景下经过不断验证打磨的CTR模型、基于飞桨框架的大规模分布式训练、工业级稀疏参数Serving组件,帮助用户在Kubernetes环境中一键完成推荐系统架构部署,快速搭建和验证CTR模型训练和预测效果,具备高性能、工业级部署、端到端体验及二次深度开发的特性。
Open Source, Distributed, RESTful Search Engine
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
Monitoring and Management Web Application for ElasticSearch instances and clusters.
Embulk: Pluggable Bulk Data Loader.
Fast and light Redis C client library built over Hiredis, thread-safe, write replication, auto-reconnect, sync pool, async libev.
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
An open source event analytics platform
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.