Junhao Liu's Projects
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
Apache Arrow DataFusion SQL Query Engine
Apache Arrow DataFusion Comet Spark Accelerator
Apache Beam is a unified programming model for Batch and Streaming data processing.
Ceph is a distributed object, block, and file storage platform
eBPF-based Networking, Security, and Observability
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
Compositional, streaming I/O library for Scala
they see me ringin'
Apache Iceberg
Kubebuilder - SDK for building Kubernetes APIs using CRDs
👩🏿🎓👨🏽🎓👩🏻🎓CNCF Mentoring: LFX Mentorship + Summer of Code
OneTable is an omni-directional converter for table formats that facilitates interoperability across data processing systems and query engines.
Open Control Plane for Tables in Data Lakehouse
Apache Pinot - A realtime distributed OLAP datastore
The official home of the Presto distributed SQL query engine for big data
A cluster computing framework for processing large-scale geospatial data
Test infrastructure for the Kubernetes project.
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
wazero: the zero dependency WebAssembly runtime for Go developers