Starred repositories
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
Distributed stream processing engine in Rust
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…
ByConity is an open source cloud data warehouse
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
An open-source columnar data format designed for fast & realtime analytic with big data.
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
Presentations, meetups and talks about ClickHouse
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
OLAP Database Performance Tuning Guide
An efficient storage and compute engine for both on-prem and cloud-native data analytics.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
CockroachDB — the cloud native, distributed SQL database designed for high availability, effortless scale, and control over data placement.
Pluggable in-process caching engine to build and scale high performance services
PolarDB-X is a cloud native distributed SQL Database designed for high concurrency, massive storage, complex querying scenarios.
Snowflake dataset containing statistics for 70 million queries over 14 day period
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
cpuworker - A Customized Goroutine Scheduler over Golang Runtime
An open-source, cloud-native, unified time series database for metrics, logs and events, supporting SQL/PromQL/Streaming. Available on GreptimeCloud.
NoSQL data store using the Seastar framework, compatible with Apache Cassandra and Amazon DynamoDB