Documentation | QuickStart | Development
Fluss is a streaming storage built for real-time analytics which can serve as the real-time data layer for Lakehouse architectures.
It bridges the gap between data streaming and data Lakehouse by enabling low-latency, high-throughput data ingestion and processing while seamlessly integrating with popular compute engines like Apache Flink, while Apache Spark, and StarRocks are coming soon.
Fluss (German: river, pronounced /flus/
) enables streaming data continuously converging, distributing and flowing into lakes, like a river 🌊
- Sub-Second Latency: Low-latency streaming reads/writes optimized for real-time applications with Apache Flink.
- Columnar Stream: 10x improvement in streaming read performance with efficient pushdown projections.
- Streaming & Lakehouse Unification: Unified data streaming and Lakehouse with low latencies for powerful analytics.
- Real-Time Updates: Cost-efficient partial updates for large-scale data without expensive join operations.
- Changelog Generation: Complete changelogs for streaming processors, streamlining analytics workflows.
- Lookup Queries: Ultra-high QPS for primary key lookups, enabling efficient dimension table serving.
Prerequisites for building Fluss:
- Unix-like environment (we use Linux, Mac OS X, Cygwin, WSL)
- Git
- Maven (we require version >= 3.8.6)
- Java 8 or 11
git clone https://github.com/alibaba/fluss.git
cd fluss
./mvnw clean package -DskipTests
Fluss is now installed in build-target
. The build command uses Maven Wrapper (mvnw
) which ensures the correct Maven version is used.
Fluss is open-source, and we’d love your help to keep it growing! Join the discussions, open issues if you find a bug or request features, contribute code and documentation, or help us improve the project in any way. All contributions are welcome!
Fluss project is licensed under the Apache License 2.0.