An end-to-end streaming data pipeline: Kafka ingests events, Flink processes them in real-time into Iceberg tables, Trino provides SQL analytics, and Superset visualizes the results. This builds the ...
This repository aims to introduce the Data Lakehouse pattern as a suitable and flexible solution to transit small companies to established enterprises, allowing to implement a local data lakehouse ...