Open Source Lakehouse

The recommended data architecture with free and open source tools is is:

  • Data Format: Parquet

  • Storage: Local folders or Minio S3 Compatible

  • ETL: Apache Air Flow

  • Lakehouse: DuckDB

Last updated

Was this helpful?