Dremio
Free tier availableLakehouse platform with SQL query engine
📖 Overview
Dremio is a lakehouse platform that provides a fast SQL query engine directly on your data lake. Built on Apache Arrow, it offers excellent performance without requiring data movement. Strong focus on Apache Iceberg and open formats.
✨ Key Features
- ✓ Arrow-based Engine: Fast columnar execution
- ✓ Data Reflections: Automatic query acceleration
- ✓ Iceberg Native: First-class Iceberg support
- ✓ Semantic Layer: Curated datasets and views
- ✓ Federation: Query across sources
- ✓ Nessie Integration: Git-like data versioning
💰 Pricing
Model
open source
Starting Price
$0
Cloud/Pro
Pay-as-you-go
✓ Free tier available
🏢 Enterprise plans available
👍 Pros
- + Query data lake directly (no ETL)
- + Excellent Arrow Flight performance
- + Strong open-source commitment
- + Good Iceberg integration
- + Self-service data access
👎 Cons
- − Less mature than Databricks/Snowflake
- − Cloud offering still growing
- − Can be complex to tune
- − Smaller ecosystem
🎯 Best For
Organizations wanting to query data lakes directly without moving data to a warehouse. Ideal for Iceberg-centric architectures.
🔗 Works With
📁 More Data Lakes Tools
Apache Hudi
Delta Lake
Open-source storage layer bringing reliability to data lakes
Apache Iceberg
Open table format for huge analytic datasets
Onehouse