Overview
Amundsen is an open-source data discovery platform originally built at Lyft. It helps data teams find, understand, and trust data through search, metadata management, and usage-based ranking. Named after the explorer who first reached the South Pole.
Key Features
- ✓ Search Interface: Find tables, dashboards, and people
- ✓ Table Detail Pages: Schema, stats, owners, descriptions
- ✓ Column-level Metadata: Descriptions, badges, stats
- ✓ Lineage: Basic data lineage support
- ✓ Usage Ranking: Popular tables surfaced first
- ✓ Programmatic Descriptions: Auto-generated from queries
Pros
- 👍 True open-source with active community
- 👍 Production-proven at Lyft scale
- 👍 Good search experience
- 👍 Usage-based relevance ranking
- 👍 Simpler than DataHub
Cons
- 👎 Less feature-rich than commercial tools
- 👎 Requires engineering to deploy
- 👎 Limited governance features
- 👎 Smaller ecosystem than competitors
- 👎 Development pace slowed
Best For
Organizations wanting open-source data discovery without DataHub's complexity. Good starting point for data cataloging.
Founded: 2019 HQ: Lyft (open-sourced)