What’s a data lakehouse?
At a high level, the data lakehouse describes a data warehouse-like service that runs against a data lake, which sits on top of an object storage service. But how does it work? And is it a viable option for your organization?
This comprehensive five-part eBook on the data lakehouse aims to introduce the concept, describe what is new and different about it, and assess the strengths and weaknesses of data lakehouse architecture as a complement to the data warehouse.
- Part 1 defines the data lakehouse and explains what differentiates it
- Part 2 in this series explores this architecture as an attempt to adapt (or to retrofit) the core requirements of data warehouse architecture to accord with the priorities of cloud-native software design
- Part 3 assesses the viability of the lakehouse for typical data warehouse workloads
- Part 4 explores the use of data modeling with the data lakehouse. It evaluates the claim that the data lakehouse comprises a lightly modeled alternative to the conventional enterprise data warehouse
- Part 5 compares the data lakehouse with the platform-as-a-service (PaaS) data warehouse, which is also based on cloud-native design principles