Prerequisite: None
This session will include a moderated Q&A featuring questions from the live audience.
The data lakehouse is the new kid on the block in the world of data architectures. In a nutshell, the data lakehouse is a combination of a data warehouse and a data lake; it’s an architecture developed to support a typical data warehouse workload plus a data lake workload.
A data lakehouse holds structured, semistructured, and unstructured data. Data is stored in files that can be accessed by any type of tool and database server. SQL engines are able to access that data efficiently for more traditional business intelligence workloads, and data scientists can create their descriptive and prescriptive models directly on the data. It makes a lot of sense to combine these two worlds because they are sharing the same data and they are sharing logic.
In this talk, Rick van der Lans will discuss all aspects of data warehouses and data lakes—including data quality, data governance, auditability, performance, immutability, historical data, and data integration—to help you determine if the data lakehouse is a valuable and realistic data architecture for your business.