By using tdwi.org website you agree to our use of cookies as described in our cookie policy. Learn More

Spotlight Session: The Future Is Open — The Rise of the Cloud Data Lake

October 14, 2020

Prerequisite: None

Ryan Murray

OSS Developer

Dremio

The rise of cloud data lake storage (e.g., S3, ADLS) as the default bit bucket in the cloud, combined with the infinite supply and elasticity of cloud compute (e.g., EC2, Azure VMs), has ushered in a new era in data analytics architectures. In this new world, data can be stored and managed in open source file and table formats, such as Apache Parquet and Apache Iceberg, and accessed by best-of-breed elastic compute engines such as Dremio, Databricks, and EMR. As a result, companies can now avoid becoming locked into monolithic systems such as cloud data warehouses and Hadoop distributions, and instead enjoy the flexibility of using the best-of-breed technologies of today and tomorrow. In this session, we explore these secular trends and the building blocks that have come together to enable this new open architecture.

Subscribe to Receive summit updates via email