RESEARCH & RESOURCES

BPR Data Lakes cover

Best Practices Report | Data Lakes: Purposes, Practices, Patterns, and Platforms

March 29, 2017

When designed well, a data lake is an effective data-driven design pattern for capturing a wide range of data types, both old and new, at large scale. By definition, a data lake is optimized for the quick ingestion of raw, detailed source data plus on-the-fly processing of such data for exploration, analytics, and operations. Even so, traditional, latent data practices are possible, too.

Organizations are adopting the data lake design pattern (whether on Hadoop or a relational database) because lakes provision the kind of raw data that users need for data exploration and discovery-oriented forms of advanced analytics. A data lake can also be a consolidation point for both new and traditional data, thereby enabling analytics correlations across all data. 

To help users prepare, this TDWI Best Practices Report defines data lake types, then discusses their emerging best practices, enabling technologies, and real-world use cases. The report’s survey quantifies users’ trends and readiness for data lakes, and the report’s user stories document real-world activities.


Your e-mail address is used to communicate with you about your registration, related products and services, and offers from select vendors. Refer to our Privacy Policy for additional information.

TDWI Membership

Get immediate access to training discounts, video library, research, and more.

Find the right level of Membership for you.