A data lake ingests data in its raw, original state, straight from data sources, with little or no cleansing, standardization, remodeling, or transformation. These and other data management best practices can then be applied flexibly as diverse use cases demand.
Most data lakes are built atop Hadoop, which enables a data lake to capture, process, and repurpose a wide range of data types and structures with linear scalability and high availability.
File Type: .pdf
File Size: 474 KB
Duration: 12 pages