A data pipeline is the set of processes that move data from sources to destinations for analysis and storage. Pipelines may include ingestion, transformation, quality checks, and orchestration, supporting analytics, machine learning, and business reporting.