Big Data Preparation Cloud Service (BDP)
January 1, 2016
Preparing data for analysis at any scale is a notoriously time-consuming and error-prone process. It is estimated that up to 90% of the time spent on data analysis projects is spent on data preparation. The problem is that data originates from an ever growing number of sources; comes in a wide variety of complex formats; and can span the range from structured, semistructured, and more often unstructured content. In this environment, each data set takes weeks or months of effort to process, frequently requiring programmers writing custom scripts. Accelerating and automating data preparation is the key to unlocking the potential of all your data.
This white paper describes how Oracle Big Data Preparation Cloud Service (BDP) provides a set of coordinated services that automate, streamline, and guide the process of data ingestion, preparation, enrichment, and governance without costly manual intervention. BDP is available in the Oracle Cloud and powered by Apache Spark and Hadoop. It provides a highly intuitive and interactive user experience, guiding business users with a rich set of recommendations, which results in a significant cost advantage in analytical and big data projects by reducing the amount of time and resources required to ingest and prepare data sets for multiple downstream processes. Typically complex operations are made easy; and error-prone setup and configuration are resolved.
In summary, BDP renders the hardest parts of today’s business data ecosystem simple, scalable, and automated via the Oracle Cloud, reducing noise and boosting signal quality that tremendously improves your data for downstream applications.