A Gentle Introduction to Apache Spark
January 1, 2018
Learn how to get started with Apache Spark
Apache Spark’s ability to speed analytics applications by orders of magnitude, its versatility, and its ease of use are quickly winning the market. Due to Spark’s appeal to developers, end users, and integrators to solve complex data problems at scale, it is now the most active open source project with the big data community.
With rapid adoption by enterprises across a wide range of industries, Spark has been deployed at massive scale, collectively processing multiple petabytes of data on clusters of over 8,000 nodes. If you are a developer or data scientist interested in big data, learn how Spark may be the tool for you. Databricks is happy to present this e-book as a practical introduction to Spark.
Download this e-book to learn:
- Spark’s basic architecture
- Why Spark is a popular choice for data analytics
- What tools and features are available
- How to get started right away through interactive sample code