Practical Apache Spark: Using the Scala API

Practical Apache Spark

eBook Details:

  • Paperback: 296 pages
  • Publisher: WOW! eBook; 1st edition (December 13, 2018)
  • Language: English
  • ISBN-10: 1484236513
  • ISBN-13: 978-1484236512

eBook Description:

Practical Apache Spark: Using the Scala API

Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure.

What You Will Learn

  • Discover the functional programming features of Scala
  • Understand the complete architecture of Spark and its components
  • Integrate Apache Spark with Hive and Kafka
  • Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
  • Work with different machine learning concepts and libraries using Spark’s MLlib packages

On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.


Leave a Reply

Your email address will not be published. Required fields are marked *