SPARK Workflow

Share this book with your friends

SPARK Workflow Real-Time Data Streamer

Author Name: Suripeddi Koundinya | Format: Paperback | Genre : Computers | Other Details

Apache Spark is a unified computing engine and a set of libraries for parallel data processing on computer clusters.

Spark can be used from Python, Java, or Scala, R, or SQL. Spark itself is written in Scala, and runs on the Java Virtual Machine (JVM) and therefore to run Spark either on your laptop or a cluster, all you need is an installation of Java 6 or newer. If you wish to use the Python API you will also need a Python interpreter (version 2.6 or newer). If you wish to use R you’ll also need a version of R on your machine.

Downloading Spark for your Hadoop Version: You don’t need to have Hadoop, but if you have an existing Hadoop cluster or HDFS installation, download the matching version.

In this book, you will see Workouts of Spark using Scala version 2.11.12 as Spark is primarily written in Scala, making it Spark’s “default” language.

Select Format

Paperback ₹ 369

Inclusive of all taxes

Delivery

Item is available at

Estimated Delivery 3rd Mar - 4th Mar

Enter pincode for exact delivery dates

Suripeddi Koundinya

Suripeddi Koundinya is a Data Analyst, Blogger, Financial Advisor and Psychologist. He pursued Masters in Biotechnology, Astrology and Psychology. He authored books on Data Science – Hive, Scala, SQL, Big Data, Sqoop, etc. He’s both Fictional and Non-Fictional Writer.