Installing Spark on MapR

This topic describes how to install Apache Spark on a MapR instance.

You must download the Apache Spark which matches the Spark version of your MapR (MapR Converged Data Platform) cluster. You can get the Spark version information from your installed cluster or from the MapR official website.

To install the Spark 1.5.x or 1.6.x component:

  1. Make sure you have a MapR client on the BDD Shell machine.
  2. Get Spark in one of two ways:
    • Either install MapR's Spark:
      yum install mapr-spark mapr-spark-historyserver
    • Or copy Spark from the MapR cluster:
      scp -r bdduser@abc.us.example.com:/opt/mapr/spark/ /opt/mapr/spark/
  3. Copy MapR's hive-site.xml file:
    scp -r bdduser@abc.us.example.com:/opt/mapr/hive/hive-1.2/conf/hive-site.xml /opt/mapr/spark/spark-1.6.1/conf/