Things on this page are fragmentary and immature notes/thoughts of the author. Please read with your own judgement!
You can download prebuilt binary Spark at
https://
https://
Clone Spark 2.4.5.
:::bash git clone --depth 1 --branch 2.4.5 https://github.com/apache/spark.gitUse Scala 2.11.
:::bash ./dev/change-scala-version.sh 2.11Run dev/make-distribution.sh
:::bash ./dev/make-distribution.sh \ --name hadoop2.7 \ --tgz \ -Pyarn \ -Phive \ -Phive-thriftserver \ -Phadoop-2.7 \ -Dhadoop.version=2.7.3 \ -Dscala-2.11Build Spark.
:::bash ./build/mvn \ -Pyarn \ -Phive \ -Phive-thriftserver \ -Phadoop-2.7 \ -Dhadoop.version=2.7.3 \ -Dscala-2.11 \ -DskipTests \ clean packageThis will generate a
Spark-2.4.5-*.tgzfile that you use to deploy. Just copy it to your Spark client and unzip it to the right location.If you are building Spark for using in your company, you probably need to replace the directory
confwith the customized one for your company.