Install Apache Spark 3.5 on Linux (Ubuntu, CentOS)
A walkthrough of installing Apache Spark 3.5 on modern Linux, from prerequisites through a working standalone cluster. Earlier versions are covered in Install Apache Spark 1.0 on Ubuntu 14.04 and Install Apache Spark 2 on Ubuntu 16.04 and macOS.
Prerequisites Requirement Version / Recommendation Java OpenJDK 17 (Spark 3.5 supports 8, 11, 17) Python 3.8+ for PySpark Memory 4 GB minimum, 8 GB+ for comfortable work Storage 10 GB free for install + logs OS Ubuntu 20.04+, CentOS 7+, or equivalent …
Continue Reading
: Install Apache Spark 3.5 on Linux (Ubuntu, CentOS)