DEV Community

Suttipong Kullawattana
Suttipong Kullawattana

Posted on

How to setup pyspark with Jupyter Notebook for Data Engineer

I have to conclusion how to setup pyspark for building ETL pipeline with Jupyter Notebook by summary step like this.

First step, Install python 3.9.1 for use on python.

Second step, install scala $ brew install scala

Image description

Third step, start with $ pyspark

Image description

Fourth step, run data frame

Image description

Reference: apache-spark, Getting started with mongodb, pyspark and jupyter-notebook, How to install pyspark on mac

Top comments (0)