Pyspark working issues


#1

from pyspark.sql import SparkSession

spark = SparkSession.
builder.
master(‘yarn’).
appName(‘Data Frame’).
getOrCreate()

ModuleNotFoundError Traceback (most recent call last)
in ()
----> 1 from pyspark.sql import SparkSession
2
3 spark = SparkSession. builder. master(‘yarn’). appName(‘Data Frame’). getOrCreate()

ModuleNotFoundError: No module named ‘pyspark’


#2

Check this blog post: https://cloudxlab.com/blog/running-pyspark-jupyter-notebook/
It should help.