I am running Spark in Microsoft Azure. The cluster size I configured in Azure is -
6 Nodes (2 HEAD + 4 WORKER) 40 CORES.
In my spark submit job, I give -
spark-submit --master=yarn --num-executors=9 $script $task2_outdir $task3_outdir
Question - What is the role of num-executors? How does it work with these worker nodes when an application job is submitted?