Spark job vs stage vs task in simple terms(with cheat sheet)

Aditya
2 min readSep 20, 2022

When a spark application invoke an action, such as collect() or take() on your DataFrame or Dataset, the action will create a job. Below is the Cheat sheet to remember.

One job results one or more stages

One stage results one or more tasks

one task operates on one partition

So what does executors run ? → Executor(for interview or concept perspective) is one jvm on one physical node(each physical node can contain or or more executors). One executor can…

--

--

Aditya

Principal data engineer → Distributed Threat hunting security platform | aws certified solutions architect | gssp-java | Chicago-IL