Concept
Batch Processing

Shuffle: why it dominates your job runtime

When data crosses the network between executors, you are paying for it.

✍️

This concept is being written.

The detailed explanation, examples, and diagrams for this topic are not published yet. In the meantime, here is what you can do: