Skew handling in distributed batch
One key has 90% of the rows. The job waits on one executor while the rest idle.
This concept is being written.
The detailed explanation, examples, and diagrams for this topic are not published yet. In the meantime, here is what you can do:
- Browse the rest of the System Design Concepts library.
- Follow the System Design Roadmap for a structured path through these topics.
- Try a practice problem that uses this concept in context.