stream processing systems
Load balancing with partial key grouping
I studied the problem of workload skew also in the context of stream processing systems. I proposed one of the first load balancing techniques to address this problem, which is called partial key grouping. The technique has been incorporated into Apache Storm and became a popular baseline for other load balancing algorithms.
References
2016
-
When Two Choices are not Enough: Balancing at Scale in Distributed Stream ProcessingIn 2016 IEEE 32nd International Conference on Data Engineering (ICDE), 2016
2015
-
The Power of Both Choices: Practical Load Balancing for Distributed Stream Processing EnginesIn IEEE 31st International Conference on Data Engineering (ICDE), 2015