Explains a few buzz words in Spark, External Shuffle Service (ESS), Remote Shuffle Service (RSS). And a few existing ESS, RSS solutions for cloud use cases.
Stability and the resilience of a distributed system is aways the hardest goal to achieve. But also the most important. In this post, we’ll see how to leverage some chaos monkey tools to discover vulnerable defects.
Golang tests are running good on local but becoming flaky on CI/CD pipeline? This post introduces a few possibilities could cause this and help you improve your unit tests stability.
A tutorial about how to setup a local k8s cluster on MacOS for development.
This post introduces Kubernetes scheduler from a newbie’s narrative.
More posts here …