"Cannot connect to the Docker daemon" errors appear more frequently as more runners we deploy #3828
Open
4 tasks done
Labels
bug
Something isn't working
gha-runner-scale-set
Related to the gha-runner-scale-set mode
needs triage
Requires review from the maintainers
Checks
Controller Version
0.9.3
Deployment Method
ArgoCD
Checks
To Reproduce
3.1. If less than around 200 runners, only 1 or 2 runners fail through the week because of docker.
3.2. If more than around 200 runners, around a third or the runners fail and jobs have to be re-runned.
Describe the bug
Pipeline step complains that docker is not running:
Describe the expected behavior
It should do any of one of these two options:
Additional Context
Controller Logs
Runner name to copy&search on logs: build-s-99pst-runner-bnhsl https://gist.github.com/snavarro-factorial/ee965f37114d0ac4589169012cc098a6
Runner Pod Logs
Export format in csv: https://gist.github.com/snavarro-factorial/796fba24ba5c7f854d3b95f04b636021
The text was updated successfully, but these errors were encountered: