Monitor pod lifeness
At the moment systemd monitors the PID of the last container in the pod.
When this container exits, then the unit is going to be restarted. When this container stays alive the pod is seen as running, even when any other container fails.
2 issues with that:
- We might not restart if for example the socat container files, which provides connectivity to the pod
- We might restart although the overall pod is healthy, but just one container fails.
What would be the expected behavior?
- Monitor all containers and fail if one exits?
- What else?
- Should we consider other health checks?