I’m still new to some of the internals of Airflow. However, if state and logging are tracked in a remote database (like Postgres), then the (in theory), you might be fine after reloading/rebooting the container from a fresh image.
In terms of stopping the existing instance safely though, without interrupting jobs, do you have a time window where you know everything would be complete?