Vaidyanathan Srinivasan <svaidy at linux.vnet.ibm.com> writes:

> There should be a restart limit.  If we actually crash starting
> opal-prd, then we will get stuck infinitely trying to start it.
> This could happen if there are events pending and actually the event
> action is crashing prd.  We will keep getting the same attention and
> not make any progress.
> Does systemd unit file allow a limited restart attempts?

By default systemd will ratelimit the number of times a unit is
started via StartLimitBurst, StartLimitIntervalSec option which defaults
to 5-times/10-sec.

Also the default RestartSec interval is 100ms. So in case of an error
spike and default RestartSec; opal-prd may be restarted too quickly and
will get rate limited thereby missing reported prd errors.

So I would suggest RestartSec=1 and StartLimitIntervalSec=5 so that
opal-prd is restart after about 1 second interval without any rate

