At the agent or monitored system level, ELM is designed with two different levels of fault tolerance protection.
Caching– When Service Agents are unable to connect to an ELM Server they will cache data until a connection is re-established to maintain data collection of all events configured for monitoring. The cache size can be configured as needed.
Agent Monitor- This monitor item performs regular checks on the ELM Service Agents installed. If the Service Agent fails to respond or responds slowly, actions such as a restart can be taken or a notification triggered so that monitoring can be resumed as quickly as possible.