Checkpoint - WYnnTheDays/Arvid GitHub Wiki
Checkpoint
is a technique that provides fault tolerance for copmuting systems consists of
- saving a snapshot of the application's state
- applications can restart from that point in case of failure
the long running applications that are executed in the failure-prone computing systems