Checkpoint - WYnnTheDays/Arvid GitHub Wiki

Checkpoint

is a technique that provides fault tolerance for copmuting systems consists of

  1. saving a snapshot of the application's state
  2. applications can restart from that point in case of failure

the long running applications that are executed in the failure-prone computing systems