Glossary - VertebrateResequencing/vr-pipe GitHub Wiki

This document explains key terminology.

  • Pipeline: an ordered series of Steps that must be carried out to analyse a given dataset in a desired way.
  • Step: An individual step of a pipeline that cannot be subdivided; usually corresponds to providing input to a single piece of analysis software and generating some output (the execution of single command line).
  • Job: A DRM term describing a single request to execute something on the computer cluster. What is executed in a Job typically corresponds to the command line of a Step.
  • DRM: Distributed Resource Management system. A third-party system used for running software across a compute cluster. Examples include LSF and Sun Grid Engine.