Pilot job
In computer science, a pilot job is a type of multilevel scheduling, in which a resource is acquired by an application so that the application can schedule work into that resource directly, rather than going through a local job scheduler, which might lead to queue waits for each work unit. This term comes from the Condor High-Throughput Computing System, in which Condor GlideIns provides this functionality. Another example of pilot jobs is the BigJob implemented in SAGA. Yet another example is Swift Coasters, part of the Swift parallel scripting system.
Pilot jobs are most often used on systems that have queues, as part of their purpose is, in some sense, to avoid multiple waits in these queues. These are most often found in parallel computing systems, but pilot jobs are usually part of a distributed application.
References
- Sfiligoi 2008 J. Phys.: Conf. Ser. 119 062044 doi:10.1088/1742-6596/119/6/062044
- André Luckow, Lukasz Lacinski, Shantenu Jha. SAGA BigJob: An Extensible and Interoperable Pilot-Job Abstraction for Distributed Applications and Systems, 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, 2010
- Michael Wilde et al., Swift: A language for distributed parallel scripting Parallel Computing (2011), Volume: 37, Issue: May, Publisher: Elsevier B.V., Pages: 633-652, doi: 10.1016/j.parco.2011.05.005