|Stable release||9.1.2 / February 2013|
|Operating system||Unix, Linux, Windows|
Platform Load Sharing Facility (or simply LSF) is a workload management platform, job scheduler, for distributed HPC environments. It can be used to execute batch jobs on networked Unix and Windows systems on many different architectures.
In 2007, Platform released Platform Lava, which is a simplified version of LSF based on an old version of LSF release, licensed under GNU General Public License v2. LSF was based on the Utopia research project at the University of Toronto.
LSF Scheduling Policies
- Fair share, preemptive, backfill and SLA scheduling
- High throughput scheduling
- Multicluster scheduling
- Topology-, resource-, and energy-aware scheduling
LSF Addon Products
- IBM Platform Application Center
- Web interfaces for job submission, management and remote visualization.
- IBM Platform RTM
- A real-time dashboard for monitoring global workloads and resource.
- IBM Platform License Scheduler
- License management tool with policy-driven allocation and tracking of software licenses.
- IBM Platform Analytic
- Analytic tool for visualizing and analyzing workload data.
- IBM Platform Process Manager
- An interface for designing complex engineering computational processes
- IBM Platform Session Scheduler
- High-throughput low-latency scheduling solution for LSF environments.
- IBM Platform Dynamic Cluster
- Cloud management solution to change static cluster into dynamic share cloud resources.
LSF Extensions and integrations
LSF Extensions includes some major extensions:
- The Distributed Resource Management Application API handles job management in a range of distributed resource management systems.
- HPC Profile Basic
- This describes how JSDL, Basic Execution Service (BES) and existing web services security mechanisms can be used interoperable to address batch job scheduling use case.
- LSF Perl API
- This comprises two modules, Base and Batch, allowing Platform's LSF APIs to be called by Perl.
- Base module allows Perl applications to link the Load Information Manager (LIM) and Remote Execution Server (RES) daemons for LSF services e.g. retrieving system configuration and dynamic load information for distributed clusters hosts, task placement advice via LIM, and other related functions, thereby improving application performance and resources accessibility.
- Batch module allows Perl applications to retrieve information as well as the submission of information about the hosts, queues, users, jobs and configuration of the batch system.
- SAGA (Simple API for Grid Applications)
- The SAGA C++ Reference Implementation provides an LSF plug-in (adaptor) for its standardized job submission, control and monitoring API. The API is available for C++ and Python.
- Python LSF wrappers
- LSF's API written in C can be easily accessed using Python. Several implementations of LSF Python APIs exist.
- Michael R. Ault, Mike Ault, Madhu Tumma, and Ranko Mosic (2004). Oracle 10g Grid & Real Application Clusters. Rampant TechPress. p. 24. ISBN 9780974435541.
- Goering, Richard (March 8, 1999). "Load sharing brings kudos". EE Times Online. Retrieved 2007-11-12. "LSF ... enables load sharing by distributing jobs to available CPUs in heterogeneous networks ... but don't tell them that; they'll just want to raise their prices"
- Platform Lava
- "Utopia: A Load Sharing Facility for Large, Heterogeneous Distributed Computer Systems". John Wiley & Sons. Retrieved 2007-12-29.
- IBM Closes on Acquisition of Platform Computing
- pylsf on GitHub
- Robert Marcus (2002). Great Global Grid. Trafford Publishing. pp. 258–259. ISBN 9781553698845.
- Grabowski, Tomasz (December 5, 2001). "Platform LSF SetUID Executables Vulnerability". Juniper Networks. Retrieved 2007-11-12.
- Ahmar Abbas (2004). Grid Computing. Charles River Media. pp. 110–117. ISBN 9781584502760.