Worldwide LHC Computing Grid
The Worldwide LHC Computing Grid (WLCG), formerly (until 2006) the LHC Computing Grid (LCG), is an international collaborative project that consists of a grid-based computer network infrastructure incorporating over 170 computing centers in 36 countries, as of 2012. It was designed by CERN to handle the prodigious volume of data produced by Large Hadron Collider (LHC) experiments.
By 2012, data from over 300 trillion (3 x 1014) LHC proton-proton collisions had been analyzed, and LHC collision data was being produced at approximately 25 petabytes per year. As of 2012, The LHC Computing Grid had become the world's largest computing grid comprising over 170 computing facilities in a worldwide network across 36 countries.
The Large Hadron Collider at CERN was designed to prove or disprove the existence of the Higgs boson, an important but elusive piece of knowledge that had been sought by particle physicists for over 40 years. A very powerful particle accelerator was needed, because Higgs bosons might not be seen in lower energy experiments, and because vast numbers of collisions would need to be studied. Such a collider would also produce unprecedented quantities of collision data requiring analysis. Therefore advanced computing facilities were needed to process the data.
A design report was published in 2005. It was announced to be ready for data on 3 October 2008. A popular 2008 press article predicted "the internet could soon be made obsolete" by its technology. CERN had to publish its own articles trying to clear up the confusion. It incorporates both private fiber optic cable links and existing high-speed portions of the public Internet. At the end of 2010, the Grid consisted of some 200,000 processing cores and 150 petabytes of disk space, distributed across 34 countries.
The data stream from the detectors provides approximately 300 GByte/s of data, which after filtering for "interesting events", results in a "raw data" stream of about 300 MByte/s. The CERN computer center, considered "Tier 0" of the LHC Computing Grid, has a dedicated 10 Gbit/s connection to the counting room.
The project was expected to generate 27 TB of raw data per day, plus 10 TB of “event summary data”, which represents the output of calculations done by the CPU farm at the CERN data center. This data is sent out from CERN to eleven Tier 1 academic institutions in Europe, Asia, and North America, via dedicated 10 Gbit/s links. This is called the LHC Optical Private Network. More than 150 Tier 2 institutions are connected to the Tier 1 institutions by general-purpose national research and education networks. The data produced by the LHC on all of its distributed computing grid is expected to add up to 10–15 PB of data each year. In total, the four main detectors at the LHC produced 13 petabytes of data in 2010.
The Tier 1 institutions receive specific subsets of the raw data, for which they serve as a backup repository for CERN. They also perform reprocessing when recalibration is necessary. The primary configuration for the computers used in the grid is based on Scientific Linux.
- Hayes, Jacqui (21 December 2011). "Happy 10th Birthday, WLCG!". International Grid Science This Week. Retrieved 2012-12-20.
- What is the Worldwide LHC Computing Grid?, CERN, January 2011, retrieved 2012-01-11
- Welcome, CERN, January 2011, retrieved 2012-01-11
- Hunt for Higgs boson hits key decision point
- Worldwide LHC Computing Grid main page 14 November 2012: "[A] global collaboration of more than 170 computing centres in 36 countries ... to store, distribute and analyse the ~25 Petabytes (25 million Gigabytes) of data annually generated by the Large Hadron Collider"
- What is the Worldwide LHC Computing Grid? (Public 'About' page) 14 November 2012: "Currently WLCG is made up of more than 170 computing centers in 36 countries...The WLCG is now the world's largest computing grid"
- "LHC Computing Grid: Technical Design Report". document LCG-TDR-001, CERN-LHCC-2005-024 (The LCG TDR Editorial Board). 20 June 2005. ISBN 92-9083-253-3. Retrieved 2 October 2011.
- "LHC GridFest". CERN. 2008.
- Jonathan Leake (6 April 2008). "Coming soon: superfast internet". The Times (London). Retrieved 25 January 2013.
- "The Grid: separating fact from fiction". CERN. May 2008. Retrieved 25 January 2013. Adapted from an article originally published in Symmetry Breaking.
- Geoff Brumfiel (19 January 2011). "High-energy physics: Down the petabyte highway". Nature 469. pp. 282–283. doi:10.1038/469282a. Retrieved 2 October 2011.
- "Network transfer architecture". CERN. Retrieved 2 October 2011.
- final-draft-4-key[dead link]
- Brodkin, Jon (28 April 2008). "Parallel Internet: Inside the Worldwide LHC computing grid". Techworld.com.