TY - JOUR
T1 - Total cost of ownership and evaluation of Google cloud resources for the ATLAS experiment at the LHC
AU - ATLAS Collaboration
AU - Newman, Paul
AU - Allport, Phil
AU - Bellos, Panagiotis
AU - Bracinik, Juraj
AU - Charlton, David
AU - Chisholm, Andrew
AU - Gonella, Laura
AU - Hawkes, Chris
AU - Hillier, Stephen
AU - Krizka, Karol
AU - Lomas, Josh
AU - Marinescu, Mihaela
AU - Neep, Thomas
AU - Nikolopoulos, Konstantinos
AU - Skorda, Eleni
AU - Thomas, Jurgen
AU - Thompson, Paul
AU - Watson, Alan
AU - Watson, Miriam
AU - Wu, Chonghao
N1 - Not yet published as of 08/10/2024
48 pages in total, author list starting page 31, 7 figures, 1 table, submitted to Computing and Software for Big Science. All figures including auxiliary figures are available at https://atlas.web.cern.ch/Atlas/GROUPS/PHYSICS/PAPERS/SOFT-2023-02/
PY - 2024/10/1
Y1 - 2024/10/1
N2 - The ATLAS Google Project was established as part of an ongoing evaluation of the use of commercial clouds by the ATLAS Collaboration, in anticipation of the potential future adoption of such resources by WLCG grid sites to fulfil or complement their computing pledges. Seamless integration of Google cloud resources into the worldwide ATLAS distributed computing infrastructure was achieved at large scale and for an extended period of time, and hence cloud resources are shown to be an effective mechanism to provide additional, flexible computing capacity to ATLAS. For the first time a total cost of ownership analysis has been performed, to identify the dominant cost drivers and explore effective mechanisms for cost control. Network usage significantly impacts the costs of certain ATLAS workflows, underscoring the importance of implementing such mechanisms. Resource bursting has been successfully demonstrated, whilst exposing the true cost of this type of activity. A follow-up to the project is underway to investigate methods for improving the integration of cloud resources in data-intensive distributed computing environments and reducing costs related to network connectivity, which represents the primary expense when extensively utilising cloud resources.
AB - The ATLAS Google Project was established as part of an ongoing evaluation of the use of commercial clouds by the ATLAS Collaboration, in anticipation of the potential future adoption of such resources by WLCG grid sites to fulfil or complement their computing pledges. Seamless integration of Google cloud resources into the worldwide ATLAS distributed computing infrastructure was achieved at large scale and for an extended period of time, and hence cloud resources are shown to be an effective mechanism to provide additional, flexible computing capacity to ATLAS. For the first time a total cost of ownership analysis has been performed, to identify the dominant cost drivers and explore effective mechanisms for cost control. Network usage significantly impacts the costs of certain ATLAS workflows, underscoring the importance of implementing such mechanisms. Resource bursting has been successfully demonstrated, whilst exposing the true cost of this type of activity. A follow-up to the project is underway to investigate methods for improving the integration of cloud resources in data-intensive distributed computing environments and reducing costs related to network connectivity, which represents the primary expense when extensively utilising cloud resources.
KW - cs.DC
KW - hep-ex
UR - https://www.springer.com/physics/particle+and+nuclear+physics/journal/41781
M3 - Article
SN - 2510-2036
JO - Computing and Software for Big Science
JF - Computing and Software for Big Science
ER -