Published January 12, 2024 | Version 1.0
Publication Open

NSF Major Facilities Cloud Use Cases and Considerations

  • 1. ROR icon California Institute of Technology
  • 2. ROR icon National Center for Atmospheric Research
  • 3. ROR icon Indiana University Bloomington
  • 4. ROR icon Internet2
  • 5. ROR icon Woods Hole Oceanographic Institution
  • 6. ROR icon Ocean Observatories Initiative
  • 7. ROR icon University of Southern California
  • 8. ROR icon Oregon State University
  • 9. IceCube Neutrino Observatory
  • 10. ROR icon University of Wisconsin–Madison
  • 11. ROR icon University of North Carolina at Chapel Hill
  • 12. ROR icon Renaissance Computing Institute
  • 13. ROR icon University of Arizona
  • 14. CyVerse
  • 15. ROR icon EarthScope Consortium
  • 16. ROR icon University of Notre Dame

Description

The National Science Foundation (NSF) supports over 25 Major Facilities (MFs) that serve as cornerstones for the science community. These MFs, characterized by their continuous operations, large-scale and sophisticated data collection, and broad user communities, represent long-term investments intended for multi-decade operations. 

Cloud computing presents a wealth of possibilities for these MFs. It offers an extensive range of services, including data storage, archival, processing, and sophisticated data access. Scalability is a distinct advantage of cloud platforms, allowing facilities to transition smoothly from initial development stages to high-demand phases. This shift, however, does not have to be all-or-nothing. MFs can leverage the cloud selectively, optimizing their resources without resorting to an all-in approach.

Case studies, such as those of the National Center for Atmospheric Research (NCAR), IceCube Neutrino Observatory (IceCube), and the combined Seismological Facility for the Advancement of Geoscience (SAGE) and Geodetic Facility for the advancement of Geoscience (GAGE) facilities, operated by the EarthScope Consortium, offer valuable insights. They underline the benefits of the cloud while also highlighting challenges, particularly in comparison to traditional on-premises (on-prem) infrastructure.

When considering cloud adoption, cost emerges as a pivotal factor. Costs are incurred for data storage, egress, compute resources, serverless functions, and application programming interface (API) requests. It is critical for MFs to understand these costs and factor them into operating budgets and planning.

A significant challenge that MFs might encounter is "vendor lock-in." Transitioning between vendors can become costly, primarily because of differing APIs, tools, and services offered by various cloud providers. MFs adopting cloud tools should employ strategies to mitigate vendor lock-in. 

As the cloud becomes more integral to operations, training assumes paramount importance. A workforce adept at navigating cloud technologies will be crucial for MFs to harness the full potential of these platforms.

In essence, cloud computing offers a transformative opportunity for NSF MFs. While it introduces myriad complexities, its advantages are manifold. By viewing cloud adoption as a flexible, case-by-case decision rather than an absolute “all-or-nothing” option, MFs can make informed choices about the level and extent of cloud adoption, which has the potential to significantly amplify their capabilities, enhance research quality, and ensure efficient resource utilization. This report encourages MFs to educate themselves on the fundamentals of cloud computing in order to better understand and leverage the cloud tools and computing model where they best fit MF’s specific needs, optimizing benefits while effectively managing challenges and complexities.

Notes (English)

This project is supported by the National Science Foundation Office of Advanced Cyberinfrastructure in the Directorate for Computer Information Science under Grant #2127548. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funding agency.

Files

MajorFacilities-Cloud-UseCases-Considerations-2024.pdf

Files (2.2 MB)

Additional details

Funding

CI CoE: CI Compass: An NSF Cyberinfrastructure (CI) Center of Excellence for Navigating the Major Facilities Data Lifecycle 2127548
National Science Foundation