Published October 27, 2025 | Version v1
Journal article Open

Artificial Intelligence-Powered Raman Spectroscopy through Open Science and FAIR Principles

  • 1. ROR icon Instituto de Catálisis y Petroleoquímica
  • 2. ROR icon Istituto di Fotonica e Nanotecnologie
  • 3. CSIC, Consejo Superior de Investigaciones Cientificas
  • 4. CSIC
  • 5. ROR icon Humboldt-Universität zu Berlin
  • 6. ROR icon King Abdullah University of Science and Technology
  • 7. Ideaconsult Ltd
  • 8. ROR icon Plovdiv University
  • 9. ELODIZ Ltd
  • 10. ROR icon Politecnico di Milano
  • 11. Technische Informationsbibliothek TIB
  • 12. ROR icon National Research Council
  • 13. ROR icon Consejo Superior de Investigaciones Científicas

Description

Raman spectroscopy is a fast-growing and increasingly powerful analytical technique applied across diverse disciplines such as materials science, chemistry, biology and medicine. This growth is driven by advances in Raman instrumentation and greatly supported by the flourishing of chemometrics and artificial intelligence (AI). However, the full potential of this technique is often hampered by challenges related to data acquisition, processing, interpretation, and sharing. This review paper addresses how a concerted effort toward digitalization, incorporating principles of Open Science and FAIR data (Findable, Accessible, Interoperable, and Reusable), is essential to develop and implement robust, standardized, and accessible digital workflows. These workflows are key to unlock the full power of Raman spectroscopy in combination with AI. We explore the current landscape of digital tools and open resources in Raman spectroscopy, highlighting both existing solutions as well as critical gaps. Despite these advances, the field remains fragmented, with many initiatives developed in isolation, limiting interoperability and slowing progress. In this regard, we assess the trends in Raman spectroscopy hardware and control software as well as the role of AI in improving data collection, automating data analysis, extracting meaningful insights, and enabling predictive modeling. We review challenges such as data quality and model interpretability that constrain the effectiveness and applicability of AI in Raman spectroscopy. Furthermore, we emphasize the importance of standardized data formats, metadata schemas, and domain-specific ontologies to ensure machine-actionability, database federation and interoperability as well as to facilitate collaborative research. We provide curated lists of existing open hardware, databases and standards relevant to Raman spectroscopy. Finally, we propose a roadmap toward an open and FAIR ecosystem for Raman spectroscopy, emphasizing the need for sustainable infrastructure, collaborative development, and community involvement.

Files

CocaLopez2025 - artificial-intelligence-powered-raman-spectroscopy-through-open-science-and-fair-principles.pdf