Open Raman Library: A Leap for AI in Diagnostics
The Universitat Oberta de Catalunya (UOC) and the Institute of Photonic Sciences (ICFO), led by Marcelo Terán, have announced an open-source Raman spectral library scheduled for release in late 2025. The database will contain high-quality spectra for 140 biomolecules. Raman spectroscopy records molecular vibrations as spectral fingerprints, making it well suited to identify biomolecules without labels or destructive processing. The new library is billed as a public resource to support AI-driven diagnostic research.
Solving the Data Gap in Biomedical AI
One major barrier to clinical adoption of Raman methods has been limited access to standardized, validated spectral references. Research groups often use proprietary or small in-house collections, which impedes reproducibility and model generalization. The UOC and ICFO library addresses that gap by providing consistent, well-documented spectra to train and benchmark machine learning models. Early validation from the team reports high spectral matching accuracy across test samples, indicating the data can support robust classification workflows.
Driving Precision in Disease Detection
With reliable reference spectra, AI models can identify specific biomolecules linked to disease states. Immediate use cases include non-invasive analysis of biofluids and tissue biopsies to detect markers related to cancer and to monitor therapeutic responses. Over time the resource could enable faster, more objective diagnostic reads, reduce observer variability, and support point-of-care tools that classify molecular signatures rather than relying only on morphology.
The Power of Open Science for Future AI Models
An open-access approach invites community contributions to expand coverage, add metadata and standardize data formats. That communal growth will help machine learning teams develop models that distinguish subtle disease states and track treatment effects. By removing data bottlenecks, the UOC-ICFO initiative aims to accelerate reproducible research and practical AI applications in clinical diagnostics.
Questions answered: the library is an open spectral database by UOC and ICFO led by Marcelo Terán; it targets the lack of shared spectral data; it improves identification by supplying validated fingerprints for model training; applications span cancer detection to therapy monitoring; and open access lets the community scale and validate AI models for healthcare.




