EACR25-1857

Non-invasive bladder cancer detection: identification of a urinary volatile biomarker panel using GC-MS metabolomics and machine learning

A. Carapito1, V. Fernandes Ferreira2, A. Silva Ferreira3, A. Teixeira-Marques4, R. Henrique5, C. Jerónimo6, A. Roque7, F. Carvalho1, J. Pinto1, P. Guedes de Pinho1
1UCIBIO – Applied Molecular Biosciences Unit, Laboratory of Toxicology, Faculty of Pharmacy, University of Porto, Porto, Portugal
2Cork Supply Portugal, S.A., São Paio de Oleiros, Portugal, São Paio de Oleiros, Portugal
3Cork Supply Portugal, S.A., São Paio de Oleiros, Portugal; Centro de Biotecnologia e Química Fina (CBQF), Laboratório Associado, Escola Superior de Biotecnologia, Universidade Católica Portuguesa, Porto, Portugal; Institute for Wine Biotechnology (IWBT), Department of Viticulture and Oenology (DVO), University of Stellenbosch, South Africa, Porto, Portugal
4Cancer Biology and Epigenetics Group, Research Center (CI-IPOP), Porto Comprehensive Cancer Center (P.CCC), Portuguese Oncology Institute of Porto (IPO Porto), Porto, Portugal; Department of Pathology and Molecular Immunology, ICBAS-School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal, Porto, Portugal
5Cancer Biology and Epigenetics Group, Research Center (CI-IPOP), Porto Comprehensive Cancer Center (P.CCC), Portuguese Oncology Institute of Porto (IPO Porto), Porto, Portugal; Department of Pathology and Molecular Immunology, ICBAS-School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal; Department of Pathology, Portuguese Oncology Institute of Porto (IPO Porto), Porto.CCC Porto Comprehensive Cancer Center, Porto, Portugal, Porto, Portugal
6Cancer Biology and Epigenetics Group, Research Center (CI-IPOP), Porto Comprehensive Cancer Center (P.CCC), Portuguese Oncology Institute of Porto (IPO Porto), Porto, Portugal; Department of Pathology and Molecular Immunology, ICBAS-School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal;, Porto, Portugal
7Associate Laboratory i4HB – Institute for Health and Bioeconomy, NOVA School of Science and Technology, NOVA University of Lisbon, Caparica, Portugal; UCIBIO – Applied Molecular Biosciences Unit, Department of Chemistry, NOVA School of Science and Technology, NOVA University of Lisbon, Caparica, Portugal, Caparica/Lisbon, Portugal
Introduction:

Bladder cancer (BC) remains a major public health concern, for which early detection is essential for improving patient prognosis. Emerging metabolomics research has highlighted the potential of urinary volatile organic compounds (VOCs) as non-invasive biomarkers for the early detection of BC. Cancer-related metabolic alterations give rise to distinct VOC profiles, facilitating the differentiation between malignant and non-malignant conditions. Advanced analytical techniques, such as gas chromatography-mass spectrometry (GC-MS), in combination with machine learning (ML) algorithms, offer a powerful approach for biomarker discovery and validation. This study aimed to leverage ML algorithms applied to GC-MS-based metabolomics data to discover a panel of volatile biomarkers able to accurately discriminate BC patients from cancer-free controls.

Material and method:

Urine samples were collected from 177 participants, including 87 patients diagnosed with BC and 90 cancer-free controls. This study was approved by both the Data Protection Officer and the Ethics Committee of the Portuguese Institute of Oncology of Porto (CES 82/022) and the Ethics Committee of the Faculty of Pharmacy of the University of Porto (CEFFUP). Urine sample analysis was conducted using headspace solid-phase microextraction coupled with GC-MS. Five distinct ML algorithms were employed for data analysis: support vector machine (SVM), random forest (RF), partial least squares-discriminant analysis (PLS-DA), extreme gradient boosting (XGBoost), and k-nearest neighbors (k-NN). Model performance was evaluated using receiver operating characteristics (ROC) analysis, with key metrics including area under the curve (AUC), sensitivity, specificity, and accuracy.

Result and discussion:

Among the various ML algorithms tested, the RF model demonstrated the highest performance in distinguishing BC patients from cancer-free controls. A panel of eight potential biomarkers was identified through RF for the overall detection of BC, achieving 89% sensitivity, 92% specificity, 91% accuracy, with an AUC of 0.872. This biomarker panel consisted of three ketones, three aldehydes, one fatty alcohol, and one phenolic compound. Of these eight compounds, seven were found at higher levels in the urine of BC patients compared to controls, while one compound was found at lower levels.

Conclusion:

Our study identified a panel of eight volatile biomarkers that accurately detect BC. These findings underscore the potential of metabolomics, combined with ML techniques in cancer biomarker discovery. Implementation of this urinary biomarker panel in clinical practice may revolutionize BC detection by offering a non-invasive and rapid screening method.