Unsupervised Neural Networks for Breast Cancer Clustering: A Comparative Study of RBMs and SOMs with Interpretability Metrics

Mekki Soundes, Labdaoui Ahlam

Abstract


This study presents a comparative analysis of two unsupervised neural network models—Restricted Boltzmann Machines (RBMs) and Self-Organizing Maps (SOMs)—applied to breast cancer data clustering. The primary objective is to evaluate and benchmark these models in terms of their latent feature extraction, clustering accuracy, and interpretability in a medical diagnostic context. Using a preprocessed breast cancer dataset comprising 569 patient records and 30 clinical features, the models were trained and evaluated based on two internal clustering metrics: Silhouette Score and Davies-Bouldin Index (DBI). The proposed methodology, implemented in Python, emphasizes reproducibility and diagnostic relevance. RBMs achieved a Silhouette Score of 0.88 and a DBI of 0.52, indicating compact and well-separated clusters, while SOMs recorded significantly lower performance with a Silhouette Score of 0.34 and a DBI of 1.47. Furthermore, classification performance (based on cluster-label mapping) shows RBMs yielding precision between 0.82 and 0.92, and recall between 0.87 and 0.89 for benign and malignant cases. SOMs, although less accurate, offer superior visualization of high-dimensional data, which aids in exploratory analysis and interpretability. The key contribution of this work lies in the development of a standardized evaluation framework for unsupervised neural clustering in healthcare, combining quantitative clustering metrics with qualitative insights into clinical applicability. The findings demonstrate that RBMs are better suited for diagnostic tasks requiring high pattern recognition, whereas SOMs retain value for data exploration and decision explanation. This research introduces a novel integration of RBM-based clustering into medical analytics, highlighting its potential in supporting decision-making processes in oncology. Future work will extend this approach to hybrid models and multi-modal datasets, aiming to balance performance and explainability in complex diagnostic environments.

Article Metrics

Abstract: 21 Viewers PDF: 13 Viewers

Keywords


Breast Cancer; Clustering; Restricted Boltzmann Machine; Self-Organizing Map; Unsupervised Learning; Python; Pattern Recognition; Silhouette Score; Applied Data Science

Full Text:

PDF


Refbacks

  • There are currently no refbacks.



Barcode

Journal of Applied Data Sciences

ISSN : 2723-6471 (Online)
Collaborated with : Computer Science and Systems Information Technology, King Abdulaziz University, Kingdom of Saudi Arabia.
Publisher : Bright Publisher
Website : http://bright-journal.org/JADS
Email : taqwa@amikompurwokerto.ac.id (principal contact)
    support@bright-journal.org (technical issues)

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0