%L uninimx28577 %T An Integrated Machine Learning and Genomic Framework for Precise Detection of Gastric Cancer %J The American Journal of Pathology %K Gastric cancer histological images k-means clustering unsupervised learning convolutional neural networks image processing %D 2026 %R doi:10.1016/j.ajpath.2026.04.014 %X This study presents a novel integrative approach for the analysis of high-dimensional gene expression data, leveraging the complementary strengths of unsupervised clustering and supervised classification. Using K-means clustering, the dataset is stratified into three distinct clusters, revealing intrinsic biological patterns and relationships. The resulting cluster assignments are subsequently employed as pseudo-labels to train machine learning models, including support vector machines, random forest, and a stacking ensemble classifier. To validate and enhance the robustness of clustering, complementary methodologies such as hierarchical clustering and DBSCAN are employed, with results visualized through PCA-driven dimensionality reduction. The high predictive accuracy achieved by the classifiers underscores the separability and reliability of the identified clusters. Furthermore, feature importance analysis highlighted key genetic determinants within each cluster, offering actionable insights into potential biomarkers and critical genomic features. This framework bridges the gap between exploratory unsupervised learning and predictive supervised modeling, providing a scalable and interpretable methodology for analyzing complex genomic datasets. Its applicability extends to biomarker discovery, patient stratification, and other precision medicine applications, emphasizing its utility in advancing genomic research and clinical practice. %A Eshmal Iman %A Sohail Jabbar %A Shabana Ramzan %A Ali Raza %A Farwa Raoof %A Stefanía Carvajal-Altamiranda %A Vivian Lipari %A Imran Ashraf