Token Mixing for Breast Cancer Diagnosis: Pre-Trained MLP-Mixer Models on Mammograms

Abstract

Breast cancer remains a leading cause of mortality among women, necessitating accurate and computationally efficient diagnostic solutions. Deep learning, particularly convolutional neural networks (CNNs), has significantly advanced mammographic analysis by automating feature extraction and improving early detection. However, CNNs rely on localised feature extraction, limiting their ability to capture long-range dependencies essential for robust classification. This study introduces and evaluates the effectiveness of pre-trained MLP-Mixer models using transfer learning as an alternative to CNN-based approaches, utilising their token-mixing and channel-mixing mechanisms to integrate local and global spatial features in mammograms. Four MLP-Mixer variants (B/16, L/16, B/32, and L/32) were systematically assessed on three benchmark datasets: CBIS-DDSM, INbreast, and MIAS. The results demonstrate that MLP-Mixer models, particularly those with smaller patch sizes (L/16 and B/16), consistently achieve state-of-the-art accuracy and sensitivity, while also offering 30 – 50% faster inference times compared to leading CNNs such as ResNet and DenseNet. These models demonstrate strong generalisation across multiple benchmark datasets and strike an effective balance between diagnostic accuracy and computational efficiency, which are essential requirements for clinical deployment. Their performance underscores the importance of fine-grained feature extraction in mammographic analysis. Comparative results indicate that MLP-Mixer models offer a compelling alternative to conventional CNNs by efficiently capturing both local and global dependencies without the high computational demands of deep convolutional network architectures. These findings highlight the promise of token-based models for AI-assisted breast cancer diagnosis and suggest that MLP-Mixer architectures are well-suited for real-time medical imaging applications. By enabling direct global spatial interaction, reducing architectural complexity, and improving diagnostic precision across varied imaging conditions, MLP-Mixers offer a computationally efficient alternative to traditional CNNs without compromising accuracy.

Publication DOI: https://doi.org/10.1109/ACCESS.2025.3586139
Divisions: College of Engineering & Physical Sciences > Aston Digital Futures Institute
Aston University (General)
Funding Information: This work was supported in part by the Brunel University of London Research Funding Scheme.
Additional Information: Copyright © 2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Uncontrolled Keywords: Breast cancer diagnosis,computer-aided diagnosis,deep learning,mammography,pre-trained multi-layer perceptron (MLP)-mixer models,pretrained convolution neural network models,General Computer Science,General Materials Science,General Engineering
Publication ISSN: 2169-3536
Data Access Statement: In this study, we use three publicly available datasets: MIAS (Mammographic Image Analysis Society database) (https://www.repository.cam.ac.uk/items/b6a97f0c-3b9b-40ad-8f18-3d121eef1459), CBIS-DDSM (Curated Breast Imaging Subset of the Digital Database for Screening Mammography) (https://www.cancerimagingarchive.net/collection/cbis-ddsm/), and INbreast https://medicalresearch.inescporto.pt/breastresearch/index.php/Get_INbreast_Database).
Last Modified: 09 Jan 2026 08:09
Date Deposited: 08 Jan 2026 15:12
Full Text Link:
Related URLs: https://ieeexpl ... cument/11075669 (Publisher URL)
http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2025-07-16
Published Online Date: 2025-07-10
Accepted Date: 2025-06-30
Authors: Ahmed, Hosameldin O.A. (ORCID Profile 0000-0002-8523-1099)
Nandi, Asoke K.

Download

[img]

Version: Published Version

License: Creative Commons Attribution


Export / Share Citation


Statistics

Additional statistics for this record