Hybrid CNN–Vision Transformer Architecture forAccurate Liver Cancer Diagnosis from MedicalImaging

Satyendra  Sharma; Pradeep  Laxkar

doi:10.5281/zenodo.20047881

pdf

Published: 2026-04-18

DOI: https://doi.org/10.5281/zenodo.20047881

Keywords:

Hybrid Deep Learning, CNN, Vision Transformer, Liver Cancer, Medical Imaging, Feature Fusion, Transfer Learning

Satyendra Sharma

ITM (SLS) Baroda University

Pradeep Laxkar

ITM (SLS) Baroda University

Abstract

Detecting liver cancer early remains challenging because medical images can vary widely between patients, and differences
in scan contrast are often subtle. This study describes a hybrid model that combines a CNN with a Vision Transformer, aiming to
capture both fine, local image details and broader contextual information. In this setup, the CNN is used to focus on nearby visual
signals such as edges and textures, while the transformer analyzes the full image to learn longer-range relationships between different
regions. The method is evaluated on public datasets, including LiTS and TCGA-LIHC, with consistent preprocessing applied across
all data. The reported accuracy is 94.8%, which is higher than the results from models using only a CNN or only a transformer.
These findings indicate that leveraging both local and global features may lead to better performance in liver cancer detection

Downloads

Download data is not yet available.

Issue

Vol. 2 No. 4 (2026): April-2026

Section

Review Article

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

This work is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain the copyright of their work and grant the Journal of Global Research in Multidisciplinary Studies (JGRMS) the right of first publication. This license permits unrestricted use, distribution, adaptation, and reproduction in any medium or format, provided the original author(s), source, and publication are properly credited. Users may copy, redistribute, remix, transform, and build upon the published material for any purpose, including commercial use, in accordance with the terms of the CC BY 4.0 License.

How to Cite

Hybrid CNN–Vision Transformer Architecture forAccurate Liver Cancer Diagnosis from MedicalImaging (S. . Sharma & P. Laxkar , Trans.). (2026). Journal of Global Research in Multidisciplinary Studies(JGRMS), 2(4), 07-11. https://doi.org/10.5281/zenodo.20047881

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Issue

Section

How to Cite

Similar Articles