An Overview of Vision Transformers and Deep Learning Methods for Classifying Remote Sensing Images

Keerthishree P V

doi:10.52783/jisem.v10i2.2303

PDF

Published: Feb 5, 2025

DOI: https://doi.org/10.52783/jisem.v10i2.2303

Keywords:

Image classification, vision transformer, deep learning, machine learning, CNN, SVM, VGG, XGBoost, KNN, Random Forest.

Keerthishree P V, Suhas G K, S G Gollagi, Yathisha L

Abstract

The diversified, multifarious, and high-dimensional nature of remote-sensing photos makes remote-sensing image scene categorization (RSISC) an important and challenging challenge for understanding changes on Earth's surface. RSISC's main goal is to give acquired images semantic labels so that they can be arranged according to semantic content. Deep learning frameworks, especially in image analysis, have seen a sharp increase in interest and development in recent years. Even though deep learning approaches are more computationally costly than conventional machine learning techniques, they have demonstrated great potential in this field. This study provides a thorough evaluation of several deep learning (DL) methods, including Vision Transformers (ViTs) and Convolutional Neural Networks (CNNs) like ResNet, VGG16, InceptionV3, and DenseNet. We use the NWPU-RESISC45 and RSI-CB256 remote sensing datasets, which are both publically accessible, to assess how well these models perform. The findings show that although conventional CNN designs perform competitively, Vision Transformers (ViTs) are better at identifying intricate spatial correlations in the data for the categorization of remote sensing images. Because vision transformers use self-attention mechanisms to efficiently capture complicated spatial linkages and long-range dependencies, they perform exceptionally well in remote sensing picture classification. Furthermore, multi-scale feature extraction is made possible by their patch-based processing, which improves accuracy, particularly in high-resolution images.

Issue

Vol. 10 No. 2 (2025)

Section

Articles

Journal of Information Systems Engineering and Management

An Overview of Vision Transformers and Deep Learning Methods for Classifying Remote Sensing Images

Abstract

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details