A new CNN training approach with application to hyperspectral image classification

dc.contributor.authorKutluk, Sezer
dc.contributor.authorKayabol, Koray
dc.contributor.authorAkan, Aydin
dc.date.accessioned2025-10-29T11:29:24Z
dc.date.issued2021
dc.departmentFakülteler, Mühendislik Fakültesi, Elektronik Mühendisliği Bölümü
dc.description.abstractThree main requirements of a successful application of deep learning are the network architecture, a large enough training dataset, and a good optimization algorithm. In this paper we mainly focus on the optimization part. We propose a training algorithm for convolutional neural networks which makes use of both first and second order derivatives for training different layers. We utilize an approximate second order algorithm for the classification layer while we train the rest of the network with the conventional approach which is backpropagation with first order derivatives. We show that this approach helps us achieve a higher classification accuracy with a much smaller number of training iterations compared to training the whole network with gradient descent based algorithms. Moreover, although second order optimization is generally costlier, we show that the proposed approach is trained faster not only in terms of the number of iterations but also training duration. We also present the integration of CNNs with a probabilistic spatial model and apply this to the land cover classification problem in hyperspectral images. The results show that the algorithm allows us to achieve superior results with a simple network even with limited training data compared to existing approaches. (C) 2021 Elsevier Inc. All rights reserved.
dc.identifier.doi10.1016/j.dsp.2021.103016
dc.identifier.issn1051-2004
dc.identifier.issn1095-4333
dc.identifier.orcid0000-0002-3048-5526
dc.identifier.scopus2-s2.0-85102642648
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1016/j.dsp.2021.103016
dc.identifier.urihttps://hdl.handle.net/20.500.14854/11074
dc.identifier.volume113
dc.identifier.wosWOS:000640937700006
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherAcademic Press Inc Elsevier Science
dc.relation.ispartofDigital Signal Processing
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WOS_20251020
dc.subjectDeep learning
dc.subjectConvolutional neural networks (CNN)
dc.subjectLogistic regression
dc.subjectOptimization
dc.subjectHyperspectral image classification
dc.titleA new CNN training approach with application to hyperspectral image classification
dc.typeArticle

Dosyalar