A hybrid classification-regression approach for 3D hand pose estimation using graph convolutional networks

dc.contributor.authorKourbane, Ikram
dc.contributor.authorGenç, Yakup
dc.date.accessioned2025-10-29T11:27:13Z
dc.date.issued2022
dc.departmentFakülteler, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstractHand pose estimation is a crucial part of a wide range of augmented reality and human-computer interaction applications. Predicting the 3D hand pose from a single RGB image is challenging due to occlusion and depth ambiguities. GCN-based (Graph Convolutional Networks) methods exploit the structural relationship similarity between graphs and hand joints to model kinematic dependencies between joints. These techniques use predefined or global learned joint relationships, which may fail to capture pose dependent constraints. To address this problem, we propose a two-stage GCN-based framework that learns per-pose (per-image) relationship constraints. Specifically, the first phase quantizes the 2D/3D space to classify the joints into 2D/3D blocks based on their locality. This spatial dependency information guides the regression branch to estimate reliable 2D and 3D poses. The second stage further improves the 3D estimation through a GCN-based module that uses an adaptative nearest neighbor algorithm to determine joint relationships. Extensive experiments show that our multi-stage GCN approach yields an efficient model that produces accurate 2D/3D hand poses and outperforms the state-of-the-art on two public datasets.
dc.identifier.doi10.1016/j.image.2021.116564
dc.identifier.issn0923-5965
dc.identifier.issn1879-2677
dc.identifier.orcid0000-0002-6952-6735
dc.identifier.orcid0000-0001-8753-6710
dc.identifier.scopus2-s2.0-85119936926
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1016/j.image.2021.116564
dc.identifier.urihttps://hdl.handle.net/20.500.14854/10642
dc.identifier.volume101
dc.identifier.wosWOS:000724345500007
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherElsevier
dc.relation.ispartofSignal Processing-Image Communication
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WOS_20251020
dc.subject3D hand pose estimation
dc.subjectGraph convolutional networks
dc.subjectClassification
dc.subjectMulti-stage learning
dc.subjectMonocular RGB image
dc.titleA hybrid classification-regression approach for 3D hand pose estimation using graph convolutional networks
dc.typeArticle

Dosyalar