A graph-based approach for absolute 3D hand pose estimation using a single RGB image

dc.contributor.authorKourbane, Ikram
dc.contributor.authorGenç, Yakup
dc.date.accessioned2025-10-29T11:32:57Z
dc.date.issued2022
dc.departmentFakülteler, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.description.abstractMonocular RGB-based 3D hand pose estimation is n is crucial for a wide range of augmented reality and human-computer interaction applications. However, this task is highly challenging due to occlusion, scale, and depth ambiguities. Most existing methods mainly focus on estimating a scale-normalized root-relative 3D pose from the cropped hand image. In this work, we propose a multi-stage GCN-based (Graph Convolutional Networks) approach to estimate the absolute 3D hand pose from a single RGB image. We exploit both the cropped hand and the global scene image, which provides clues about the hand scale and location in the camera space. Our network consists of three main stages: 2D key-points, 3D root-relative, and 3D absolute pose estimation. To achieve better performance, we propose a new loss function. It separates the extracted image features based on 3D joint locations to simplify the regression task. Extensive experiments on five public datasets show that our efficient model estimates accurate global 3D hand poses and performs favorably against several baselines and state-of-the-art methods. Also, we validate the proposed approach on a newly created dataset. It contains RGB hand images with accurate 3D pose annotations and high lighting and poses variations.
dc.identifier.doi10.1007/s10489-022-03390-x
dc.identifier.endpage16682
dc.identifier.issn0924-669X
dc.identifier.issn1573-7497
dc.identifier.issue14
dc.identifier.orcid0000-0002-6952-6735
dc.identifier.orcid0000-0001-8753-6710
dc.identifier.scopus2-s2.0-85127261235
dc.identifier.scopusqualityQ2
dc.identifier.startpage16667
dc.identifier.urihttps://doi.org/10.1007/s10489-022-03390-x
dc.identifier.urihttps://hdl.handle.net/20.500.14854/12193
dc.identifier.volume52
dc.identifier.wosWOS:000778053200005
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer
dc.relation.ispartofApplied Intelligence
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WOS_20251020
dc.subject3D hand pose estimation
dc.subjectGraph convolutional networks
dc.subjectLoss function
dc.subjectMulti-stage learning
dc.subjectMonocular RGB image
dc.subjectGlobal coordinates
dc.titleA graph-based approach for absolute 3D hand pose estimation using a single RGB image
dc.typeArticle

Dosyalar