Towards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter Framework

dc.contributor.authorMazhar, Neha
dc.contributor.authorUllah, Syed Asad
dc.contributor.authorBasheer, Shakila
dc.contributor.authorJung, Haejoon
dc.contributor.authorSolaija, Muhammad Sohaib J.
dc.contributor.authorMahmood, Aamir
dc.contributor.authorGidlund, Mikael
dc.date.accessioned2025-10-29T12:08:23Z
dc.date.issued2025
dc.departmentGebze Teknik Üniversitesi
dc.description.abstractThe proliferation of large-scale Internet-of-things (IoT) deployments and the emergence of 6G wireless technologies have created a pressing need for intelligent, energy-aware, and low-latency communication frameworks. In this work, we propose a novel two-phase reinforcement learning (RL)-based architecture designed to minimize the age of information (AoI) in 6G-enabled IoT networks. Our approach integrates (i) a deep deterministic policy gradient (DDPG)-driven backscatter-assisted cognitive radio non-orthogonal multiple access (CR-NOMA) scheme in the uplink, and (ii) a lightweight Q-learning-based power-domain NOMA (PD-NOMA) strategy for the downlink. In the uplink, energy harvesting (EH) sensors employ deep RL to jointly optimize backscatter reflection coefficients and transmission scheduling over shared spectrum using CR-NOMA. This enables energy-efficient communication and reduced AoI under dynamic energy and channel conditions. In the downlink, the edge node serves multiple IoT users simultaneously using PD-NOMA, where a Q-learning agent intelligently decides whether to transmit fresh or cached data to each user based on battery levels, channel quality, and information freshness. Both phases are modeled as Markov decision processes (MDPs), allowing agents to learn independently and converge toward optimal policies that balance information freshness, spectral efficiency (SE), and energy constraints. Extensive simulations demonstrate that the proposed framework effectively reduces AoI across both phases, with consistent convergence even under varying sensor densities and EH conditions. Moreover, by relying on explainable and verifiable learning mechanisms, our model addresses emerging concerns around reliability and trustworthiness in artificial intelligence (AI)-driven 6G-IoT systems. This framework represents a step toward scalable, adaptive, and responsible AI integration for future mission-critical IoT applications. © 2025 Elsevier B.V., All rights reserved.
dc.identifier.doi10.1109/JIOT.2025.3611868
dc.identifier.isbn9781728176055
dc.identifier.issn2327-4662
dc.identifier.scopus2-s2.0-105016718705
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1109/JIOT.2025.3611868
dc.identifier.urihttps://hdl.handle.net/20.500.14854/14469
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.ispartofIEEE Internet of Things Journal
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_Scopus_20251020
dc.subjectage of information (AoI)
dc.subjectcognitive radio non-orthogonal multiple access (CR-NOMA)
dc.subjectdeep deterministic policy gradient (DDPG)
dc.subjectInternet-of-things (IoT)
dc.subjectMarkov decision processes (MDPs)
dc.subjectpower-domain NOMA (PD-NOMA)
dc.subjectreinforcement learning (RL)
dc.subjectspectral efficiency (SE) and artificial intelligence (AI)
dc.titleTowards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter Framework
dc.typeArticle

Dosyalar