Towards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter Framework

Mazhar, Neha; Ullah, Syed Asad; Basheer, Shakila; Jung, Haejoon; Solaija, Muhammad Sohaib J.; Mahmood, Aamir; Gidlund, Mikael

doi:10.1109/JIOT.2025.3611868

Towards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter Framework

dc.contributor.author	Mazhar, Neha
dc.contributor.author	Ullah, Syed Asad
dc.contributor.author	Basheer, Shakila
dc.contributor.author	Jung, Haejoon
dc.contributor.author	Solaija, Muhammad Sohaib J.
dc.contributor.author	Mahmood, Aamir
dc.contributor.author	Gidlund, Mikael
dc.date.accessioned	2025-10-29T12:08:23Z
dc.date.issued	2025
dc.department	Gebze Teknik Üniversitesi
dc.description.abstract	The proliferation of large-scale Internet-of-things (IoT) deployments and the emergence of 6G wireless technologies have created a pressing need for intelligent, energy-aware, and low-latency communication frameworks. In this work, we propose a novel two-phase reinforcement learning (RL)-based architecture designed to minimize the age of information (AoI) in 6G-enabled IoT networks. Our approach integrates (i) a deep deterministic policy gradient (DDPG)-driven backscatter-assisted cognitive radio non-orthogonal multiple access (CR-NOMA) scheme in the uplink, and (ii) a lightweight Q-learning-based power-domain NOMA (PD-NOMA) strategy for the downlink. In the uplink, energy harvesting (EH) sensors employ deep RL to jointly optimize backscatter reflection coefficients and transmission scheduling over shared spectrum using CR-NOMA. This enables energy-efficient communication and reduced AoI under dynamic energy and channel conditions. In the downlink, the edge node serves multiple IoT users simultaneously using PD-NOMA, where a Q-learning agent intelligently decides whether to transmit fresh or cached data to each user based on battery levels, channel quality, and information freshness. Both phases are modeled as Markov decision processes (MDPs), allowing agents to learn independently and converge toward optimal policies that balance information freshness, spectral efficiency (SE), and energy constraints. Extensive simulations demonstrate that the proposed framework effectively reduces AoI across both phases, with consistent convergence even under varying sensor densities and EH conditions. Moreover, by relying on explainable and verifiable learning mechanisms, our model addresses emerging concerns around reliability and trustworthiness in artificial intelligence (AI)-driven 6G-IoT systems. This framework represents a step toward scalable, adaptive, and responsible AI integration for future mission-critical IoT applications. © 2025 Elsevier B.V., All rights reserved.
dc.identifier.doi	10.1109/JIOT.2025.3611868
dc.identifier.isbn	9781728176055
dc.identifier.issn	2327-4662
dc.identifier.scopus	2-s2.0-105016718705
dc.identifier.scopusquality	Q1
dc.identifier.uri	https://doi.org/10.1109/JIOT.2025.3611868
dc.identifier.uri	https://hdl.handle.net/20.500.14854/14469
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Institute of Electrical and Electronics Engineers Inc.
dc.relation.ispartof	IEEE Internet of Things Journal
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_Scopus_20251020
dc.subject	age of information (AoI)
dc.subject	cognitive radio non-orthogonal multiple access (CR-NOMA)
dc.subject	deep deterministic policy gradient (DDPG)
dc.subject	Internet-of-things (IoT)
dc.subject	Markov decision processes (MDPs)
dc.subject	power-domain NOMA (PD-NOMA)
dc.subject	reinforcement learning (RL)
dc.subject	spectral efficiency (SE) and artificial intelligence (AI)
dc.title	Towards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter Framework
dc.type	Article

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Towards Trustworthy and Fresh Data Delivery in 6G IoT: A DRL-aided Cognitive NOMA and Backscatter Framework

Dosyalar

Koleksiyon