Dr Mahdi Boloursaz Mashhadi

Lecturer in Communications and AI

m.boloursazmashhadi@surrey.ac.uk

Academic and research departments

Institute for Communication Systems, School of Computer Science and Electronic Engineering.

About

Biography

Mahdi Boloursaz Mashhadi (Senior Member, IEEE) is a Lecturer at the 5G/6G Innovation Centre (5G/6GIC) at the Institute for Communication Systems (ICS), School of Computer Science and Electronic Engineering (CSEE), University of Surrey (UoS), UK, and a Surrey AI fellow. His research is focused at the intersection of AI/ML with wireless communication, learning and communication co-design, generative AI for telecommunications, and collaborative machine learning. Prior to joining ICS, he was a postdoctoral research associate at the Intelligent Systems and Networks (ISN) Research Group, Imperial College London, 2019-2021. He received B.S., M.S., and Ph.D. degrees in mobile telecommunications from the Sharif University of Technology (SUT), Tehran, Iran. He has more than 40 peer reviewed publications and patents in the areas of wireless communications, machine learning, and signal processing. He is a PI/Co-PI for various government and industry funded projects including the UKTIN/DSIT 12M£ national project TUDOR. He received the Best Paper Award from the IEEE EWDTS conference, and the Exemplary Reviewer Award from the IEEE ComSoc in 2021 and 2022. He served as a panel judge for the International Telecommunication Union (ITU) on the “AI/ML in 5G” challenge 2021- 2022. He is an editor for the Springer Nature Wireless Personal Communications Journal.

Affiliations and memberships

IEEE Senior Member, Surrey AI Fellow
Fellow of the Higher Education Academy (FHEA)
Editor, Springer Nature Wireless Personal Communications Journal

News

10 JUN 2022

Spotlight on ‘Wireless AI’ in the 6GIC Selected Advanced Topics Workshop Series

In the media

20 February 2025

Invited Talk at IEEE Future Networks on "Goal-Oriented Generative Semantic Communications with Multimodal LLMs"

Invited Speaker

IEEE Future Networks Webinar Series

05 October 2023

1st Rank in ITU AI/ML in 5G challenge

Research Supervisor

ITU AI/ML in 5G challenge

23 June 2022

First 6GIC-CLICK Selected Advanced Topics Workshop on "Wireless AI"

Chair

6GIC Selected Advanced Topics Workshop Series (6GIC-CLICK)

01 November 2021

Special Session on "Machine Learning for Communications"

Chair

55'th Annual Asilomar Conference on Signals, Systems, and Computers

Research

Research interests

My current research is focused at the intersection of AI and machine learning and wireless communications. I'm interested in the specific role of AI and machine learning in future generation wireless networks. I am working on generative AI for telecommunications, AIoT systems, and the joint design of smart machine learning agents and the underlying wireless network to achieve goal oriented and semantic communications. I am looking at the interactions between AI and wireless communications which can be either AI for wireless communications or wireless communications for collaborative/distributed/federated machine learning.

Research projects

TOWARDS UBIQUITOUS 3D OPEN RESILIENT NETWORK (TUDOR) (Co-PI)
The TUDOR Project is a £12M UK flagship research project funded by the Department of Science, Innovation and Technology (DSIT). It targets low technology readiness level (TRL) research, aiming to tackle strategic technical challenges oriented to the design of future 6G paradigm.
Start date: February 2023 - End date: January 2025.

Supervision

Postgraduate research supervision

Post Doctoral Researchers:

I am looking to recruit a motivated Post Doctoral researcher working on "Emerging AI Technologies for 6G".

PhD Students:

-Jingxuan Men (PG/R - Comp Sci & Elec Eng)

-Farshad Zeinali Esfahani (PG/R - Comp Sci & Elec Eng)

-Xinkai Liu (PG/R - Comp Sci & Elec Eng, ICS)

-Sotiris Chatzimiltis (PG/R - Comp Sci & Elec Eng, ICS)

Visiting Researchers:

-Yunqi Gao (Visiting researcher from Zhejiang University, China)

-Ruoqi Wen (Visiting researcher from Zhejiang University, China)

Alumni:

-Li Qiao (PG/R - Comp Sci & Elec Eng, ICS)

-Dr. Daesung Yu, PDRA in AI for Communications

-Dr. Chunmei Xu, PDRA in Semantic Communications

-Tatsuya Kikuzuki, Visiting Researcher from Fujitsu Japan

-Mahnoosh Mahdavimoghadam (PG/R - Comp Sci & Elec Eng, ICS)

I am recruiting PhD students in topics of Communications, AI and Signal processing. Interested applicants send CVs to: m.boloursazmashhadi@surrey.ac.uk

Teaching

FUNDAMENTALS OF MACHINE LEARNING FOR COMMUNICATIONS - 2025/6 - University of Surrey

DATA & INTERNET NETWORKING - 2025/6 - University of Surrey

LABORATORIES, DESIGN & PROFESSIONAL STUDIES I - 2025/6 - University of Surrey

Publications

Mojtaba Amiri, Amirhosein Mohammadzadeh, Farshad Zeinali, Mohammad Robat Mili, Mahdi Boloursaz Mashhadi, Pei Xiao (2025)Movable Antenna SWIPT Systems with STAR-RIS: A Meta DRL Approach, In: IEEE Transactions on Vehicular Technology Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TVT.2025.3622305

The movable antenna (MA) technology has recently demonstrated significant potential in improving the communication performance by enabling local movement of the antennas to obtain better channel conditions. Simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) has also emerged as a promising technology enabling full-space coverage by dividing incident signals into independently tunable transmitted and reflected components. In this paper, we propose a novel STAR-RIS-assisted multiuser communication system with MAs for simultaneous wireless information and power transfer (SWIPT). In order to achieve maximum sum-rate and energy harvesting efficiency, the proposed system optimizes beamforming, transmission/reflection coefficients (RCs), and MA positions simultaneously using a meta twin delay deep deterministic policy gradient (MTD3) reinforcement learning (RL) approach. The numerical results demonstrate the superiority of MTD3 over conventional baselines across various system settings, including different numbers of RIS elements, power budget, and minimum energy constraints, making it more robust and efficient for real-time resource allocation in SWIPT systems.

Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, A-Long Jin, Yanfeng Zhang, Pei Xiao, Rahim Tafazolli, Merouane Debbah (2025)FlowMoE: A Scalable Pipeline Scheduling Framework for Distributed Mixture-of-Experts Training, In: Advances in Neural Information Processing Systems 38 (NeurIPS 2025) NeurIPS

The parameter size of modern large language models (LLMs) can be scaled up via the sparsely-activated Mixture-of-Experts (MoE) technique to avoid excessive increase of the computational costs. To further improve training efficiency, pipelining computation and communication has become a promising solution for distributed MoE training. However, existing work primarily focuses on scheduling tasks within the MoE layer, such as expert computing and all-to-all (A2A) communication , while neglecting other key operations including multi-head attention (MHA) computing, gating, and all-reduce communication. In this paper, we propose FlowMoE, a scalable framework for scheduling multi-type task pipelines. First, FlowMoE constructs a unified pipeline to consistently scheduling MHA computing , gating, expert computing, and A2A communication. Second, FlowMoE introduces a tensor chunk-based priority scheduling mechanism to overlap the all-reduce communication with all computing tasks. We implement FlowMoE as an adaptive and generic framework atop PyTorch. Extensive experiments with 675 typical MoE layers and four real-world MoE models across two GPU clusters demonstrate that our proposed FlowMoE framework outperforms state-of-the-art MoE training frameworks, reducing training time by 13%-57%, energy consumption by 10%-39%, and memory usage by 7%-32%. FlowMoE's code is available at https://github.com/ZJU-CNLAB/FlowMoE.

Mengmeng Ren, Li Qiao, Long Yang, Zhen Gao, Jian Chen, Mahdi Boloursaz Mashhadi, Pei Xiao, Rahim Tafazolli, Mehdi Bennis (2505)Generative Semantic Communication via Textual Prompts: Latency Performance Tradeoffs, In: IEEE Transactions on Vehicular Technology74(9) IEEE

DOI: 10.1109/TVT.2025.3566488

This paper develops an edge-device collaborative Generative Semantic Communications (Gen SemCom) framework leveraging pre-trained Multi-modal/Vision Language Models (M/VLMs) for ultra-low-rate semantic communication via textual prompts. The proposed framework optimizes the use of M/VLMs on the wireless edge/device to generate high-fidelity textual prompts through visual captioning/question answering, which are then transmitted over a wireless channel for SemCom. Specifically, we develop a multiuser Gen SemCom framework using pre-trained M/VLMs, and formulate a joint optimization problem of prompt generation offloading, communication and computation resource allocation to minimize the latency and maximize the resulting semantic quality. Due to the non-convex nature of the problem with highly coupled discrete and continuous variables, we decompose it as a two-level problem and propose a low-complexity swap/leaving/joining (SLJ)-based matching algorithm. Simulation results demonstrate significant performance improvements over the conventional semantic-unaware/non-collaborative generation offloading benchmarks. Index Terms—Pre-trained multi-modal/vision language models (M/VLMs), semantic communication, zero/few-shot captioning, collaborative edge-device generative AI.

Jingxuan Men, Carl Chinonso Udora, Ning Wang, Mahdi Boloursaz Mashhadi, Yi Ma, Mike Nilsson (2024)Demo Abstract: Semantic Communications for Immersive Multi-view Media Delivery

Our demonstration highlights the use of semantic communication to transmit stereo frame streams and its application in immersive media. Considering the limitations of traditional stereo compression with digital transmission under poor channel conditions, such as the cliff effect and high latency in computation and transmission, we employ the Deep Joint Source and Channel Coding (Deep JSCC) framework to transmit semantic information of stereo streams between the sender and receiver. To address channel instability, we propose a dynamic rate adjustment method that adapts to channel conditions while maintaining transmission efficiency and reconstruction quality. Furthermore, we extend this work to stereo stream applications, enabling the real-time synthesis of multiple novel view streams of the streamer. The overall design of this demo enables an immersive multi-view experience by transmitting rate-controlled semantic features from only two perspectives.

Chunmei Xu, Mahdi Boloursaz Mashhadi, Yi Ma, Rahim Tafazolli, Jiangzhou Wang (2025)Generative Semantic Communications With Foundation Models: Perception-Error Analysis and Semantic-Aware Power Allocation, In: IEEE Journal on Selected Areas in Communications43(7)pp. 2493-2505 Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/JSAC.2025.3559120

Generative foundation models can revolutionize the design of semantic communication (SemCom) systems allowing high fidelity exchange of semantic information at ultra low rates. In this work, a generative SemCom framework with pretrained foundation models is proposed, where both uncoded forward-with-error and coded discard-with-error schemes are developed for the semantic decoder. To characterize the impact of transmission reliability on the perceptual quality of the regenerated signal, their mathematical relationship is analyzed from a rate-distortion-perception perspective, which is proved to be non-decreasing. The semantic values are defined to measure the semantic information of multimodal semantic features accordingly. We also investigate semantic-aware power allocation problems aiming at power consumption minimization for ultra low rate and high fidelity SemComs. To solve these problems, two semantic-aware power allocation methods are proposed by leveraging the non-decreasing property of the perception-error relationship. Numerically, perception-error functions and semantic values of semantic data streams under both schemes for image tasks are obtained based on the Kodak dataset. Simulation results show that our proposed semanticaware method significantly outperforms conventional approaches, particularly in the channel-coded case (up to 90% power saving).

Li Qiao, Mahdi Boloursaz Mashhadi, Zhen Gao, Rahim Tafazolli, Mehdi Bennis, Dusit (Tao) Niyato (2025)Token Communications: A Large Model-Driven Framework for Cross-modal Context-aware Semantic Communications, In: Pre-print

In this paper, we introduce token communications (TokCom), a large model-driven framework to leverage cross-modal context information in generative semantic communications (GenSC). TokCom is a new paradigm, motivated by the recent success of generative foundation models and multimodal large language models (GFM/MLLMs), where the communication units are tokens, enabling efficient transformer-based token processing at the transmitter and receiver. In this paper, we introduce the potential opportunities and challenges of leveraging context in GenSC, explore how to integrate GFM/MLLMs-based token processing into semantic communication systems to leverage cross-modal context effectively at affordable complexity, present the key principles for efficient TokCom at various layers in future wireless networks. In a typical image semantic communication setup, we demonstrate a significant improvement of the bandwidth efficiency, achieved by TokCom by leveraging the context information among tokens. Finally, the potential research directions are identified to facilitate adoption of TokCom in future wireless networks.

Behrad Mahmoudi, Ahmad Khonsari, Farshad Zeinali, Mohammad Robat Mili, Mahdi Boloursaz Mashhadi, Pei Xiao (2025)Beamforming and Reflection Design for Short Packet ISAC With Non-Ideal RIS: An A3C-Based Approach, In: IEEE transactions on vehicular technology74(5)pp. 1-6 IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TVT.2025.3525525

Integrated sensing and communication (ISAC) is a promising solution to mitigate the increasing congestion of the wireless spectrum. In this paper, we investigate the short packet communication regime within an ISAC system assisted by a reconfigurable intelligent surface (RIS) to meet the low latency ultra-reliable requirements in the next-generation wireless networks. We consider a non-ideal RIS model that captures effects of the phase-dependent amplitude variations in the reflection coefficients, and we have incorporated the near-field model into the channels between the RIS and the users or targets. In this setup, we jointly design the transmit beamforming and the RIS phase shifts to maximize the sum rate while satisfying the sensing signal-to-noise ratio (SNR) requirement. The system simultaneously carries out multitarget sensing and multi-user short packet communications with the help of the RIS. Considering the nonconvex and dynamic nature of the resulting optimization problem, we propose an asynchronous advantage actor-critic (A3C) based method for beamforming and reflection design in this setup. Numerical results demonstrate the superiority of the proposed scheme over the benchmarks.

Yunqi Gao, Zechao Zhuang, Bing Hu, Mahdi Boloursaz Mashhadi, A-Long Jin, Pei Xiao (2025)NetPlacer+: Model Parallelism based on Load Balance in Distributed Deep Learning, In: IEEE transactions on emerging topics in computational intelligenceEarly Access IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TETCI.2025.3543765

The importance of Model Parallelism in Distributed Deep Learning continues to grow due to the increase in the Deep Neural Network (DNN) scale and the demand for higher training speed. Different from all the existing works, we propose a model-parallel strategy called NetPlacer+ based on load balance. The major idea in NetPlacer+ is to partition the DNN model into multiple devices by balancing each device’s computation and communication load. We build the mathematical model of NetPlacer+. We transform the mathematical model of NetPlacer+ and obtain its approximate optimal solution using the interior point method. Extensive experiments in two GPU clusters and eight modern DNNs are conducted to verify the effectiveness of NetPlacer+. Experimental results show that the model-parallel strategy of NetPlacer+ achieves up to 1.25x speedup compared to NVIDIA’s DLPlacer.

Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, Wei Wang, Mehdi Bennis (2025)PipeSFL: A Fine-Grained Parallelization Framework for Split Federated Learning on Heterogeneous Clients, In: IEEE transactions on mobile computing24(3)pp. 1774-1791 IEEE

DOI: 10.1109/TMC.2024.3489642

Split Federated Learning (SFL) improves scalability of Split Learning (SL) by enabling parallel computing of the learning tasks on multiple clients. However, state-of-the-art SFL schemes neglect the effects of heterogeneity in the clients’ computation and communication performance as well as the computation time for the tasks offloaded to the cloud server. In this paper, we propose a fine-grained parallelization framework, called PipeSFL, to accelerate SFL on heterogeneous clients. PipeSFL is based on two key novel ideas. First, we design a server-side priority scheduling mechanism to minimize per-iteration time. Second, we propose a hybrid training mode to reduce per-round time, which employs asynchronous training within rounds and synchronous training between rounds. We theoretically prove the optimality of the proposed priority scheduling mechanism within one round and analyze the total time per round for PipeSFL, SFL and SL. We implement PipeSFL on PyTorch. Extensive experiments on seven 64-client clusters with different heterogeneity demonstrate that at training speed, PipeSFL achieves up to 1.65x and 1.93x speedup compared to EPSL and SFL, respectively. At energy consumption, PipeSFL saves up to 30.8% and 43.4% of the energy consumed within each training round compared to EPSL and SFL, respectively. PipeSFL’s code is available at https://github.com/ZJU-CNLAB/PipeSFL.

Li Qiao, Mahdi Boloursaz Mashhadi, Zhen Gao, Chuan Heng Foh, Pei Xiao, Mehdi Bennis (2024)Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models, In: IEEE wireless communications letters13(10)pp. 2652-2656 IEEE

DOI: 10.1109/LWC.2024.3429295

—Generative foundation AI models have recently shown great success in synthesizing natural signals with high perceptual quality using only textual prompts and conditioning signals to guide the generation process. This enables semantic communications at extremely low data rates in future wireless networks. In this paper, we develop a latency-aware semantic communications framework with pre-trained generative models. The transmitter performs multi-modal semantic decomposition on the input signal and transmits each semantic stream with the appropriate coding and communication schemes based on the intent. For the prompt, we adopt a re-transmission-based scheme to ensure reliable transmission, and for the other semantic modalities we use an adaptive modulation/coding scheme to achieve robustness to the changing wireless channel. Furthermore , we design a semantic and latency-aware scheme to allocate transmission power to different semantic modalities based on their importance subjected to semantic quality constraints. At the receiver, a pre-trained generative model synthesizes a high fidelity signal using the received multi-stream semantics. Simulation results demonstrate ultra-low-rate, low-latency, and channel-adaptive semantic communications.

Kaidi Wang, Yi Ma, Mahdi Boloursaz Mashhadi, Chuan Heng Foh, Rahim Tafazolli, Zhi Ding (2025)Convergence Acceleration in Wireless Federated Learning: A Stackelberg Game Approach, In: IEEE Transactions on Vehicular Technology74(1)pp. 714-729 Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TVT.2024.3452933

This paper studies issues that arise with respect to the joint optimization for convergence time in federated learning over wireless networks (FLOWN). We consider the criterion and protocol for selection of participating devices in FLOWN under the energy constraint and derive its impact on device selection. In order to improve the training efficiency, age-of-information (AoI) enables FLOWN to assess the freshness of gradient updates among participants. Aiming to speed up convergence, we jointly investigate global loss minimization and latency minimization in a Stackelberg game based framework. Specifically, we formulate global loss minimization as a leader-level problem for reducing the number of required rounds, and latency minimization as a follower-level problem to reduce time consumption of each round. By decoupling the follower-level problem into two sub-problems, including resource allocation and sub-channel assignment, we achieve an optimal strategy of the follower through monotonic optimization and matching theory. At the leader-level, we derive an upper bound of convergence rate and subsequently refor-mulate the global loss minimization problem and propose a new age-of-update (AoU) based device selection algorithm. Simulation results indicate the superior performance of the proposed AoU based device selection scheme in terms of the convergence rate, as well as efficient utilization of available sub-channels.

Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, A-Long Jin, Pei Xiao, Chunming Wu (2024)US-Byte: An Efficient Communication Framework for Scheduling Unequal-sized Tensor Blocks in Distributed Deep Learning, In: IEEE transactions on parallel and distributed systems : a publication of the IEEE Computer Society35(1)pp. 123-139 IEEE

DOI: 10.1109/TPDS.2023.3331372

The communication bottleneck severely constrains the scalability of distributed deep learning, and efficient communication scheduling accelerates distributed DNN training by overlapping computation and communication tasks. However, existing approaches based on tensor partitioning are not efficient and suffer from two challenges: (1) the fixed number of tensor blocks transferred in parallel can not necessarily minimize the communication overheads; (2) although the scheduling order that preferentially transmits tensor blocks close to the input layer can start forward propagation in the next iteration earlier, the shortest per-iteration time is not obtained. In this paper, we propose an efficient communication framework called US-Byte. It can schedule unequal-sized tensor blocks in a near-optimal order to minimize the training time. We build the mathematical model of US-Byte by two phases: (1) the overlap of gradient communication and backward propagation, and (2) the overlap of gradient communication and forward propagation. We theoretically derive the optimal solution for the second phase and efficiently solve the first phase with a low-complexity algorithm. We implement the US-Byte architecture on PyTorch framework. Extensive experiments on two different 8-node GPU clusters demonstrate that US-Byte can achieve up to 1.26x and 1.56x speedup compared to ByteScheduler and WFBP, respectively. We further exploit simulations of 128 GPUs to verify the potential scaling performance of US-Byte. Simulation results show that US-Byte can achieve up to 1.69x speedup compared to the state-of-the-art communication framework.

Li Qiao, Zhen Gao, Mahdi Boloursaz Mashhadi, Deniz Gunduz (2024)Massive Digital Over-the-Air Computation for Communication-Efficient Federated Edge Learning, In: IEEE journal on selected areas in communications : a publication of the IEEE Communications Society42(11)pp. 3078-3094 IEEE

DOI: 10.1109/JSAC.2024.3431572

Over-the-air computation (AirComp) is a promising technology converging communication and computation over wireless networks, which can be particularly effective in model training, inference, and more emerging edge intelligence applications. AirComp relies on uncoded transmission of individual signals, which are added naturally over the multiple access channel thanks to the superposition property of the wireless medium. Despite significantly improved communication efficiency, how to accommodate AirComp in the existing and future digital communication networks, that are based on discrete modulation schemes, remains a challenge. This paper proposes a massive digital AirComp (MD-AirComp) scheme, that leverages an unsourced massive access protocol, to enhance compatibility with both current and next-generation wireless networks. MD-AirComp utilizes vector quantization to reduce the uplink communication overhead, and employs shared quantization and modulation codebooks. At the receiver, we propose a near-optimal approximate message passing-based algorithm to compute the model aggregation results from the superposed sequences, which relies on estimating the number of devices transmitting each code sequence, rather than trying to decode the messages of individual transmitters. We apply MD-AirComp to the federated edge learning (FEEL), and show that it significantly accelerates FEEL convergence compared to state-of-the-art while using the same amount of communication resources.

Somayeh Aghashahi, Zolfa Zeinalpour-Yazdi, Aliakbar Tadaion, Mahdi Boloursaz Mashhadi, Ahmed Elzanaty (2024)Single Antenna Tracking and Localization of RIS-enabled Vehicular Users, In: IEEE transactions on vehicular technology74(3)pp. 1-13 IEEE

DOI: 10.1109/TVT.2024.3492542

Reconfigurable Intelligent Surfaces (RISs) are envisioned to be employed in next generation wireless networks to enhance the communication and radio localization services. In this paper, we propose novel localization and tracking algorithms exploiting reflections through RISs at multiple receivers. We utilize a single antenna transmitter (Tx) and multiple single antenna receivers (Rxs) to estimate the position and the velocity of users (e.g. vehicles) equipped with RISs. Then, we design the RIS phase shifts to separate the signals from different users. The proposed algorithms exploit the geometry information of the signal at the RISs to localize and track the users. We also conduct a comprehensive analysis of the Cramer-Rao lower bound (CRLB) of the localization system. Compared to the time of arrival (ToA)-based localization approach, the proposed method reduces the localization error by a factor up to three. Also, the simulation results show the accuracy of the proposed tracking approach.

Yunqi Gao, Bing Hu, Mahdi Boloursaz Mashhadi, Wei Wang, Rahim Tafazolli, Merouane Debbah (2024)A Dynamic Sliding Window based Tensor Communication Scheduling Framework for Distributed Deep Learning, In: IEEE Transactions on Network Science and Engineering12(2) Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TNSE.2024.3523320

Simultaneous tensor communication can effectively improve the scalability of distributed deep learning on large clusters. However, a fixed number of tensor blocks communicated concurrently violates the priority-based scheduling strategy and cannot minimize communication overheads. In this paper, we propose a novel simultaneous tensor communication framework, namely D-Credit, which transmits tensor blocks based on dynamic sliding windows to minimize per-iteration time in distributed DNN training. We build the mathematical model of D-Credit in two phases: (1) the overlap of gradient communication and backward propagation, and (2) the overlap of gradient communication and forward computation. We drive the optimal window sizes for the second phase analytically, and develop a greedy algorithm to efficiently determine the dynamic window sizes for the first phase of D-Credit. We implement the D-Credit architecture on PyTorch framework. Experimental results on two different GPU clusters demonstrate that at training speed, D-Credit can achieve up to 1.26x, 1.21x, 1.48x and 1.53x speedup compared to ByteScheduler, DeAR, PyTorch-DDP and WFBP, respectively. At energy consumption, D-Credit saves up to 17.8% and 25.1% of the training energy consumption compared to ByteScheduler and WFBP, respectively. D-Credit’s code is available at https://github.com/ZJU-CNLAB/D-Credit.

Chunmei Xu, Mahdi Boloursaz Mashhadi, Yi Ma, Rahim Tafazolli (2024)Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models, In: Proc. of IEEE Global Communications Conference (GLOBECOM) IEEE

—Recent advancements in diffusion models have led to a significant breakthrough in generative modeling. The combination of the generative model and semantic communication (SemCom) enables high-fidelity semantic information exchange at ultra-low rates. In this paper, a novel generative SemCom framework for image tasks is proposed, utilizing pre-trained foundation models as semantic encoders and decoders for semantic feature extraction and image regeneration, respectively. The mathematical relationship between transmission reliability and the perceptual quality of regenerated images is modeled and the semantic values of extracted features are defined accordingly. This relationship is derived through numerical simulations on the Kodak dataset. Furthermore, we investigate the semantic-aware power allocation problem, aiming to minimize total power consumption while guaranteeing semantic performance. To solve this problem, two semantic-aware power allocation methods are proposed by constraint decoupling and bisection search, respectively. Numerical results demonstrate that the proposed semantic-aware methods outperform conventional approach in terms of total power consumption.

Tatsuya Kikuzuki, Mahdi Boloursaz Mashhadi, Yi Ma, Rahim Tafazolli (2024)Attention on the Preambles: Sensing with mmWave CSI, In: IEEE open journal of the Communications Society

DOI: 10.1109/OJCOMS.2024.3475989

The ubiquitous availability of wireless networks and devices provides a unique opportunity to leverage the corresponding communication signals to enable wireless sensing applications. In this article, we develop a new framework for environment sensing by opportunistic use of the mmWave communication signals. The proposed framework is based on a mixture of the conventional and Neural Network (NN) signal processing techniques for simultaneous counting and localization of multiple targets in the environment in a bi-static setting. In this framework, multi-modal delay, Doppler, angular features are first derived from the Channel State Information (CSI) estimated at the receiver, and then a transformer-based NN architecture exploiting attention mechanisms, called CSIformer, is designed to extract the most effective features for sensing. We also develop a novel post-processing technique based on Kullback-Leibler (KL) minimization to transfer knowledge between the counting and localization tasks, thereby simplifying the NN architecture. Our numerical results show accurate counting and localization capabilities that significantly outperform the existing works based on pure conventional signal processing techniques, as well as NN-based approaches. The simulation codes are available at: https://github.com/University-of-Surrey-Mahdi/Attention-on-the-Preambles-Sensing-with-mmWave-CSI.

Qianqian Yang, Mandi Boloursaz Mashhadi, Deniz Gunduz (2019)DEEP CONVOLUTIONAL COMPRESSION FOR MASSIVE MIMO CSI FEEDBACK, In: 2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP)2019-8918798pp. 1-6 IEEE

DOI: 10.1109/mlsp.2019.8918798

Soheil Salehi, Mandi Boloursaz Mashhadi, Alireza Zaeemzadeh, Nazanin Rahnavard, Ronald F. DeMara, Mahdi Boloursaz Mashhadi (2018)Energy-Aware Adaptive Rate and Resolution Sampling of Spectrally Sparse Signals Leveraging VCMA-MTJ Devices, In: IEEE journal on emerging and selected topics in circuits and systems8(4)8418353pp. 679-692 IEEE

DOI: 10.1109/JETCAS.2018.2857998

This paper devises a novel adaptive framework for the energy-aware acquisition of spectrally sparse signals. The adaptive quantized compressive sensing (CS) techniques, beyond-complementary metal-oxide-semiconductor (CMOS) hardware architecture, and corresponding algorithms which utilize them have been designed concomitantly to minimize the overall of signal acquisition. First, a spin-based adaptive intermittent quantizer (AIQ) is developed to facilitate the realization of the adaptive sampling technique. Next, a framework for smart and adaptive determination of the sampling rate and quantization resolution based on the instantaneous signal and hardware constraints is introduced. Finally, signal reconstruction algorithms which process the quantized CS samples are investigated. Simulation results indicate that an AIQ architecture using a spin-based quantizer incurs only 20.98-mu W power dissipation on average using 22-nm technology for 1-8 bits uniform output. Furthermore, in order to provide 8-bit quantization resolution, 85.302-mu W maximum power dissipation is attained. Our results indicate that the proposed AIQ design provides up to 6.18-mW power savings on average compared to other adaptive rate and resolution CMOS-based CS analog-to-digital converter designs. In addition, the mean square error values achieved by the simulation results confirm efficient reconstruction of the signal based on the proposed approach.

Reza Kazemi, Mahdi Boloursaz, Seyed M. Etemadi, Fereydoon Behnia, Mahdi Boloursaz Mashhadi (2016)Capacity Bounds and Detection Schemes for Data Over Voice, In: IEEE transactions on vehicular technology65(11)7387776pp. 8964-8977 IEEE

DOI: 10.1109/TVT.2016.2519926

Cellular networks provide widespread and reliable voice communications among subscribers through mobile voice channels. These channels benefit from superior priority and higher availability compared with conventional cellular data communication services, such as General Packet Radio Service, Enhanced Data Rates for GSM Evolution, and High-Speed Downlink Packet Access. These properties are of major interest to applications that require transmitting small volumes of data urgently and reliably, such as an emergency call in vehicular applications. This encourages excessive research to make digital communication through voice channels feasible, leading to the emergence of Data over Voice (DoV) technology. In this research, we investigate the challenges of transmitting data through mobile voice channels. We introduce a simplified information-theoretic model of the vocoder channel and derive bounds on its capacity. By invoking detection theory concepts and conjecturing Weibull and chi-square distributions for approximately modeling the probability distribution of channel output, we propose improved detection schemes based on the mentioned distributions and compare the achieved performances with the calculated bounds and other state-of-the-art DoV structures. Moreover, in common mobile networks, the vocoder compression rate is adopted in accordance with the network traffic adaptively. Although this phenomenon affects the overall capacity significantly, it has been overlooked by previous research studies. In this research, we apply the Gilbert-Elliott (GE) model to the voice channel, extract the required model parameters from the Markov model, and bound the overall voice channel capacity by considering the adaptive rate adjustment phenomenon.

Sahar Sadrizadeh, Shahrzad Kiani, Mahdi Boloursaz, Farokh Marvasti, Mahdi Boloursaz Mashhadi Iterative method for simultaneous sparse approximation, In: arXiv.org

DOI: 10.48550/arxiv.1707.08310

This paper studies the problem of Simultaneous Sparse Approximation (SSA). This problem arises in many applications which work with multiple signals maintaining some degree of dependency such as radar and sensor networks. In this paper, we introduce a new method towards joint recovery of several independent sparse signals with the same support. We provide an analytical discussion on the convergence of our method called Simultaneous Iterative Method with Adaptive Thresholding (SIMAT). Additionally, we compare our method with other group-sparse reconstruction techniques, i.e., Simultaneous Orthogonal Matching Pursuit (SOMP), and Block Iterative Method with Adaptive Thresholding (BIMAT) through numerical experiments. The simulation results demonstrate that SIMAT outperforms these algorithms in terms of the metrics Signal to Noise Ratio (SNR) and Success Rate (SR). Moreover, SIMAT is considerably less complicated than BIMAT, which makes it feasible for practical applications such as implementation in MIMO radar systems.

Reza Kazemi, Mahdi Boloursaz Mashhadi, Mohsen Heidari Khoozani, Fereydoon Behnia (2015)Modem based on sphere packing techniques in high-dimensional Euclidian sub-space for efficient data over voice communication through mobile voice channels, In: IET communications9(4)pp. 508-516 The Institution of Engineering and Technology

DOI: 10.1049/iet-com.2014.0610

The increased penetration of cellular networks has made voice channels widely available ubiquitously. On the other hand, mobile voice channels possess properties that make them an ideal choice for high priority, low-rate real-time communications. Mobile voice channel with the mentioned properties, could be utilised in emergency applications in vehicular communications area such as the standardised emergency call system planned to be launched in 2015. This study aims to investigate the challenges of data transmission through these channels and proposes an efficient data transfer structure. To this end, a proper statistical model for the channel distortion is proposed and an optimum detector is derived considering the proposed channel model. Optimum symbols are also designed according to the derived rule and analytical bounds on error probability are obtained for the orthogonal signaling and sphere packing techniques. Moreover, analytical evaluation is performed and appropriate simulation results are presented. Finally, it is observed that the proposed structure based on the sphere packing technique achieves superior performance compared with prior works in this field. Although the ideas offered in this study are utilised to cope with voice channel non-idealities, the steps taken in this study could also be applied to channels with similar conditions.

Mahdi Boloursaz Mashhadi, Hadi Zayyani, Saeed Gazor, Farokh Marvasti (2018)Iterative Reconstruction of Spectrally Sparse Signals from Level Crossings, In: 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)2018-8553367pp. 435-439 IEEE

DOI: 10.23919/EUSIPCO.2018.8553367

This paper considers the problem of sparse signal reconstruction from the timing of its Level Crossings (LC)s. We formulate the sparse Zero Crossing (ZC) reconstruction problem in terms of a single 1-bit Compressive Sensing (CS) model. We also extend the Smoothed L0 (SL0) sparse reconstruction algorithm to the 1-bit CS framework and propose the Binary SL0 (BSL0) algorithm for iterative reconstruction of the sparse signal from ZCs in cases where the number of sparse coefficients is not known to the reconstruction algorithm a priori. Similar to the ZC case, we propose a system of simultaneously constrained signed-CS problems to reconstruct a sparse signal from its Level Crossings (LC) s and modify both the Binary Iterative Hard Thresholding (BIHT) and BSL0 algorithms to solve this problem. Simulation results demonstrate superior performance of the proposed LC reconstruction techniques in comparison with the literature.

M. Boloursaz, A. H. Hadavi, R. Kazemi, F. Behnia (2012)Secure data communication through GSM Adaptive Multi Rate voice channel, In: 6th International Symposium on Telecommunications (IST)6483136pp. 1021-1026 IEEE

DOI: 10.1109/ISTEL.2012.6483136

This paper considers the problem of digital data transmission through the Global System for Mobile communications (GSM) for security applications. A data modem is presented that utilizes codebooks of Speech-Like (SL) symbols to transmit data through the GSM Adaptive Multi Rate (AMR) voice codec. Using this codebook of finite alphabet, the continuous vocoder channel is modeled by a Discrete Memory less Channel (DMC). A heuristic optimization algorithm is proposed to select codebook symbols from a database of observed human speech such that the capacity of DMC is maximized. Using the DMC capacity, a lower bound on the capacity of the considered voice channel can be achieved. Simulation results show that the proposed data modem achieves higher data rates and lower symbol error rates compared to previously reported results while requiring lower computational complexity for codebook optimization.

Mahdi Boloursaz Mashhadi, Mahmoud Essalat, Mohammad Ahmadi, Farokh Marvasti (2016)An improved algorithm for Heart Rate tracking during physical exercise using simultaneous wrist-type photoplethysmographic (PPG) and acceleration signals, In: 2016 23rd Iranian Conference on Biomedical Engineering and 2016 1st International Iranian Conference on Biomedical Engineering (ICBME)7890946pp. 146-149 IEEE

DOI: 10.1109/ICBME.2016.7890946

Causal Heart Rate (HR) monitoring using photoplethysmographic (PPG) signals recorded from wrist during physical exercise is a challenging task because the PPG signals in this scenario are highly contaminated by artifacts caused by hand movements of the subject. This paper proposes a novel algorithm for this problem, which consists of two main blocks of Noise Suppression and Peak Selection. The Noise Suppression block removes Motion Artifacts (MAs) from the PPG signals utilizing simultaneously recorded 3D acceleration data. The Peak Selection block applies some decision mechanisms to correctly select the spectral peak corresponding to HR in PPG spectra. Experimental results on benchmark dataset recorded from 12 subjects during fast running at the peak speed of 15 km/hour showed that the proposed algorithm achieves an average absolute error of 1.50 beats per minute (BPM), which outperforms state of the art.

Mahdi Boloursaz Mashhadi, Majid Farhadi, Mahmoud Essalat, Farokh Marvasti (2018)LOW COMPLEXITY HEART RATE MEASUREMENT FROM WEARABLE WRIST-TYPE PHOTOPLETHYSMOGRAPHIC SENSORS ROBUST TO MOTION ARTIFACTS, In: 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)2018-8461520pp. 921-924 IEEE

DOI: 10.1109/ICASSP.2018.8461520

This paper presents a low complexity while accurate Heart Rate (HR) estimation technique from signals captured by Photoplethysmographic (PPG) sensors worn on the wrist during intensive physical exercise. Wrist-type PPG signals experience severe Motion Artifacts (MA) that hinder efficient HR estimation especially during intensive physical exercises. To suppress the motion artifacts efficiently, simultaneous 3 dimensional acceleration signals are used as reference MAs. The proposed method achieves an Average Absolute Error (AAE) of 1.19 Beats Per Minute (BPM) on the 12 benchmark PPG recordings in which subjects run at speeds of up to 15 km/h. This method also achieves an AAE of 2.17 BPM on the whole benchmark database of 23 recordings that include both running and arm movement activities. This performance is comparable with state-of-the-art algorithms while at a significantly reduced computational cost which makes its standalone implementation on wearable devices feasible. The proposed algorithm achieves an average processing time of 32 milliseconds per input frames of length 8 seconds (2 channel PPG and 3D ACC signals) on a 3.2 GHz processor.

Mahmoud Essalat, Mahdi Boloursaz Mashhadi, Farokh Marvasti, Mahdi Boloursaz Mashhadi (2016)Supervised heart rate tracking using wrist-type photoplethysmographic (PPG) signals during physical exercise without simultaneous acceleration signals, In: 2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP)7906025pp. 1166-1170 IEEE

DOI: 10.1109/GlobalSIP.2016.7906025

PPG based heart rate (HR) monitoring has recently attracted much attention with the advent of wearable devices such as smart watches and smart bands. However, due to severe motion artifacts (MA) caused by wristband stumbles, PPG based HR monitoring is a challenging problem in scenarios where the subject performs intensive physical exercises. This work proposes a novel approach to the problem based on supervised learning by Neural Network (NN). By simulations on the benchmark datasets [1], we achieve acceptable estimation accuracy and improved run time in comparison with the literature. A major contribution of this work is that it alleviates the need to use simultaneous acceleration signals. The simulation results show that although the proposed method does not process the simultaneous acceleration signals, it still achieves the acceptable Mean Absolute Error (MAE) of 1.39 Beats Per Minute (BPM) on the benchmark data set.

S Sadrizadeh, Sh Kianidehkordi, M Mashhadi, F Marvasti, Mahdi Boloursaz Mashhadi (2019)Iterative method for simultaneous sparse approximation, In: Scientia Iranica. Transaction D, Computer science & engineering, electrical engineering26(3)pp. 1601-1607 Sharif University of Technology

DOI: 10.24200/sci.2018.5564.1347

This paper studies the problem of Simultaneous Sparse Approximation (SSA). This problem arises in many applications that work with multiple signals maintaining some degree of dependency, e.g., radar and sensor networks. We introduce a new method towards joint recovery of several independent sparse signals with the same support. We provide an analytical discussion of the convergence of our method, called Simultaneous Iterative Method (SIM). In this study, we compared our method with other group-sparse reconstruction techniques, namely Simultaneous Orthogonal Matching Pursuit (SOMP) and Block Iterative Method with Adaptive Thresholding (BIMAT), through numerical experiments. The simulation results demonstrated that SIM outperformed these algorithms in terms of the metrics Signal to Noise Ratio (SNR) and Success Rate (SR). Moreover, SIM is considerably less complicated than BIMAT, which makes it feasible for practical applications such as implementation in MIMO radar systems.

Mahdi Boloursaz Mashhadi, Fereidoon Behnia (2016)Efficient codebook design for digital communication through compressed voice channels, In: IET communications10(18)pp. 2613-2620 The Institution of Engineering and Technology

DOI: 10.1049/iet-com.2015.0827

The common voice channels existing in cellular communication networks provide reliable, ubiquitously available and top priority communication mediums. These properties make voice dedicated channels an ideal choice for high priority, real time communication. However, such channels include voice codecs that hamper the data flow by compressing the waveforms prior to transmission. This study designs codebooks of speech-like symbols for reliable data transfer through the voice channel of cellular networks. An efficient algorithm is proposed to select proper codebook symbols from a database of natural speech to optimise a desired objective. Two variants of this codebook optimisation algorithm are presented: One variant minimises the symbol error rate and the other maximises the capacity achievable by the codebook. It is shown both analytically and by the simulation results that under certain circumstances, these two objective functions reach the same performance. Simulation results also show that the proposed codebook optimisation algorithm achieves higher data rates and lower symbol error rates compared with previously reported results while requiring lower computational complexity for codebook optimisation. The Gilbert–Elliot channel model is utilised to study the effects of adaptive compression rate adjustment of the vocoder on overall voice channel capacity. Finally, practical implementation issues are addressed.

Mahdi Boloursaz Mashhadi, Nikan Salarieh, Ehsan Shahrabi Farahani, Farokh Marvasti, Mahdi Boloursaz Mashhadi (2017)Level crossing speech sampling and its sparsity promoting reconstruction using an iterative method with adaptive thresholding, In: IET signal processing11(6)721pp. 721-726 Inst Engineering Technology-Iet

DOI: 10.1049/iet-spr.2016.0569

The authors propose asynchronous level crossing (LC) A/D converters for low redundancy voice sampling. They propose to utilise the family of iterative methods with adaptive thresholding (IMAT) for reconstructing voice from non-uniform LC and adaptive LC (ALC) samples thereby promoting sparsity. The authors modify the basic IMAT algorithm and propose the iterative method with adaptive thresholding for level crossing (IMATLC) algorithm for improved reconstruction performance. To this end, the authors analytically derive the basic IMAT algorithm by applying the gradient descent and gradient projection optimisation techniques to the problem of square error minimisation subjected to sparsity. The simulation results indicate that the proposed IMATLC reconstruction method outperforms the conventional reconstruction method based on low-pass signal assumption by 6.56dBs in terms of reconstruction signal-to-noise ratio (SNR) for LC sampling. In this scenario, IMATLC outperforms orthogonal matching pursuit, least absolute shrinkage and selection operator and smoothed L0 sparsity promoting algorithms by average amounts of 12.13, 10.31, and 10.28dBs, respectively. Finally, the authors compare the performance of the proposed LC/ALC-based A/Ds with the conventional uniform sampling-based A/Ds and their random sampling-based counterparts both in terms of perceptual evaluation of speech quality and reconstruction SNR.

Mahdi Boloursaz Mashhadi, Maryam Fallah, Farokh Marvasti (2017)Interpolation of sparse graph signals by sequential Adaptive Thresholds, In: 2017 International Conference on Sampling Theory and Applications (SampTA)8024339pp. 266-270 IEEE

DOI: 10.1109/SAMPTA.2017.8024339

This paper considers the problem of interpolating signals defined on graphs. A major presumption considered by many previous approaches to this problem has been low-pass/band-limitedness of the underlying graph signal. However, inspired by the findings on sparse signal reconstruction, we consider the graph signal to be rather sparse/compressible in the Graph Fourier Transform (GFT) domain and propose the Iterative Method with Adaptive Thresholding for Graph Interpolation (IMATGI) algorithm for sparsity promoting interpolation of the underlying graph signal. We analytically prove convergence of the proposed algorithm. We also demonstrate efficient performance of the proposed IMATGI algorithm in reconstructing randomly generated sparse graph signals. Finally, we consider the widely desirable application of recommendation systems and show by simulations that IMATGI outperforms state-of-the-art algorithms on the benchmark datasets in this application.

R. Kazemi, R. Mosayebi, S. M. Etemadi, M. Boloursaz, F. Behnia, Mahdi Boloursaz Mashhadi (2012)A lower capacity bound of secure end to end data transmission via GSM network, In: 6th International Symposium on Telecommunications (IST)6483135pp. 1015-1020 IEEE

DOI: 10.1109/ISTEL.2012.6483135

Global System for Mobile communications (GSM) is a widely spread, reliable and P2P channel through all over the world. These characteristics make GSM a channel suitable for a variety of applications in different domains especially security applications such as secure voice communication. Performance and usage of GSM applications extremely depends on the transmission data rate. Hence, transmitting data over GSM is still an attractive topic for research. This paper considers the problem of digital data transmission through the GSM voice channel. A lower capacity bound for data transmission through the GSM Adaptive Multi Rate (AMR) voice codec is presented. The GSM channel is modeled in a simple manner to overcome it's memory and non-linearity effects. A new statistic based on received samples is extracted and a novel method to transmit data over that channel which asymptotically tends to the achieved lower bound is offered.

M. Boloursaz, R. Kazemi, D. Nashtaali, M. Nasiri, F. Behnia (2013)Secure data over GSM based on algebraic codebooks, In: East-West Design & Test Symposium (EWDTS 2013)6673148pp. 1-4 IEEE

DOI: 10.1109/EWDTS.2013.6673148

This paper considers the problem of secure data communication through the Global System for Mobile communications (GSM). The algebraic codebook method for data transmission through the Adaptive Multi Rate 12.2Kbps voice channel is investigated and its maximum achievable data rate is calculated. Based on the vocoder channel properties, the method's Bit Error Rate (BER) performance is improved by repetition coding and classification methods. Simulation results show that by simultaneous application of repetition coding and clustering methods, the decoder's performance improves about 6.5% compared to the case of no clustering for 1Kbps data communication in AMR 4.75 voice codec.

M. Boloursaz, R. Kazemi, B. Barazandeh, F. Behnia (2014)Bounds on Compressed Voice Channel Capacity, In: 2014 IRAN WORKSHOP ON COMMUNICATION AND INFORMATION THEORY (IWCIT)6842487pp. 1-6 IEEE

DOI: 10.1109/IWCIT.2014.6842487

The voice channels present in cellular communication networks provide reliable, widespread and high priority communication mediums. Using these voice channels as a bearer for data transmission allows to deliver high Quality of Service data. But voice channels include vocoders that hinder the data flow by compressing the waveforms prior to transmission. Calculating vocoder channel capacity remains a challenging problem since no analytical model has been proposed for the vocoder channel so far. In this research, simplified models for the vocoder channel are proposed and bounds on vocoder channel capacity are derived based on them. In common cellular networks, the vocoder compression rate is adjusted adaptively according to the network's traffic conditions which further complicates calculating an overall capacity for the voice channel. In this research, the Gilbert-Elliot channel model is applied to the cellular voice channel to enable the study of the effect of adaptive vocoder rate adjustment on overall voice channel capacity. Modeling the voice channel and calculating its capacity provides reference bounds for comparison with any newly proposed communication scheme over this channel.

Mahdi Boloursaz Mashhadi, Saeed Gazor, Nazanin Rahnavard, Farokh Marvasti (2018)Feedback Acquisition and Reconstruction of Spectrum-Sparse Signals by Predictive Level Comparisons, In: IEEE signal processing letters25(4)pp. 496-500 IEEE

DOI: 10.1109/LSP.2018.2801836

In this letter, we propose a sparsity promoting feedback acquisition and reconstruction scheme for sensing, encoding and subsequent reconstruction of spectrally sparse signals. In the proposed scheme, the spectral components are estimated utilizing a sparsity-promoting, sliding-window algorithm in a feedback loop. Utilizing the estimated spectral components, a level signal is predicted and sign measurements of the prediction error are acquired. The sparsity promoting algorithm can then estimate the spectral components iteratively from the sign measurements. Unlike many batch-based compressive sensing algorithms, our proposed algorithm gradually estimates and follows slow changes in the sparse components utilizing a sliding-window technique. We also consider the scenario in which possible flipping errors in the sign bits propagate along iterations (due to the feedback loop) during reconstruction. We propose an iterative error correction algorithm to cope with this error propagation phenomenon considering a binary-sparse occurrence model on the error sequence. Simulation results show effective performance of the proposed scheme in comparison with the literature.

M. Boloursaz, A. H. Hadavi, R. Kazemi, F. Behnia, Mahdi Boloursaz Mashhadi (2013)A data modem for GSM Adaptive Multi Rate voice channel, In: East-West Design & Test Symposium (EWDTS 2013)6673152pp. 1-4 IEEE

DOI: 10.1109/EWDTS.2013.6673152

This paper considers the problem of digital data transmission through the Global System for Mobile communications (GSM). A data modem is presented that utilizes codebooks of Speech-Like (SL) symbols to transmit data through the GSM Adaptive Multi Rate (AMR) voice codec. Using this codebook of finite alphabet, the continuous vocoder channel is modeled by a Discrete Memory less Channel (DMC). A heuristic optimization algorithm is proposed to select codebook symbols from a database of observed human speech such that the capacity of DMC is maximized. Simulation results show that the proposed data modem achieves higher data rates and lower symbol error rates compared to previously reported results while requiring lower computational complexity for codebook optimization.

Mandi Boloursaz Mashhadi, Qianqian Yang, Deniz Gunduz, Mahdi Boloursaz Mashhadi (2020)CNN-BASED ANALOG CSI FEEDBACK IN FDD MIMO-OFDM SYSTEMS, In: 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING2020-9053850pp. 8579-8583 IEEE

DOI: 10.1109/icassp40776.2020.9053850

Massive multiple-input multiple-output (MIMO) systems require downlink channel state information (CSI) at the base station (BS) to better utilize the available spatial diversity and multiplexing gains. However, in a frequency division duplex (FDD) massive MIMO system, CSI feedback overhead degrades the overall spectral efficiency. Deep Learning (DL)based CSI feedback compression schemes have received a lot of attention recently as they provide significant improvements in compression efficiency; however, they still require reliable feedback links to convey the compressed CSI information to the BS. Instead, we propose here a Convolutional neural network (CNN)-based analog feedback scheme, called AnalogDeepCMC, which directly maps the downlink CSI to uplink channel input. Corresponding noisy channel outputs are used by another CNN to reconstruct the downlink channel estimate. The proposed analog scheme not only outperforms existing digital CSI feedback schemes in terms of the achievable downlink rate, but also simplifies the feedback transmission as it does not require explicit quantization, coding, and modulation, and provides a low-latency alternative particularly in rapidly changing MIMO channels, where the CSI needs to be estimated and fed back periodically.

Tatsuya Kikuzuki, Mahdi Boloursaz Mashhadi, Yi Ma, Rahim Tafazolli (2023)Feature Selection for Automated QoE Prediction, In: 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)pp. 1-6 IEEE

DOI: 10.1109/PIMRC56721.2023.10293923

—With the huge number of broadband users, automated network management becomes of huge interest to service providers. A major challenge is automated monitoring of user Quality of Experience (QoE), where Artificial Intelligence (AI) and Machine Learning (ML) models provide powerful tools to predict user QoE from basic protocol indicators such as Round Trip Time (RTT), retransmission rate, etc. In this paper, we introduce an effective feature selection method along with the corresponding classification algorithms to address this challenge. The simulation results show a prediction accuracy of 78% on the benchmark ITU ML5G-PS-012 dataset, improving 11% over the state-of-the-art result whilst reducing the model complexity at the same time. Moreover, we show that the local area network round trip time (LAN RTT) value during daytime and midweek plays the most prominent factor affecting the user QoE.

Samaneh Aghashahi, Aliakbar Tadaion, Zolfa Zeinalpour-Yazdi, Mahdi Boloursaz Mashhadi, Ahmed Elzanaty (2023)EMF-aware Energy Efficient MU-SIMO Systems with Multiple RISs, In: IEEE Transactions on Vehicular Technology IEEE

DOI: 10.1109/TVT.2023.3339830

In this paper, we consider an uplink transmission of a multiuser single-input multiple-output (SIMO) system assisted with multiple reconfigurable intelligent surfaces (RISs). We investigate the energy efficiency (EE) maximization problem with an electromagnetic field (EMF) exposure constraint. In order to solve the problem, we present a lower bound for the EE and adopt an alternate optimization problem. Then, we propose the Energy Efficient Multi-RIS (EEMR) algorithm to obtain the optimal transmit power of the users and phase shifts of the RISs. Moreover, we study this problem for a system with a central RIS and compare the results. The simulation results show that for a sufficient total number of RIS elements, the system with distributed RISs is more energy efficient compared to the system with a central RIS. In addition, for both the systems the EMF exposure constraints enforce a trade-off between the EE and EMF-awareness of the system.

Somayeh Aghashahi, Zolfa Zeinalpour-Yazdi, Aliakbar Tadaion Aliakbar Tadaion, Mahdi Boloursaz Mashhadi, Ahmed Elzanaty (2023)MU-Massive MIMO with Multiple RISs: SINR Maximization and Asymptotic Analysis, In: IEEE Wireless Communications Letters Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/LWC.2023.3256187

In this letter, we investigate the signal-to-interference-plus-noise-ratio (SINR) maximization problem in a multi-user massive multiple-input-multiple-output (massive MIMO) system enabled with multiple reconfigurable intelligent surfaces (RISs). We examine two zero-forcing (ZF) beamforming approaches for interference management namely BS-UE-ZF and BS-RIS-ZF that enforce the interference to zero at the users (UEs) and the RISs, respectively. Then, for each case, we resolve the SINR maximization problem to find the optimal phase shifts of the elements of the RISs. Also, we evaluate the asymptotic expressions for the optimal phase shifts and the maximum SINRs when the number of the base station (BS) antennas tends to infinity. We show that if the channels of the RIS elements are independent and the number of the BS antennas tends to infinity, random phase shifts achieve the maximum SINR using the BS-UE-ZF beamforming approach. The simulation results illustrate that by employing the BS-RIS-ZF beamforming approach, the asymptotic expressions of the phase shifts and maximum SINRs achieve the rate obtained by the optimal phase shifts even for a small number of the BS antennas.

Dr. Mahdi Boloursaz Mashhadi, Qianqian Yang, Deniz Gunduz (2021)Distributed Deep Convolutional Compression for Massive MIMO CSI Feedback, In: IEEE transactions on wireless communications20(4)pp. 2621-2633 IEEE

DOI: 10.1109/TWC.2020.3043502

Massive multiple-input multiple-output (MIMO) systems require downlink channel state information (CSI) at the base station (BS) to achieve spatial diversity and multiplexing gains. In a frequency division duplex (FDD) multiuser massive MIMO network, each user needs to compress and feedback its downlink CSI to the BS. The CSI overhead scales with the number of antennas, users and subcarriers, and becomes a major bottleneck for the overall spectral efficiency. In this paper, we propose a deep learning (DL)-based CSI compression scheme, called DeepCMC , composed of convolutional layers followed by quantization and entropy coding blocks. In comparison with previous DL-based CSI reduction structures, DeepCMC proposes a novel fully-convolutional neural network (NN) architecture, with residual layers at the decoder, and incorporates quantization and entropy coding blocks into its design. DeepCMC is trained to minimize a weighted rate-distortion cost, which enables a trade-off between the CSI quality and its feedback overhead. Simulation results demonstrate that DeepCMC outperforms the state of the art CSI compression schemes in terms of the reconstruction quality of CSI for the same compression rate. We also propose a distributed version of DeepCMC for a multi-user MIMO scenario to encode and reconstruct the CSI from multiple users in a distributed manner. Distributed DeepCMC not only utilizes the inherent CSI structures of a single MIMO user for compression, but also benefits from the correlations among the channel matrices of nearby users to further improve the performance in comparison with DeepCMC. We also propose a reduced-complexity training method for distributed DeepCMC, allowing to scale it to multiple users, and suggest a cluster-based distributed DeepCMC approach for practical implementation.

Sotiris Chatzimiltis, Mohammad Shojafar, Mahdi Boloursaz Mashhadi, Rahim Tafazolli (2024)A Collaborative Software Defined Network-based Smart Grid Intrusion Detection System, In: IEEE open journal of the Communications Society5pp. 700-711 IEEE

DOI: 10.1109/OJCOMS.2024.3351088

Current technological advancements in Software Defined Networks (SDN) can provide efficient solutions for smart grids (SGs). An SDN-based SG promises to enhance the efficiency, reliability and sustainability of the communication network. However, new security breaches can be introduced with this adaptation. A layer of defence against insider attacks can be established using machine learning based intrusion detection system (IDS) located on the SDN application layer. Conventional centralised practises, violate the user data privacy aspect, thus distributed or collaborative approaches can be adapted so that attacks can be detected and actions can be taken. This paper proposes a new SDN-based SG architecture, highlighting the existence of IDSs in the SDN application layer. We implemented a new smart meter (SM) collaborative intrusion detection system (SM-IDS), by adapting the split learning methodology. Finally, a comparison of a federated learning and split learning neighbourhood area network (NAN) IDS was made. Numerical results showed, a five class classification accuracy of over 80.3% and F1-score 78.9 for a SM-IDS adapting the split learning technique. Also, the split learning NAN-IDS exhibit an accuracy of over 81.1% and F1-score 79.9.

Mahdi Boloursaz Mashhadi, Deniz Gunduz, Alberto Perotti, Branislav M. Popovic (2023)DRF codes: Deep SNR-robust feedback codes, In: ITU Journal on Future and Evolving Technologies4(3)pp. 447-460 International Telecommunication Union

DOI: 10.52953/DAPE6014

We present a new Deep Neural Network (DNN)-based error correction code for fading channels with output feedback, called the Deep SNR-Robust Feedback (DRF) code. At the encoder, parity symbols are generated by a Long Short Term Memory (LSTM) network based on the message, as well as the past forward channel outputs observed by the transmitter in a noisy fashion. The decoder uses a bidirectional LSTM architecture along with a Signal to Noise Ratio (SNR)-aware attention NN to decode the message. The proposed code overcomes two major shortcomings of DNN-based codes over channels with passive output feedback: (i) the SNR-aware attention mechanism at the decoder enables reliable application of the same trained NN over a wide range of SNR values; (ii) curriculum training with batch size scheduling is used to speed up and stabilize training while improving the SNR-robustness of the resulting code. We show that the DRF codes outperform the existing DNN-based codes in terms of both the SNR-robustness and the error rate in an Additive White Gaussian Noise (AWGN) channel with noisy output feedback. In fading channels with perfect phase compensation at the receiver, DRF codes learn to efficiently exploit knowledge of the instantaneous fading amplitude (which is available to the encoder through feedback) to reduce the overhead and complexity associated with channel estimation at the decoder. Finally, we show the effectiveness of DRF codes in multicast channels with feedback, where linear feedback codes are known to be strictly suboptimal. These results show the feasibility of automatic design of new channel codes using DNN-based language models.

Katarina Vuckovic, Mahdi Boloursaz Mashhadi, Farzam Hejazi, Nazanin Rahnavard, Ahmed Alkhateeb (2023)PARAMOUNT: Towards generalizable deeP leARning for mmwAve beaM selectiOn using sUb-6GHz chaNnel measuremenTs, In: IEEE Transactions on Wireless Communications Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TWC.2023.3324916

Deep neural networks (DNNs) in the wireless communication domain have been shown to be hardly generalizable to scenarios where the train and test datasets follow a different distribution. This lack of generalization poses a significant hurdle to the practical utilization of DNNs in wireless communication. In this paper, we propose a generalizable deep learning approach for millimeter wave (mmWave) beam selection using sub-6 GHz channel state information (CSI) measurements, referred to as PARAMOUNT. First, we provide a detailed discussion on physical aspects of the electromagnetic wave scattering in the mmWave and sub-6 GHz bands. Based on this discussion, we develop the augmented discrete angle delay profile (ADADP) which is a novel linear transformation for the sub-6 GHz CSI that extracts the angle-delay attributes and provides a semantic visual representation of the multi-path clusters. Next, we introduce a convolutional neural network (CNN) structure that can learn the signatures of the path clusters in the sub-6 GHz ADADP representation and transform it to mmWave band beam indices. We demonstrate by extensive simulations on several different datasets that PARAMOUNT can generalize beyond the training dataset which is mainly due to transfer learning principles that allow transferring information from previously learned tasks to the learning of new unseen tasks.

Mahdi Boloursaz Mashhadi, Mahnoosh Mahdavimoghadam, Rahim Tafazolli, Walid Saad (2023)Collaborative Learning with a Drone Orchestrator, In: IEEE Transactions on Vehicular Technologypp. 1-12 Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TVT.2023.3303630

In this paper, the problem of drone-assisted collaborative learning is considered. In this scenario, swarm of intelligent wireless devices train a shared neural network (NN) model with the help of a drone. Using its sensors, each device records samples from its environment to gather a local dataset for training. The training data is severely heterogeneous as various devices have different amount of data and sensor noise level. The intelligent devices iteratively train the NN on their local datasets and exchange the model parameters with the drone for aggregation. For this system, the convergence rate of collaborative learning is derived while considering data heterogeneity, sensor noise levels, and communication errors, then, the drone trajectory that maximizes the final accuracy of the trained NN is obtained. The proposed trajectory optimization approach is aware of both the devices data characteristics (i.e., local dataset size and noise level) and their wireless channel conditions, and significantly improves the convergence rate and final accuracy in comparison with baselines that only consider data characteristics or channel conditions. Compared to state-of-the-art baselines, the proposed approach achieves an average 3.85 improvement in the final accuracy of the trained NN on benchmark datasets for image recognition and semantic segmentation tasks, respectively. Moreover, the proposed framework achieves a significant speedup in training, leading to an average 24% and 87% saving in the drone's hovering time, communication overhead, and battery usage, respectively for these tasks.

Mahdi Boloursaz Mashhadi, Nir Shlezinger, Yonina C. Eldar, Deniz Gunduz, Mahdi Boloursaz Mashhadi (2021)Fedrec: Federated Learning of Universal Receivers Over Fading Channels, In: 2021 IEEE Statistical Signal Processing Workshop (SSP)2021-9513736pp. 576-580 IEEE

DOI: 10.1109/SSP49050.2021.9513736

Wireless communications is often subject to channel fading. Various statistical models have been proposed to capture the inherent randomness in fading, and conventional model-based receiver designs rely on accurate knowledge of this underlying distribution, which, in practice, may be complex and intractable. In this work, we propose a neural network-based symbol detection technique for down-link fading channels, which is based on the maximum a-posteriori probability (MAP) detector. To enable training on a diverse ensemble of fading realizations, we propose a federated training scheme, in which multiple users collaborate to jointly learn a universal data-driven detector, hence the name FedRec. The performance of the resulting receiver is shown to approach the MAP performance in diverse channel conditions without requiring knowledge of the fading statistics, while inducing a substantially reduced communication overhead in its training procedure compared to centralized training.

Matteo Zecchin, Mahdi Boloursaz Mashhadi, Mikolaj Jankowski, Deniz Gunduz, Marios Kountouris, David Gesbert (2022)LIDAR and Position-Aided mmWave Beam Selection with Non-local CNNs and Curriculum Training, In: IEEE Transactions on Vehicular Technology Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/TVT.2022.3142513

Efficient millimeter wave (mmWave) beam selection in vehicle-to-infrastructure (V2I) communication is a crucial yet challenging task due to the narrow mmWave beamwidth and high user mobility. To reduce the search overhead of iterative beam discovery procedures, contextual information from light detection and ranging (LIDAR) sensors mounted on vehicles has been leveraged by data-driven methods to produce useful side information. In this paper, we propose a lightweight neural network (NN) architecture along with the corresponding LIDAR preprocessing, which significantly outperforms previous works. Our solution comprises multiple novelties that improve both the convergence speed and the final accuracy of the model. In particular, we define a novel loss function inspired by the knowledge distillation idea, introduce a curriculum training approach exploiting line-of-sight (LOS)/non-line-of-sight (NLOS) information, and we propose a non-local attention module to improve the performance for the more challenging NLOS cases. Simulation results on benchmark datasets show that utilizing solely LIDAR data and the receiver position, our NN-based beam selection scheme can achieve 79.9% throughput of an exhaustive beam sweeping approach without any beam search overhead and 95% by searching among as few as 6 beams. In a typical mmWave V2I scenario, our proposed method considerably reduces the beam search time required to achieve a desired throughput, in comparison with the inverse fingerprinting and hierarchical beam selection schemes.

Mahdi Boloursaz Mashhadi, Deniz Gunduz (2020)Deep Learning for Massive MIMO Channel State Acquisition and Feedback, In: Journal of the Indian Institute of Science100(2)pp. 369-382 Springer Nature

DOI: 10.1007/s41745-020-00169-2

Massive multiple-input multiple-output (MIMO) systems are a main enabler of the excessive throughput requirements in 5G and future generation wireless networks as they can serve many users simultaneously with high spectral and energy efficiency. To achieve this massive MIMO systems require accurate and timely channel state information (CSI), which is acquired by a training process that involves pilot transmission, CSI estimation, and feedback. This training process incurs a training overhead, which scales with the number of antennas, users, and subcarriers. Reducing the training overhead in massive MIMO systems has been a major topic of research since the emergence of the concept. Recently, deep learning (DL)-based approaches have been proposed and shown to provide significant reduction in the CSI acquisition and feedback overhead in massive MIMO systems compared to traditional techniques. In this paper, we present an overview of the state-of-the-art DL architectures and algorithms used for CSI acquisition and feedback, and provide further research directions.

Dr. Mahdi Boloursaz Mashhadi, Deniz Gunduz (2021)Pruning the Pilots: Deep Learning-Based Pilot Design and Channel Estimation for MIMO-OFDM Systems, In: IEEE transactions on wireless communications20(10)pp. 6315-6328 IEEE

DOI: 10.1109/TWC.2021.3073309

With the large number of antennas and subcarriers the overhead due to pilot transmission for channel estimation can be prohibitive in wideband massive multiple-input multiple-output (MIMO) systems. This can degrade the overall spectral efficiency significantly, and as a result, curtail the potential benefits of massive MIMO. In this paper, we propose a neural network (NN)-based joint pilot design and downlink channel estimation scheme for frequency division duplex (FDD) MIMO orthogonal frequency division multiplex (OFDM) systems. The proposed NN architecture uses fully connected layers for frequency-aware pilot design, and outperforms linear minimum mean square error (LMMSE) estimation by exploiting inherent correlations in MIMO channel matrices utilizing convolutional NN layers. Our proposed NN architecture uses a non-local attention module to learn longer range correlations in the channel matrix to further improve the channel estimation performance.We also propose an effective pilot reduction technique by gradually pruning less significant neurons from the dense NN layers during training. This constitutes a novel application of NN pruning to reduce the pilot transmission overhead. Our pruning-based pilot reduction technique reduces the overhead by allocating pilots across subcarriers non-uniformly and exploiting the inter-frequency and inter-antenna correlations in the channel matrix efficiently through convolutional layers and attention module.

Mahdi Boloursaz Mashhadi, Mikolaj Jankowski, Tze-Yang Tung, Szymon Kobus, Deniz Gunduz (2021)Federated mmWave Beam Selection Utilizing LIDAR Data, In: IEEE Wireless Communications Letters10(10)pp. 2269-2273 Institute of Electrical and Electronics Engineers (IEEE)

DOI: 10.1109/LWC.2021.3099136

Efficient link configuration in millimeter wave (mmWave) communication systems is a crucial yet challenging task due to the overhead imposed by beam selection. For vehicle-to-infrastructure (V2I) networks, side information from LIDAR sensors mounted on the vehicles has been leveraged to reduce the beam search overhead. In this letter, we propose a federated LIDAR aided beam selection method for V2I mmWave communication systems. In the proposed scheme, connected vehicles collaborate to train a shared neural network (NN) on their locally available LIDAR data during normal operation of the system. We also propose a reduced-complexity convolutional NN (CNN) classifier architecture and LIDAR preprocessing, which significantly outperforms previous works in terms of both the performance and the complexity.

Additional publications

For a comprehensive list of my publications please refer to my Google Scholar.