Human Activity Analysis in Smart Environments

Abstract

Human Activity Recognition (HAR) in smart environments is central to Ambient Assisted Living (AAL), which focuses on supporting independent living, safety, and well-being for older adults and individuals requiring assistance. Vision-based HAR systems using cameras capture rich contextual data but face limited user acceptance due to privacy concerns. This thesis presents contributions to the design of vision-based HAR systems aimed at addressing end users’ concerns to engender trust and enhance acceptance. The first contribution addresses the lack of empirical evidence on user perspectives on vision-based HAR systems. An online survey explored the perspectives of prospective users, identifying several design requirements such as communication and transparency, system flexibility and output modes, reliability and interpretability, privacy and data security, user control, and clarity and simplicity. The second contribution presents skeleton-based HAR as a privacy-preserving alternative to RGB data. Two models are proposed. The Hierarchical Temporal Convolution Network (HT-ConvNet) introduces feature-weighted fusion and a method for capturing long-range temporal dependencies without added computational cost, improving both accuracy and efficiency. The Robust and Efficient Temporal Convolution Network (RE-TCN) combines adaptive temporal weighting, depthwise separable convolutions, and data augmentation to enhance robustness and efficiency in unstructured real-world settings. Both models outperform existing approaches on multiple datasets. The third contribution explores millimetre-wave (mmWave) radar as an alternative sensing modality. The proposed Stochastic Edge Network (SEdgeNet) enables direct activity recognition from sparse point clouds using adaptive neighbour sampling and efficient edge feature learning. Complementing this, the proposed mmPoint-PoseNet estimates 3D human poses from mmWave radar point clouds, providing 3D pose data to support downstream HAR. Both methods achieve state-of-the-art results on activity recognition and pose estimation, addressing the challenges of noise, sparsity, and irregularity in mmWave radar data. These contributions advance the design of HAR systems in smart environments by aligning technical development with user expectations. The work demonstrates how AAL systems can be designed to foster trust and enhance adoption.

Publication DOI: https://doi.org/10.48780/publications.aston.ac.uk.00048816
Divisions: College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies
Additional Information: Copyright © Vincent Gbouna Zakka, 2025. Vincent Gbouna Zakka asserts their moral right to be identified as the author of this thesis. This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without appropriate permission or acknowledgement. If you have discovered material in Aston Publications Explorer which is unlawful e.g. breaches copyright, (either yours or that of a third party) or any other law, including but not limited to those relating to patent, trademark, confidentiality, data protection, obscenity, defamation, libel, then please read our Takedown Policy and contact the service immediately.
Institution: Aston University
Uncontrolled Keywords: Human Activity Recognition,Ambient Assisted Living,Skeleton-based Activity Recognition,mmWave Radar,Privacy Preservation,Temporal Convolution Networks,Graph Neural Networks,Pose Estimation,Computational Efficiency,User Acceptance
Last Modified: 16 Mar 2026 18:32
Date Deposited: 16 Mar 2026 18:30
Completed Date: 2025-09
Authors: Zakka, Vincent Gbouna
Thesis Supervisor: Dai, Zhuangzhuang
Manso, Luis J.

Export / Share Citation


Statistics

Additional statistics for this record