Deep Learning in Graph Domains for Sensorised Environments

Abstract

As our society moves swiftly towards an era where technology seamlessly integrates into our daily lives, our homes and cities are becoming increasingly sensorised. This change is fueled by advancements in artificial intelligence that facilitate harnessing the potential of smart environments. The main focus of this thesis is to investigate how Graph Neural Networks (GNNs) can be effectively applied to these environments, with a focus on those where humans and robots share the space. In these scenarios, integrating and exploiting data from multiple sources and analysing interactions between individuals, objects, sensors and robots is paramount. As the literature shows, GNNs have advantageous properties to process this kind of data when compared to more established deep learning approaches. This thesis presents a range of methods and applications in sensorised environments that leverage GNNs’ properties. The main contributions span applications in three main fields: human-aware robot navigation, human pose estimation, and the generation of traffic images. For human-aware navigation, this thesis proposes a model capable of estimating the level of discomfort caused by a robot’s presence among people and objects, considering not only the entities themselves but also the interactions happening. This model is later improved to yield discomfort maps that can be used as cost maps for motion planning. In the domain of human pose estimation, two different solutions are presented: a model capable of estimating the position and orientation of the people in the environment, and a multi-camera and multi-person 3D human full pose estimator. This last model, which does not require a labelled dataset for training, can be used for tracking people and feed their poses into the aforementioned cost map generator, as seen in the experimentation of this thesis. These works exhibit superior results in terms of precision, accuracy, and time efficiency when compared to similar state-of-the-art works. Finally, in the field of image generation, the thesis explores an application within the context of smart cities: generating realistic traffic images conditioned with graphs. This work leverages the strengths of GNNs when working with semantic data. The model can generate realistic images based on the properties of the items expected in them –namely their position, size and colour– and global properties such as the time of day. GNNs can be time-inefficient due to the added complexity of dealing with heterogeneously structured data. Consequently, the success of the applications presented in this thesis is the result of the effective integration of this networks, often in conjunction with other well-known approaches. One notable example is the fusion of convolutional networks with GNNs, which in this thesis leads to more efficient image generation when compared to pure GNN architectures. These methods constitute the central contribution of this thesis, as they allow GNNs to fully exploit their potential while mitigating inefficiencies.

Publication DOI: https://doi.org/10.48780/publications.aston.ac.uk.00046074
Divisions: College of Engineering & Physical Sciences > School of Computer Science and Digital Technologies > Applied AI & Robotics
Additional Information: Copyright © Daniel Rodriguez Criado, 2023. Daniel Rodriguez Criado asserts their moral right to be identified as the author of this thesis. This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without appropriate permission or acknowledgement. If you have discovered material in Aston Publications Explorer which is unlawful e.g. breaches copyright, (either yours or that of a third party) or any other law, including but not limited to those relating to patent, trademark, confidentiality, data protection, obscenity, defamation, libel, then please read our Takedown Policy and contact the service immediately.
Institution: Aston University
Uncontrolled Keywords: Graph Neural Networks,Sensorised Environments,Human-Aware Navigation,Multi-Camera Pose Estimation,Robotics,Image Generation
Last Modified: 19 Feb 2024 15:41
Date Deposited: 19 Feb 2024 15:41
Completed Date: 2023
Authors: Rodriguez-Criado, Daniel

Export / Share Citation


Statistics

Additional statistics for this record