Understanding Domain Adaptation:Applications in Reinforcement Learning and Extrapolation

Abstract

Deep Learning has made impressive progress in a number of data processing domains. A large part of this progress comes from building large and complex models which are costly to train and, when data is limited, prone to over-fitting. We explore solutions to this problem through the domain adaptation paradigm. Domain adaptation assumes that although data may be visually dissimilar, drawn from differing distributions such as photographs versus paintings, they still contain the same content and so share a representation space. We propose a model for domain adaptation building on the recent concept of generative adversarial networks. We show how this model can be used for domain adaptation with applications to reinforcement learning. We further investigate the utility of the model for explicit extrapolation problems. Having proposed a model, we develop a further understanding of the conditions that lead to better shared embedding spaces. With this understanding, we propose a penalty term for the adversarial domain adaptation problem which we demonstrate achieves state of the art performance on a number of benchmark domain adaptation datasets. We also consider the problem of adaptation from the perspective of extrapolation along the sampling space. Domain adaptation research does not explicitly consider this case and we propose two new datasets to examine the concept. Further to this, we take a broader view of the potential applications of various transfer learning techniques and apply one-shot learning to a recently proposed extrapolation task. Through our extrapolation experiments we demonstrate the need for new datasets and testing protocols in order to appropriately verify model generalisation performance, which may otherwise be difficult to judge.

Additional Information: © Thomas Patrick Carr, 2020. Thomas Patrick Carr asserts their moral right to be identified as the author of this thesis. This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without appropriate permission or acknowledgement. If you have discovered material in Aston Publications Explorer which is unlawful e.g. breaches copyright, (either yours or that of a third party) or any other law, including but not limited to those relating to patent, trademark, confidentiality, data protection, obscenity, defamation, libel, then please read our Takedown Policy and contact the service immediately.
Institution: Aston University
Uncontrolled Keywords: Domain Adaptation,Deep Learning,Transfer Learning,Generalisation
Last Modified: 08 Dec 2023 08:59
Date Deposited: 23 May 2022 11:59
Completed Date: 2020-12
Authors: Carr, Thomas Patrick

Export / Share Citation


Statistics

Additional statistics for this record