Belmahi, O. (1997). On-line Learning in a Changing Environment. Masters thesis, Aston University.
Abstract
On-line learning of non-stationary tasks by two-layer neural networks is studied within the framework of statistical mechanics. A fully connected network with K hidden units and fixed hidden-to-output weights (a soft committee machine) learns a non-stationary task represented by a network of similar architecture having M hidden nodes. The network is trained via gradient descent (standard back-propagation) on randomly drawn inputs and the corresponding outputs generated by the teacher network representing the task. This work employs a general framework for the dynamics of on-line learning obtained earlier for the fixed environment case. We describe a general task non-stationarity and investigate the learning process in these learning scenarios where on-line methods have been found to be most useful. The dynamics are first analysed for K = M = 2 which is the building block of the general case (any K and M). The learning processes of stationary and non-stationary tasks are found to be qualitatively similar. However, for non-stationarities the transient stage of the dynamics becomes shorter and there is some residual error after convergence. These phases are investigated both numerically and analytically. The insight gained from the non-stationary case leads to a new learning rule which seems to be more efficient than basic gradient descent in escaping the symmetric subspace related to the transient part of the dynamics. These effects are studied in arbitrary realisable scenarios (K = M).
Publication DOI: | https://doi.org/10.48780/publications.aston.ac.uk.00021425 |
---|---|
Additional Information: | Copyright © Belmahi, O, 1997. Belmahi, O asserts their moral right to be identified as the author of this thesis. This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without appropriate permission or acknowledgement. If you have discovered material in Aston Publications Explorer which is unlawful e.g. breaches copyright, (either yours or that of a third party) or any other law, including but not limited to those relating to patent, trademark, confidentiality, data protection, obscenity, defamation, libel, then please read our Takedown Policy and contact the service immediately. |
Institution: | Aston University |
Uncontrolled Keywords: | online learning |
Last Modified: | 16 Apr 2025 13:52 |
Date Deposited: | 19 Mar 2014 11:20 |
Completed Date: | 1997 |
Authors: |
Belmahi, O.
|