Information geometric measurements of generalisation

Zhu, Huaiyu and Rohwer, Richard (1995). Information geometric measurements of generalisation. Technical Report. Aston University, Birmingham, UK.

Abstract

Neural networks can be regarded as statistical models, and can be analysed in a Bayesian framework. Generalisation is measured by the performance on independent test data drawn from the same distribution as the training data. Such performance can be quantified by the posterior average of the information divergence between the true and the model distributions. Averaging over the Bayesian posterior guarantees internal coherence; Using information divergence guarantees invariance with respect to representation. The theory generalises the least mean squares theory for linear Gaussian models to general problems of statistical estimation. The main results are: (1)~the ideal optimal estimate is always given by average over the posterior; (2)~the optimal estimate within a computational model is given by the projection of the ideal estimate to the model. This incidentally shows some currently popular methods dealing with hyperpriors are in general unnecessary and misleading. The extension of information divergence to positive normalisable measures reveals a remarkable relation between the dlt dual affine geometry of statistical manifolds and the geometry of the dual pair of Banach spaces Ld and Ldd. It therefore offers conceptual simplification to information geometry. The general conclusion on the issue of evaluating neural network learning rules and other statistical inference methods is that such evaluations are only meaningful under three assumptions: The prior P(p), describing the environment of all the problems; the divergence Dd, specifying the requirement of the task; and the model Q, specifying available computing resources.

Divisions:	Aston University (General)
Uncontrolled Keywords:	Neural networks,Bayesian framework,internal coherence,statistical estimation,information geometry,statistical inference,computing resources
ISBN:	NCRG/95/005
Last Modified:	09 Jul 2025 07:30
Date Deposited:	22 Sep 2009 14:48
PURE Output Type:	Technical report
Published Date:	1995
Authors:	Zhu, Huaiyu Rohwer, Richard

Download

Information geometric measurements of generalisation

Abstract

Download

Export / Share Citation

Explore Further

Statistics