Bayesian invariant measurements of generalisation for discrete distributions

Zhu, Huaiyu and Rohwer, Richard (1995). Bayesian invariant measurements of generalisation for discrete distributions. Technical Report. Aston University, Birmingham, UK. (Unpublished)

Abstract

Neural network learning rules can be viewed as statistical estimators. They should be studied in Bayesian framework even if they are not Bayesian estimators. Generalisation should be measured by the divergence between the true distribution and the estimated distribution. Information divergences are invariant measurements of the divergence between two distributions. The posterior average information divergence is used to measure the generalisation ability of a network. The optimal estimators for multinomial distributions with Dirichlet priors are studied in detail. This confirms that the definition is compatible with intuition. The results also show that many commonly used methods can be put under this unified framework, by assume special priors and special divergences.

Divisions:	Aston University (General)
Additional Information:	Copyright © 1995, Huaiyu Zhu and Richard Rohwer. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/).
Uncontrolled Keywords:	Neural network,learning rules,Bayesian framework,distribution
Last Modified:	10 Dec 2025 13:45
Date Deposited:	21 Jul 2009 12:12
PURE Output Type:	Technical report
Published Date:	1995-08-31
Authors:	Zhu, Huaiyu Rohwer, Richard

Download

License: Creative Commons Attribution Non-commercial No Derivatives

Export / Share Citation

Explore Further

Statistics

Additional statistics for this record

Record administration