The Use of Formal Grammars in Automatic Speech Recognition

Abstract

An automatic isolated-word recognition (IWR) system normally consists of a feature extractor (FH) followed by a recognition processor. Some form of 'training' is usually required in order to combat problems of variations in speech. This thesis presents the application of formal grammars to model a FE in an IWR system. The method is to construct, in the training mode, one grammar for each word in the vocabulary, directly from a set of sample strings of 'features' represented by symbols. In the recognition mode, an incoming string is analysed to determine which grammar, if any, could have generated it. Inference algorithms for both finite-state grammars (FSG's) and context-free grammars (CFG's) considered here are based on the eriterion of maximizing the similarity between various strings of the same word. The classification of a string involves the use of the ‘weighted matching network' technique in the FSG approach and the computation of the minimisation matrix M for the CFG approach. Both the FSG and CFG models offer comparable recognition performances whilst the use of the CFG approach results in an increase in the amount of computation required. It appears, therefore, that there is no advantage gained, in terms of recognition performance and computational requirement, from the use of CFG approach over that of the FSG in the recognition of isolated words. The use of formal grammar approach over the direct storage of strings in isolated-word application makes possible the *generalisation' of strings in the training set. This can reduce the number of strings required by the learning process. Another advantage of the linguistic method is the reduction in the amount of computation in the FSG approach which is a result of the merging between similar segments of various strings during the training process.

Publication DOI: https://doi.org/10.48780/publications.aston.ac.uk.00008029
Divisions: College of Engineering & Physical Sciences
Additional Information: Copyright © Chaiyaporn Chirathamjaree, 1979. Chaiyaporn Chirathamjaree asserts their moral right to be identified as the author of this thesis. This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without appropriate permission or acknowledgement. If you have discovered material in Aston Publications Explorer which is unlawful e.g. breaches copyright, (either yours or that of a third party) or any other law, including but not limited to those relating to patent, trademark, confidentiality, data protection, obscenity, defamation, libel, then please read our Takedown Policy and contact the service immediately.
Institution: Aston University
Uncontrolled Keywords: formal grammars,automatic,speech recognition
Last Modified: 18 Mar 2025 08:33
Date Deposited: 12 May 2010 13:05
Completed Date: 1979-02
Authors: Chirathamjaree, Chaiyaporn

Export / Share Citation


Statistics

Additional statistics for this record