King Saud University Repository >
King Saud University >
Science Colleges >
College of Computer and Information Sciences >
College of Computer and Information Sciences >

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/14961

Title: Which one is dominant for neural network based speech recognition – Δ or ΔΔ articulatory parameters?
Authors: Ghulam Muhammad
Keywords: multilayer neural networks, Gram-Schmidt orthogonalization, articulatory velocity coefficient, articulatory acceleration coefficient, Inhibition/Enhancement network
Issue Date: 2010
Publisher: IEEE Computer Society
Abstract: This paper presents a method that describes the effect of articulatory velocity coefficient (Δ) on neural network based speech recognition. The method consists of three stages: a) two multilayer neural networks (MLNs), where second MLN takes Δ articulatory parameters as input b) Inhibition/Enhancement (In/En) network and c) Gram-Schmidt orthogonalization before connecting with a hidden Markov model (HMM) based classifier. From the experiments on Japanese Newspaper Article Sentences (JNAS), it is shown that velocity coefficient has more effect on the phoneme recognition performance. Moreover, the proposed phoneme recognizer with articulatory velocity coefficient provides higher phoneme accuracy with fewer mixture components in HMMs.
URI: http://hdl.handle.net/123456789/14961
Appears in Collections:College of Computer and Information Sciences

Files in This Item:

File Description SizeFormat
Dr Gulam-27-conf.docx14.9 kBMicrosoft Word XMLView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.


DSpace Software Copyright © 2002-2009 MIT and Hewlett-Packard - Feedback