G - Physics – 10 – L
Patent
G - Physics
10
L
354/47
G10L 15/14 (2006.01)
Patent
CA 1332195
RAPIDLY TRAINING A SPEECH RECOGNIZER TO A SUBSEQUENT SPEAKER GIVEN TRAINING DATA OF A REFERENCE SPEAKER ABSTRACT OF THE DISCLOSURE Apparatus and method for training the statistics of a Markov Model speech recognizer to a subsequent speaker who utters part of a training text after the recognizer has been trained for the statistics of a reference speaker who utters a full training text. Where labels generated by an acoustic processor in response to uttered speech serve as outputs for Markov models, the present apparatus and method determine label output probabilities at transitions in the Markov mod- els corresponding to the subsequent speaker where there is sparse training data. Specifically, label output probabili- ties for the subsequent speaker are re-parameterized based on confusion matrix entries having values indicative of the similarity between an ?th label output of the subsequent speaker and a kth label output for the reference speaker. The label output probabilities based on re-parameterized data are combined with initialized label output probabilities to form "smoothed" label output probabilities which feature smoothed probability distributions. Based on label outputs generated when the subsequent speaker utters the shortened training text, "basic" label output probabilities computed by conventional methodology are linearly averaged against the smoothed label output probabilities to produce improved label output probabilities.
570927
Bahl Lalit R.
Mercer Robert L.
Nahamoo David
International Business Machines Corporation
Rosen Arnold
LandOfFree
Rapidly training a speech recognizer to a subsequent speaker... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Rapidly training a speech recognizer to a subsequent speaker..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Rapidly training a speech recognizer to a subsequent speaker... will most certainly appreciate the feedback.
Profile ID: LFCA-PAI-O-1318946