G - Physics – 10 – L
Patent
G - Physics
10
L
G10L 15/02 (2006.01) G10L 19/02 (2006.01) G10L 15/06 (2006.01)
Patent
CA 2290185
Systems and methods for processing acoustic speech signals which utilize the wavelet transform (and alternatively, the Fourier transform) as a fundamental tool. The method essentially involves "synchrosqueezing" spectral component data obtained by performing a wavelet transform (or Fourier transform) on digitized speech signals. In one aspect, spectral components of the synchrosqueezed plane are dynamically tracked via a K-means clustering algorithm. The amplitude, frequency and bandwidth of each of the; components are, thus, extracted. The cepstrum generated from this information is referred to as "K-mean Wastrum." In another aspect, the result of the K-mean clustering process is further processed to limit the set of primary components to formants. The resulting features are referred to as "formant-based wastrum." Formants are interpolated in unvoiced regions and the contribution of unvoiced turbulent part of the spectrum are added. This method requires adequate formant tracking. The resulting robust formant extraction has a number of applications in speech processing and analysis including vocal tract normalization.
Basu Sankar
Maes Stephane H.
Bereskin & Parr Llp/s.e.n.c.r.l.,s.r.l.
International Business Machines Corporation
Nuance Communications Inc.
LandOfFree
Wavelet-based energy binning cepstral features for automatic... does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Wavelet-based energy binning cepstral features for automatic..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Wavelet-based energy binning cepstral features for automatic... will most certainly appreciate the feedback.
Profile ID: LFCA-PAI-O-1885138