G - Physics – 06 – F
Patent
G - Physics
06
F
G06F 17/16 (2006.01) G10L 13/02 (2006.01) G10L 13/08 (2006.01)
Patent
CA 2177863
A system and method are described for determining a near-optimum subset of data, based on a selected model, from a large corpus of data. Sets of feature vectors corresponding to natural or other preselected divisions of the data corpus are mapped into matrices representative of such divisions. The invention operates to find a submatrix of full rank formed as a union of one or more of those division-based matrices. A greedy algorithm utilizing Gram-Schmidt orthonormalization operates on the division matrices to find a near optimum submatrix and in a time bound representing a substantial improvement over prior-art methods. An important application of the invention is the selection of a small number of sentences from a corpus of a very large number of such sentences from which the parameters of a duration model for speech synthesis can be estimated.
Buchsbaum Adam Louis
Vansanten Jan Pieter
At&t Ipm Corp.
Kirby Eades Gale Baker
LandOfFree
System and method for selecting training text does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with System and method for selecting training text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for selecting training text will most certainly appreciate the feedback.
Profile ID: LFCA-PAI-O-1724217