System and method for selecting training text

G - Physics – 06 – F

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G06F 17/16 (2006.01) G10L 13/02 (2006.01) G10L 13/08 (2006.01)

Patent

CA 2177863

A system and method are described for determining a near-optimum subset of data, based on a selected model, from a large corpus of data. Sets of feature vectors corresponding to natural or other preselected divisions of the data corpus are mapped into matrices representative of such divisions. The invention operates to find a submatrix of full rank formed as a union of one or more of those division-based matrices. A greedy algorithm utilizing Gram-Schmidt orthonormalization operates on the division matrices to find a near optimum submatrix and in a time bound representing a substantial improvement over prior-art methods. An important application of the invention is the selection of a small number of sentences from a corpus of a very large number of such sentences from which the parameters of a duration model for speech synthesis can be estimated.

LandOfFree

Say what you really think

Search LandOfFree.com for Canadian inventors and patents. Rate them and share your experience with other people.

Rating

System and method for selecting training text does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with System and method for selecting training text, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and System and method for selecting training text will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFCA-PAI-O-1724217

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.