G - Physics – 06 – K
Patent
G - Physics
06
K
G06K 9/18 (2006.01) G06K 9/62 (2006.01)
Patent
CA 2108536
Pseudo two-dimensional hidden Markov models (HMMs) are used to represent text elements, such as characters or words. Observation vectors for each text element are based on pixel maps obtained by optical scanning. A character is represented by a pseudo two-dimensional HMM having a number of superstates, with each superstate having at least one state. Text elements are compared with such models by using the Viterbi algorithm, first in connection with the states in each superstate, then the superstates themselves, to calculate the probability that a particular model represents the text element. Parameters for the models are generated by training routines. Probabilities can be adjusted to compensate for changes in scale, translations, slant, and rotation. An embodiment is also disclosed for identifying keywords in a body of text. A first pseudo two-dimensional HMM is created for the words that may appear in the text. Each word in the text is compared with both models, again using the Viterbi algorithm, to calculate probabilities that the model represents the subject word. If the probability for the keyword is greater than that for the extraneous words, the subject word is identified as being the keyword. Preprocessing steps for reducing the number of words to be compared can be added.
Agazzi Oscar Ernesto
Kuo Shyh-Shiaw
American Telephone And Telegraph Company
Kirby Eades Gale Baker
LandOfFree
Text recognition using two-dimensional stochastic models does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Text recognition using two-dimensional stochastic models, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text recognition using two-dimensional stochastic models will most certainly appreciate the feedback.
Profile ID: LFCA-PAI-O-2081156