Text categorization based on co-classification learning from...

G - Physics – 06 – F

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G06F 17/27 (2006.01) G06F 15/18 (2006.01)

Patent

CA 2718579

The present document describes a method and a system for generating classifiers from multilingual corpora including subsets of content-equivalent documents written in different languages. When the documents are translations of each other, their classifications must be substantially the same. Embodiments of the invention utilize this similarity in order to enhance the accuracy of the classification in one language based on the classification results in the other language, and vice versa. A system in accordance with the present embodiments implements a method which comprises generating a first classifier from a first subset of the corpora in a first language; generating a second classifier from a second subset of the corpora in a second language; and re-training each of the classifiers on its respective subset based on the classification results of the other classifier, until a training cost between the classification results produced by subsequent iterations reaches a local minima.

LandOfFree

Say what you really think

Search LandOfFree.com for Canadian inventors and patents. Rate them and share your experience with other people.

Rating

Text categorization based on co-classification learning from... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Text categorization based on co-classification learning from..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Text categorization based on co-classification learning from... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFCA-PAI-O-1666306

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.