Method for domain identification of documents in a document...

G - Physics – 06 – F

Patent

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

G06F 17/30 (2006.01) G06F 17/27 (2006.01)

Patent

CA 2651217

A method for processing documents in a document database includes determining vocabulary words for each document, and determining a respective relevancy for each vocabulary word based upon occurrences thereof in all of the documents. Similarities are determined between the documents based upon the vocabulary words and their respective relevancies. At least one domain identification is determined for the documents based upon the determined similarities.

L'invention concerne un procédé de traitement de documents dans une base de données documentaire qui comprend la détermination des termes de vocabulaire pour chaque document, ainsi que la détermination de la pertinence respective pour chaque terme de vocabulaire sur la base du nombre d'occurrences relevé pour chacun d'eux dans l'ensemble des documents. Le procédé comprend également la détermination des similitudes entre les différents documents sur la base des termes de vocabulaire et de leur pertinence respective. Au moins une identification de domaine est déterminée pour les documents sur la base des similitudes ainsi déterminées.

LandOfFree

Say what you really think

Search LandOfFree.com for Canadian inventors and patents. Rate them and share your experience with other people.

Rating

Method for domain identification of documents in a document... does not yet have a rating. At this time, there are no reviews or comments for this patent.

If you have personal experience with Method for domain identification of documents in a document..., we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Method for domain identification of documents in a document... will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFCA-PAI-O-1620237

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.