G - Physics – 10 – L
Patent
G - Physics
10
L
G10L 15/00 (2006.01) G06F 17/30 (2006.01) G11B 20/10 (2006.01)
Patent
CA 2662564
A speech processing system divides a spoken audio stream into partial audio streams ("snippets"). The system may divide a portion of the audio stream into two snippets at a position at which the speaker performed an editing operation, such as pausing and then resuming recording, or rewinding and then resuming recording. The snippets may be transmitted sequentially to a consumer, such as an automatic speech recognizer or a playback device, as the snippets are generated. The consumer may process (e.g., recognize or play back) the snippets as they are received. The consumer may modify its output in response to editing operations reflected in the snippets. The consumer may process the audio stream while it is being created and transmitted even if the audio stream includes editing operations that invalidate previously-transmitted partial audio streams, thereby enabling shorter turnaround time between dictation and consumption of the complete audio stream.
La présente invention concerne un système de traitement de la parole qui divise un flux audio parlé en flux audio partiels ('snippets'). Le système peut diviser une partie du flux audio en deux snippets selon une position sur laquelle l'orateur a effectué une modification, comme faire une pause puis reprendre l'enregistrement, ou rembobiner et reprendre l'enregistrement. Les snippets peuvent être transmis séquentiellement à un utilisateur, tel un dispositif de reconnaissance vocale automatique ou un dispositif de lecture, lors de la génération des snippets. L'utilisateur peut traiter (reconnaître ou lire, par exemple) les snippets dès réception. L'utilisateur peut modifier sa sortie en réponse aux modifications reflétées dans les snippets. L'utilisateur peut traiter le flux audio lors de sa création et de sa transmission même si le flux audio comprend des opérations de modification qui invalident les flux audio partiels transmis précédemment, ce qui permet de parvenir à un délai d'exécution plus court entre la dictée et l'utilisation du flux audio complet.
Carraux Eric
Koll Detlef
Fasken Martineau Dumoulin Llp
Multimodal Technologies Inc.
Multimodal Technologies Llc
LandOfFree
Recognition of speech in editable audio streams does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Recognition of speech in editable audio streams, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Recognition of speech in editable audio streams will most certainly appreciate the feedback.
Profile ID: LFCA-PAI-O-2045637