G - Physics – 10 – L
Patent
G - Physics
10
L
G10L 15/02 (2006.01) G10L 17/00 (2006.01)
Patent
CA 2311439
A method for collecting data associated with the voice of a voice system user includes conducting a conversation with the user, capturing and digitizing a speech waveform of the user, extracting at least one acoustic feature from the digitized speech waveform and storing attribute data corresponding to the acoustic feature, together with an identifying indicia, in the data warehouse in a form to facilitate subsequent data mining. User attributes can include gender, age, accent, native language, dialect, socioeconomic classification, educational level and emotional state. Data gathering can be repeated for a large number of users, until sufficient data is present. The attribute data to be stored can include raw acoustic features, or processed features, such as the user's emotional state, age, gender, socioeconomic group, and the like. In an alternative form of method, the user attribute can be used to real-time modify behavior of the voice system, with or without storage of data for subsequent data mining. An apparatus for collecting data associated with a voice of a user includes a dialog management unit, an audio capture module, an acoustic front end, a processing module and a data warehouse. The acoustic front end receives and digitizes a speech waveform from the user and extracts at least one acoustic feature from the digitized speech waveform. The feature is correlated with at least one user attribute. The processing module analyzes the acoustic feature to determine the user attribute, which can then be stored in the data warehouse. The dialog management unit can include, for example, a telephone interactive voice response system. The processor can be an application specific circuit, a separate general purpose computer with appropriate software, or a processor portion of the IVR. The processing module can include an emotional state classifier, a speaker clusterer and classifier, a speech recognizer, and/or an accent identifier. Alternatively, the apparatus can be configured as a real-time- modifiable voice system for interaction with a user, which can be used to practice the method for tailoring a voice system response.
Kanevsky Dimitri
Maes Stephan H.
Sorensen Jeffrey S.
International Business Machines Corporation
Wang Peter
LandOfFree
Conversational data mining does not yet have a rating. At this time, there are no reviews or comments for this patent.
If you have personal experience with Conversational data mining, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Conversational data mining will most certainly appreciate the feedback.
Profile ID: LFCA-PAI-O-1674689