Match Document Document Title
8600750 Speaker-cluster dependent speaker recognition (speaker-type automated speech recognition)  
In an example embodiment, there is disclosed herein an automatic speech recognition (ASR) system that employs speaker clustering (or speaker type) for transcribing audio. A large corpus of audio...
8600746 Speech recognition parameter adjustment  
Audio data that encodes an utterance of a user is received. It is determined that the user has been classified as a novice user of a speech recognizer. A speech recognizer setting is selected that...
8600747 Method for dialog management  
A spoken dialog system and method having a dialog management module are disclosed. The dialog management module includes a plurality of dialog motivators for handling various operations during a...
8600745 System and method for processing speech files  
A system and method for speech file processing which provides users with differentially selectable speech file transcripts which can be sent to one or more other users. The speech files may be...
8600762 Mobile terminal and method for recognizing voice thereof  
A method for detecting a character or a word emphasized by a user from a voice inputted in a mobile terminal to refer it as meaningful information for a voice recognition, or emphatically...
8600743 Noise profile determination for voice-related feature  
Systems, methods, and devices for noise profile determination for a voice-related feature of an electronic device are provided. In one example, an electronic device capable of such noise profile...
8593501 Voice-controlled labeling of communication session participants  
In general, this disclosure describes techniques for providing identification, such as a name, to participants of a communication session. In one example, a method includes establishing, by a...
8595015 Audio communication assessment  
A device may include a communication interface configured to receive audio signals associated with audible communications from a user; an output device; and logic. The logic may be configured to...
8589170 Single control message device  
A message device records and plays messages. In a first aspect, actuation is effected by way of a single control. The control generally corresponds to the whole cover. Relatedly, in a second...
8589168 Method and apparatus for analyzing discussion regarding media programs  
A process and system including a device including a controller to detect a plurality of users engaging in a voice conference related to a presentation of a media program, convert speech dialog...
8589159 Keyword display system, keyword display method, and program  
The present invention is a keyword display system that includes a speaker specifier for specify a speaker; a weight determinator for determining a weight of the specified speaker; a keyword...
8589160 Systems and methods for providing an electronic dictation interface  
Some embodiments disclosed herein store a target application and a dictation application. The target application may be configured to receive input from a user. The dictation application interface...
8589334 Robust information fusion methods for decision making for multisource data  
Methods and systems are provided for developing decision information relating to a single system based on data received from a plurality of sensors. The method includes receiving first data from a...
8589379 Report generation support system  
The report generation support system according to the embodiment comprises an input history recording part, operation history recording part, selection part, extraction part, and a display...
8589158 Application server for reducing ambiance noise in an auscultation signal, and for recording comments while auscultating a patient with an electronic stethoscope  
An application server for reducing ambiance noise in an auscultation signal, and for recording comments while auscultating a patient with an electronic stethoscope This application server (AS)...
8589157 Replying to text messages via automated voice search techniques  
An automated “Voice Search Message Service” provides a voice-based user interface for generating text messages from an arbitrary speech input. Specifically, the Voice Search Message Service...
8583430 Semi-automated intermodal voice to data transcription method and apparatus  
A semi-automated, intermodal transcription-formatted data input system utilizing one or more interconnected servers which receive communications links. The system identifies and validates a user,...
8583432 Dialect-specific acoustic language modeling and speech recognition  
Methods and systems for automatic speech recognition and methods and systems for training acoustic language models are disclosed. One system for automatic speech recognition includes a dialect...
8583434 Methods for statistical analysis of speech  
Computer-implemented methods and apparatus are provided to facilitate the recognition of the content of a body of speech data. In one embodiment, a method for analyzing verbal communication is...
8583431 Communications system with speech-to-text conversion and associated methods  
A communications system includes a first communications device cooperating with a second communications device. The first communications device multiplexes a digital speech message and a...
8583433 System and method for efficiently transcribing verbal messages to text  
A system and method for efficiently transcribing verbal messages to text is provided. Verbal messages are received and at least one of the verbal messages is divided into segments. Automatically...
8577671 Method of and system for using conversation state information in a conversational interaction system  
A method of using conversation state information in a conversational interaction system is disclosed. A method of inferring a change of a conversation session during continuous user interaction...
8577679 Symbol insertion apparatus and symbol insertion method  
Enables symbol insertion evaluation in consideration of a difference in speaking style features between speakers. For a word sequence transcribing voice information, the symbol insertion...
8571863 Apparatus and methods for identifying a media object from an audio play out  
In one example, a device captures a segment of audio played out over an audio source in response to a control signal from a user interface of the device. The device causes any speech of the...
8571849 System and method for enriching spoken language translation with prosodic information  
Disclosed herein are systems, methods, and computer readable-media for enriching spoken language translation with prosodic information in a statistical speech translation framework. The method...
8571862 Multimodal interface for input of text  
The disclosure describes an overall system/method for text-input using a multimodal interface with a combination of speech recognition and text prediction. Specifically, an “always listening” mode...
8571851 Semantic interpretation using user gaze order  
Methods, and systems, including computer programs encoded on computer-readable storage mediums, including a method for performing semantic interpretation using gaze order. The method includes...
8571864 Automatic identification of repeated material in audio signals  
A system and method are described for recognizing repeated audio material within at least one media stream without prior knowledge of the nature of the repeated material. The system and method are...
8566088 System and method for automatic speech to text conversion  
Speech recognition is performed in near-real-time and improved by exploiting events and event sequences, employing machine learning techniques including boosted classifiers, ensembles, detectors...
8566090 System and method for referring to entities in a discourse domain  
Systems, methods, and non-transitory computer-readable media for referring to entities. The method includes receiving domain-specific training data of sentences describing a target entity in a...
8566078 Game based method for translation data acquisition and evaluation  
A method of generating a statistical machine translation database through a game in which a monolingual structure is provided to a plurality of players. A first translation attempt is received...
8566103 Multi-modal web interaction over wireless network  
A system, apparatus, and method is disclosed for receiving user input at a client device, interpreting the user input to identify a selection of at least one of a plurality of web interaction...
8560323 Providing contextual information for spoken information  
Techniques are described for providing relevant information to users (e.g., information that is at least potentially of interest to the users). Relevant information for a user may be automatically...
8560310 Method and apparatus providing improved voice activated functions  
A method, apparatus and computer program product for providing improved voice activated functions is presented. A grammar is provided from a collection of names for use in a voice activated...
8560301 Apparatus and method for language expression using context and intent awareness  
A language expression apparatus and a method based on a context and a intent awareness, are provided. The apparatus and method may recognize a context and an intent of a user and may generate a...
8560314 Applying service levels to transcripts  
Speech is transcribed to produce a draft transcript of the speech. Portions of the transcript having a high priority are identified. For example, particular sections of the transcript may be...
8560315 Conference support device, conference support method, and computer-readable medium storing conference support program  
A conference support device includes an image receiving portion that receives captured images from conference terminals, a voice receiving portion that receives, from one of the conference...
8554566 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8554558 Visualizing automatic speech recognition and machine translation output  
An automated speech processing method, system and computer program product are disclosed. In one embodiment, a speech-to-text (STT) engine is used for converting an audio input to text data in a...
8554567 Multi-channel interactive self-help application platform and method  
An interactive voice response (IVR) platform running a voice application for use with a voice client is extended to support text messaging clients and other clients of other media types on other...
8554541 Virtual pet system, method and apparatus for virtual pet chatting  
A virtual pet system includes: a virtual pet client, adapted to receive a sentence in natural language and send the sentence to a Q&A server the Q&A server, adapted to receive the sentence,...
8554559 Localized speech recognition with offload  
A local computing device may receive an utterance from a user device. In response to receiving the utterance, the local computing device may obtain a text string transcription of the utterance,...
8548809 Voice guidance system and voice guidance method using the same  
A voice guidance system for providing a guidance by voice concerning operations of an information processing apparatus, comprises a detector that detects that a predetermined function of the...
8548807 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
8548806 Voice recognition device, voice recognition method, and voice recognition program  
A voice recognition device, a voice recognition method and a voice recognition program capable of appropriately restricting recognition objects based on voice input from a user to recognize the...
8543397 Mobile device voice activation  
A mobile computerized device receives an indication of a first user input comprising a button actuation to initiate a push-to-talk voice search. The device receives from the user a spoken search...
8543398 Training an automatic speech recognition system using compressed word frequencies  
Respective word frequencies may be determined from a corpus of utterance-to-text-string mappings that contain associations between audio utterances and a respective text string transcription of...
8543395 Methods and systems for performing synchronization of audio with corresponding textual transcriptions and determining confidence values of the synchronization  
Methods and systems for performing audio synchronization with corresponding textual transcription and determining confidence values of the timing-synchronization are provided. Audio and a...
8543396 Continuous speech transcription performance indication  
Audio data that includes speech may be transcribed to text by a speech recognition engine. One or more metrics associated with the audio data and/or the text may be determined. An indicator...
8543404 Proactive completion of input fields for automated voice enablement of a web page  
Embodiments of the present invention provide a method and computer program product for the proactive completion of input fields for automated voice enablement of a Web page. In an embodiment of...