Match Document Document Title
8970656 Static and dynamic video calling avatars  
A communication device may include logic configured to detect a request to initiate a video call by the user of the communication device; select an avatar for the video call, wherein the avatar...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...
8972269 Methods and systems for interfaces allowing limited edits to transcripts  
A transcript interface for displaying a plurality of words of a transcript in a text editor can be provided and configured to receive a command to edit the transcript. Limited edits to data...
8972258 Sparse maximum a posteriori (map) adaption  
Techniques disclosed herein include using a Maximum A Posteriori (MAP) adaptation process that imposes sparseness constraints to generate acoustic parameter adaptation data for specific users...
8965759 Digital voice memo transfer and processing  
Systems, methods, apparatuses, and computer programs for transfer of recorded digital voice memos to a computing system and processing of the transferred digital voice memos by the computing...
8965760 Communication device, method, non-transitory computer readable medium, and system of a remote conference  
A communication device may acquire material data to share with a particular communication device, when one or more of first image data and first audio data is outputted. The communication device...
8965763 Discriminative language modeling for automatic speech recognition with a weak acoustic model and distributed training  
Training data from a plurality of utterance-to-text-string mappings of an automatic speech recognition (ASR) system may be selected. Parameters of the ASR system that characterize the utterances...
8965761 Differential dynamic content delivery with text display in dependence upon simultaneous speech  
Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document;...
8965764 Electronic apparatus and voice recognition method for the same  
Disclosed are an electronic apparatus and a voice recognition method for the same. The voice recognition method for the electronic apparatus includes: receiving an input voice of a user;...
8964946 Identifying recorded call data segments of interest  
A method and apparatus of processing a voice call are disclosed. One example method of operation may include recording at least a portion of a voice call, and storing the portion of the voice call...
8959024 Visualizing, navigating and interacting with audio content  
Methods and arrangements for visually representing audio content in a voice application. A display is connected to a voice application, and an image is displayed on the display, the image...
8954317 Method and apparatus of processing user text input information  
A method and apparatus of processing communications with a mobile station are disclosed. One example method may include receiving a message from the mobile station and processing the message by...
8953753 Mass-scale, user-independent, device-independent voice messaging system  
A mass-scale, user-independent, device-independent, voice messaging system that converts unstructured voice messages into text for display on a screen is disclosed. The system comprises (i)...
8954318 Method of and system for using conversation state information in a conversational interaction system  
A method of using conversation state information in a conversational interaction system is disclosed. A method of inferring a change of a conversation session during continuous user interaction...
8954326 Apparatus and method for voice command recognition based on a combination of dialog models  
Provided are a voice command recognition apparatus and method capable of figuring out the intention of a voice command input through a voice dialog interface, by combining a rule based dialog...
8947596 Alignment of closed captions  
In embodiments, apparatuses, methods and storage media are described that are associated with alignment of closed captions. Video content (along with associated audio) may be analyzed to determine...
8949130 Internal and external speech recognition use with a mobile communication facility  
In embodiments of the present invention improved capabilities are described for a user interacting with a mobile communication facility, where speech presented by the user is recorded using a...
8949125 Annotating maps with user-contributed pronunciations  
Systems and methods are provided to select a most typical pronunciation of a location name on a map from a plurality of user pronunciations. A server generates a reference speech model based on...
8949124 Automated learning for speech-based applications  
Systems and methods for modifying a computer-based speech recognition system. A speech utterance is processed with the computer-based speech recognition system using a set of internal...
8949128 Method and apparatus for providing speech output for speech-enabled applications  
Techniques for providing speech output for speech-enabled applications. A synthesis system receives from a speech-enabled application a text input including a text transcription of a desired...
8949134 Method and apparatus for recording/replaying application execution with recorded voice recognition utterances  
A diagnostic tool for speech recognition applications is provided, which enables a administrator to collect multiple recorded speech sessions. The administrator can then search for various failure...
8942981 Natural language call router  
A natural language call router forwards an incoming call from a caller to an appropriate destination. The call router has a speech recognition mechanism responsive to words spoken by a caller for...
8942977 System and method for speech recognition using pitch-synchronous spectral parameters  
The present invention defines a pitch-synchronous parametrical representation of speech signals as the basis of speech recognition, and discloses methods of generating the said pitch-synchronous...
8935163 Automatic conversation system and conversation scenario editing device  
A conversation scenario editor generates/edits a conversation scenario for an automatic conversation system. The system includes a conversation device and a conversation server. The conversation...
8935166 Systems and methods for providing an electronic dictation interface  
Some embodiments disclosed herein store a target application and a dictation application. The target application may be configured to receive input from a user. The dictation application interface...
8935165 Method for displaying words and processing device and computer program product thereof  
The disclosure provides a method for displaying words. In the method, a speech signal is received. A pitch contour and an energy contour of the speech signal are extracted. Speech recognition is...
8930189 Distributed user input to text generated by a speech to text transcription service  
A particular method includes receiving, at a representational state transfer endpoint device, a first user input related to a first speech to text conversion performed by a speech to text...
8922617 Systems, methods, and devices for time-shifting playback of a live online meeting  
In various embodiments, an attendee of a live online meeting selects screen data from an earlier point in time in the online meeting for playback while the meeting is still ongoing. Automatically...
8924210 Text processing using natural language understanding  
Techniques for converting spoken speech into written speech are provided. The techniques include transcribing input speech via speech recognition, mapping each spoken utterance from input speech...
8918318 Extended recognition dictionary learning device and speech recognition system  
Speech recognition of even a speaker who uses a speech recognition system is enabled by using an extended recognition dictionary suited to the speaker without requiring any previous learning using...
8918317 Decoding-time prediction of non-verbalized tokens  
Non-verbalized tokens, such as punctuation, are automatically predicted and inserted into a transcription of speech in which the tokens were not explicitly verbalized. Token prediction may be...
8914286 Speech recognition with hierarchical networks  
Provided are systems and methods for using hierarchical networks for recognition, such as speech recognition. Conventional automatic recognition systems may not be both efficient and flexible....
8914283 System and method for unsupervised and active learning for automatic speech recognition  
A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for...
8914291 Method and apparatus for generating synthetic speech with contrastive stress  
Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output,...
8914284 Methods and apparatus for conducting internet protocol telephony communication  
IP telephony communications are conducted by sending both data produced by a CODEC that represents received spoken audio input, and a textual representation of the spoken audio input. A receiving...
8914278 Automatic context sensitive language correction and enhancement using an internet corpus  
A computer-assisted language correction system including spelling correction functionality, misused word correction functionality, grammar correction functionality and vocabulary enhancement...
8914292 ***WITHDRAWN PATENT AS PER THE LATEST USPTO WITHDRAWN LIST***
Internal and external speech recognition use with a mobile communication facility
 
In embodiments of the present invention improved capabilities are described for a user interacting with a mobile communication facility, where speech presented by the user is recorded using a...
8914003 System and method for processing a voicemail  
Described is a system and method for processing a voice mail. The method comprises receiving a voice mail, converting the voice mail into a text message using a predefined speech-to-text...
8914277 Speech and language translation of an utterance  
According to example configurations, a speech-processing system parses an uttered sentence into segments. The speech-processing system translates each of the segments in the uttered sentence into...
RE45289 Selective noise/channel/coding models and recognizers for automatic speech recognition  
An apparatus and method for the robust recognition of speech during a call in a noisy environment is presented. Specific background noise models are created to model various background noises...
8909532 Supporting multi-lingual user interaction with a multimodal application  
Methods, apparatus, and products are disclosed for supporting multi-lingual user interaction with a multimodal application, the application including a plurality of VoiceXML dialogs, each dialog...
8909525 Interactive voice recognition electronic device and method  
An interactive voice recognition electronic device converts a received voice signal to a text, and searches a voice databases to find a matched voice text of the converted text. The matched voice...
8909516 Functionality for normalizing linguistic items  
Computing functionality converts an input linguistic item into a normalized linguistic item, representing a normalized counterpart of the input linguistic item. In one environment, the input...
8903847 Digital media voice tags in social networks  
A voice tagging system includes a client computing device that includes a media object capture device and a voice capture device and runs a client application that associates media objects to...
8903716 Personalized vocabulary for digital assistant  
Methods, systems, and computer readable storage medium related to operating an intelligent digital assistant are disclosed. A text string is obtained from a speech input received from a user. The...
8903723 Audio synchronization for document narration with user-selected playback  
Disclosed are techniques and systems to provide a narration of a text. In some aspects, the techniques and systems described herein include generating a timing file that includes elapsed time...
8898061 ***WITHDRAWN PATENT AS PER THE LATEST USPTO WITHDRAWN LIST***
Distributed user input to text generated by a speech to text transcription service
 
A particular method includes receiving, at a representational state transfer endpoint device, a first user input related to a first speech to text conversion performed by a speech to text...
8892442 System and method for answering a communication notification  
Disclosed herein are systems, methods, and computer readable-media for answering a communication notification. The method for answering a communication notification comprises receiving a...
8892662 Call completion via instant communications client  
A system is disclosed for achieving completion of a telephone call by way of an instant communications client.
8892437 Method and apparatus of providing semi-automated classifier adaptation for natural language processing  
Example embodiments of the present invention may include a method that provides transcribing spoken utterances occurring during a call and assigning each of the spoken utterances with a...