Match Document Document Title
8751230 Method and device for generating vocabulary entry from acoustic data  
A method and a device (1) for automatically generating vocabulary entry from input acoustic data (3), comprising a vocabulary entry type-specific acoustic phonetic transcription module (2; T) and...
8751240 Apparatus and method for forming search engine queries based on spoken utterances  
A combination and a method are provided. Automatic speech recognition is performed on a received utterance. A meaning of the utterance is determined based, at least in part, on the recognized...
8744839 Recognition of target words using designated characteristic values  
Target word recognition includes: obtaining a candidate word set and corresponding characteristic computation data, the candidate word set comprising text data, and characteristic computation data...
8744848 Methods and systems for training dictation-based speech-to-text systems using recorded samples  
A method and apparatus useful to train speech recognition engines is provided. Many of today's speech recognition engines require training to particular individuals to accurately convert speech to...
8738374 System and method for the secure, real-time, high accuracy conversion of general quality speech into text  
Described is a speech-to-text conversion system and method that provides secure, real-time and high-accuracy conversion of general-quality speech into text. The system is designed to interface...
8738377 Predicting and learning carrier phrases for speech input  
Predicting and learning users' intended actions on an electronic device based on free-form speech input. Users' actions can be monitored to develop of a list of carrier phrases having one or more...
8738375 System and method for optimizing speech recognition and natural language parameters with user feedback  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an...
8731920 Document transcription system training  
A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal...
8731919 Methods and system for capturing voice files and rendering them searchable by keyword or phrase  
A system for capturing voice files and rendering them searchable, comprising one or more devices capable of capturing audio speech electronically, a recorder coupled to the devices for retrieving...
8731926 Spoken term detection apparatus, method, program, and storage medium  
A spoken term detection apparatus includes: processing performed by a processor includes a feature extraction process extracting an acoustic feature from speech data accumulated in an accumulation...
8725513 Providing expressive user interaction with a multimodal application  
Methods, apparatus, and products are disclosed for providing expressive user interaction with a multimodal application, the multimodal application operating in a multimodal browser on a multimodal...
8725505 Verb error recovery in speech recognition  
A computer implemented method and system for speech recognition are provided. The method and system generally maintain a set of verbs for speech recognition commands. Upon recognizing utterance of...
8725507 Systems and methods for synthesis of motion for animation of virtual heads/characters via voice processing in portable devices  
Systems and methods consistent with the innovations herein relate to communication using a virtual humanoid animated during call processing. According to one exemplary implementation, the...
8719022 Compressed phonetic representation  
An audio processing system makes use of a number of levels of compression or data reduction, thereby providing reduced storage requirements while maintaining a high accuracy of keyword detection...
8719035 Method and apparatus for recognizing and reacting to user personality in accordance with speech recognition system  
Techniques are disclosed for recognizing user personality in accordance with a speech recognition system. For example, a technique for recognizing a personality trait associated with a user...
8718242 Method and apparatus for automatically building conversational systems  
A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a...
8719016 Speech analytics system and system and method for determining structured speech  
A method for converting speech to text in a speech analytics system is provided. The method includes receiving audio data containing speech made up of sounds from an audio source, processing the...
8712772 Method and system for processing dictated information  
A method and system for processing dictated information into a dynamic form are disclosed. The method comprises presenting an image (3) belonging to an image category to a user, dicatating a first...
8712778 Systems and methods for archiving and retrieving navigation points in a voice command platform  
A method and system for identifying, saving and utilizing bookmarks in a voice-command platform. The system allows a user to bookmark objects specified within voice-markup filed resulting in the...
8712757 Methods and apparatus for monitoring communication through identification of priority-ranked keywords  
A method for communication management includes receiving at least one keyword and receiving a replay time span input. Further, the method includes receiving a plurality of communication inputs...
8712781 System and method for customized prompting  
A method for providing an audible prompt to a user within a vehicle. The method includes retrieving one or more data files from a memory device. The data files define certain characteristics of an...
8706486 Voice data leakage detection and prevention systems  
An exemplary system for detecting and preventing voice data leakage may comprise one or more servers running a packet payload converter module, a transcript generator module, and a detection logic...
8706485 Method and device for mnemonic contact image association  
The present invention pertains to method and a communication device (100) for associating a contact record pertaining to a remote speaker (220) with a mnemonic image (191) based on attributes of...
8706499 Periodic ambient waveform analysis for enhanced social functions  
Client devices periodically capture ambient audio waveforms, generate waveform fingerprints, and upload the fingerprints to a server for analysis. The server compares the waveforms to a database...
8705705 Voice rendering of E-mail with tags for improved user experience  
Tags, such as XML tags, are inserted into email to separate email content from signature blocks, privacy notices and confidentiality notices, and to separate original email messages from replies...
8700395 Transcription data extraction  
A computer program product, for performing data determination from medical record transcriptions, resides on a computer-readable medium and includes computer-readable instructions for causing a...
8700408 In-vehicle apparatus and information display system  
An in-vehicle apparatus receives an image data representative of a screen image from a portable terminal with a touch panel. The apparatus extracts a text code data from the image data, and...
8700396 Generating speech data collection prompts  
This document generally describes computer technologies relating to generating speech data collection prompts, such as textual scripts and/or textual scenarios. Speech data collection prompts for...
8700403 Unified treatment of data-sparseness and data-overfitting in maximum entropy modeling  
A method of statistical modeling is provided which includes constructing a statistical model and incorporating Gaussian priors during feature selection and during parameter optimization for the...
8694537 Systems and methods for enabling natural language processing  
Systems and methods for searching databases by sound data input are provided herein. A service provider may have a need to make their database(s) searchable through search technology. However, the...
8694312 Discriminative training of document transcription system  
A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal...
8694309 Automatic speech recognition tuning management  
A method, a computer readable medium and a system for automatic speech recognition tuning management that comprises, collecting an utterance, analyzing the utterance, correlating the collected...
8692862 System and method for selection of video data in a video conference environment  
A method is provided in one embodiment and includes establishing a communication session involving a first endpoint and a second endpoint that are associated with a video conference in a network...
8694319 Dynamic prosody adjustment for voice-rendering synthesized data  
Methods, systems, and products are disclosed for dynamic prosody adjustment for voice-rendering synthesized data that include retrieving synthesized data to be voice-rendered; identifying, for the...
8694321 Image-to-speech system  
Apparatus for communicating includes a processor, a memory, a storage, a display, a manual input arrangement and audio output. Image data elements are stored in the storage and the processor is...
8694317 Methods and apparatus relating to searching of spoken audio data  
Methods for processing audio data containing speech to produce a searchable index file and for subsequently searching such an index file are provided. The processing method uses a phonetic...
8688447 Method and system for domain-specific noisy channel natural language processing (NLP)  
A method for processing transcriptions using natural language processing (NLP), the method includes obtaining transcriptions corresponding to an utterance from a user device, where each of the...
8688443 Multimodal augmented reality for location mobile information service  
In one or more embodiments, one or more methods and/or systems described can perform producing a lattice of object hypotheses based on multiple reference objects from image information; receiving...
8688445 Multi-core processing for parallel speech-to-text processing  
This specification describes technologies relating to multi core processing for parallel speech-to-text processing. In some implementations, a computer-implemented method is provided that includes...
8689251 Content recognition for targeting video advertisements  
Methods, systems, and apparatus, including computer program products, for providing advertisements. A plurality of advertisement targeting criteria is determined from a video stream or file. A...
8688446 Providing text input using speech data and non-speech data  
Systems, methods, and computer readable media providing a speech input interface. The interface can receive speech input and non-speech input from a user through a user interface. The speech input...
8682668 Language model score look-ahead value imparting device, language model score look-ahead value imparting method, and program storage medium  
A speech recognition apparatus that performs frame synchronous beam search by using a language model score look-ahead value prevents the pruning of a correct answer hypothesis while suppressing an...
8682661 Robust speech recognition  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing speech input. In one aspect, a method includes receiving a user input and a...
8682672 Synchronous transcript display with audio/video stream in web cast environment  
A system and method is described that permits synchronization of a transcript with an audio/video stream of a webcast. The system also permits a user to perform a search of the transcript and then...
8682665 Concise dynamic grammars using N-best selection  
A method and apparatus derive a dynamic grammar composed of a subset of a plurality of data elements that are each associated with one of a plurality of reference identifiers. The present...
8682663 Performing speech recognition over a network and using speech recognition results based on determining that a network connection exists  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating, distributing, and using speech recognition models are disclosed. The methods,...
8676572 Computer-implemented system and method for enhancing audio to individuals participating in a conversation  
A computer-implemented system and method for enhancing audio to individuals participating in a conversation is provided. Audio data for individuals participating in one or more conversations is...
8676579 Dual microphone voice authentication for mobile device  
A method of authenticating a user of a mobile device having a first microphone and a second microphone, the method comprising receiving voice input from the user at the first and second...
8675854 Multi-modal communications with conferencing and clients  
A system and method for merging multi-modal communications are disclosed. The multi-modal communications can be synchronous, asynchronous and semi-synchronous. By way of a non-limiting example, at...
8675638 Method and apparatus for enabling dual tone multi-frequency signal processing in the core voice over internet protocol network  
The invention provides a method and apparatus for enabling DTMF signal processing in the core VoIP network. More specifically, the present invention enables a VoIP network to recognize and respond...