Match Document Document Title
9043206 System and methods for matching an utterance to a template hierarchy  
A system and methods for matching at least one word of an utterance against a set of template hierarchies to select the best matching template or set of templates corresponding to the utterance....
9037469 Automated communication integrator  
An apparatus includes a plurality of applications and an integrator having a voice recognition module configured to identify at least one voice command from a user. The integrator is configured to...
9026442 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
9020816 Hidden markov model for speech processing with training method  
A method, system and apparatus are shown for identifying non-language speech sounds in a speech or audio signal. An audio signal is segmented and feature vectors are extracted from the segments of...
9020820 State detecting apparatus, communication apparatus, and storage medium storing state detecting program  
A state detecting apparatus includes: a processor to execute acquiring utterance data related to uttered speech, computing a plurality of statistical quantities for feature parameters regarding...
9020818 Format based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015044 Formant based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9009040 Training a transcription system  
According to certain embodiments, training a transcription system includes accessing recorded voice data of a user from one or more sources. The recorded voice data comprises voice samples. A...
9009045 Model-driven candidate sorting  
Methods and systems for model-driven candidate sorting for evaluating digital interviews are described. In one embodiment, a model-driven candidate-sorting tool selects a data set of digital...
9002708 Speech recognition system and method based on word-level candidate generation  
A speech recognition system and method based on word-level candidate generation are provided. The speech recognition system may include a speech recognition result verifying unit to verify a word...
9002703 Community audio narration generation  
The community-based generation of audio narrations for a text-based work leverages collaboration of a community of people to provide human-voiced audio readings. During the community-based...
8996371 Method and system for automatic domain adaptation in speech recognition applications  
A system and method for adapting a language model to a specific environment by receiving interactions captured the specific environment, generating a collection of documents from documents...
8996372 Using adaptation data with cloud-based speech recognition  
Speech recognition may be improved using data derived from an utterance. In some embodiments, audio data is received by a user device. Adaptation data may be retrieved from a data store accessible...
8996373 State detection device and state detecting method  
A state detection device includes: a first model generation unit to generate a first specific speaker model obtained by modeling speech features of a specific speaker in an undepressed state; a...
8996376 Intelligent text-to-speech conversion  
Techniques for improved text-to-speech processing are disclosed. The improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech...
8996380 Methods and systems for synchronizing media  
Systems and methods of synchronizing media are provided. A client device may be used to capture a sample of a media stream being rendered by a media rendering source. The client device sends the...
8990084 Method of active learning for automatic speech recognition  
State-of-the-art speech recognition systems are trained using transcribed utterances, preparation of which is labor-intensive and time-consuming. The present invention is an iterative method for...
8983836 Captioning using socially derived acoustic profiles  
Mechanisms for performing dynamic automatic speech recognition on a portion of multimedia content are provided. Multimedia content is segmented into homogeneous segments of content with regard to...
8977551 Parametric speech synthesis method and system  
The present invention provides a parametric speech synthesis method and a parametric speech synthesis system. The method comprises sequentially processing each frame of speech of each phone in a...
8972259 System and method for teaching non-lexical speech effects  
A method and system for teaching non-lexical speech effects includes delexicalizing a first speech segment to provide a first prosodic speech signal and data indicative of the first prosodic...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...
8972258 Sparse maximum a posteriori (map) adaption  
Techniques disclosed herein include using a Maximum A Posteriori (MAP) adaptation process that imposes sparseness constraints to generate acoustic parameter adaptation data for specific users...
8965761 Differential dynamic content delivery with text display in dependence upon simultaneous speech  
Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document;...
8959019 Efficient empirical determination, computation, and use of acoustic confusability measures  
Efficient empirical determination, computation, and use of an acoustic confusability measure comprises: (1) an empirically derived acoustic confusability measure, comprising a means for...
8949125 Annotating maps with user-contributed pronunciations  
Systems and methods are provided to select a most typical pronunciation of a location name on a map from a plurality of user pronunciations. A server generates a reference speech model based on...
8947220 Speech recognition functionality in a vehicle through an extrinsic device  
Speech recognition in a vehicle through an extrinsic device includes detecting, via the vehicle, a presence of a mobile communications device that is configured with a speech recognition...
8935167 Exemplar-based latent perceptual modeling for automatic speech recognition  
Methods, systems, and computer-readable media related to selecting observation-specific training data (also referred to as “observation-specific exemplars”) from a general training corpus, and...
8924213 Detecting potential significant errors in speech recognition results  
In some embodiments, the recognition results produced by a speech processing system (which may include two or more recognition results, including a top recognition result and one or more...
8918408 Candidate generation for predictive input using input history  
A computing device maintains an input history in memory. This input history includes input strings that have been previously entered into the computing device. When the user begins entering...
8918318 Extended recognition dictionary learning device and speech recognition system  
Speech recognition of even a speaker who uses a speech recognition system is enabled by using an extended recognition dictionary suited to the speaker without requiring any previous learning using...
8914290 Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment  
Method and apparatus that dynamically adjusts operational parameters of a text-to-speech engine in a speech-based system. A voice engine or other application of a device provides a mechanism to...
8914283 System and method for unsupervised and active learning for automatic speech recognition  
A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for...
8914278 Automatic context sensitive language correction and enhancement using an internet corpus  
A computer-assisted language correction system including spelling correction functionality, misused word correction functionality, grammar correction functionality and vocabulary enhancement...
8909528 Method and system for prompt construction for selection from a list of acoustically confusable items in spoken dialog systems  
A method (and system) of determining confusable list items and resolving this confusion in a spoken dialog system includes receiving user input, processing the user input and determining if a list...
8909534 Speech recognition training  
A method may include selecting, by a computing device, sets of two or more text candidates from a plurality of text candidates corresponding to vocal input. The method may further include for each...
8909529 Method and system for automatically detecting morphemes in a task classification system using lattices  
The invention concerns a method and corresponding system for building a phonotactic mode for domain independent speech recognition. The method may include recognizing phones from a user's input...
8892437 Method and apparatus of providing semi-automated classifier adaptation for natural language processing  
Example embodiments of the present invention may include a method that provides transcribing spoken utterances occurring during a call and assigning each of the spoken utterances with a...
8892436 Front-end processor for speech recognition, and speech recognizing apparatus and method using the same  
A method of recognizing speech is provided. The method includes the operations of (a) dividing first speech that is input to a speech recognizing apparatus into frames; (b) converting the frames...
8886533 System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A...
8886535 Utilizing multiple processing units for rapid training of hidden markov models  
A method of optimizing the calculation of matching scores between phone states and acoustic frames across a matrix of an expected progression of phone states aligned with an observed progression...
8886543 Frequency ratio fingerprint characterization for audio matching  
System and methods for characterizing interest points within a fingerprint are disclosed herein. The systems include generating a set of interest points and an anchor point related to an audio...
8886534 Speech recognition apparatus, speech recognition method, and speech recognition robot  
A speech recognition apparatus includes a speech input unit that receives input speech, a phoneme recognition unit that recognizes phonemes of the input speech and generates a first phoneme...
8880402 Automatically adapting user guidance in automated speech recognition  
A speech recognition method includes receiving input speech from a user, processing the input speech to obtain at least one parameter value, and determining an experience level of the user using...
8880397 Systems, devices and methods for list display and management  
Exemplary embodiments provide systems, devices and methods that allow creation and management of lists of items in an integrated manner on an interactive graphical user interface. A user may speak...
8868423 System and method for controlling access to resources with a spoken CAPTCHA test  
Systems and methods for controlling access to resources using spoken Completely Automatic Public Turing Tests To Tell Humans And Computers Apart (CAPTCHA) tests are disclosed. In these systems and...
8868410 Non-dialogue-based and dialogue-based learning apparatus by substituting for uttered words undefined in a dictionary with word-graphs comprising of words defined in the dictionary  
The invention provides a dialogue-based learning apparatus through dialogue with users comprising: a speech input unit (10) for inputting speeches; a speech recognition unit (20) for recognizing...
8862468 Leveraging back-off grammars for authoring context-free grammars  
A system and method of refining context-free grammars (CFGs). The method includes deriving back-off grammar (BOG) rules from an initially developed CFG and utilizing the initial CFG and the...
8856005 Location based responses to telephone requests  
A method for receiving processed information at a remote device is described. The method includes transmitting from the remote device a verbal request to a first information provider and receiving...
8856002 Distance metrics for universal pattern processing tasks  
A universal pattern processing system receives input data and produces output patterns that are best associated with said data. The system uses input means receiving and processing input data, a...
8849660 Training of voice-controlled television navigation  
Systems and methods for training voice activation control of electronic equipment are disclosed. One example method includes receiving a selection corresponding to at least one command used to...