Match Document Document Title
9037465 Automatic disclosure detection  
A method of detecting pre-determined phrases to determine compliance quality is provided. The method includes determining whether at least one of an event or a precursor event has occurred based...
9037462 User intention based on N-best list of recognition hypotheses for utterances in a dialog  
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for using alternate recognition hypotheses to improve whole-dialog understanding accuracy. The...
9026437 Location determination system and mobile terminal  
A location determination system includes a first mobile terminal and a second mobile terminal. The first mobile terminal includes a first processor to acquire a first sound signal, analyze the...
9026442 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
9020816 Hidden markov model for speech processing with training method  
A method, system and apparatus are shown for identifying non-language speech sounds in a speech or audio signal. An audio signal is segmented and feature vectors are extracted from the segments of...
9015044 Formant based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9009041 Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data  
A method is described for improving the accuracy of a transcription generated by an automatic speech recognition (ASR) engine. A personal vocabulary is maintained that includes replacement words....
9009043 Pattern processing system specific to a user group  
Methods and apparatus for identifying a user group in connection with user group-based speech recognition. An exemplary method comprises receiving, from a user, a user group identifier that...
9008329 Noise reduction using multi-feature cluster tracker  
Provided are methods and systems for noise suppression within multiple time-frequency points of spectral representations. A multi-feature cluster tracker is used to track signal and noise sources...
8990081 Method of analysing an audio signal  
A method of analyzing an audio signal is disclosed. A digital representation of an audio signal is received and a first output function is generated based on a response of a physiological model to...
8990076 Front-end difference coding for distributed speech recognition  
In automated speech recognition (ASR), multiple devices may be employed to perform the ASR in a distributed environment. To reduce bandwidth use in transmitting between devices ASR information is...
8990082 Non-scorable response filters for speech scoring systems  
A method for scoring non-native speech includes receiving a speech sample spoken by a non-native speaker and performing automatic speech recognition and metric extraction on the speech sample to...
8977547 Voice recognition system for registration of stable utterances  
A voice recognition system includes: a voice input unit 11 for inputting a voice uttered a plurality of times; a registering voice data storage unit 12 for storing voice data uttered the plurality...
8972416 Management of content items  
Disclosed are various embodiments of a content management application that facilitates a content management system. Content items that can include audio and/or video can be stored in the content...
8972259 System and method for teaching non-lexical speech effects  
A method and system for teaching non-lexical speech effects includes delexicalizing a first speech segment to provide a first prosodic speech signal and data indicative of the first prosodic...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...
8965762 Bimodal emotion recognition method and system utilizing a support vector machine  
A method is disclosed in the present disclosure for recognizing emotion by setting different weights to at least of two kinds of unknown information, such as image and audio information, based on...
8959019 Efficient empirical determination, computation, and use of acoustic confusability measures  
Efficient empirical determination, computation, and use of an acoustic confusability measure comprises: (1) an empirically derived acoustic confusability measure, comprising a means for...
8948466 Biometric identification and verification  
In real biometric systems, false match rates and false non-match rates of 0% do not exist. There is always some probability that a purported match is false, and that a genuine match is not...
8942978 Parameter learning in a hidden trajectory model  
Parameters for distributions of a hidden trajectory model including means and variances are estimated using an acoustic likelihood function for observation vectors as an objection function for...
8938390 System and method for expressive language and developmental disorder assessment  
In one embodiment, a method for detecting autism in a natural language environment using a microphone, sound recorder, and a computer programmed with software for the specialized purpose of...
8924212 System and method for robust access and entry to large structured data using voice form-filling  
A method, apparatus and machine-readable medium are provided. A phonotactic grammar is utilized to perform speech recognition on received speech and to generate a phoneme lattice. A document...
8918406 Intelligent analysis queue construction  
A method of processing content files may include receiving the content file, employing processing circuitry to determine an identity score of a source of a portion of at least a portion the...
8918319 Speech recognition device and speech recognition method using space-frequency spectrum  
In a speech recognition device and a speech recognition method, a key phrase containing at least one key word is received. The speech recognition method comprises steps: receiving a sound source...
8909522 Voice activity detector based upon a detected change in energy levels between sub-frames and a method of operation  
A voice activity detector (100) includes a frame divider (201) for dividing frames of an input signal into consecutive sub-frames, an energy level estimator (202) for estimating an energy level of...
8903724 Speech recognition device and method outputting or rejecting derived words  
A speech recognition device includes, a speech recognition section that conducts a search, by speech recognition, on audio data stored in a first memory section to extract word-spoken portions...
8892424 Audio analysis terminal and system for emotion estimation of a conversation that discriminates utterance of a user and another person  
An audio analysis system includes a terminal apparatus and a host system. The terminal apparatus acquires an audio signal of a sound containing utterances of a user and another person,...
8892436 Front-end processor for speech recognition, and speech recognizing apparatus and method using the same  
A method of recognizing speech is provided. The method includes the operations of (a) dividing first speech that is input to a speech recognizing apparatus into frames; (b) converting the frames...
8886533 System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A...
8886532 Leveraging interaction context to improve recognition confidence scores  
On a computing device a speech utterance is received from a user. The speech utterance is a section of a speech dialog that includes a plurality of speech utterances. One or more features from the...
8880399 Utterance verification and pronunciation scoring by lattice transduction  
In the field of language learning systems, proper pronunciation of words and phrases is an integral aspect of language learning, determining the proximity of the language learner's pronunciation...
8856002 Distance metrics for universal pattern processing tasks  
A universal pattern processing system receives input data and produces output patterns that are best associated with said data. The system uses input means receiving and processing input data, a...
8843367 Adaptive equalization system  
An adaptive equalization system that adjusts the spectral shape of a speech signal based on an intelligibility measurement of the speech signal may improve the intelligibility of the output speech...
8831941 System and method for tracking fraudulent electronic transactions using voiceprints of uncommon words  
Disclosed are systems, methods, and computer readable media for comparing customer voice prints comprising of uncommonly spoken words with a database of known fraudulent voice signatures and...
8825479 System and method for recognizing emotional state from a speech signal  
A computerized method, software, and system for recognizing emotions from a speech signal, wherein statistical and MFCC features are extracted from the speech signal, the MFCC features are sorted...
8812315 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
8812923 Error concealment for sub-band coded audio signals  
A decoder and method of decoding a sub-band coded digital audio signal. The decoder comprises: an input, for receiving sub-band coefficients for a plurality of sub-bands of the audio signal; an...
8812321 System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning  
Disclosed herein are systems, methods and non-transitory computer-readable media for performing speech recognition across different applications or environments without model customization or...
8812291 Large language models in machine translation  
Systems, methods, and computer program products for machine translation are provided. In some implementations a system is provided. The system includes a language model including a collection of...
8788266 Language model creation device, language model creation method, and computer-readable storage medium  
The present invention uses a language model creation device 200 that creates a new language model using a standard language model created from standard language text. The language model creation...
8781825 Reducing false positives in speech recognition systems  
Embodiments of the present invention improve methods of performing speech recognition. In one embodiment, the present invention includes a method comprising receiving a spoken utterance,...
8768700 Voice search engine interface for scoring search hypotheses  
A system may receive a voice search query and may determine word hypotheses for the voice query. Each word hypothesis may include one or more terms. The system may obtain a search query log and...
8768706 Content-based audio playback emphasis  
Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the...
8768692 Speech recognition method, speech recognition apparatus and computer program  
A speech recognition apparatus predicts, based on the occurrence cycle and duration time of impulse noise that occurs periodically, a segment in which impulse noise occurs, and executes speech...
8768697 Method for measuring speech characteristics  
In some embodiments, a method includes measuring a disparity between two speech samples by segmenting both a reference speech sample and a student speech sample into speech units. A duration...
8762147 Consonant-segment detection apparatus and consonant-segment detection method  
A signal portion is extracted from an input signal for each frame having a specific duration to generate a per-frame input signal. The per-frame input signal in a time domain is converted into a...
8762152 Speech recognition system interactive agent  
Methods and systems for performing speech recognition using an electronic interactive agent are disclosed. In embodiments of the invention, an electronic agent is presented in a form perceptible...
8751229 System and method for handling missing speech data  
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for handling missing speech data. The computer-implemented method includes receiving speech with a...
8744845 Method for processing noisy speech signal, apparatus for same and computer-readable recording medium  
A noise estimation method for a noisy speech signal according to an embodiment of the present invention includes the steps of approximating a transformation spectrum by transforming an input noisy...
8737571 Methods and apparatus providing call quality testing  
A method, apparatus and computer readable medium for call quality testing is presented. A query is transmitted over a communications network from a first location to a second location. The query...