Match Document Document Title
9043209 Language model creation device  
This device 301 stores a first content-specific language model representing a probability that a specific word appears in a word sequence representing a first content, and a second...
9037464 Computing numeric representations of words in a high-dimensional space  
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for computing numeric representations of words. One of the methods includes obtaining a set of...
9037460 Dynamic long-distance dependency with conditional random fields  
Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a...
9037462 User intention based on N-best list of recognition hypotheses for utterances in a dialog  
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for using alternate recognition hypotheses to improve whole-dialog understanding accuracy. The...
9031844 Full-sequence training of deep structures for speech recognition  
A method includes an act of causing a processor to access a deep-structured model retained in a computer-readable medium, the deep-structured model includes a plurality of layers with respective...
9026442 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
9020820 State detecting apparatus, communication apparatus, and storage medium storing state detecting program  
A state detecting apparatus includes: a processor to execute acquiring utterance data related to uttered speech, computing a plurality of statistical quantities for feature parameters regarding...
9020818 Format based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015044 Formant based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9009039 Noise adaptive training for speech recognition  
Technologies are described herein for noise adaptive training to achieve robust automatic speech recognition. Through the use of these technologies, a noise adaptive training (NAT) approach may...
8972254 Turbo processing for speech recognition with local-scale and broad-scale decoders  
Environmental recognition systems may improve recognition accuracy by leveraging local and nonlocal features in a recognition target. A local decoder may be used to analyze local features, and a...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...
8964948 Method for setting voice tag  
A method for setting a voice tag is provided, which comprises the following steps. First, counting a number of phone calls performed between a user and a contact person. If the number of phone...
8959014 Training acoustic models using distributed computing techniques  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training acoustic models. Speech data and data identifying a transcription for the speech...
8959022 System for media correlation based on latent evidences of audio  
A method for determining a relatedness between a query video and a database video is provided. A processor extracts an audio stream from the query video to produce a query audio stream, extracts...
8949127 Recognizing the numeric language in natural spoken dialogue  
A system and a method are provided. A speech recognition processor receives unconstrained input speech and outputs a string of words. The speech recognition processor is based on a numeric...
8949130 Internal and external speech recognition use with a mobile communication facility  
In embodiments of the present invention improved capabilities are described for a user interacting with a mobile communication facility, where speech presented by the user is recorded using a...
8949125 Annotating maps with user-contributed pronunciations  
Systems and methods are provided to select a most typical pronunciation of a location name on a map from a plurality of user pronunciations. A server generates a reference speech model based on...
8942975 Noise suppression in a Mel-filtered spectral domain  
Techniques are described herein that suppress noise in a Mel-filtered spectral domain. For example, a window may be applied to a representation of a speech signal in a time domain. The windowed...
8935170 Speech recognition  
A speech recognition system, according to an example embodiment, includes a data storage to store speech training data. A training engine determines consecutive breakout periods in the speech...
8930183 Voice conversion method and system  
A method of converting speech from the characteristics of a first voice to the characteristics of a second voice, the method comprising: receiving a speech input from a first voice, dividing said...
8924214 Radar microphone speech recognition  
A method for detecting and recognizing speech is provided that remotely detects body motions from a speaker during vocalization with one or more radar sensors. Specifically, the radar sensors...
8914292 ***WITHDRAWN PATENT AS PER THE LATEST USPTO WITHDRAWN LIST***
Internal and external speech recognition use with a mobile communication facility
 
In embodiments of the present invention improved capabilities are described for a user interacting with a mobile communication facility, where speech presented by the user is recorded using a...
8914277 Speech and language translation of an utterance  
According to example configurations, a speech-processing system parses an uttered sentence into segments. The speech-processing system translates each of the segments in the uttered sentence into...
8909538 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...
8909518 Frequency axis warping factor estimation apparatus, system, method and program  
A warping factor estimation system comprises label information generation unit that outputs voice/non-voice label information, warp model storage unit in which a probability model representing...
8888494 Interactive environment for performing arts scripts  
One or more embodiments present a script to a user in an interactive script environment. A digital representation of a manuscript is analyzed. This digital representation includes a set of roles...
8886533 System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification  
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A...
8880402 Automatically adapting user guidance in automated speech recognition  
A speech recognition method includes receiving input speech from a user, processing the input speech to obtain at least one parameter value, and determining an experience level of the user using...
8868423 System and method for controlling access to resources with a spoken CAPTCHA test  
Systems and methods for controlling access to resources using spoken Completely Automatic Public Turing Tests To Tell Humans And Computers Apart (CAPTCHA) tests are disclosed. In these systems and...
8856002 Distance metrics for universal pattern processing tasks  
A universal pattern processing system receives input data and produces output patterns that are best associated with said data. The system uses input means receiving and processing input data, a...
8849667 Method and apparatus for speech recognition  
A computer-implemented method, apparatus and computer program product. The computer-implemented method performed by a computerized device, comprising: transforming a hidden Markov model to qubits;...
8849663 Systems and methods for segmenting and/or classifying an audio signal from transformed audio information  
A system and method may be provided to segment and/or classify an audio signal from transformed audio information. Transformed audio information representing a sound may be obtained. The...
8843370 Joint discriminative training of multiple speech recognizers  
Adjusting model parameters is described for a speech recognition system that combines recognition outputs from multiple speech recognition processes. Discriminative adjustments are made to model...
8838433 Selection of domain-adapted translation subcorpora  
An architecture is discussed that provides the capability to subselect the most relevant data from an out-domain corpus to use either in isolation or in combination conjunction with in-domain...
8818802 Real-time data pattern analysis system and method of operation thereof  
A method for real-time data-pattern analysis. The method includes receiving and queuing at least one data-pattern analysis request by a data-pattern analysis unit controller. At least one data...
8812315 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
8812322 Semi-supervised source separation using non-negative techniques  
Systems and methods for semi-supervised source separation using non-negative techniques are described. In some embodiments, various techniques disclosed herein may enable the separation of signals...
8798990 Methods and systems for natural language understanding using human knowledge and collected data  
Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a...
8793124 Speech processing method and apparatus for deciding emphasized portions of speech, and program therefor  
A scheme to judge emphasized speech portions, wherein the judgment is executed by a statistical processing in terms of a set of speech parameters including a fundamental frequency, power and a...
8793137 Method for processing the output of a speech recognizer  
A method for processing speech, comprising semantically parsing a received natural language speech input with respect to a plurality of predetermined command grammars in an automated speech...
8788256 Multiple language voice recognition  
Computer implemented speech processing generates one or more pronunciations of an input word in a first language by a non-native speaker of the first language who is a native speaker of a second...
8775177 Speech recognition process  
A speech recognition process may perform the following operations: performing a preliminary recognition process on first audio to identify candidates for the first audio; generating first...
8774261 Soft linear and non-linear interference cancellation  
A two stage interference cancellation (IC) process includes a linear IC stage that suppresses co-channel interference (CCI) and adjacent channel interference (ACI). The linear IC stage...
8775179 Speech-based speaker recognition systems and methods  
The illustrative embodiments described herein provide systems and methods for authenticating a speaker. In one embodiment, a method includes receiving reference speech input including a reference...
8768706 Content-based audio playback emphasis  
Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the...
8762148 Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program  
A method and apparatus for carrying out adaptation using input speech data information even at a low reference pattern recognition performance. A reference pattern adaptation device 2 includes a...
8756062 Male acoustic model adaptation based on language-independent female speech data  
A method of generating proxy acoustic models for use in automatic speech recognition includes training acoustic models from speech received via microphone from male speakers of a first language,...
8744855 Determining reading levels of electronic books  
Architectures and techniques are described to determine a reading level of an electronic book. In particular, words, phrases, clauses, and parts of speech of an electronic book may be tagged and...
8744849 Microphone-array-based speech recognition system and method  
A microphone-array-based speech recognition system combines a noise cancelling technique for cancelling noise of input speech signals from an array of microphones, according to at least an...