Matches 1 - 50 out of 432 1 2 3 4 5 6 7 8 9 >


Match Document Document Title
9026446 System for generating captions for live video broadcasts  
An adaptive workflow system can be used to implement captioning projects, such as projects for creating captions or subtitles for live and non-live broadcasts. Workers can repeat words spoken...
9026441 Spoken control for user construction of complex behaviors  
A device interface system is presented. Contemplated device interfaces allow for construction of complex device behaviors by aggregating device functions. The behaviors are triggered based on...
9026442 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
9020818 Format based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015044 Formant based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015045 Method for refining a search  
A method for refining a search is provided. Embodiments may include receiving a first speech signal corresponding to a first utterance and receiving a second speech signal corresponding to a...
9009041 Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data  
A method is described for improving the accuracy of a transcription generated by an automatic speech recognition (ASR) engine. A personal vocabulary is maintained that includes replacement words....
8996371 Method and system for automatic domain adaptation in speech recognition applications  
A system and method for adapting a language model to a specific environment by receiving interactions captured the specific environment, generating a collection of documents from documents...
8996372 Using adaptation data with cloud-based speech recognition  
Speech recognition may be improved using data derived from an utterance. In some embodiments, audio data is received by a user device. Adaptation data may be retrieved from a data store accessible...
8996373 State detection device and state detecting method  
A state detection device includes: a first model generation unit to generate a first specific speaker model obtained by modeling speech features of a specific speaker in an undepressed state; a...
8996380 Methods and systems for synchronizing media  
Systems and methods of synchronizing media are provided. A client device may be used to capture a sample of a media stream being rendered by a media rendering source. The client device sends the...
8990080 Techniques to normalize names efficiently for name-based speech recognition grammars  
Techniques to normalize names for name-based speech recognition grammars are described. Some embodiments are particularly directed to techniques to normalize names for name-based speech...
8990085 System and method for handling repeat queries due to wrong ASR output by modifying an acoustic, a language and a semantic model  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for handling expected repeat speech queries or other inputs. The method causes a computing device to...
8983836 Captioning using socially derived acoustic profiles  
Mechanisms for performing dynamic automatic speech recognition on a portion of multimedia content are provided. Multimedia content is segmented into homogeneous segments of content with regard to...
8972261 Computer-implemented system and method for voice transcription error reduction  
A computer-implemented system and method for voice transcription error reduction is provided. Speech utterances are obtained from a voice stream and each speech utterance is associated with a...
8972260 Speech recognition using multiple language models  
In accordance with one embodiment, a method of generating language models for speech recognition includes identifying a plurality of utterances in training data corresponding to speech, generating...
8964948 Method for setting voice tag  
A method for setting a voice tag is provided, which comprises the following steps. First, counting a number of phone calls performed between a user and a contact person. If the number of phone...
8965763 Discriminative language modeling for automatic speech recognition with a weak acoustic model and distributed training  
Training data from a plurality of utterance-to-text-string mappings of an automatic speech recognition (ASR) system may be selected. Parameters of the ASR system that characterize the utterances...
8954325 Speech recognition in automated information services systems  
The present invention allows feedback from operator workstations to be used to update databases used for providing automated information services. When an automated process fails, recorded speech...
8953889 Object datastore in an augmented reality environment  
An augmented reality environment allows interaction between virtual and real objects and enhances an unstructured real-world environment. An object datastore comprising attributes of an object...
8949130 Internal and external speech recognition use with a mobile communication facility  
In embodiments of the present invention improved capabilities are described for a user interacting with a mobile communication facility, where speech presented by the user is recorded using a...
8949126 Creating statistical language models for spoken CAPTCHAs  
Methods for creating statistical language models (SLMs) for spoken Completely Automated Turing Tests for Telling Computers and Humans Apart (CAPTCHAs) are disclosed. In these methods, candidate...
8938392 Configuring a speech engine for a multimodal application based on location  
Methods, apparatus, and products are disclosed for configuring a speech engine for a multimodal application based on location. The multimodal application operates on a multimodal device supporting...
8935167 Exemplar-based latent perceptual modeling for automatic speech recognition  
Methods, systems, and computer-readable media related to selecting observation-specific training data (also referred to as “observation-specific exemplars”) from a general training corpus, and...
8918318 Extended recognition dictionary learning device and speech recognition system  
Speech recognition of even a speaker who uses a speech recognition system is enabled by using an extended recognition dictionary suited to the speaker without requiring any previous learning using...
8914286 Speech recognition with hierarchical networks  
Provided are systems and methods for using hierarchical networks for recognition, such as speech recognition. Conventional automatic recognition systems may not be both efficient and flexible....
8914292 ***WITHDRAWN PATENT AS PER THE LATEST USPTO WITHDRAWN LIST***
Internal and external speech recognition use with a mobile communication facility
 
In embodiments of the present invention improved capabilities are described for a user interacting with a mobile communication facility, where speech presented by the user is recorded using a...
8909529 Method and system for automatically detecting morphemes in a task classification system using lattices  
The invention concerns a method and corresponding system for building a phonotactic mode for domain independent speech recognition. The method may include recognizing phones from a user's input...
8892436 Front-end processor for speech recognition, and speech recognizing apparatus and method using the same  
A method of recognizing speech is provided. The method includes the operations of (a) dividing first speech that is input to a speech recognizing apparatus into frames; (b) converting the frames...
8886540 Using speech recognition results based on an unstructured language model in a mobile communication facility application  
A method and system for entering information into a software application resident on a mobile communication facility is provided. The method and system may include recording speech presented by a...
8886535 Utilizing multiple processing units for rapid training of hidden markov models  
A method of optimizing the calculation of matching scores between phone states and acoustic frames across a matrix of an expected progression of phone states aligned with an observed progression...
8886534 Speech recognition apparatus, speech recognition method, and speech recognition robot  
A speech recognition apparatus includes a speech input unit that receives input speech, a phoneme recognition unit that recognizes phonemes of the input speech and generates a first phoneme...
8880495 Search query expansion and group search  
Audio information is recorded in an overwriteable circular buffer of a computing device. Construction of a search query is initiated by receiving a user input. The user input includes one or more...
8868423 System and method for controlling access to resources with a spoken CAPTCHA test  
Systems and methods for controlling access to resources using spoken Completely Automatic Public Turing Tests To Tell Humans And Computers Apart (CAPTCHA) tests are disclosed. In these systems and...
8862468 Leveraging back-off grammars for authoring context-free grammars  
A system and method of refining context-free grammars (CFGs). The method includes deriving back-off grammar (BOG) rules from an initially developed CFG and utilizing the initial CFG and the...
8856002 Distance metrics for universal pattern processing tasks  
A universal pattern processing system receives input data and produces output patterns that are best associated with said data. The system uses input means receiving and processing input data, a...
8849664 Realtime acoustic adaptation using stability measures  
Methods, systems, and computer programs encoded on a computer storage medium for real-time acoustic adaptation using stability measures are disclosed. The methods include the actions of receiving...
8843367 Adaptive equalization system  
An adaptive equalization system that adjusts the spectral shape of a speech signal based on an intelligibility measurement of the speech signal may improve the intelligibility of the output speech...
8843371 Speech recognition adaptation systems based on adaptation data  
The instant application includes computationally-implemented systems and methods that include managing adaptation data, the adaptation data is at least partly based on at least one speech...
8838448 Forced/predictable adaptation for speech recognition  
A method is described for use with automatic speech recognition using discriminative criteria for speaker adaptation. An adaptation evaluation is performed of speech recognition performance data...
8818809 Methods and apparatus for generating, updating and distributing speech recognition models  
Techniques for generating, distributing, and using speech recognition models are described. A shared speech processing facility is used to support speech recognition for a wide variety of devices...
8812317 Signal processing apparatus capable of learning a voice command which is unsuccessfully recognized and method of recognizing a voice command thereof  
Provided are an apparatus and method for recognizing voice commands, the apparatus including: a voice command recognition unit which recognizes an input voice command; a voice command recognition...
8805684 Distributed speaker adaptation  
Automatic speech recognition (ASR) may be performed on received utterances. The ASR may be performed by an ASR module of a computing device (e.g., a client device). The ASR may include: generating...
8798995 Key word determinations from voice data  
Topics of potential interest to a user, useful for purposes such as targeted advertising and product recommendations, can be extracted from voice content produced by a user. A computing device can...
8798994 Resource conservative transformation based unsupervised speaker adaptation  
The present invention discloses a solution for conserving computing resources when implementing transformation based adaptation techniques. The disclosed solution limits the amount of speech data...
8781831 System and method for standardized speech recognition infrastructure  
Disclosed herein are systems, methods, and computer-readable storage media for selecting a speech recognition model in a standardized speech recognition infrastructure. The system receives speech...
8781821 Voiced interval command interpretation  
A method is disclosed for controlling a voice-activated device by interpreting a spoken command as a series of voiced and non-voiced intervals. A responsive action is then performed according to...
8775177 Speech recognition process  
A speech recognition process may perform the following operations: performing a preliminary recognition process on first audio to identify candidates for the first audio; generating first...
8775178 Updating a voice template  
Updating a voice template for recognizing a speaker on the basis of a voice uttered by the speaker is disclosed. Stored voice templates indicate distinctive characteristics of utterances from...
8768698 Methods and systems for speech recognition processing using search query information  
Methods and systems for speech recognition processing are described. In an example, a computing device may be configured to receive information indicative of a frequency of submission of a search...

Matches 1 - 50 out of 432 1 2 3 4 5 6 7 8 9 >