|
Match
|
Document |
Document Title |
|
|
8185400 |
System and method for isolating and processing common dialog cues
A method, system and machine-readable medium are provided. Speech input is received at a speech recognition component and recognized output is produced. A common dialog cue from the received speech...
|
|
|
8180641 |
Sequential speech recognition with two unequal ASR systems
Sequential speech recognition using two unequal automatic speech recognition (ASR) systems may be provided. The system may provide two sets of vocabulary data. A determination may be made as to...
|
|
|
8180640 |
Grapheme-to-phoneme conversion using acoustic data
Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of...
|
|
|
8180147 |
Robust pattern recognition system and method using Socratic agents
A computer-implemented pattern recognition method, system and program product, the method comprising in one embodiment: creating electronically a linkage between a plurality of models within a...
|
|
|
8179289 |
Handheld electronic device with text disambiguation
A handheld electronic device includes a reduced QWERTY keyboard and is enabled with disambiguation software. The device provides output in the form of a default output and a number of variants. The...
|
|
|
8175865 |
Method and apparatus of generating text script for a corpus-based text-to speech system
A method of text script generation for a corpus-based text-to-speech system includes searching in a source corpus having L sentences, selecting N sentences with a best integrated efficiency as N...
|
|
|
8170866 |
System and method for increasing accuracy of searches based on communication network
Disclosed are systems, methods and computer-readable media for using a local communication network to generate a speech model. The method includes retrieving for an individual a list of numbers in...
|
|
|
8165870 |
Classification filter for processing data for creating a language model
The method and apparatus utilize a filter to remove a variety of non-dictated words from data based on probability and improve the effectiveness of creating a language model.
|
|
|
8155960 |
System and method for unsupervised and active learning for automatic speech recognition
A system and method is provided for combining active and unsupervised learning for automatic speech recognition. This process enables a reduction in the amount of human supervision required for...
|
|
|
8150694 |
System and method for providing an acoustic grammar to dynamically sharpen speech interpretation
The system and method described herein may provide an acoustic grammar to dynamically sharpen speech interpretation. In particular, the acoustic grammar may be used to map one or more phonemes...
|
|
|
8145485 |
Grammar weighting voice recognition information
A device receives a voice recognition statistic from a voice recognition application and applies a grammar improvement rule based on the voice recognition statistic. The device also automatically...
|
|
|
8135590 |
Position-dependent phonetic models for reliable pronunciation identification
A representation of a speech signal is received and is decoded to identify a sequence of position-dependent phonetic tokens wherein each token comprises a phone and a position indicator that...
|
|
|
8131544 |
System for distinguishing desired audio signals from noise
A system distinguishes a primary audio source and background noise to improve the quality of an audio signal. A speech signal from a microphone may be improved by identifying and dampening...
|
|
|
8126712 |
Information communication terminal, information communication system, information communication method, and storage medium for storing an information communication program thereof for recognizing speech information
An information communication terminal (100) that includes: a speech recognition module (6) for recognizing speech information to identify a plurality of words in the recognized speech information;...
|
|
|
8121837 |
Adjusting a speech engine for a mobile computing device based on background noise
Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a...
|
|
|
8117036 |
Non-disruptive side conversation information retrieval
Information is exchanged between a user of a communications device and an application during an ongoing conversation between the user using the communications device and a party, without disrupting...
|
|
|
8103502 |
Systems and methods for extracting meaning from multimodal inputs using finite-state devices
Multimodal utterances contain a number of different modes. These modes can include speech, gestures, and pen, haptic, and gaze inputs, and the like. This invention use recognition results from one...
|
|
|
8103503 |
Speech recognition for determining if a user has correctly read a target sentence string
Systems and methods for processing a user speech input to determine whether the user has correctly read a target sentence string are provided. One disclosed method may include receiving a sentence...
|
|
|
8099280 |
Speech recognition method and speech recognition apparatus
A speech recognition method in which, upon speech recognition with use of a model composed of subwords such as triphones depending on a plurality of context, inhibiting hypotheses from extending...
|
|
|
8082150 |
Method and apparatus for identifying an unknown work
A system for determining an identity of a received work. The system receives audio data for an unknown work. The audio data is divided into segments. The system generates a signature of the unknown...
|
|
|
8082148 |
Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving...
|
|
|
8078467 |
Device and method for language model switching and adaptation
This invention provides a device and method for language model switching and adaptation, wherein the device comprises a notification manager which notifies a language model switching section of the...
|
|
|
8077975 |
Handwriting symbol recognition accuracy using speech input
Described is a bimodal data input technology by which handwriting recognition results are combined with speech recognition results to improve overall recognition accuracy. Handwriting data and...
|
|
|
8069045 |
Hierarchical approach for the statistical vowelization of Arabic text
The present invention relates to the field of computer-aided text and speech processing, and in particular to a method and respective system for converting an input text given in an incomplete...
|
|
|
8065147 |
Gramma generation for password recognition
A password grammar for speech recognition is described. A password is normalized into a list of strings of a plurality of character types such as letters and numerals. For each string of letters,...
|
|
|
8065144 |
Multilingual speech recognition
A method for speech recognition. The method uses a single pronunciation estimator to train acoustic phoneme models and recognize utterances from multiple languages. The method includes accepting...
|
|
|
8064573 |
Computer generated prompting
A method and apparatus for generating appropriate confirmatory prompts in a speech-enabled, interactive computer system. The method can be incorporated in an interactive voice response system that...
|
|
|
8050922 |
Voice recognition with dynamic filter bank adjustment based on speaker categorization
Voice recognition methods and systems are disclosed. A voice signal is obtained for an utterance of a speaker. The speaker is categorized as a male, female, or child and the categorization is used...
|
|
|
8046222 |
Segmenting words using scaled probabilities
Systems, methods, and apparatuses including computer program products for segmenting words using scaled probabilities. In one implementation, a method is provided. The method includes receiving a...
|
|
|
8046224 |
Speaker adaptation of vocabulary for speech recognition
A phonetic vocabulary for a speech recognition system is adapted to a particular speaker's pronunciation. A speaker can be attributed specific pronunciation styles, which can be identified from...
|
|
|
8032375 |
Using generic predictive models for slot values in language modeling
A generic predictive argument model that can be applied to a set of shot values to predict a target slot value is provided. The generic predictive argument model can predict whether or not a...
|
|
|
8014591 |
Robust pattern recognition system and method using socratic agents
A computer-implemented pattern recognition method, system and program product, the method comprising in one embodiment: creating electronically a linkage between a plurality of models within a...
|
|
|
8010358 |
Voice recognition with parallel gender and age normalization
Methods and apparatus for voice recognition are disclosed. A voice signal is obtained and two or more voice recognition analyses are performed on the voice signal. Each voice recognition analysis...
|
|
|
8000971 |
Discriminative training of multi-state barge-in models for speech processing
Disclosed are systems and methods for training a barge-in-model for speech processing in a spoken dialogue system comprising the steps of (1) receiving an input having at least one speech segment...
|
|
|
8000965 |
Information-processing device and method that attains speech-recognition to recognize data input via speech
An information-processing device and method that attains speech-recognition to recognize data input via speech. The information-processing device and method includes analyzing...
|
|
|
7996214 |
System and method of exploiting prosodic features for dialog act tagging in a discriminative modeling framework
Disclosed are a system and method for exploiting information in an utterance for dialog act tagging. An exemplary method includes receiving a user utterance, computing at periodic intervals at...
|
|
|
7991615 |
Grapheme-to-phoneme conversion using acoustic data
Described is the use of acoustic data to improve grapheme-to-phoneme conversion for speech recognition, such as to more accurately recognize spoken names in a voice-dialing system. A joint model of...
|
|
|
7983916 |
Sampling rate independent speech recognition
A sampling-rate-independent method of automated speech recognition (ASR). Speech energies of a plurality of codebooks generated from training data created at an ASR sampling rate are compared to...
|
|
|
7983910 |
Communicating across voice and text channels with emotion preservation
Communicating across channels with emotion preservation includes: receiving, by a processor in a communication device, a voice communication; analyzing, by the processor in the communication...
|
|
|
7969329 |
Handheld electronic device with text disambiguation
A handheld electronic device includes a reduced QWERTY keyboard and is enabled with disambiguation software. The device provides output in the form of a default output and a number of variants. The...
|
|
|
7970613 |
Method and system for Gaussian probability data bit reduction and computation
Use of runtime memory may be reduced in a data processing algorithm that uses one or more probability distribution functions. Each probability distribution function may be characterized by one or...
|
|
|
7966171 |
System and method for increasing accuracy of searches based on communities of interest
Disclosed are systems, methods and computer-readable media for using a local communication network to generate a speech model. The method includes retrieving for an individual a list of numbers in...
|
|
|
7962343 |
Method and system of building a grammar rule with baseforms generated dynamically from user utterances
A method (200) of building a grammar with baseforms generated dynamically from user utterances can include the steps of recording (205) a user utterance, generating (210) a baseform using the user...
|
|
|
7957971 |
System and method of spoken language understanding using word confusion networks
Word lattices that are generated by an automatic speech recognition system are used to generate a modified word lattice that is usable by a spoken language understanding module. In one embodiment,...
|
|
|
7957969 |
Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciatons
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon...
|
|
|
7953598 |
Grammar weighting voice recognition information
A device receives a voice recognition statistic from a voice recognition application and applies a grammar improvement rule based on the voice recognition statistic. The device also automatically...
|
|
|
7949528 |
System and method for spelling recognition using speech and non-speech input
A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method includes generating an unweighted grammar, selecting a database of words, generating...
|
|
|
7949517 |
Dialogue system with logical evaluation for language identification in speech recognition
A method and device are provided for classifying at least two languages in an automatic dialogue system, which processes digitized speech input. At least one speech recognition method and at least...
|
|
|
7945445 |
Hybrid lexicon for speech recognition
Methods and apparatus for speech recognition based on a hidden Markov model are disclosed. A disclosed method of speech recognition is based on a hidden Markov model in which words to be recognized...
|
|
|
7941312 |
Dynamic mixed-initiative dialog generation in speech recognition
Disclosed are a method (500), apparatus (100) and computer program product for generating a mixed-initiative dialog to obtain information for dialog slots. A composite grammar dependent upon a set...
|