Match Document Document Title
7617106 Error detection for speech to text transcription systems  
A method, a system and a computer program product detects errors within text generated by a speech to text transcription system. The transcribed text is re-transformed into an artificial speech...
7617093 Authoring speech grammars  
A method and apparatus are provided for automatically forming a grammar. Example text strings are received and N-grams are formed based on the text strings. A rule in the grammar is then generated...
7606775 Mobile communication terminal using MOBP learning  
A scheduling apparatus and method of an intelligent mobile communication terminal are provided. The scheduling apparatus of the mobile communication terminal may include a Real Time Clock (RTC) for...
7599837 Creating a speech recognition grammar for alphanumeric concepts  
A method and system to generate a grammar adapted for use by a speech recognizer includes receiving a representation of an alphanumeric expression. For instance, the representation can take the...
7590941 Communication and collaboration system using rich media environments  
A system that enables communication and collaboration among individuals using rich media environments. A system according to the present techniques includes a set of rich media environments each...
7590536 Voice language model adjustment based on user affinity  
Methods, systems and computer readable medium for improving the accuracy of voice processing are provided. Embodiments of the present invention generally provide methods, systems and articles of...
7590533 New-word pronunciation learning using a pronunciation graph  
A method and computer-readable medium convert the text of a word and a user's pronunciation of the word into a phonetic description to be added to a speech recognition lexicon. Initially, a...
7590224 Automated task classification system  
The invention concerns an automated task classification system that operates on a task objective of a user. The system may include a meaningful phrase generator that generates a plurality of...
7587320 Automatic segmentation in speech synthesis  
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
7587318 Correlating video images of lip movements with audio signals to improve speech recognition  
A speech recognition device can include an audio signal receiver configured to receive audio signals from a speech source, a video signal receiver configured to receive video signals from the...
7587317 Word training interface  
A method for exposing speech engine features to one or more independent applications wherein the features relate to word training and/or wherein the method optionally exposes the speech engine...
7584098 Vocabulary-independent search of spontaneous speech  
A method of identifying a location of a query string in an audio signal is provided. Under the method, a segment of the audio signal is selected. A score for a query string in the segment of the...
7584097 System and method for noisy automatic speech recognition employing joint compensation of additive and convolutive distortions  
A system for, and method of, noisy automatic speech recognition employing joint compensation of additive and convolutive distortions and a digital signal processor incorporating the system or the...
7574411 Low memory decision tree  
Management of a low memory treelike data structure is shown. The method according to the invention comprises steps for creating a decision tree including a parent node and at least one leaf node,...
7574357 Applications of sub-audible speech recognition based upon electromyographic signals  
Method and system for generating electromyographic or sub-audible signals (“SAWPs”) and for transmitting and recognizing the SAWPs that represent the original words and/or phrases. The SAWPs...
7574356 System and method for spelling recognition using speech and non-speech input  
A system and method for non-speech input or keypad-aided word and spelling recognition is disclosed. The method comprises performing spelling recognition via automatic speech recognition (ASR) on...
7567906 Systems and methods for generating an annotation guide  
Systems and methods for generating an annotation guide. Speech data is organized and presented to a user. After the user selects some of the utterances in the speech data, the selected utterances...
7567905 Method for identifying and verifying an element using a voice system  
An identification and verification method for identifying and verifying an element. The method uses a voice system. The method includes the steps of associating at least one of an at least 4 unit...
7567900 Harmonic structure based acoustic speech interval detection method and device  
A harmonic structure acoustic signal detection device not depending on the level fluctuation of the input signal including: an FFT unit which performs FFT on an input signal and calculates a power...
7567899 Methods and apparatus for audio recognition  
A method, apparatus and computer memory are provided for recognizing an audio fingerprint of an unknown audio recording. A database stores a plurality of audio recording identifiers corresponding...
7562019 Automated testing of voice recognition software  
A method and a system for testing a voice enabled application on a target device, the method including conducting one or more interactions with the target device, at least some of the interactions...
7562014 Active learning process for spoken dialog systems  
A large amount of human labor is required to transcribe and annotate a training corpus that is needed to create and update models for automatic speech recognition (ASR) and spoken language...
7562010 Generating confidence scores from word lattices  
Systems and methods for determining word confidence scores. Speech recognition systems generate a word lattice for speech input. Posterior probabilities of the words in the word lattice are...
7555426 Method and apparatus for dynamic grammars and focused semantic parsing  
The present invention provides a dialogue system in which semantic ambiguity is reduced by selectively choosing which semantic structures are to be made available for parsing based on previous...
7552050 Speech recognition system and method utilizing adaptive cancellation for talk-back voice  
A voice recognition system includes an adaptive filter and a subtractor. The adaptive filter generates a simulated talk-back voice y(n) by setting a filter coefficient simulating a transfer system...
7552049 Noise adaptation system of speech model, noise adaptation method, and noise adaptation program for speech recognition  
An object of the present invention is to enable optimal clustering for many types of noise data and to improve the accuracy of estimation of a speech model sequence of input speech. Noise is added...
7548863 Adaptive context sensitive analysis  
A method, apparatus, system, and signal-bearing medium that converts input data, such as speech or phonetic characters, to text by finding documents that are similar to the input data and using the...
7542902 Information provision for call centres  
A voice platform monitors a conversation between a call center agent and a caller to identify any predetermined keywords or phrases used in the conversation therebetween. These keywords or phrases...
7542900 Noise reduction using correction vectors based on dynamic aspects of speech and noise normalization  
A method and apparatus are provided for reducing noise in a signal. Under one aspect of the invention, a correction vector is selected based on a noisy feature vector that represents a noisy...
7536303 Audio restoration apparatus and audio restoration method  
An audio restoration apparatus is provided which restores an audio to be restored having a missing audio part and being included in a mixed audio. The audio restoration apparatus includes: a mixed...
7533023 Intermediary speech processor in network environments transforming customized speech parameters  
A speech processing system is provided for customizing speech parameters across speech applications in a networked environment. The speech processing system includes: a speech processing...
7533014 Method and system for concurrent use of two or more closely coupled communication recognition modalities  
A method and system are provided in which a speech recognition system and one or more other input modalities are run in parallel. The system is especially useful in easing the Chinese input...
7529676 Audio device control device, audio device control method, and program  
A language analyzer 2 performs speech recognition on a speech input by a speech input unit 1 , specifies a possible word which is represented by the speech, and the score thereof, and supplies...
7529670 Automatic speech recognition system for people with speech-affecting disabilities  
A speech recognition system is provided that, in one embodiment, includes an input 104 operable to receive voice utterances from a user, a speaker monitoring agent 132 operable to determine a...
7529669 Voice-based multimodal speaker authentication using adaptive training and applications thereof  
A voice based multimodal speaker authentication method and telecommunications application thereof employing a speaker adaptive method for training phenome specific Gaussian mixture models. Applied...
7529666 Minimum bayes error feature selection in speech recognition  
In connection with speech recognition, the design of a linear transformation θε p×n , of rank p×n, which projects the features of a classifier xε n onto y=θxε p such as to achieve minimum...
7529665 Two stage utterance verification device and method thereof in speech recognition system  
A two stage utterance verification device and a method thereof are provided. The two stage utterance verification method includes performing a first utterance verification function based on a SVM...
7526431 Speech recognition using ambiguous or phone key spelling and/or filtering  
Alphabetic filtering of the speech recognition of words uses a key press to indicate a desired character in an alphabetic filter string, where each key press represents two or more letters. The key...
7519536 System and method for providing network coordinated conversational services  
A system and method for providing automatic and coordinated sharing of conversational resources, e.g., functions and arguments, between network-connected servers and devices and their corresponding...
7516077 Voice control system  
A voice control system receives a voice command from a user via a microphone. A command executability determination circuit determined whether the voice command is executable in the current...
7516068 Optimized collection of audio for speech recognition  
A method for audio collection and speech recognition includes providing a plurality of listening devices, the listening devices including at least one microphone capsule, a signal pre-amplifier, a...
7516067 Method and apparatus using harmonic-model-based front end for robust speech recognition  
A system and method are provided that reduce noise in speech signals. The system and method decompose a noisy speech signal into a harmonic component and a residual component. The harmonic...
7505909 Device control device and device control method  
A language analyzer ( 2 ) performs speech recognition on a speech input by a speech input unit ( 1 ), specifies a possible word which is represented by the speech, and the score thereof, and...
7505903 Speech recognition dictionary creation method and speech recognition dictionary creating device  
A speech recognition dictionary creation method is provided for creating a speech recognition dictionary that is used for creating document data such as electronic mails through voice input in an...
7505902 Discrimination of components of audio signals based on multiscale spectro-temporal modulations  
An audio signal ( 172 ) representative of an acoustic signal is provided to an auditory model ( 105 ). The auditory model ( 105 ) produces a high-dimensional feature set based on physiological...
7505901 Intelligent acoustic microphone fronted with speech recognizing feedback  
Voice command operated systems are being installed in modern motor vehicles with increasing frequency. Such systems should be operable by various vehicle occupants and from various seating...
7502731 System and method for performing speech recognition by utilizing a multi-language dictionary  
The present invention comprises a system and method for speech recognition utilizing a multi-language dictionary, and may include a recognizer that is configured to compare input speech data to a...
7502632 Text messaging device  
A hand-held wireless communication device for creating and sending text messages including ideograms, said communication device including: an input interface for a user to make a phonetic input;...
7496501 System and method for identifying base noun phrases  
A system and method identify base noun phrases (baseNP) in a linguistic input. A part-of-speech tagger identifies N-best part-of-speech tag sequences corresponding to the linguistic input. A baseNP...
7493257 Method and apparatus handling speech recognition errors in spoken dialogue systems  
To handle portions of a recognized sentence having an error, a user is questioned about contents associated with portions. According to a user's answer, a result is obtained. Speech recognition...