|
Match
|
Document |
Document Title |
|
|
7403896 |
Speech recognition system and program thereof
Speech recognition is performed by matching between a characteristic quantity of an inputted speech and a composite HMM obtained by synthesizing a speech HMM (hidden Markov model) and a noise HMM...
|
|
|
7401019 |
Phonetic fragment search in speech data
A method of searching audio data is provided including receiving a query defining multiple phonetic possibilities. The method also includes comparing the query with a lattice of phonetic hypotheses...
|
|
|
7398210 |
System and method for performing analysis on word variants
A computer-readable medium stores a first lexicon data structure for lexicon words. The first data structure includes a host form variant field containing a host form variant such as a clitic host...
|
|
|
7392185 |
Speech based learning/training system using semantic decoding
An intelligent query system for processing voiced-based queries is disclosed, which uses a combination of both statistical and semantic based processing to identify the question posed by the user...
|
|
|
7392184 |
Arrangement of speaker-independent speech recognition
A method needed in speech recognition for forming a pronunciation model in a telecommunications system comprising at least one portable electronic device and server. The electronic device is...
|
|
|
7389230 |
System and method for classification of voice signals
A system and method for classifying a voice signal to one of a set of predefined categories, based upon a statistical analysis of features extracted from the voice signal. The system includes an...
|
|
|
7379870 |
Contextual filtering
The present invention relates to techniques for contextual filtering for improving an output of a speech recognizer. The techniques comprise receiving a representation of a speech utterance into a...
|
|
|
7373297 |
Automated speech recognition filter
An automated speech recognition filter is disclosed. The automated speech recognition filter device provides a speech signal to an automated speech platform that approximates an original speech...
|
|
|
7366673 |
Selective enablement of speech recognition grammars
A method for processing speech audio in a network connected client device can include selecting a speech grammar for use in a speech recognition system in the network connected client device;...
|
|
|
7346511 |
Method and apparatus for recognizing multiword expressions
Words of an input string are morphologically analyzed to identify their alternative base forms and parts of speech. The analyzed words of the input string are used to compile the input string into...
|
|
|
7346495 |
Method and system for building a domain specific statistical language model from rule based grammar specifications
A method and system providing a statistical representation from rule-based grammar specifications. The language model is generated by obtaining a statistical representation of a rule-based language...
|
|
|
7340390 |
Mobile communication terminal and method therefore
A method for organizing data records in a memory in a mobile telecommunication terminal is disclosed. The method comprises receiving a plurality of digits, which identify a subscriber terminal in a...
|
|
|
7340396 |
Method and apparatus for providing a speaker adapted speech recognition model set
Speech feature vectors ( 10 ) are provided and utilized to develop a corresponding estimated speaker dependent speech feature space model ( 20 ) (in one embodiment, it is not necessary that this...
|
|
|
7324941 |
Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these
A method and apparatus for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, and a voice recognition apparatus having the apparatus and a voice...
|
|
|
7319960 |
Speech recognition method and system
A speech recognition system uses a phoneme counter to determine the length of a word to be recognized. The result is used to split a lexicon into one or more sub-lexicons containing only words...
|
|
|
7319964 |
Method and apparatus for segmenting a multi-media program based upon audio events
The present invention provides for a method and apparatus for segmenting a multi-media program based upon audio events. In an embodiment a method of classifying an audio stream is provided. This...
|
|
|
7319959 |
Multi-source phoneme classification for noise-robust automatic speech recognition
A system and method are disclosed for processing an audio signal including separating the audio signal into a plurality of streams which group sounds from a same source prior to classification and...
|
|
|
7318032 |
Speaker recognition method based on structured speaker modeling and a “Pickmax” scoring technique
A technique for improved score calculation and normalization in a framework of recognition with phonetically structured speaker models. The technique involves determining, for each frame and each...
|
|
|
7313526 |
Speech recognition using selectable recognition modes
The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized...
|
|
|
7310601 |
Speech recognition apparatus and speech recognition method
The present invention provides a speech recognition apparatus which appropriately performs speech recognition by generating, in real time, language models adapted to a new topic even in the case...
|
|
|
7308404 |
Method and apparatus for speech recognition using a dynamic vocabulary
A method and apparatus are provided for performing speech recognition using a dynamic vocabulary. Results from a preliminary speech recognition pass can be used to update or refine a language model...
|
|
|
7299180 |
Name entity extraction using language models
A name entity extraction technique using language models is provided. A general language model is provided for the natural language understanding domain. A language model is also provided for each...
|
|
|
7295979 |
Language context dependent data labeling
Bootstrapping of a system from one language to another often works well when the two languages share the similar acoustic space. However, when the new language has sounds that do not occur in the...
|
|
|
7289958 |
Automatic language independent triphone training using a phonetic table
A method for training acoustic models for a new target language is provided using a phonetic table, which characterizes the phones, used in one or more reference language(s) with respect to their...
|
|
|
7283959 |
Compact easily parseable binary format for a context-free grammar
A computer-loadable data structure is provided that represents a state-and-transition-based description of a speech grammar. The data structure includes first and second transition entries that...
|
|
|
7280966 |
Electronic mail replies with speech recognition
A method for responding to an electronic mail message with a limited input device such as a phone includes audibly rendering the question and a set of proposed answers typically provided in the...
|
|
|
7280967 |
Method for detecting misaligned phonetic units for a concatenative text-to-speech voice
A method of filtering phonetic units to be used within a concatenative text-to-speech (CTTS) voice. Initially, a normality threshold can be established. At least one phonetic unit that has been...
|
|
|
7275033 |
Method and system for using rule-based knowledge to build a class-based domain specific statistical language model
A method and system for providing a class-based statistical language model representation from rule-based knowledge is disclosed. The class-based language model is generated from a statistical...
|
|
|
7272558 |
Speech recognition training method for audio and video file indexing on a search engine
A method and a related system to index audio and video documents and to automatically train the language model of a speech recognition system according to the context of the documents being indexed.
|
|
|
7263484 |
Phonetic searching
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string...
|
|
|
7260531 |
Interactive system, method, and program performing data search using pronunciation distance and entropy calculations
Disclosed are an interactive system and method of controlling the same for achieving a task more efficiently. Items of data to be searched are stored in a memory ( 107 ) in a form classified...
|
|
|
7249017 |
Speech recognition with score calculation
In order to prevent degradation of speech recognition accuracy due to an unknown word, a dictionary database has stored therein a word dictionary in which are stored, in addition to words for the...
|
|
|
7240004 |
Systems and methods for determining the determinizability of finite-state automata and transducers
Finite-state transducers and weighted finite-state automata may not be determinizable. The twins property can be used to characterize the determinizability of such devices. For a weighted...
|
|
|
7240008 |
Speech recognition system, program and navigation system
Voice of a user is inputted to a speech recognition section until a start of a no-voice domain from depression of a talk-switch. LPC cepstrum coefficients are calculated from the voice in an LPC...
|
|
|
7231019 |
Automatic identification of telephone callers based on voice characteristics
A method and apparatus are provided for identifying a caller of a call from the caller to a recipient. A voice input is received from the caller, and characteristics of the voice input are applied...
|
|
|
7228277 |
Mobile communications terminal, voice recognition method for same, and record medium storing program for voice recognition
A voice input section receives voice of the user designating a name etc. and outputs a voice signal to a speech recognition section. The speech recognition section analyzes and recognizes the voice...
|
|
|
7225130 |
Methods, systems, and programming for performing speech recognition
The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations;...
|
|
|
7203644 |
Automating tuning of speech recognition systems
Embodiments of a speech recognition system are disclosed. The system includes at least one recognizer to produce output signals from audio input signals and a feedback module to collect feedback...
|
|
|
7197457 |
Method for statistical language modeling in speech recognition
A system for generating language modeling data for a speech recognition system includes an expression extractor to extract expression from domain-specific data of an existing domain using a base of...
|
|
|
7194410 |
Generation of a reference-model directory for a voice-controlled communications device
For a voice recognition system in a voice-controlled communication appliance, command words from a vocabulary are entered in text form and are transmitted to a separate converter station via a...
|
|
|
7155392 |
Context free grammar engine for speech recognition system
The present invention includes a context-free grammar (CFG) engine which communicates through an exposed interface with a speech recognition engine. The context-free grammar engine, in one...
|
|
|
7149688 |
Multi-lingual speech recognition with cross-language context modeling
An approach to multi-lingual speech recognition that permits different words in an utterance to be from different languages. Words from different languages are represented using different sets of...
|
|
|
7149689 |
Two-engine speech recognition
A speech recognition system comprises exactly two automated speech recognition (ASR) engines connected to receive the same inputs. Each engine produces a recognition output, a hypothesis. The...
|
|
|
7139708 |
System and method for speech recognition using an enhanced phone set
A system and method for speech recognition using an enhanced phone set comprises speech data, an enhanced phone set, and a transcription generated by a transcription process. The transcription...
|
|
|
7139688 |
Method and apparatus for classifying unmarked string substructures using Markov Models
A technique for structurally classifying substructures of at least one unmarked string utilizing at least one training data set with inserted markers identifying labeled substructures. A model of...
|
|
|
7136815 |
Method for voice recognition
A method improves voice recognition by improving storage of voice recognition (VR) templates. The improved storage means that more VR models can be stored in memory. The more VR models that are...
|
|
|
7127394 |
Assigning meanings to utterances in a speech recognition system
Assigning meanings to spoken utterances in a speech recognition system. A plurality of speech rules is generated, each of the of speech rules comprising a language model and an expression...
|
|
|
7124081 |
Method and apparatus for speech recognition using latent semantic adaptation
A method and apparatus for speech recognition using latent semantic adaptation is described herein. According to one aspect of the present invention, a method for recognizing speech comprises using...
|
|
|
7120582 |
Expanding an effective vocabulary of a speech recognition system
The invention provides techniques for creating and using fragmented word models to increase the effective size of an active vocabulary of a speech recognition system. The active vocabulary...
|
|
|
7120580 |
Method and apparatus for recognizing speech in a noisy environment
An apparatus and a concomitant method for speech recognition. In one embodiment, the present method is referred to as a “Dynamic Noise Compensation” (DNC) method where the method estimates the...
|