|
Match
|
Document |
Document Title |
|
|
7613612 |
Voice synthesizer of multi sounds
In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency...
|
|
|
7606710 |
Method for text-to-pronunciation conversion
A method for text-to-pronunciation conversion includes a process for searching grapheme-phoneme segments and a three-stage process of text-to-pronunciation conversion. This method looks for a...
|
|
|
7599838 |
Speech animation with behavioral contexts for application scenarios
Methods and systems, including computer program products, for speech animation. The system includes a speech animation server and one or more speech animation clients. The speech animation server...
|
|
|
7590540 |
Method and system for statistic-based distance definition in text-to-speech conversion
A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be...
|
|
|
7587320 |
Automatic segmentation in speech synthesis
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
|
|
|
7584104 |
Method and system for training a text-to-speech synthesis system using a domain-specific speech database
A system, method and computer readable medium that trains a text-to-speech synthesis system for use in speech synthesis is disclosed. The method may include recording audio files of one or more...
|
|
|
7574360 |
Unit selection module and method of chinese text-to-speech synthesis
A unit selection module for Chinese Text-to-Speech (TTS) synthesis includes a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified...
|
|
|
7565293 |
Seamless hybrid computer human call service
A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data...
|
|
|
7555433 |
Voice generator, method for generating voice, and navigation apparatus
A main controller feeds a spelling translator with a text item representing a place name stored in a map database. The spelling translator translates the spelling of the text item according to...
|
|
|
7546241 |
Speech synthesis method and apparatus, and dictionary generation method and apparatus
In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed...
|
|
|
7519535 |
Frame erasure concealment in voice communications
A voice decoder configured to receive a sequence of frames, each of the frames having voice parameters. The voice decoder includes a speech generator that generates speech from the voice...
|
|
|
7502739 |
Intonation generation method, speech synthesis apparatus using the method and voice server
In generation of an intonation pattern of a speech synthesis, a speech synthesis system is capable of providing a highly natural speech and capable of reproducing speech characteristics of a...
|
|
|
7487093 |
Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “<morphing type=“emotion” start=“happy” end=“angry”>” and end tag...
|
|
|
7483832 |
Method and system for customizing voice translation of text to speech
A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized...
|
|
|
7472066 |
Automatic speech segmentation and verification using segment confidence measures
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit...
|
|
|
7472065 |
Generating paralinguistic phenomena via markup in text-to-speech synthesis
Converting marked-up text into a synthesized stream includes providing marked-up text to a processor-based system, converting the marked-up text into a text stream including vocabulary items,...
|
|
|
7472061 |
Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon...
|
|
|
7464034 |
Voice converter for assimilation by frame synthesis with temporal alignment
A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. The apparatus includes a storage section, an analyzing section including...
|
|
|
7460997 |
Method and system for preselection of suitable units for concatenative speech
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic...
|
|
|
7454348 |
System and method for blending synthetic voices
A system and method for generating a synthetic text-to-speech TTS voice are disclosed. A user is presented with at least one TTS voice and at least one voice characteristic. A new synthetic TTS...
|
|
|
7454345 |
Word or collocation emphasizing voice synthesizer
A voice synthesizer, which obtains a voice by emphasizing a specific part of a sentence, includes an emphasis degree deciding unit that extracts a word or a collocation to be emphasized from among...
|
|
|
7454341 |
Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system
According to one aspect of the invention, a method is provided in which a mean vector set and a variance vector set of a set of N Gaussians are divided into multiple mean sub-vector sets and...
|
|
|
7451087 |
System and method for converting text-to-voice
A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The method includes receiving and expanding text data to form a...
|
|
|
7415118 |
System and method for distributed gain control
In accordance with an embodiment, the invention provides a spectral enhancement system that includes a plurality of distributed filters, a plurality of energy distribution units, and a...
|
|
|
7406417 |
Method for conditioning a database for automatic speech processing
A neural network can be trained for synthesizing or recognizing speech with the aid of a database produced by automatically matching graphemes and phonemes. First, graphemes and phonemes are...
|
|
|
7400651 |
Device and method for interpolating frequency components of signal
A frequency interpolation apparatus is provided which reproduces a signal similar to an original signal by approximately recovering suppressed frequency components, from an input signal having the...
|
|
|
7365260 |
Apparatus and method for reproducing voice in synchronism with music piece
Music piece sequence data are composed of a plurality of event data which include performance event data and user event data designed for linking a voice to progression of a music piece. A...
|
|
|
7346507 |
Method and apparatus for training an automated speech recognition-based system
A method and apparatus for building a training set for an automated speech recognition-based system, which determines the statistically optimal number of frequently requested responses to automate...
|
|
|
7328159 |
Interactive speech recognition apparatus and method with conditioned voice prompts
An improved system for an interactive voice recognition system ( 400 ) includes a voice prompt generator ( 401 ) for generating voice prompt in a first frequency band ( 501 ). A speech detector (...
|
|
|
7328157 |
Domain adaptation for TTS systems
Embodiments of the present invention pertain to adaptation of a corpus-driven general-purpose TTS system to at least one specific domain. The domain adaptation is realized by adding a limited...
|
|
|
7308408 |
Providing services for an information processing system using an audio interface
A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services...
|
|
|
7308407 |
Method and system for generating natural sounding concatenative synthetic speech
A method for generating synthetic speech can include identifying a recording of conversational speech and creating a transcription of the conversational speech. Using the transcription, rather than...
|
|
|
7280968 |
Synthetically generated speech responses including prosodic characteristics of speech inputs
A method for digitally generating speech with improved prosodic characteristics can include receiving a speech input, determining at least one prosodic characteristic contained within the speech...
|
|
|
7277856 |
System and method for speech synthesis using a smoothing filter
A speech synthesis system for controlling a discontinuous distortion that occurs at the transition portion between concatenated phonemes which are speech units of a synthesized speech using a...
|
|
|
7266497 |
Automatic segmentation in speech synthesis
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
|
|
|
7171362 |
Assignment of phonemes to the graphemes producing them
The assignment of phonemes to graphemes producing them in a lexicon having words (grapheme sequences) and their associated phonetic transcription (phoneme sequences) for the preparation of patterns...
|
|
|
7139712 |
Speech synthesis apparatus, control method therefor and computer-readable memory
A second phoneme is generated in consideration of a phonemic context with respect to a first phoneme as a search target. Phonemic piece data corresponding to the second phoneme is searched out from...
|
|
|
7124083 |
Method and system for preselection of suitable units for concatenative speech
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic...
|
|
|
7120584 |
Method and system for real time audio synthesis
A method and system for synthesizing audio speech is provided. A synthesis engine receives from a host, compressed and normalized speech units and prosodic information. The synthesis engine...
|
|
|
7082396 |
Methods and apparatus for rapid acoustic unit selection from a large speech corpus
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen...
|
|
|
7076426 |
Advance TTS for facial animation
An enhanced system is achieved by allowing bookmarks which can specify that the stream of bits that follow corresponds to phonemes and a plurality of prosody information, including duration...
|
|
|
7069217 |
Waveform synthesis
A synthesizer is disclosed in which a speech waveform is synthesized by selecting a synthetic starting waveform segment and then generating a sequence of further segments. The further waveform...
|
|
|
7062440 |
Monitoring text to speech output to effect control of barge-in
A speech system has a speech input channel including a speech recognizer, and a speech output channel including a text-to-speech converter. Associated with the input channel is a barge-in control...
|
|
|
7031919 |
Speech synthesizing apparatus and method, and storage medium therefor
A speech synthesizing apparatus for synthesizing a speech waveform stores speech data, which is obtained by adding attribute information onto phoneme data, in a database. In accordance with...
|
|
|
7016841 |
Singing voice synthesizing apparatus, singing voice synthesizing method, and program for realizing singing voice synthesizing method
A singing voice synthesizing apparatus is provided, which enables achievement of a natural sounding synthesized singing voice with a good level of comprehensibility. A phoneme database stores a...
|
|
|
7003461 |
Method and apparatus for an adaptive codebook search in a speech processing system
An adaptive codebook search (ACS) algorithm is based on a set of matrix operations suitable for data processing engines supporting a single instruction multiple data (SIMD) architecture. The result...
|
|
|
6970820 |
Voice personalization of speech synthesizer
The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be...
|
|
|
6959277 |
Voice feature extraction device
In a conventional device for extracting voice features accurately without being influenced by noises, such as a voice recognition device, usually an input voice signal is processed first by a noise...
|
|
|
6876968 |
Run time synthesizer adaptation to improve intelligibility of synthesized speech
A method and system provide for run-time modification of synthesized speech. The method includes the step of generating synthesized speech based on textual input and a plurality of run-time control...
|
|
|
6847932 |
Speech synthesis device handling phoneme units of extended CV
Given phonetic information is divided into speech units of extended CV which is a contiguous sequence of phonemes without clear distinction containing a vowel or some vowels. Contour of vocal tract...
|