Matches 1 - 50 out of 146 1 2 3 >
Match Document Document Title
7613612 Voice synthesizer of multi sounds  
In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency...
7606710 Method for text-to-pronunciation conversion  
A method for text-to-pronunciation conversion includes a process for searching grapheme-phoneme segments and a three-stage process of text-to-pronunciation conversion. This method looks for a...
7599838 Speech animation with behavioral contexts for application scenarios  
Methods and systems, including computer program products, for speech animation. The system includes a speech animation server and one or more speech animation clients. The speech animation server...
7590540 Method and system for statistic-based distance definition in text-to-speech conversion  
A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be...
7587320 Automatic segmentation in speech synthesis  
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
7584104 Method and system for training a text-to-speech synthesis system using a domain-specific speech database  
A system, method and computer readable medium that trains a text-to-speech synthesis system for use in speech synthesis is disclosed. The method may include recording audio files of one or more...
7574360 Unit selection module and method of chinese text-to-speech synthesis  
A unit selection module for Chinese Text-to-Speech (TTS) synthesis includes a probabilistic context free grammar (PCFG) parser, a latent semantic indexing (LSI) module, and a modified...
7565293 Seamless hybrid computer human call service  
A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data...
7555433 Voice generator, method for generating voice, and navigation apparatus  
A main controller feeds a spelling translator with a text item representing a place name stored in a map database. The spelling translator translates the spelling of the text item according to...
7546241 Speech synthesis method and apparatus, and dictionary generation method and apparatus  
In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed...
7519535 Frame erasure concealment in voice communications  
A voice decoder configured to receive a sequence of frames, each of the frames having voice parameters. The voice decoder includes a speech generator that generates speech from the voice...
7502739 Intonation generation method, speech synthesis apparatus using the method and voice server  
In generation of an intonation pattern of a speech synthesis, a speech synthesis system is capable of providing a highly natural speech and capable of reproducing speech characteristics of a...
7487093 Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof  
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “<morphing type=“emotion” start=“happy” end=“angry”>” and end tag...
7483832 Method and system for customizing voice translation of text to speech  
A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized...
7472066 Automatic speech segmentation and verification using segment confidence measures  
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit...
7472065 Generating paralinguistic phenomena via markup in text-to-speech synthesis  
Converting marked-up text into a synthesized stream includes providing marked-up text to a processor-based system, converting the marked-up text into a text stream including vocabulary items,...
7472061 Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations  
Systems and methods are provided for automatically building a native phonetic lexicon for a speech-based application trained to process a native (base) language, wherein the native phonetic lexicon...
7464034 Voice converter for assimilation by frame synthesis with temporal alignment  
A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. The apparatus includes a storage section, an analyzing section including...
7460997 Method and system for preselection of suitable units for concatenative speech  
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic...
7454348 System and method for blending synthetic voices  
A system and method for generating a synthetic text-to-speech TTS voice are disclosed. A user is presented with at least one TTS voice and at least one voice characteristic. A new synthetic TTS...
7454345 Word or collocation emphasizing voice synthesizer  
A voice synthesizer, which obtains a voice by emphasizing a specific part of a sentence, includes an emphasis degree deciding unit that extracts a word or a collocation to be emphasized from among...
7454341 Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (LVCSR) system  
According to one aspect of the invention, a method is provided in which a mean vector set and a variance vector set of a set of N Gaussians are divided into multiple mean sub-vector sets and...
7451087 System and method for converting text-to-voice  
A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The method includes receiving and expanding text data to form a...
7415118 System and method for distributed gain control  
In accordance with an embodiment, the invention provides a spectral enhancement system that includes a plurality of distributed filters, a plurality of energy distribution units, and a...
7406417 Method for conditioning a database for automatic speech processing  
A neural network can be trained for synthesizing or recognizing speech with the aid of a database produced by automatically matching graphemes and phonemes. First, graphemes and phonemes are...
7400651 Device and method for interpolating frequency components of signal  
A frequency interpolation apparatus is provided which reproduces a signal similar to an original signal by approximately recovering suppressed frequency components, from an input signal having the...
7365260 Apparatus and method for reproducing voice in synchronism with music piece  
Music piece sequence data are composed of a plurality of event data which include performance event data and user event data designed for linking a voice to progression of a music piece. A...
7346507 Method and apparatus for training an automated speech recognition-based system  
A method and apparatus for building a training set for an automated speech recognition-based system, which determines the statistically optimal number of frequently requested responses to automate...
7328159 Interactive speech recognition apparatus and method with conditioned voice prompts  
An improved system for an interactive voice recognition system ( 400 ) includes a voice prompt generator ( 401 ) for generating voice prompt in a first frequency band ( 501 ). A speech detector (...
7328157 Domain adaptation for TTS systems  
Embodiments of the present invention pertain to adaptation of a corpus-driven general-purpose TTS system to at least one specific domain. The domain adaptation is realized by adding a limited...
7308408 Providing services for an information processing system using an audio interface  
A method and system for providing efficient menu services for an information processing system that uses a telephone or other form of audio user interface. In one embodiment, the menu services...
7308407 Method and system for generating natural sounding concatenative synthetic speech  
A method for generating synthetic speech can include identifying a recording of conversational speech and creating a transcription of the conversational speech. Using the transcription, rather than...
7280968 Synthetically generated speech responses including prosodic characteristics of speech inputs  
A method for digitally generating speech with improved prosodic characteristics can include receiving a speech input, determining at least one prosodic characteristic contained within the speech...
7277856 System and method for speech synthesis using a smoothing filter  
A speech synthesis system for controlling a discontinuous distortion that occurs at the transition portion between concatenated phonemes which are speech units of a synthesized speech using a...
7266497 Automatic segmentation in speech synthesis  
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
7171362 Assignment of phonemes to the graphemes producing them  
The assignment of phonemes to graphemes producing them in a lexicon having words (grapheme sequences) and their associated phonetic transcription (phoneme sequences) for the preparation of patterns...
7139712 Speech synthesis apparatus, control method therefor and computer-readable memory  
A second phoneme is generated in consideration of a phonemic context with respect to a first phoneme as a search target. Phonemic piece data corresponding to the second phoneme is searched out from...
7124083 Method and system for preselection of suitable units for concatenative speech  
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic...
7120584 Method and system for real time audio synthesis  
A method and system for synthesizing audio speech is provided. A synthesis engine receives from a host, compressed and normalized speech units and prosodic information. The synthesis engine...
7082396 Methods and apparatus for rapid acoustic unit selection from a large speech corpus  
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen...
7076426 Advance TTS for facial animation  
An enhanced system is achieved by allowing bookmarks which can specify that the stream of bits that follow corresponds to phonemes and a plurality of prosody information, including duration...
7069217 Waveform synthesis  
A synthesizer is disclosed in which a speech waveform is synthesized by selecting a synthetic starting waveform segment and then generating a sequence of further segments. The further waveform...
7062440 Monitoring text to speech output to effect control of barge-in  
A speech system has a speech input channel including a speech recognizer, and a speech output channel including a text-to-speech converter. Associated with the input channel is a barge-in control...
7031919 Speech synthesizing apparatus and method, and storage medium therefor  
A speech synthesizing apparatus for synthesizing a speech waveform stores speech data, which is obtained by adding attribute information onto phoneme data, in a database. In accordance with...
7016841 Singing voice synthesizing apparatus, singing voice synthesizing method, and program for realizing singing voice synthesizing method  
A singing voice synthesizing apparatus is provided, which enables achievement of a natural sounding synthesized singing voice with a good level of comprehensibility. A phoneme database stores a...
7003461 Method and apparatus for an adaptive codebook search in a speech processing system  
An adaptive codebook search (ACS) algorithm is based on a set of matrix operations suitable for data processing engines supporting a single instruction multiple data (SIMD) architecture. The result...
6970820 Voice personalization of speech synthesizer  
The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be...
6959277 Voice feature extraction device  
In a conventional device for extracting voice features accurately without being influenced by noises, such as a voice recognition device, usually an input voice signal is processed first by a noise...
6876968 Run time synthesizer adaptation to improve intelligibility of synthesized speech  
A method and system provide for run-time modification of synthesized speech. The method includes the step of generating synthesized speech based on textual input and a plurality of run-time control...
6847932 Speech synthesis device handling phoneme units of extended CV  
Given phonetic information is divided into speech units of extended CV which is a contiguous sequence of phonemes without clear distinction containing a vowel or some vowels. Contour of vocal tract...
Matches 1 - 50 out of 146 1 2 3 >