|
Match
|
Document |
Document Title |
|
|
7603280 |
Speech output apparatus, speech output method, and program
A speech output apparatus is disclosed, which can allow the user to easily catch synthetic speech when the synthetic speech is output upon being superposed on a music output. The apparatus output...
|
|
|
7603278 |
Segment set creating method and apparatus
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
|
|
|
7596497 |
Speech synthesis apparatus and speech synthesis method
A speech synthesis apparatus and a speech synthesis method, in which a waveform of a desired formant shape may be generated with a small volume of computing operations. A voiced sound generating...
|
|
|
7571104 |
Dynamic real-time cross-fading of voice prompts
A system and method are provided for creating shorter more natural sounding voice messages and prompts from a plurality of pre-recorded sound segments, the prerecorded sound segments are...
|
|
|
7552052 |
Voice synthesis apparatus and method
A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is...
|
|
|
7529672 |
Speech synthesis using concatenation of speech waveforms
A method of synthesizing a speech signal by providing a first speech unit signal having an end interval and a second speech unit signal having a front interval, wherein at least some of the periods...
|
|
|
7526430 |
Speech synthesis apparatus
A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a...
|
|
|
7523038 |
Voice controlled system and method
A voice controlled system includes a microphone for receiving voice commands and for converting each voice command to an electrical output; a filter system connected to receive the electrical...
|
|
|
7472066 |
Automatic speech segmentation and verification using segment confidence measures
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit...
|
|
|
7451087 |
System and method for converting text-to-voice
A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The method includes receiving and expanding text data to form a...
|
|
|
7418389 |
Defining atom units between phone and syllable for TTS systems
A method for identifying common multiphone units to add to a unit inventory for a text-to-speech generator is disclosed. The common multiphone units are units that are larger than a phone, but...
|
|
|
7412390 |
Method and apparatus for speech synthesis, program, recording medium, method and apparatus for generating constraint information and robot apparatus
The emotion is to be added to the synthesized speech as the prosodic feature of the language is maintained. In a speech synthesis device 200 , a language processor 201 generates a string of...
|
|
|
7409347 |
Data-driven global boundary optimization
Portions from segment boundary regions of a plurality of speech segments are extracted. Each segment boundary region is based on a corresponding initial unit boundary. Feature vectors that...
|
|
|
7373299 |
Variable voice rate apparatus and variable voice rate method
A variable voice rate apparatus to control a reproduction rate of voice, includes a voice data generation unit configured to generate voice data from the voice, a text data generation unit...
|
|
|
7299182 |
Text-to-speech (TTS) for hand-held devices
There is provided an Ebook. The Ebook includes a memory device, a text-to-speech (TTS) module, and at least one speaker. The memory device stores files. The files include text. The TTS module...
|
|
|
7275035 |
System and method for speech generation from brain activity
In a method of assisting a subject to generate speech, at least one first neural impulse is sensed from a first preselected location in the subject's brain. A first preselected sound is associated...
|
|
|
7249022 |
Singing voice-synthesizing method and apparatus and storage medium
There are provided a singing voice-synthesizing method and apparatus capable of performing synthesis of natural singing voices close to human singing voices based on performance data being input in...
|
|
|
7240005 |
Method of controlling high-speed reading in a text-to-speech conversion system
A method of high-speed reading in a text-to-speech conversion system including a text analysis module ( 101 ) for generating a phoneme and prosody character string from an input text; a prosody...
|
|
|
7219065 |
Emphasis of short-duration transient speech features
A sound processor including a microphone ( 1 ), a pre-amplifier ( 2 ), a bank of N parallel filters ( 3 ), means for detecting short-duration transitions in the envelope signal of each filter...
|
|
|
7219061 |
Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized
Predetermined macrosegments of the fundamental frequency are determined by a neural network, and these predefined macrosegments are reproduced by fundamental-frequency sequences stored in a...
|
|
|
7171362 |
Assignment of phonemes to the graphemes producing them
The assignment of phonemes to graphemes producing them in a lexicon having words (grapheme sequences) and their associated phonetic transcription (phoneme sequences) for the preparation of patterns...
|
|
|
7124084 |
Singing voice-synthesizing method and apparatus and storage medium
There are provided a singing voice-synthesizing method and apparatus capable of performing synthesis of natural singing voices close to human singing voices based on performance data being input in...
|
|
|
RE39336 |
Formant-based speech synthesizer employing demi-syllable concatenation with independent cross fade in the filter parameter and source domains
The concatenative speech synthesizer employs demi-syllable subword units to generate speech. The synthesizer is based on a source-filter model that uses source signals that correspond closely to...
|
|
|
7117156 |
Method and apparatus for performing packet loss or frame erasure concealment
The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with...
|
|
|
7113909 |
Voice synthesizing method and voice synthesizer performing the same
A stereotypical sentence is synthesized into a voice of an arbitrary speech style. A third party is able to prepare prosody data and a user of a terminal device having a voice synthesizing part can...
|
|
|
7089187 |
Voice synthesizing system, segment generation apparatus for generating segments for voice synthesis, voice synthesizing method and storage medium storing program therefor
A voice synthesizing system can make necessary calculation amount satisfactorily small and can make necessary file size small. The system includes a compressed pitch segment database storing...
|
|
|
7065485 |
Enhancing speech intelligibility using variable-rate time-scale modification
The method and preprocessor enhances the intelligibility of narrowband speech without essentially lengthening the overall time duration of the signal. Both spectral enhancements and variable-rate...
|
|
|
7054815 |
Speech synthesizing method and apparatus using prosody control
A speech synthesizing apparatus extracts small speech segments from a speech waveform as a prosody control target and adds inhibition information for inhibiting a predetermined prosody change...
|
|
|
7031919 |
Speech synthesizing apparatus and method, and storage medium therefor
A speech synthesizing apparatus for synthesizing a speech waveform stores speech data, which is obtained by adding attribute information onto phoneme data, in a database. In accordance with...
|
|
|
7010491 |
Method and system for waveform compression and expansion with time axis
With the goal of presenting a waveform compression and expansion apparatus with which the sound quality of such things as musical tones that are expressed by waveforms is satisfactory following the...
|
|
|
6999922 |
Synchronization and overlap method and system for single buffer speech compression and expansion
The present invention ( 110 ) permits a user to speed up and slow down speech without changing the speakers pitch ( 102, 110, 112, 128, 402–416 ). It is a user adjustable feature to change the...
|
|
|
6961704 |
Linguistic prosodic model-based text to speech
An arrangement is provided for text to speech processing based on linguistic prosodic models. Linguistic prosodic models are established to characterize different linguistic prosodic...
|
|
|
6950798 |
Employing speech models in concatenative speech synthesis
A text-to-speech synthesizer employs database that includes units. For each unit there is a collection of unit selection parameters and a plurality of frames. Each frame has a set of model...
|
|
|
6879957 |
Method for producing a speech rendition of text from diphone sounds
A text-to-speech system utilizes a method for producing a speech rendition of text based on dividing some or all words of a sentence into component diphones. A phonetic dictionary is aligned so...
|
|
|
6873955 |
Method and apparatus for recording/reproducing or producing a waveform using time position information
Partial waveform data representative of a waveform shape variation are extracted from supplied waveform data, and the extracted partial waveform data are stored along with time position information...
|
|
|
6847932 |
Speech synthesis device handling phoneme units of extended CV
Given phonetic information is divided into speech units of extended CV which is a contiguous sequence of phonemes without clear distinction containing a vowel or some vowels. Contour of vocal tract...
|
|
|
6823309 |
Speech synthesizing system and method for modifying prosody based on match to database
A speech synthesis system for storing in advance a degree of modification of prosodic data in a prosodic data modifying rule apparatus, the degree of modification corresponding to an approximate...
|
|
|
6813604 |
Methods and apparatus for speaker specific durational adaptation
A text to speech system modeling durational characteristics of a target speaker is addressed herein. A body of target speaker training text is selected having maximum possible information about...
|
|
|
6785652 |
Method and apparatus for improved duration modeling of phonemes
A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis...
|
|
|
6658382 |
Audio signal coding and decoding methods and apparatus and recording media with programs therefor
An input signal is time-frequency transformed, then the frequency-domain coefficients are divided into coefficient segments of about 100 Hz width to generate a sequence of coefficient segments, and...
|
|
|
6647280 |
Method and apparatus for processing a physiological signal
A signal processing method, preferably for extracting a fundamental period from a noisy, low-frequency signal, is disclosed. The signal processing method generally comprises calculating a numerical...
|
|
|
6629067 |
Range control system
A range control system includes an input section for inputting a singing voice, a fundamental frequency extracting section for extracting a fundamental frequency of the inputted voice, and a pitch...
|
|
|
6553344 |
Method and apparatus for improved duration modeling of phonemes
A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis...
|
|
|
6546367 |
Synthesizing phoneme string of predetermined duration by adjusting initial phoneme duration on values from multiple regression by adding values based on their standard deviations
Statistical data including an average value, a standard deviation, and a minimum value of a phoneme duration of each phoneme is stored in a memory. When speech production time is determined for a...
|
|
|
6542867 |
Speech duration processing method and apparatus for Chinese text-to-speech system
The duration of speech varies according to the characteristics of pronounced speech and pronouncing habit of the speaker. In the speech duration processing method and apparatus of this invention, a...
|
|
|
6499014 |
Speech synthesis apparatus
The speech synthesis apparatus of the present invention includes: a text analyzer operable to generate a phonetic and prosodic symbol string from character information of an input text; a word...
|
|
|
6496801 |
Speech synthesis employing concatenated prosodic and acoustic templates for phrases of multiple words
A speech synthesis system for generating voice dialog for a message frame having a fixed and a variable portion. A prosody module selects a prosodic template for each of the fixed and variable...
|
|
|
6490553 |
Apparatus and method for controlling rate of playback of audio data
The disclosed method and apparatus controls the rate of playback of audio data corresponding to a stream of speech. Using speech recognition, the rate of speech of the audio data is determined. The...
|
|
|
6484137 |
Audio reproducing apparatus
An audio reproducing apparatus comprises: audio decoding means for decoding an input audio signal frame by frame; data expanding/compressing means for subjecting data in a decoded frame to...
|
|
|
6470316 |
Speech synthesis apparatus having prosody generator with user-set speech-rate- or adjusted phoneme-duration-dependent selective vowel devoicing
The speech synthesis apparatus according to the present invention includes a text analyzer operable to generate a phonetic and prosodic symbol string from text information of an input text; a word...
|