Matches 1 - 50 out of 200 1 2 3 4 >
Match Document Document Title
7603280 Speech output apparatus, speech output method, and program  
A speech output apparatus is disclosed, which can allow the user to easily catch synthetic speech when the synthetic speech is output upon being superposed on a music output. The apparatus output...
7603278 Segment set creating method and apparatus  
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
7596497 Speech synthesis apparatus and speech synthesis method  
A speech synthesis apparatus and a speech synthesis method, in which a waveform of a desired formant shape may be generated with a small volume of computing operations. A voiced sound generating...
7571104 Dynamic real-time cross-fading of voice prompts  
A system and method are provided for creating shorter more natural sounding voice messages and prompts from a plurality of pre-recorded sound segments, the prerecorded sound segments are...
7552052 Voice synthesis apparatus and method  
A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is...
7529672 Speech synthesis using concatenation of speech waveforms  
A method of synthesizing a speech signal by providing a first speech unit signal having an end interval and a second speech unit signal having a front interval, wherein at least some of the periods...
7526430 Speech synthesis apparatus  
A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a...
7523038 Voice controlled system and method  
A voice controlled system includes a microphone for receiving voice commands and for converting each voice command to an electrical output; a filter system connected to receive the electrical...
7472066 Automatic speech segmentation and verification using segment confidence measures  
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit...
7451087 System and method for converting text-to-voice  
A method for converting text to concatenated voice by utilizing a digital voice library and a set of playback rules is provided. The method includes receiving and expanding text data to form a...
7418389 Defining atom units between phone and syllable for TTS systems  
A method for identifying common multiphone units to add to a unit inventory for a text-to-speech generator is disclosed. The common multiphone units are units that are larger than a phone, but...
7412390 Method and apparatus for speech synthesis, program, recording medium, method and apparatus for generating constraint information and robot apparatus  
The emotion is to be added to the synthesized speech as the prosodic feature of the language is maintained. In a speech synthesis device 200 , a language processor 201 generates a string of...
7409347 Data-driven global boundary optimization  
Portions from segment boundary regions of a plurality of speech segments are extracted. Each segment boundary region is based on a corresponding initial unit boundary. Feature vectors that...
7373299 Variable voice rate apparatus and variable voice rate method  
A variable voice rate apparatus to control a reproduction rate of voice, includes a voice data generation unit configured to generate voice data from the voice, a text data generation unit...
7299182 Text-to-speech (TTS) for hand-held devices  
There is provided an Ebook. The Ebook includes a memory device, a text-to-speech (TTS) module, and at least one speaker. The memory device stores files. The files include text. The TTS module...
7275035 System and method for speech generation from brain activity  
In a method of assisting a subject to generate speech, at least one first neural impulse is sensed from a first preselected location in the subject's brain. A first preselected sound is associated...
7249022 Singing voice-synthesizing method and apparatus and storage medium  
There are provided a singing voice-synthesizing method and apparatus capable of performing synthesis of natural singing voices close to human singing voices based on performance data being input in...
7240005 Method of controlling high-speed reading in a text-to-speech conversion system  
A method of high-speed reading in a text-to-speech conversion system including a text analysis module ( 101 ) for generating a phoneme and prosody character string from an input text; a prosody...
7219065 Emphasis of short-duration transient speech features  
A sound processor including a microphone ( 1 ), a pre-amplifier ( 2 ), a bank of N parallel filters ( 3 ), means for detecting short-duration transitions in the envelope signal of each filter...
7219061 Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized  
Predetermined macrosegments of the fundamental frequency are determined by a neural network, and these predefined macrosegments are reproduced by fundamental-frequency sequences stored in a...
7171362 Assignment of phonemes to the graphemes producing them  
The assignment of phonemes to graphemes producing them in a lexicon having words (grapheme sequences) and their associated phonetic transcription (phoneme sequences) for the preparation of patterns...
7124084 Singing voice-synthesizing method and apparatus and storage medium  
There are provided a singing voice-synthesizing method and apparatus capable of performing synthesis of natural singing voices close to human singing voices based on performance data being input in...
RE39336 Formant-based speech synthesizer employing demi-syllable concatenation with independent cross fade in the filter parameter and source domains  
The concatenative speech synthesizer employs demi-syllable subword units to generate speech. The synthesizer is based on a source-filter model that uses source signals that correspond closely to...
7117156 Method and apparatus for performing packet loss or frame erasure concealment  
The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with...
7113909 Voice synthesizing method and voice synthesizer performing the same  
A stereotypical sentence is synthesized into a voice of an arbitrary speech style. A third party is able to prepare prosody data and a user of a terminal device having a voice synthesizing part can...
7089187 Voice synthesizing system, segment generation apparatus for generating segments for voice synthesis, voice synthesizing method and storage medium storing program therefor  
A voice synthesizing system can make necessary calculation amount satisfactorily small and can make necessary file size small. The system includes a compressed pitch segment database storing...
7065485 Enhancing speech intelligibility using variable-rate time-scale modification  
The method and preprocessor enhances the intelligibility of narrowband speech without essentially lengthening the overall time duration of the signal. Both spectral enhancements and variable-rate...
7054815 Speech synthesizing method and apparatus using prosody control  
A speech synthesizing apparatus extracts small speech segments from a speech waveform as a prosody control target and adds inhibition information for inhibiting a predetermined prosody change...
7031919 Speech synthesizing apparatus and method, and storage medium therefor  
A speech synthesizing apparatus for synthesizing a speech waveform stores speech data, which is obtained by adding attribute information onto phoneme data, in a database. In accordance with...
7010491 Method and system for waveform compression and expansion with time axis  
With the goal of presenting a waveform compression and expansion apparatus with which the sound quality of such things as musical tones that are expressed by waveforms is satisfactory following the...
6999922 Synchronization and overlap method and system for single buffer speech compression and expansion  
The present invention ( 110 ) permits a user to speed up and slow down speech without changing the speakers pitch ( 102, 110, 112, 128, 402–416 ). It is a user adjustable feature to change the...
6961704 Linguistic prosodic model-based text to speech  
An arrangement is provided for text to speech processing based on linguistic prosodic models. Linguistic prosodic models are established to characterize different linguistic prosodic...
6950798 Employing speech models in concatenative speech synthesis  
A text-to-speech synthesizer employs database that includes units. For each unit there is a collection of unit selection parameters and a plurality of frames. Each frame has a set of model...
6879957 Method for producing a speech rendition of text from diphone sounds  
A text-to-speech system utilizes a method for producing a speech rendition of text based on dividing some or all words of a sentence into component diphones. A phonetic dictionary is aligned so...
6873955 Method and apparatus for recording/reproducing or producing a waveform using time position information  
Partial waveform data representative of a waveform shape variation are extracted from supplied waveform data, and the extracted partial waveform data are stored along with time position information...
6847932 Speech synthesis device handling phoneme units of extended CV  
Given phonetic information is divided into speech units of extended CV which is a contiguous sequence of phonemes without clear distinction containing a vowel or some vowels. Contour of vocal tract...
6823309 Speech synthesizing system and method for modifying prosody based on match to database  
A speech synthesis system for storing in advance a degree of modification of prosodic data in a prosodic data modifying rule apparatus, the degree of modification corresponding to an approximate...
6813604 Methods and apparatus for speaker specific durational adaptation  
A text to speech system modeling durational characteristics of a target speaker is addressed herein. A body of target speaker training text is selected having maximum possible information about...
6785652 Method and apparatus for improved duration modeling of phonemes  
A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis...
6658382 Audio signal coding and decoding methods and apparatus and recording media with programs therefor  
An input signal is time-frequency transformed, then the frequency-domain coefficients are divided into coefficient segments of about 100 Hz width to generate a sequence of coefficient segments, and...
6647280 Method and apparatus for processing a physiological signal  
A signal processing method, preferably for extracting a fundamental period from a noisy, low-frequency signal, is disclosed. The signal processing method generally comprises calculating a numerical...
6629067 Range control system  
A range control system includes an input section for inputting a singing voice, a fundamental frequency extracting section for extracting a fundamental frequency of the inputted voice, and a pitch...
6553344 Method and apparatus for improved duration modeling of phonemes  
A method and an apparatus for improved duration modeling of phonemes in a speech synthesis system are provided. According to one aspect, text is received into a processor of a speech synthesis...
6546367 Synthesizing phoneme string of predetermined duration by adjusting initial phoneme duration on values from multiple regression by adding values based on their standard deviations  
Statistical data including an average value, a standard deviation, and a minimum value of a phoneme duration of each phoneme is stored in a memory. When speech production time is determined for a...
6542867 Speech duration processing method and apparatus for Chinese text-to-speech system  
The duration of speech varies according to the characteristics of pronounced speech and pronouncing habit of the speaker. In the speech duration processing method and apparatus of this invention, a...
6499014 Speech synthesis apparatus  
The speech synthesis apparatus of the present invention includes: a text analyzer operable to generate a phonetic and prosodic symbol string from character information of an input text; a word...
6496801 Speech synthesis employing concatenated prosodic and acoustic templates for phrases of multiple words  
A speech synthesis system for generating voice dialog for a message frame having a fixed and a variable portion. A prosody module selects a prosodic template for each of the fixed and variable...
6490553 Apparatus and method for controlling rate of playback of audio data  
The disclosed method and apparatus controls the rate of playback of audio data corresponding to a stream of speech. Using speech recognition, the rate of speech of the audio data is determined. The...
6484137 Audio reproducing apparatus  
An audio reproducing apparatus comprises: audio decoding means for decoding an input audio signal frame by frame; data expanding/compressing means for subjecting data in a decoded frame to...
6470316 Speech synthesis apparatus having prosody generator with user-set speech-rate- or adjusted phoneme-duration-dependent selective vowel devoicing  
The speech synthesis apparatus according to the present invention includes a text analyzer operable to generate a phonetic and prosodic symbol string from text information of an input text; a word...
Matches 1 - 50 out of 200 1 2 3 4 >