Match Document Document Title
7010489 Method for guiding text-to-speech output timing using speech recognition markers  
A method for guiding text-to-speech output timing with speech recognition markers can include the following steps. First, tokens can be retrieved in a TTS system. The tokens can include words,...
7010488 System and method for compressing concatenative acoustic inventories for speech synthesis  
A system and method is used to compress concatenative acoustic inventories for speech. Instead of using general purpose signal compression methods such as vector quantization, the method of the...
6999922 Synchronization and overlap method and system for single buffer speech compression and expansion  
The present invention (110) permits a user to speed up and slow down speech without changing the speakers pitch (102, 110, 112, 128, 402–416). It is a user adjustable feature to change the spoken...
6996529 Speech synthesis with prosodic phrase boundary information  
Text-to-speech conversion uses pattern-matching to predict the position of phrase boundaries in spoken output. Text input to the is analyzed to identify groups of words (known as “chunks”) which...
6988069 Reduced unit database generation based on cost information  
An arrangement is provided for generating a reduced unit database of a desired size to be used in text to speech operations. A reduced unit database with a desired size is generated based on a...
6983249 Systems and methods for voice synthesis  
Systems and methods for voice synthesis are disclosed for providing a synthesized voice message that is consonant with the taste of a customer and a program storage device readable by machine to...
6975987 Device and method for synthesizing speech  
The present invention provides pitch conversion processing technology capable of minimizing the distortion of speech sound naturalness. A speech waveform in a pitch-unit is considered to be...
6975989 Text to speech synthesizer with facial character reading assignment unit  
There is provided a text analyzer for analyzing Japanese text data, a facial character reading assignment unit for assigning facial character readings to character string portions of text analysis...
6970819 Speech synthesis device  
The principal object of this invention is to provide a suitable control method for closing length with respect to phonemes (such as unvoiced plosive consonants) having a closing interval, and as a...
6970820 Voice personalization of speech synthesizer  
The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be...
6968309 Method and system for speech frame error concealment in speech decoding  
A method and system for concealing errors in one or more bad frames in a speech sequence as part of an encoded bit stream received in a decoder. When the speech sequence is voiced, the...
6965862 Reading machine  
A portable reading machine has a scanner for scanning an image comprising text. The scanner has a scanning area occupying a maximum width and an active width defined by a scanning width limiting...
6963838 Adaptive hosted text to speech processing  
Techniques are provided performing text-to-speech translation in situations in which the input texts may contain unanticipated content. According to one aspect of the invention, text-to-speech...
6959279 Text-to-speech conversion system on an integrated circuit  
A text-to-speech conversion system that includes a first module to convert text into words, a second module to convert words into phonemes, a third module to map phonemes to sound units, and a...
6950798 Employing speech models in concatenative speech synthesis  
A text-to-speech synthesizer employs database that includes units. For each unit there is a collection of unit selection parameters and a plurality of frames. Each frame has a set of model...
6947731 Method for converting status messages output in spoken form  
A method for conversion of a voice output of appliance statuses, wherein three spoken phrases are stored for each appliance to be controlled, with the first spoken phrase being allocated to a...
6947893 Acoustic signal transmission with insertion signal for machine control  
The acoustic signal transmission includes electrically synthesizing an audible sound signal and an insertion signal to generate a synthesized sound electrical signal at the sending side,...
6941267 Speech data compression/expansion apparatus and method  
Waveform data is extracted by referring to an existing waveform dictionary. Regarding the waveform data, a use frequency used for speech synthesis is accumulated and stored. A compression method...
6934680 Method for generating a statistic for phone lengths and method for determining the length of individual phones for speech synthesis  
A statistic for phone lengths is generated by determining the length of individual phones for speech synthesis. A primary statistic is based on primary clusters (for example triphones), and a...
6925437 Electronic mail device and system  
An amount of character setting information inserted in a mail text is minimized. A character is selected from a list and a given text letter string of the inputted information is inserted in the...
6876968 Run time synthesizer adaptation to improve intelligibility of synthesized speech  
A method and system provide for run-time modification of synthesized speech. The method includes the step of generating synthesized speech based on textual input and a plurality of run-time...
6871175 Voice encoding apparatus and method therefor  
A voice encoding method includes the steps of encoding a first frame that contains a plurality of voice data into encoded parameters, locally decoding the encoded parameters of the first frame...
6868380 Speech recognition system and method for generating phonotic estimates  
A speech recognition system for transforming an acoustic signal into a stream of phonetic estimates includes a frequency analyzer for generating a short-time frequency representation of the...
6859775 Joint optimization of excitation and model parameters in parametric speech coders  
A speech synthesis system is provided that optimizes a synthesis filter. Optimization is achieved by minimizing a synthesis error between the original speech sample and a synthesized speech...
6856958 Methods and apparatus for text to speech processing using language independent prosody markup  
Techniques are described for employing a set of tags to model phenomena which are smooth and subject to constraints. Tags may be used to model, for example, muscular movement producing speech. In...
6845359 FFT based sine wave synthesis method for parametric vocoders  
A Fast Fourier Transform (FFT) based voice synthesis method 110, program product and vocoder. Sounds, e.g., speech and audio, are synthesized from multiple sine waves. Each sine wave component is...
6829581 Method for prosody generation by unit selection from an imitation speech database  
A method is provided for prosody generation by unit selection from an imitation speech database. A rule based method of text to speech conversion is used to produce a set of intonation events by...
6826530 Speech synthesis for tasks with word and prosody dictionaries  
A plurality of tasks are set in a speech synthesizing process, in which at least one of speakers, emotion or situation at the time speeches are made, and contents of the speeches, is different,...
6813604 Methods and apparatus for speaker specific durational adaptation  
A text to speech system modeling durational characteristics of a target speaker is addressed herein. A body of target speaker training text is selected having maximum possible information about...
6813607 Translingual visual speech synthesis  
A computer implemented method in a language independent system generates audio-driven facial animation given the speech recognition system for just one language. The method is based on the...
6810379 Client/server architecture for text-to-speech synthesis  
A client/server text-to-speech synthesis system and method divides the method optimally between client and server. The server stores large databases for pronunciation analysis, prosody generation,...
6810378 Method and apparatus for controlling a speech synthesis system to provide multiple styles of speech  
A method and apparatus for synthesizing speech from text whereby the speech may be generated in a manner so as to effectively convey a particular, selectable style. Repeated patterns of one or...
6804649 Expressivity of voice synthesis by emphasizing source signal features  
Voice synthesis with improved expressivity is obtained in a voice synthesiser of source-filter type by making use of a library of source sound categories in the source module. Each source sound...
6801499 Diversity schemes for packet communications  
A process (111,101) of sending packets of real-time information at a sender (311) includes steps of initially generating at the sender the packets of real-time information with a source rate (s11)...
6801894 Speech synthesizer that interrupts audio output to provide pause/silence between words  
A speech synthesizer includes a data memory having a plurality of address areas, which stores a plurality of phases in the address areas and an address designating circuit designating one of the...
6766298 Application server configured for dynamically generating web pages for voice enabled web applications  
A unified web-based voice messaging system provides voice application control between a web browser and an application server via an hypertext transport protocol (HTTP) connection on an Internet...
6757653 Reassembling speech sentence fragments using associated phonetic property  
A method of composing messages for speech output and the improvement of the quality of reproduction of speech outputs. A series of original sentences for messages is segmented and stored as audio...
6757654 Forward error correction in speech coding  
An improved forward error correction (FEC) technique for coding speech data provides an encoder module which primary-encodes an input speech signal using a primary synthesis model to produce...
6754630 Synthesis of speech from pitch prototype waveforms by time-synchronous waveform interpolation  
In a method of synthesizing voiced speech from pitch prototype waveforms by time-synchronous waveform interpolation (TSWI), one or more pitch prototypes is extracted from a speech signal or a...
6751592 Speech synthesizing apparatus, and recording medium that stores text-to-speech conversion program and can be read mechanically  
A text analysis section reads, from a text file, a text to be subjected to speech synthesis, and analyzes the text using a morphological analysis section, a syntactic structure analysis section, a...
6748357 Device and method for reproduction of sounds with independently variable duration and pitch  
The waveform generation device can reproduce waveform data of various sounds stored in a memory at a reproduction velocity of reproducing waveforms at a real time and is provided with a storage...
6748355 Method of sound synthesis  
A sound synthesis method for modeling and synthesizing dynamic, parameterized sounds. The sound synthesis method yields perceptually convincing sounds and provides flexibility through model...
6748358 ELECTRONIC SPEAKING DOCUMENT VIEWER, AUTHORING SYSTEM FOR CREATING AND EDITING ELECTRONIC CONTENTS TO BE REPRODUCED BY THE ELECTRONIC SPEAKING DOCUMENT VIEWER, SEMICONDUCTOR STORAGE CARD AND INFORMATION PROVIDER SERVER  
An improved electronic speaking document viewer is provided in order that a user can readily use electronic texts in the same manner as reading text images printed on paper. The electronic...
6738457 Voice processing system  
A voice processing system 10 is connected to the telephone network 110, and runs one or more applications 220 for controlling interaction with calls to or from the telephone network. The system...
6735567 Encoding and decoding speech signals variably based on signal classification  
A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the...
6728675 Data processor controlled display system with audio identifiers for overlapping windows in an interactive graphical user interface  
There is provided a user friendly display interface system for the interactive handling and sorting out of windows in complex window hierarchical graphical user interfaces. The system provides for...
6725199 Speech synthesis apparatus and selection method  
A speech synthesizer includes plural synthesis engines each having different characteristics and converting text-form utterances into speech form. One of the synthesis engines is selected as the...
6704711 System and method for modifying speech signals  
A system and method for speech signal enhancement upsamples a narrowband speech signal at a receiver to generate a wideband speech signal. The lower frequency range of the wideband speech signal...
6697780 Method and apparatus for rapid acoustic unit selection from a large speech corpus  
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen...
6691083 Wideband speech synthesis from a narrowband speech signal  
Wideband speech is synthesized from a bandlimited speech signal, for example from speech which has been transmitted via the public switched telephone network. Due to the nature of the vocal tract,...