Matches 1 - 50 out of 50


Match Document Document Title
8972258 Sparse maximum a posteriori (map) adaption  
Techniques disclosed herein include using a Maximum A Posteriori (MAP) adaptation process that imposes sparseness constraints to generate acoustic parameter adaptation data for specific users...
8930200 Vector joint encoding/decoding method and vector joint encoder/decoder  
A vector joint encoding/decoding method and a vector joint encoder/decoder are provided, more than two vectors are jointly encoded, and an encoding index of at least one vector is split and then...
8788268 Speech synthesis from acoustic units with default values of concatenation cost  
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. When a pair of acoustic units in the...
8768704 Methods and systems for automated generation of nativized multi-lingual lexicons  
An input signal that includes linguistic content in a first language may be received by a computing device. The linguistic content may include text or speech. Based on an acoustic feature...
8738376 Sparse maximum a posteriori (MAP) adaptation  
Techniques disclosed herein include using a Maximum A Posteriori (MAP) adaptation process that imposes sparseness constraints to generate acoustic parameter adaptation data for specific users...
8731938 Computer-implemented system and method for identifying and masking special information within recorded speech  
A computer-implemented system and method for identifying and masking special information within recorded speech is provided. A field for entry of special information is identified. Movement of a...
8706493 Controllable prosody re-estimation system and method and computer program product thereof  
In one embodiment of a controllable prosody re-estimation system, a TTS/STS engine consists of a prosody prediction/estimation module, a prosody re-estimation module and a speech synthesis module....
8576961 System and method for adaptive overlap and add length estimation  
A method for determining an overlap and add length estimate comprises determining a plurality of correlation values of a plurality of ordered frequency domain samples obtained from a data frame;...
8566106 Method and device for fast algebraic codebook search in speech and audio coding  
A method and device for searching an algebraic codebook during encoding of a sound signal, wherein the algebraic codebook comprises a set of codevectors formed of a number of pulse positions and a...
8494845 Signal distortion elimination apparatus, method, program, and recording medium having the program recorded thereon  
Provided is a signal distortion elimination apparatus comprising: an inverse filter application means that outputs the signal obtained by applying an inverse filter to an observed signal as a...
8370153 Speech analyzer and speech analysis method  
A speech analyzer includes a vocal tract and sound source separating unit which separates a vocal tract feature and a sound source feature from an input speech, based on a speech generation model,...
8321225 Generating prosodic contours for synthesized speech  
The subject matter of this specification can be implemented in, among other things, a computer-implemented method including receiving text to be synthesized as a spoken utterance. The method...
8315872 Methods and apparatus for rapid acoustic unit selection from a large speech corpus  
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen...
8214216 Speech synthesis for synthesizing missing parts  
A simply configured speech synthesis device and the like for producing a natural synthetic speech at high speed. When data representing a message template is supplied, a voice unit editor (5)...
8145477 Systems, methods, and apparatus for computationally efficient, iterative alignment of speech waveforms  
Systems, methods, and apparatus described include waveform alignment operations in which a single set of evaluated cosines and sines is used to calculate cross-correlations of two periodic...
8078466 Coarticulation method for audio-visual text-to-speech synthesis  
A method for generating animated sequences of talking heads in text-to-speech applications wherein a processor samples a plurality of frames comprising image samples. The processor reads first...
7835909 Method and apparatus for normalizing voice feature vector by backward cumulative histogram  
A method and apparatus for normalizing a histogram utilizing a backward cumulative histogram which can cumulate a probability distribution function in an order from a greatest to smallest value so...
7756715 Apparatus, method, and medium for processing audio signal using correlation between bands  
Apparatus, method, and medium for processing an audio signal using a correlation between bands are provided. The apparatus includes an encoding unit encoding an input audio signal and a decoding...
7747441 Method and apparatus for speech decoding based on a parameter of the adaptive code vector  
A high quality speech is reproduced with a small data amount in speech coding and decoding for performing compression coding and decoding of a speech signal to a digital signal. In speech coding...
7003461 Method and apparatus for an adaptive codebook search in a speech processing system  
An adaptive codebook search (ACS) algorithm is based on a set of matrix operations suitable for data processing engines supporting a single instruction multiple data (SIMD) architecture. The...
6996291 Systems and methods for correlating images in an image correlation system with reduced computational loads  
After one or both of a pair of images are obtained, an auto-correlation function for one of those images is generated to determine a smear amount and possibly a smear direction. The smear amount...
6937977 Method and apparatus for processing an input speech signal during presentation of an output audio signal  
A start of an input speech signal is detected during presentation of an output audio signal and an input start time, relative to the output audio signal, is determined. The input start time is...
6804649 Expressivity of voice synthesis by emphasizing source signal features  
Voice synthesis with improved expressivity is obtained in a voice synthesiser of source-filter type by making use of a library of source sound categories in the source module. Each source sound...
6681202 Wide band synthesis through extension matrix  
The invention describes a system that generates a wide band signal (100-7000 Hz) from a telephony band (or narrow band: 300-3400 Hz) speech signal to obtain an extended band speech signal...
6513007 Generating synthesized voice and instrumental sound  
There is provided a synthesized sound generating apparatus and method which can achieve responsive and high-quality speech synthesis based on a real-time convolution operation. Coefficients are...
6421636 Frequency converter system  
An apparatus and method is disclosed for converting an input signal having frequency related information sustained over a first duration of time into an output signal sustained over a second...
6208958 Pitch determination apparatus and method using spectro-temporal autocorrelation  
A pitch determination apparatus and method using spectro-temporal autocorrelation to prevent pitch determination errors are provided. The pitch determination apparatus using spectro-temporal...
5983181 Method and apparatus for reading-out/collating a table document, and computer-readable recording medium with program making computer execute method stored therein  
The table document preparation module prepares a table document containing cells, the read-out attribute setting module sets a read-out attribute specifying a way of reading-out cell data supplied...
5949961 Word syllabification in speech synthesis system  
The present invention relates to a system and method of word syllabification. The present invention receives a word to be syllabified and determines therefrom all possible substrings capable of...
5832442 High-effeciency algorithms using minimum mean absolute error splicing for pitch and rate modification of audio signals  
A method is disclosed of modification of parameters of audio signals by dividing a digital signal converted from an original analog signal into sound frames, modifying a pitch and a playing rate...
5809456 Voiced speech coding and decoding using phase-adapted single excitation  
The present invention relates to a method and to equipment for coding and decoding a sampled speech signal. It belongs to systems used in speech processing, in particular for compression of speech...
5761635 Method and apparatus for implementing a long-term synthesis filter  
A synthesis filter is disclosed which models the effect of the fundamental frequency of speech for digital speech coders operating on the analysis-by-synthesis principle. High fundamental...
5727125 Method and apparatus for synthesis of speech excitation waveforms  
A speech vocoder device and corresponding method synthesizes speech excitation waveforms. The method entails reconstructing (216) an excitation target from decoded speech data, creating (220)...
5696873 Vocoder system and method for performing pitch estimation using an adaptive correlation sample window  
An improved vocoder system and method for estimating pitch in a speed waveform. The method comprises an improved correlation method for estimating the pitch parameter which more accurately...
5684923 Methods and apparatus for compressing and quantizing signals  
A high efficiency coding method and apparatus includes quantization which takes into account correlation of input signals of plural channels in coding the input signals of the respective channels...
5659658 Method for converting speech using lossless tube models of vocals tracts  
A method of converting speech, in which reflection coefficients are calculated from a speech signal of a speaker. From these coefficients, characteristics of cross-sectional areas of cylinder...
5546498 Method of and device for quantizing spectral parameters in digital speech coders  
A method of and a device for speech signal digital coding are described, where spectral parameters are quantized at each frame in order to exploit the actual correlation inside a frame or between...
5473759 Sound analysis and resynthesis using correlograms  
A system for reconstructing a signal waveform from a correlogram is based upon the recognition that the information in each channel of the correlogram is equivalent to the magnitude of the Fourier...
4845754 Pole-zero analyzer  
A pole-zero analyzer includes an autocorrelation impulse response calculator, first and second buffers, a pole parameter value table, a first coefficient calculator, a first inverse filter, a...
4633500 Speech synthesizer  
A speech synthesizer includes a lattice-type multi-stage digital filter modified in the next to last stage thereof by incorporating an increasing circuit for slightly effectively increasing the...
4618982 Digital speech processing system having reduced encoding bit requirements  
A digitized speech signal is divided into sections and each section is analyzed by the linear prediction method to determine the coefficients of a sound formation model, a sound volume parameter,...
4489437 Speech synthesizer  
A speech synthesizer, using linear predictive coding technique, obtains variable frame lengths by multiplying a pitch period by the number of repetitions, and interpolates PARCOR coefficients at a...
4459674 Voice input/output apparatus  
An apparatus with both voice recognition and voice synthesis provides simultaneous and non-interfering operation by inserting reverse stop filters in the recognition system controlled by the...
4435832 Speech synthesizer having speech time stretch and compression functions  
A speech synthesizer is disclosed with the capability of stretching and compressing the speech time base without changing the pitch of the synthesized speech. One frame of speech is represented...
4398262 Time multiplexed n-ordered digital filter  
An n-ordered digital filter is disclosed in the form of a Partial Autocorrelation (PARCOR) lattice structure having two multipliers and two adders. Time multiplexing eliminates the use of...
4349699 Speech synthesizer  
This PARCOR-type speech synthesizer replaces a ten-stage lattice type filter with a pipeline multiplier and feedback loop, and provides a loss circuit (for bandwidth broadening) using subtraction...
4304965 Data converter for a speech synthesizer  
Data converter for a speech synthesizer system wherein encoded formant parameters as stored in a memory are decoded and transformed or converted to reflection coefficients in real time by means of...
4209844 Lattice filter for waveform or speech synthesis circuits using digital logic  
A digital filter of the type which may be used in circuits for generating complex waveforms, such as human speech. The filter has a multiplier, an adder coupled to the output of the multiplier and...
3786188 SYNTHESIS OF PURE SPEECH FROM A REVERBERANT SIGNAL  
Speech that has been reverberated by the transfer function of a reverberant enclosure is analyzed to detect parameters from which an unreverberative synthetic version of the original speech may be...
3069507 Autocorrelation vocoder  

Matches 1 - 50 out of 50