Match Document Document Title
7315813 Method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure  
A method of speech segment selection for concatenative synthesis based on prosody-aligned distance measure is disclosed. This method is based on comparison of speech segments segmented from a...
7305337 Method and apparatus for speech coding and decoding  
The present invention includes a method for speech encoding and decoding and a design of speech coder and decoder. The characteristic of speech encoding method relies on the type of data with high...
7289951 Method for improving the coding efficiency of an audio signal  
The invention relates to a method for improving the coding accuracy and transmission efficiency of an audio signal. According to the method, a part of the audio signal to be coded is compared with...
7286986 Method and apparatus for smoothing fundamental frequency discontinuities across synthesized speech segments  
A method of smoothing fundamental frequency discontinuities at boundaries of concatenated speech segments includes determining, for each speech segment, a beginning fundamental frequency value and...
7286562 System and method for dynamically changing error algorithm redundancy levels  
The invention is concerned with improvements in full duplex Internet telephone systems with a system architecture having low latency and permitting voice communication with telephone to telephone...
7286980 Speech processing apparatus and method for enhancing speech information and suppressing noise in spectral divisions of a speech signal  
A speech processing apparatus and method may identify divisions of a signal spectrum as having a speech component or having no speech component. A comb filter is generated, based on a high-accuracy...
7280958 Method and system for suppressing receiver audio regeneration  
The invention concerns a method ( 500 ) and system ( 100 ) for suppressing receiver audio regeneration. The method ( 500 ) includes the steps of receiving a communication signal ( 502 ), at a Radio...
7277916 Dynamic translation between data network-based protocol in a data-packet-network and interactive voice response functions of a telephony network  
A system for emulating interaction with an interactive voice response unit is provided. The system comprises, a client node connected to the network, the client node soliciting interaction with the...
7275030 Method and apparatus to compensate for fundamental frequency changes and artifacts and reduce sensitivity to pitch information in a frame-based speech processing system  
A method, computer program product, and data processing system for compensating for fundamental frequency changes in a frame-based speech processing system is disclosed. In a preferred embodiment...
7272551 Computational effectiveness enhancement of frequency domain pitch estimators  
Estimating a speech signal pitch frequency by determining a speech signal frame line spectrum including spectral lines having respective line amplitudes and frequencies, selecting a predefined...
7266493 Pitch determination based on weighting of pitch lag candidates  
There is provided a method of selecting a pitch lag value from a plurality of pitch lag candidates for coding a speech signal. The method comprises identifying the plurality of pitch lag candidates...
7251597 Method for tracking a pitch signal  
A method for tracking pitch signal, including receiving a detected pitch signal that consists of a succession of pitch values, and for each current pitch value in the detected signal perform the...
7246059 Method for fast dynamic estimation of background noise  
The invention provides a method and system for dynamically estimating background noise. The system includes a portable communication device, a vocoder, and a voice activated detector. Based on...
7239999 Speed control playback of parametric speech encoded digital audio  
A method of pitch corrected speed control (PCSC) playback in which a decoder rate controller receives a desired playback speed from a PCSC controller and determines the number of decoded digital...
7236927 Pitch extraction methods and systems for speech coding using interpolation techniques  
A method of searching for an interpolated peak of a Normalized Correlation Square (NCS) signal derived from an audio signal, comprises: producing quadratically interpolated correlation (QIC) signal...
7233897 Method and apparatus for performing packet loss or frame erasure concealment  
The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with...
7233894 Low-frequency band noise detection  
A pitch estimation system including a low-frequency band noise detector (LBND) operative to detect the presence of low-frequency band noise in a first audio frame, a frequency-domain pitch...
7231346 Speech section detection apparatus  
A speech section detection apparatus capable of reliably detecting a speech section even for a word containing a glottal stop sound or for a word containing a succession of ā€œsā€ column sounds or...
7219061 Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized  
Predetermined macrosegments of the fundamental frequency are determined by a neural network, and these predefined macrosegments are reproduced by fundamental-frequency sequences stored in a...
7212517 Method and apparatus for jitter and frame erasure correction in packetized voice communication systems  
The invention comprises a system and method for correcting jitter and frame erasure in packet voice communication systems with out severely affecting the voice quality of the signal to be easily...
7203639 ***WITHDRAWN PATENT AS PER THE LATEST USPTO WITHDRAWN LIST***
2-D processing of speech
 
Acoustic signals are analyzed by two-dimensional (2-D) processing of the one-dimensional (1-D) speech signal in the time-frequency plane. The short-space 2-D Fourier transform of a...
7191105 Characterizing, synthesizing, and/or canceling out acoustic signals from sound sources  
A system for characterizing, synthesizing, and/or canceling out acoustic signals from inanimate and animate sound sources. Electromagnetic sensors monitor excitation sources in sound producing...
7177810 Method and apparatus for performing prosody-based endpointing of a speech signal  
A method and apparatus for finding endpoints in speech by utilizing information contained in speech prosody. Prosody denotes the way speakers modulate the timing, pitch and loudness of phones,...
7155313 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7155314 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7155386 Adaptive correlation window for open-loop pitch  
An approach for adaptively adjusting the correlation window for open-loop pitch determination is presented. Correlation between a windowed reference signal (or target signal) and a candidate signal...
7151983 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7151802 High frequency content recovering method and device for over-sampled synthesized wideband signal  
In a method and device for recovering the high frequency content of a wideband signal previously down-sampled, and for injecting this high frequency content in an over-sampled synthesized version...
7146250 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7146251 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7146249 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7146252 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20. The CPU 15 also deciphers whether or not an output value of...
7142946 Robot apparatus  
A robot apparatus is provided. A CPU 15 determines an output of a feeling model based on signals supplied from a touch sensor 20 . The CPU 15 also deciphers whether or not an output value of...
7139700 Hybrid speech coding and system  
Linear predictive speech coding system with classification of frames and a hybrid coder using both waveform coding and parametric coding for different classes of frames. Phase alignment for a...
7127389 Method for encoding and decoding spectral phase data for speech signals  
A speech decoder and a segment aligner are provided in the present invention. The speech decoder may include a spectrum reconstructor operative to reconstruct the spectrum of a speech segment from...
7124075 Methods and apparatus for pitch determination  
Methods and apparatus for detecting periodicity and/or for determining the fundamental period of a signal such as speech. The methods include embedding a portion of a sampled digitized signal into...
7120575 Method and system for the automatic segmentation of an audio stream into semantic or syntactic units  
A digitized speech signal ( 600 ) is input to an F0 (fundamental frequency) processor that computes ( 610 ) a continuous F0 data from the speech signal. By the criterion voicing state transition...
7117147 Method and system for improving voice quality of a vocoder  
The invention concerns a method ( 300 ) and system ( 100 ) for improving voice quality of a vocoder ( 138, 158 ). The method includes the steps of monitoring ( 312 ) a pitch of a voice signal ( 400...
7117146 System for improved use of pitch enhancement with subcodebooks  
A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the...
7107213 Device for normalizing voice pitch for voice recognition  
In a voice pitch normalization device equipped in a voice recognition device VRAp for recognizing an incoming command voice Sva uttered by any speaker, and used to normalize the incoming command...
7103539 Enhanced coded speech  
According to the invention, a method for increasing quality of an enhanced output signal to approximate an undistorted sound signal is disclosed. In one step, a distorted input signal is received...
H002172 Pitch-synchronous speech processing  
Pitch-synchronous speech processing invention involves two main steps: 1) divide the speech into pitch periods, or into pseudo pitch periods for unvoiced speech, where the breaks occur, for...
7099820 Method and apparatus for concealing jitter buffer expansion and contraction  
Methods for concealing audible distortions resulting from changes in jitter buffer size include receiving an audio stream, storing the audio stream in a jitter buffer, and determining a pitch...
7092881 Parametric speech codec for representing synthetic speech in the presence of background noise  
A system and method are provided for processing audio and speech signals using a pitch and voicing dependent spectral estimation algorithm (voicing algorithm) to accurately represent voiced speech,...
7092874 Method and device for speech analysis  
A device and a method for speech analysis are provided, comprising measuring fundamental notes of a speech sequence to be analysed and identifying frequency intervals between at least some of said...
7085722 System and method for menu-driven voice control of characters in a game environment  
In a gaming system, a user controls actions of characters in the game environment using speech commands. In a learning mode, available speech commands are displayed in a command menu on a display...
7054806 Speech synthesis apparatus using pitch marks, control method therefor, and computer-readable memory  
The distance between the first two pitch marks of a voiced portion of speech data to be processed is calculated. The difference between the adjacent inter-pitch-mark distances is calculated. The...
7047190 Method and apparatus for performing packet loss or frame erasure concealment  
The invention concerns a method and apparatus for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder that does not have a built-in or standard FEC process. A receiver with...
7047184 Speech coding apparatus and speech decoding apparatus  
A speech coding apparatus comprises a repetition period pre-selecting unit for generating a plurality of candidates for the repetition period of a driving excitation source by multiplying the...
7043424 Pitch mark determination using a fundamental frequency based adaptable filter  
A method of pitch mark determination for a speech includes the following steps. First, a fundamental frequency and fundamental frequency passband signals are acquired by using an adaptable filter....