|
Match
|
Document |
Document Title |
|
|
7617094 |
Methods, apparatus, and products for identifying a conversation
One aspect of the invention is a method of using a computer to identify a conversation. Another aspect is a method for an audio processing system that identifies conversations and enhances each...
|
|
|
7596488 |
System and method for real-time jitter control and packet-loss concealment in an audio signal
An “adaptive audio playback controller” operates by decoding and reading received packets of an audio signal into a signal buffer. Samples of the decoded audio signal are then played out of the...
|
|
|
7596487 |
Method of detecting voice activity in a signal, and a voice signal coder including a device for implementing the method
A method of detecting voice activity in a signal smoothes the “voice” or “noise” decision to avoid loss of speech segments. The method is particularly suitable for situations in which the...
|
|
|
7590524 |
Method of filtering speech signals to enhance quality of speech and apparatus thereof
The present invention relates to enhancing a quality of speech wherein speech quality degradation is reduced by removing noise from an unvoiced speech. The present invention comprises dividing an...
|
|
|
7577565 |
Adaptive voice playout in VOP
Packetized CELP-encoded speech playout with frame truncation during silence and frame expansion method dependent upon voicing classification with voiced frame expansion maintaining phasealignment.
|
|
|
7577564 |
Method and apparatus for detecting illicit activity by classifying whispered speech and normally phonated speech according to the relative energy content of formants and fricatives
Method and apparatus for the classification of speech signals. Speech is classified into two broad classes of speech production—whispered speech and normally phonated speech. Speech classified in...
|
|
|
7574451 |
System and method for speeding up database lookups for multiple synchronized data streams
A “Media Identifier” operates on concurrent media streams to provide large numbers of clients with real-time server-side identification of media objects embedded in streaming media, such as...
|
|
|
7574357 |
Applications of sub-audible speech recognition based upon electromyographic signals
Method and system for generating electromyographic or sub-audible signals (“SAWPs”) and for transmitting and recognizing the SAWPs that represent the original words and/or phrases. The SAWPs...
|
|
|
7567908 |
Differential dynamic content delivery with text display in dependence upon simultaneous speech
Differential dynamic content delivery including providing a session document for a presentation, wherein the session document includes a session grammar and a session structured document; selecting...
|
|
|
7567900 |
Harmonic structure based acoustic speech interval detection method and device
A harmonic structure acoustic signal detection device not depending on the level fluctuation of the input signal including: an FFT unit which performs FFT on an input signal and calculates a power...
|
|
|
7565286 |
Method for recovery of lost speech data
A method for lost speech samples recovery in speech transmission systems is disclosed. The method employs a waveform coder operating on digital speech samples. It exploits the composite model of...
|
|
|
7555310 |
Electronic apparatus and computer readable medium recorded voice operating program
An electronic apparatus configured, on the assumption of its sharing by a plurality of users, to provide a plurality of functions, in which normal operations and interrupt operations based on a...
|
|
|
7542787 |
Apparatus and method for providing hands-free operation of a device
The present invention provides an apparatus and method for providing hands-free operation of a device. A hands-free adapter is provided that communicates with a device and a headset. The hands-free...
|
|
|
7536298 |
Method of comfort noise generation for speech communication
An embodiment of the invention improves upon the International Telecommunication Union's ITU-T G.729 Annex B comfort noise generation algorithm by reducing the computational complexity of the...
|
|
|
7529664 |
Signal decomposition of voiced speech for CELP speech coding
An approach for improving quality of synthesized speech is presented. The input speech or residual is first separated into a voiced portion and a noise portion. The voice portion is coded using...
|
|
|
7505950 |
Soft alignment based on a probability of time alignment
Systems and methods are provided for performing soft alignment in Gaussian mixture model (GMM) based and other vector transformations. Soft alignment may assign alignment probabilities to source...
|
|
|
7505594 |
Discontinuous transmission (DTX) controller system and method
A method and apparatus for controlling a discontinuous transmission process. Audio information is digitized and provided to a vocoder. A voice activity level is determined from the digitized audio...
|
|
|
7478041 |
Speech recognition apparatus, speech recognition apparatus and program thereof
Provided is a method for canceling background noise of a sound source other than a target direction sound source in order to realize highly accurate speech recognition, and a system using the same....
|
|
|
7478040 |
Method for adaptive filtering
A method for adaptive long-term filtering of an audio signal, such as a decoded speech signal. The method includes measuring a smoothed periodicity of an audio signal segment, such as an audio...
|
|
|
7457746 |
Pitch prediction for packet loss concealment
There is provided a pitch lag predictor for use by a speech decoder to generate a predicted pitch lag parameter. The pitch lag predictor comprises a summation calculator configured to generate a...
|
|
|
7424427 |
Systems and methods for classifying audio into broad phoneme classes
An audio classification system classifies sounds in an audio stream as belonging to one of a relatively small number of classes. The audio classification system includes a signal analysis component...
|
|
|
7412379 |
Time-scale modification of signals
Techniques utilising Time Scale Modification (TSM) of signals are described. The signal is analysed and divided into frames of similar signal types. Techniques specific to the signal type are then...
|
|
|
7386444 |
Hybrid speech coding and system
Hybrid linear predictive speech coding system with phase alignment predictive quantization zero phase alignment of speech prior to waveform coding aligns synthesized speech frames of a waveform...
|
|
|
7376557 |
Method and apparatus of overlapping and summing speech for an output that disrupts speech
A privacy apparatus adds a privacy sound based on a speaker's own voice into the environment, thereby confusing listeners as to which of the sounds is the real source. This permits disruption of...
|
|
|
7366658 |
Noise pre-processor for enhanced variable rate speech codec
An enhanced noise pre-processor in a speech codec smoothes channel energy estimate moving toward a first smoothing constant if a prior signal to noise ratio estimate for more than five channels are...
|
|
|
7343284 |
Method and system for speech processing for enhancement and detection
A method for discriminating noise from signal in a noise-contaminated signal involves decomposing a frame of samples of the signal into decorrelated components, and using a difference between...
|
|
|
7337108 |
System and method for providing high-quality stretching and compression of a digital audio signal
An adaptive “temporal audio scaler” is provided for automatically stretching and compressing frames of audio signals received across a packet-based network. Prior to stretching or compressing...
|
|
|
7337107 |
Perceptual harmonic cepstral coefficients as the front-end for speech recognition
Pitch estimation and classification into voiced, unvoiced and transitional speech were performed by a spectro-temporal auto-correlation technique. A peak picking formula was then employed. A...
|
|
|
7295982 |
System and method for automatic verification of the understandability of speech
The present invention relates to a system and method for automatically verifying that a message received from a user is intelligible. In an exemplary embodiment, a message is received from the...
|
|
|
7277916 |
Dynamic translation between data network-based protocol in a data-packet-network and interactive voice response functions of a telephony network
A system for emulating interaction with an interactive voice response unit is provided. The system comprises, a client node connected to the network, the client node soliciting interaction with the...
|
|
|
7272551 |
Computational effectiveness enhancement of frequency domain pitch estimators
Estimating a speech signal pitch frequency by determining a speech signal frame line spectrum including spectral lines having respective line amplitudes and frequencies, selecting a predefined...
|
|
|
7266493 |
Pitch determination based on weighting of pitch lag candidates
There is provided a method of selecting a pitch lag value from a plurality of pitch lag candidates for coding a speech signal. The method comprises identifying the plurality of pitch lag candidates...
|
|
|
7246059 |
Method for fast dynamic estimation of background noise
The invention provides a method and system for dynamically estimating background noise. The system includes a portable communication device, a vocoder, and a voice activated detector. Based on...
|
|
|
7243062 |
Audio segmentation with energy-weighted bandwidth bias
A method ( 200 ) and apparatus ( 100 ) for segmenting a sequence of audio samples into homogeneous segments ( 550 and 555 ) are disclosed. The method ( 200 ) forms a sequence of frames ( 701 to ...
|
|
|
7233899 |
Speech recognition system using normalized voiced segment spectrogram analysis
Computer comparison of one or more dictionary entries with a sound record of a human utterance to determine whether and where each dictionary entry is contained within the sound record. The record...
|
|
|
7233894 |
Low-frequency band noise detection
A pitch estimation system including a low-frequency band noise detector (LBND) operative to detect the presence of low-frequency band noise in a first audio frame, a frequency-domain pitch...
|
|
|
7228271 |
Telephone apparatus
The telephone apparatus of the present invention comprises a first voice band expander for generating a voiced signal frequency component by shifting the frequency of the voice signal received, a...
|
|
|
7206739 |
Excitation codebook search method in a speech coding system
A method for searching an excitation (or fixed) codebook in a speech coding system. In a speech coding system including a synthesis filter for synthesizing a speech signal, a fixed codebook...
|
|
|
7191128 |
Method and system for distinguishing speech from music in a digital audio signal in real time
The present invention relates to method and system for distinguishing speech from music in a digital audio signal in real time. A method for distinguishing speech from music in a digital audio...
|
|
|
7191123 |
Gain-smoothing in wideband speech and audio signal decoder
The gain smoothing method and device modify the amplitude of an innovative codevector in relation to background noise present in a previously sampled wideband signal. The gain smoothing device...
|
|
|
7171357 |
Voice-activity detection using energy ratios and periodicity
A voice activity detector ( 100 ) filters ( 204 ) out noise energy and then computes a high-frequency (2400 Hz to 4000 Hz) versus low-frequency (100 Hz to 2400 Hz) signal energy ratio ( 224 ),...
|
|
|
7149683 |
Method and device for robust predictive vector quantization of linear prediction parameters in variable bit rate speech coding
The present invention relates to a method and device for quantizing linear prediction parameters in variable bit-rate sound signal coding, in which an input linear prediction parameter vector is...
|
|
|
7146310 |
Low bit-rate coding of unvoiced segments of speech
A low-bit-rate coding technique for unvoiced segments of speech includes the steps of extracting high-time-resolution energy coefficients from a frame of speech, quantizing the energy coefficients,...
|
|
|
7139700 |
Hybrid speech coding and system
Linear predictive speech coding system with classification of frames and a hybrid coder using both waveform coding and parametric coding for different classes of frames. Phase alignment for a...
|
|
|
7127392 |
Device for and method of detecting voice activity
The present invention is a device for and method of detecting voice activity. First, the AM envelope of a segment of a signal of interest is determined. Next, the number of times the AM envelope...
|
|
|
7120578 |
Silence description coding for multi-rate speech codecs
Speech coding systems include multi-rate speech codecs having an encoder and a decoder. Silence description coding for multi-rate speech coding systems that employ discontinued transmission is...
|
|
|
7120576 |
Low-complexity music detection algorithm and system
A method for detecting music in a speech signal having a plurality of frames. The method comprises defining a music threshold value for a first parameter extracted from a frame of the speech...
|
|
|
7117150 |
Voice detecting method and apparatus using a long-time average of the time variation of speech features, and medium thereof
A first filter ( 2061 in FIG. 1 ) calculates a long-time average of first change quantities based on a difference between a line spectral frequency of an input voice signal and a long-time...
|
|
|
7113522 |
Enhanced conversion of wideband signals to narrowband signals
Wideband speech signals must be converted to narrowband speech signals if the transmission medium or the destination terminal is constructed with narrowband constraints. A typical...
|
|
|
7103349 |
Method, system and network entity for providing text telephone enhancement for voice, tone and sound-based network services
The invention is a system, a method of transmitting messages selectively as text or non-text from an entity ( 104 ) in a network ( 100 and 102 ), and an entity in a network. A system in...
|