Matches 1 - 50 out of 330 1 2 3 4 5 6 7 >


Match Document Document Title
9026440 Method for identifying speech and music components of a sound signal  
The present invention relates to means and methods of automated difference recognition between speech and music signals in voice communication systems, devices, telephones, and methods, and more...
9009052 System and method for singing synthesis capable of reflecting voice timbre changes  
Herein provided is a system for singing synthesis capable of reflecting not only pitch and dynamics changes but also timbre changes of a user's singing. A spectral transform surface generating...
8977552 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8924204 Method and apparatus for wind noise detection and suppression using multiple microphones  
Unlike sound based pressure waves that go everywhere, air turbulence caused by wind is usually a fairly local event. Therefore, in a system that utilizes two or more spatially separated...
8909538 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...
8898062 Strained-rough-voice conversion device, voice conversion device, voice synthesis device, voice conversion method, voice synthesis method, and program  
A strained-rough-voice conversion unit (10) is included in a voice conversion device that can generate a “strained rough” voice produced in a part of a speech when speaking forcefully with...
8898057 Encoding apparatus, decoding apparatus and methods thereof  
Disclosed is an encoding apparatus that can efficiently encode a signal that is a broad or extra-broad band signal or the like, thereby improving the quality of a decoded signal. This encoding...
8886539 Prosody generation using syllable-centered polynomial representation of pitch contours  
The present invention discloses a parametrical representation of prosody based on polynomial expansion coefficients of the pitch contour near the center of each syllable. The said syllable pitch...
8886538 Systems and methods for text-to-speech synthesis using spoken example  
Systems and methods for speech synthesis and, in particular, text-to-speech systems and methods for converting a text input to a synthetic waveform by processing prosodic and phonetic content of a...
8868422 Storing a representative speech unit waveform for speech synthesis based on searching for similar speech units  
According to one embodiment, a method for editing speech is disclosed. The method can generate speech information from a text. The speech information includes phonologic information and prosody...
8868432 Audio signal bandwidth extension in CELP-based speech coder  
A method for decoding an audio signal having a bandwidth that extends beyond a bandwidth of a CELP excitation signal in an audio decoder including a CELP-based decoder element. The method includes...
8868431 Recognition dictionary creation device and voice recognition device  
A recognition dictionary creation device identifies the language of a reading of an inputted text which is a target to be registered and adds a reading with phonemes in the language identified...
8856012 Apparatus and method of encoding and decoding signals  
A method of encoding an audio signal, where signals including two or more channel signals are downmixed to a mono signal, the mono signal is divided into a low-frequency signal and a...
8856008 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8812324 Coding, modification and synthesis of speech segments  
The invention relates to a method for speech signal analysis, modification and synthesis comprising a phase for the location of analysis windows by means of an iterative process for the...
8744851 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8738381 Prosody generating devise, prosody generating method, and program  
A prosody generation apparatus capable of suppressing distortion that occurs when generating prosodic patterns and therefore generating a natural prosody is provided. A prosody changing point...
8706497 Speech signal restoration device and speech signal restoration method  
A synthesis filter 106 synthesizes a plurality of wide-band speech signals by combining wide-band phoneme signals and sound source signals from a speech signal code book 105, and a distortion...
8706496 Audio signal transforming by utilizing a computational cost function  
A sequence is received of time domain digital audio samples representing sound (e.g., a sound generated by a human voice or a musical instrument). The time domain digital audio samples are...
8706493 Controllable prosody re-estimation system and method and computer program product thereof  
In one embodiment of a controllable prosody re-estimation system, a TTS/STS engine consists of a prosody prediction/estimation module, a prosody re-estimation module and a speech synthesis module....
8682654 Systems and methods for classifying sports video  
Disclosed are systems, methods, and computer readable media having programs for classifying sports video. In one embodiment, a method includes: extracting, from an audio stream of a video clip, a...
8655659 Personalized text-to-speech synthesis and personalized speech feature extraction  
A personalized text-to-speech synthesizing device includes: a personalized speech feature library creator, configured to recognize personalized speech features of a specific speaker by comparing a...
8655650 Multiple stream decoder  
A method is provided for decoding data streams in a voice communication system. The method includes: receiving two or more data streams having voice data encoded therein; decoding each data stream...
8645126 Apparatus and method of encoding and decoding signals  
A method of encoding an audio signal, where signals including two or more channel signals are downmixed to a mono signal, the mono signal is divided into a low-frequency signal and a...
8645140 Electronic device and method of associating a voice font with a contact for text-to-speech conversion at the electronic device  
A method of associating a voice font with a contact for text-to-speech conversion at an electronic device includes obtaining, at the electronic device, the voice font for the contact, and storing...
8634783 Communication device with reduced noise speech coding  
A communication device includes memory, an input interface, a processing module, and a transmitter. The processing module receives a digital signal from the input interface, wherein the digital...
8635065 Apparatus and method for automatic extraction of important events in audio signals  
The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals;audio signal fragmenting means...
8630857 Speech synthesizing apparatus, method, and program  
Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change...
8620661 System for controlling digital effects in live performances with vocal improvisation  
A system for controlling digital effects in live performances with vocal improvisation is described. The system features a controller that utilizes several switches attached to clothing that is...
8612239 Apparatus and method for coding audio data based on input signal distribution characteristics of each channel  
Provided is an audio coding apparatus and method that can selectively apply a operation mode of a coding module for stereo or multi-channel representation according to input signal characteristics...
8606569 Automatic determination of multimedia and voice signals  
The present invention relates to means and methods of classifying speech and music signals in voice communication systems, devices, telephones, and methods, and more specifically, to systems,...
8583439 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...
8583442 Rhythm processing and frequency tracking in gradient frequency nonlinear oscillator networks  
A method for mimicking the auditory system's response to rhythm of an input signal having a time varying structure comprising the steps of receiving a time varying input signal x(t) to a network...
8583443 Recording and reproducing apparatus  
Disclosed is a recording and reproducing apparatus comprising: an apparatus main body; and a remote controller to perform remote control of the apparatus main body, wherein the remote controller...
8566092 Method and apparatus for extracting prosodic feature of speech signal  
The present invention discloses a method and an apparatus for extracting a prosodic feature of a speech signal, the method including: dividing the speech signal into speech frames; transforming...
8554566 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8510112 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8510113 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8494856 Speech synthesizer, speech synthesizing method and program product  
According to one embodiment, a speech synthesizer includes an analyzer, a first estimator, a selector, a generator, a second estimator, and a synthesizer. The analyzer analyzes text and extracts a...
8484026 Portable audio control system and audio control device thereof  
A portable audio control system that controls an audio signal transmitted from an electronic device, including an earphone device and an audio control device. The audio control device includes an...
8484037 Bandwidth extension apparatus for automatically adjusting the bandwidth of inputted signal and a method therefor  
A bandwidth extension apparatus can generate from an inputted speech signal a bandwidth-extended signal whose bandwidth is automatically adjusted according to the surrounding hearing situation by...
8478587 Voice analysis device, voice analysis method, voice analysis program, and system integration circuit  
A sound analysis device comprises: a sound parameter calculation unit operable to acquire an audio signal and calculate a sound parameter for each of partial audio signals, the partial audio...
8463412 Method and apparatus to facilitate determining signal bounding frequencies  
A signal processing platform (300) presents (101) a signal to be processed and identifies (102) signal portions with specific characteristics that are used (103) to automatically determine at...
8457969 Audio pitch changing device  
An effect device may be configured such that when an input audio signal switches from a consonant to a vowel and an input level of the switched vowel is greater than a threshold value Lc (and a...
8457115 Method and apparatus for concealing lost frame  
A method for concealing lost frame includes: using history signals before the lost frame that corresponds to a lost MDCT coefficient to generate a first synthesized signal when it is detected that...
8442831 Sound envelope deconstruction to identify words in continuous speech  
A speech recognition capability in which words of spoken text are identified based on the contour of sound waves representing the spoken text. Variations in the contour of the sound waves are...
8438033 Voice conversion apparatus and method and speech synthesis apparatus and method  
A voice conversion apparatus stores, in a parameter memory, target speech spectral parameters of target speech, stores, in a voice conversion rule memory, a voice conversion rule for converting...
8438017 Method and apparatus for encoding/decoding audio signal using adaptive LPC coefficient interpolation  
Provided are a method and apparatus for encoding or decoding an audio signal by adaptively interpolating a linear predictive coding (LPC) coefficient. In the method and apparatus of encoding or...
8433573 Prosody modification device, prosody modification method, and recording medium storing prosody modification program  
A prosody modification device includes: a real voice prosody input part that receives real voice prosody information extracted from an utterance of a human; a regular prosody generating part that...
8428958 Apparatus and method of encoding and decoding signals  
A method of encoding an audio signal, where signals including two or more channel signals are downmixed to a mono signal, the mono signal is divided into a low-frequency signal and a...

Matches 1 - 50 out of 330 1 2 3 4 5 6 7 >