Matches 1 - 50 out of 292 1 2 3 4 5 6 >


Match Document Document Title
8983842 Apparatus, process, and program for combining speech and audio data  
There is provided a speech processing apparatus including: a data obtaining unit which obtains music progression data defining a property of one or more time points or one or more time periods...
8977551 Parametric speech synthesis method and system  
The present invention provides a parametric speech synthesis method and a parametric speech synthesis system. The method comprises sequentially processing each frame of speech of each phone in a...
8977550 Information providing apparatus and information providing method  
Part units of speech information are arranged in a predetermined order to generate a sentence unit of a speech information set. To each of a plurality of speech part units of the speech...
8977552 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8909538 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...
8898057 Encoding apparatus, decoding apparatus and methods thereof  
Disclosed is an encoding apparatus that can efficiently encode a signal that is a broad or extra-broad band signal or the like, thereby improving the quality of a decoded signal. This encoding...
8888494 Interactive environment for performing arts scripts  
One or more embodiments present a script to a user in an interactive script environment. A digital representation of a manuscript is analyzed. This digital representation includes a set of roles...
8868431 Recognition dictionary creation device and voice recognition device  
A recognition dictionary creation device identifies the language of a reading of an inputted text which is a target to be registered and adds a reading with phonemes in the language identified...
8862477 Menu hierarchy skipping dialog for directed dialog speech recognition  
A method and a processing device for managing an interactive speech recognition system is provided. Whether a voice input relates to expected input, at least partially, of any one of a group of...
8856008 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8838441 Time warped modified transform coding of audio signals  
A representation of an audio signal having a first, a second and a third frame is derived by estimating first warp information for the first and second frames and second warp information for the...
8775185 Speech samples library for text-to-speech and methods and apparatus for generating and using same  
A method for converting translating text into speech with a speech sample library is provided. The method comprises converting translating an input text to a sequence of triphones; determining...
8751237 Text-to-speech device and text-to-speech method  
A sound control section (114) selects and outputs a text-to-speech item from items included in program information multiplexed with a broadcast signal; and starts or stops outputting the...
8744851 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8744841 Adaptive time and/or frequency-based encoding mode determination apparatus and method of determining encoding mode of the apparatus  
An adaptive time/frequency-based encoding mode determination apparatus including a time domain feature extraction unit to generate a time domain feature by analysis of a time domain signal of an...
8731933 Speech synthesis apparatus and method utilizing acquisition of at least two speech unit waveforms acquired from a continuous memory region by one access  
A speech synthesizing apparatus includes a selector configured to select a plurality of speech units for synthesizing a speech of a phoneme sequence by referring to speech unit information stored...
8719030 System and method for speech synthesis  
The present invention is a method and system to convert speech signal into a parametric representation in terms of timbre vectors, and to recover the speech signal thereof. The speech signal is...
8706497 Speech signal restoration device and speech signal restoration method  
A synthesis filter 106 synthesizes a plurality of wide-band speech signals by combining wide-band phoneme signals and sound source signals from a speech signal code book 105, and a distortion...
8706493 Controllable prosody re-estimation system and method and computer program product thereof  
In one embodiment of a controllable prosody re-estimation system, a TTS/STS engine consists of a prosody prediction/estimation module, a prosody re-estimation module and a speech synthesis module....
8700388 Audio transform coding using pitch correction  
A processed representation of an audio signal having a sequence of frames is generated by sampling the audio signal within first and second frames of the sequence of frames, the second frame...
8655156 Auxiliary audio transmission for preserving synchronized playout with paced-down video  
In one method embodiment, providing a multiplex of compressed versions of a first video stream and a first audio stream, each corresponding to an audiovisual (A/V) program, the first video stream...
8630857 Speech synthesizing apparatus, method, and program  
Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change...
8612225 Voice recognition device, voice recognition method, and voice recognition program  
A voice recognition device that recognizes a voice of an input voice signal, comprises a voice model storage unit that stores in advance a predetermined voice model having a plurality of detail...
8604327 Apparatus and method for automatic lyric alignment to music playback  
There is provided an information processing device including a storage unit that stores music data for playing music and lyrics data indicating lyrics of the music, a display control unit that...
8583439 Enhanced interface for use with speech recognition  
Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface...
8583442 Rhythm processing and frequency tracking in gradient frequency nonlinear oscillator networks  
A method for mimicking the auditory system's response to rhythm of an input signal having a time varying structure comprising the steps of receiving a time varying input signal x(t) to a network...
8576961 System and method for adaptive overlap and add length estimation  
A method for determining an overlap and add length estimate comprises determining a plurality of correlation values of a plurality of ordered frequency domain samples obtained from a data frame;...
8554566 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8538758 Electronic apparatus  
An electronic apparatus includes a communication module, a storage module, a manipulation module, voice output control module, and a control module. The communication module receives book data...
8510112 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8510113 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8468020 Speech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access  
An apparatus for synthesizing a speech including a waveform memory that stores a plurality of speech unit waveforms, an information memory that correspondingly stores speech unit information and...
8447609 Adjustment of temporal acoustical characteristics  
Embodiments may be a standalone module or part of mobile devices, desktop computers, servers, stereo systems, or any other systems that might benefit from condensed audio presentations of item...
8438017 Method and apparatus for encoding/decoding audio signal using adaptive LPC coefficient interpolation  
Provided are a method and apparatus for encoding or decoding an audio signal by adaptively interpolating a linear predictive coding (LPC) coefficient. In the method and apparatus of encoding or...
8433575 Augmenting an audio signal via extraction of musical features and obtaining of media fragments  
A system and method is described in which a multimedia story is rendered to a consumer in dependence on features extracted from an audio signal representing for example a musical selection of the...
8433573 Prosody modification device, prosody modification method, and recording medium storing prosody modification program  
A prosody modification device includes: a real voice prosody input part that receives real voice prosody information extracted from an utterance of a human; a regular prosody generating part that...
8428953 Audio decoding device, audio decoding method, program, and integrated circuit  
An audio decoding device of the present invention includes: a decoding unit decoding a stream to a spectrum coefficient, and outputting stream information when a frame included in the stream...
8423358 Method and apparatus for performing packet loss or frame erasure concealment  
A method for performing packet loss or Frame Erasure Concealment (FEC) for a speech coder receives encoded frames of compressed speech information transmitted from an encoder. The method...
8423367 Apparatus and method for creating singing synthesizing database, and pitch curve generation apparatus and method  
Variation over time in fundamental frequency in singing voices is separated into a melody-dependent component and a phoneme-dependent component, modeled for each of the components and stored into...
8412518 Time warped modified transform coding of audio signals  
A representation of an audio signal having a first frame, a second frame following the first frame, and a third frame following the second frame, is derived by estimating first warp information...
8406432 Apparatus and method for automatic gain control using phase information  
An apparatus and a method for automatically controlling a gain using phase information are provided. The apparatus includes a frequency conversion unit converting each of input signals received...
8401856 Automatic normalization of spoken syllable duration  
A very common problem is when people speak a language other than the language which they are accustomed, syllables can be spoken for longer or shorter than the listener would regard as...
8386166 Apparatus for text-to-speech delivery and method therefor  
A method and apparatus for determining the manner in which a navigation device should produce sounds from data is described. One embodiment, includes a device for synthesizing sounds from digital...
8374873 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8346548 Aural similarity measuring system for text  
The aural similarity measuring system and method provides a measure of the aural similarity between a target text (10) and one or more reference texts (11). Both the target text (10) and the...
8340972 Psychoacoustic method and system to impose a preferred talking rate through auditory feedback rate adjustment  
The use of SOLA speech time compression/expansion in the present invention method as a means to alter a speaker's talking rate by adjusting the speech rate at which people hear their own voice. A...
8340967 Speech samples library for text-to-speech and methods and apparatus for generating and using same  
A method of recording speech for use in a speech samples library. In an exemplary embodiment, the method comprises recording a speaker pronouncing a phoneme with musical parameters characterizing...
8332215 Dynamic range control module, speech processing apparatus, and method for amplitude adjustment for a speech signal  
The invention provides a dynamic range control module installed in a speech processing apparatus. In one embodiment, the dynamic range control module comprises a buffer, a voice activity detector,...
8321216 Time-warping of audio signals for packet loss concealment avoiding audible artifacts  
Packet loss concealment (PLC) systems and methods are described that use time-warping to merge a concealment signal generated to replace one or more bad frames of an audio signal with a received...
8301279 Signal processing apparatus, signal processing method, and program therefor  
A signal processing apparatus subjects an audio signal to musical pitch analysis using different analysis techniques for the higher and lower frequencies. When an audio signal is input, a first...

Matches 1 - 50 out of 292 1 2 3 4 5 6 >