Matches 1 - 50 out of 236 1 2 3 4 5 >


Match Document Document Title
9043213 Speech recognition and synthesis utilizing context dependent acoustic models containing decision trees  
A speech recognition method including the steps of receiving a speech input from a known speaker of a sequence of observations and determining the likelihood of a sequence of words arising from...
9026438 Detecting barge-in in a speech dialogue system  
A method for detecting barge-in in a speech dialog system comprising determining whether a speech prompt is output by the speech dialog system, and detecting whether speech activity is present in...
9026445 Text-to-speech user's voice cooperative server for instant messaging clients  
A system and method to allow an author of an instant message to enable and control the production of audible speech to the recipient of the message. The voice of the author of the message is...
9020812 Audio signal processing method and device  
Disclosed is an audio signal processing method comprising the steps of: receiving an audio signal containing current frame data; generating a first temporary output signal for the current frame...
9002711 Speech synthesis apparatus and method  
According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers'...
8996384 Transforming components of a web page to voice prompts  
Embodiments of the invention address the deficiencies of the prior art by providing a method, apparatus, and program product to of converting components of a web page to voice prompts for a user....
8990087 Providing text to speech from digital content on an electronic device  
A method for providing text to speech from digital content in an electronic device is described. Digital content including a plurality of words and a pronunciation database is received....
8977552 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8954328 Systems and methods for document narration with multiple characters having multiple moods  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. Further disclosed are techniques and systems for providing a plurality of characters at least...
8898062 Strained-rough-voice conversion device, voice conversion device, voice synthesis device, voice conversion method, voice synthesis method, and program  
A strained-rough-voice conversion unit (10) is included in a voice conversion device that can generate a “strained rough” voice produced in a part of a speech when speaking forcefully with...
8888494 Interactive environment for performing arts scripts  
One or more embodiments present a script to a user in an interactive script environment. A digital representation of a manuscript is analyzed. This digital representation includes a set of roles...
8868423 System and method for controlling access to resources with a spoken CAPTCHA test  
Systems and methods for controlling access to resources using spoken Completely Automatic Public Turing Tests To Tell Humans And Computers Apart (CAPTCHA) tests are disclosed. In these systems and...
8868418 Receiver intelligibility enhancement system  
Embodiments of the invention provide a communication device and methods for enhancing audio signals. A first audio signal buffer and a second audio signal buffer are acquired. Thereafter, the...
8868431 Recognition dictionary creation device and voice recognition device  
A recognition dictionary creation device identifies the language of a reading of an inputted text which is a target to be registered and adds a reading with phonemes in the language identified...
8856008 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8849669 System for tuning synthesized speech  
An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and/or extended SSML to synthesized audio. Provisions are provided to create, view,...
8831950 Automated voice enablement of a web page  
Embodiments of the present invention provide a method, system and computer program product for the automated voice enablement of a Web page. In an embodiment of the invention, a method for voice...
8825485 Text to speech method and system converting acoustic units to speech vectors using language dependent weights for a selected language  
A text-to-speech method for use in a plurality of languages, including: inputting text in a selected language; dividing the inputted text into a sequence of acoustic units; converting the sequence...
8781835 Methods and apparatuses for facilitating speech synthesis  
Methods and apparatuses are provided for facilitating speech synthesis. A method may include generating a plurality of input models representing an input by using a statistical model synthesizer...
8751239 Method, apparatus and computer program product for providing text independent voice conversion  
An apparatus for providing text independent voice conversion may include a first voice conversion model and a second voice conversion model. The first voice conversion model may be trained with...
8751236 Devices and methods for speech unit reduction in text-to-speech synthesis systems  
A device may receive a plurality of speech sounds that are indicative of pronunciations of a first linguistic term. The device may determine concatenation features of the plurality of speech...
8744851 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8731933 Speech synthesis apparatus and method utilizing acquisition of at least two speech unit waveforms acquired from a continuous memory region by one access  
A speech synthesizing apparatus includes a selector configured to select a plurality of speech units for synthesizing a speech of a phoneme sequence by referring to speech unit information stored...
8719030 System and method for speech synthesis  
The present invention is a method and system to convert speech signal into a parametric representation in terms of timbre vectors, and to recover the speech signal thereof. The speech signal is...
8706497 Speech signal restoration device and speech signal restoration method  
A synthesis filter 106 synthesizes a plurality of wide-band speech signals by combining wide-band phoneme signals and sound source signals from a speech signal code book 105, and a distortion...
8655659 Personalized text-to-speech synthesis and personalized speech feature extraction  
A personalized text-to-speech synthesizing device includes: a personalized speech feature library creator, configured to recognize personalized speech features of a specific speaker by comparing a...
8655664 Text presentation apparatus, text presentation method, and computer program product  
According to an embodiment, a text presentation apparatus presenting text for a speaker to read aloud for voice recording includes: a text storing unit for storing first text; a presenting unit...
8639511 Robot, method and program of correcting a robot voice in accordance with head movement  
A robot may include a driving control unit configured to control a driving of a movable unit that is connected movably to a body unit, a voice generating unit configured to generate a voice, and a...
8630971 System and method of using Multi Pattern Viterbi Algorithm for joint decoding of multiple patterns  
Systems, devices, and methods for using Multi-Pattern Viterbi Algorithm for joint decoding of multiple patterns are disclosed. An exemplary method may receive a plurality of sets of...
8630857 Speech synthesizing apparatus, method, and program  
Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change...
8600753 Method and apparatus for combining text to speech and recorded prompts  
An arrangement provides for improved synthesis of speech arising from a message text. The arrangement stores prerecorded prompts and speech related characteristics for those prompts. A message is...
8583437 Speech synthesis with incremental databases of speech waveforms on user terminals over a communications network  
Service architecture for providing to a user terminal of a communications network textual information and relative speech synthesis, the user terminal being provided with a speech synthesis engine...
8571849 System and method for enriching spoken language translation with prosodic information  
Disclosed herein are systems, methods, and computer readable-media for enriching spoken language translation with prosodic information in a statistical speech translation framework. The method...
8566099 Tabulating triphone sequences by 5-phoneme contexts for speech synthesis  
A system and method for improving the response time of text-to-speech synthesis using triphone contexts. The method includes identifying a set of triphone sequences, tabulating the set of triphone...
8560317 Voice recognition apparatus and recording medium storing voice recognition program  
A vocabulary dictionary storing unit for storing a plurality of words in advance, a vocabulary dictionary managing unit for extracting recognition target words, a matching unit for calculating a...
8554566 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8543404 Proactive completion of input fields for automated voice enablement of a web page  
Embodiments of the present invention provide a method and computer program product for the proactive completion of input fields for automated voice enablement of a Web page. In an embodiment of...
8537164 Animation retargeting  
Systems and methods are described, which create a mapping from a space of a source object (e.g., source facial expressions) to a space of a target object (e.g., target facial expressions). In...
8510112 Method and system for enhancing a speech database  
A system, method and computer readable medium that enhances a speech database for speech synthesis is disclosed. The method may include labeling audio files in a primary speech database,...
8494854 CAPTCHA using challenges optimized for distinguishing between humans and machines  
An audible based electronic challenge system is used to control access to a computing resource by using a test to identify an origin of a voice. The test is based on analyzing a spoken utterance...
8494849 Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system  
A method of transmitting speech data to a remote device in a distributed speech recognition system, includes the steps of: dividing an input speech signal into frames; calculating, for each frame,...
8489399 System and method for verifying origin of input through spoken language analysis  
An audible based electronic challenge system is used to control access to a computing resource by using a test to identify an origin of a voice. The test is based on analyzing a spoken utterance...
8478595 Fundamental frequency pattern generation apparatus and fundamental frequency pattern generation method  
A fundamental frequency pattern generation apparatus includes a first storage including representative vectors each corresponding to a prosodic control unit and having a section for changing the...
8468020 Speech synthesis apparatus and method wherein more than one speech unit is acquired from continuous memory region by one access  
An apparatus for synthesizing a speech including a waveform memory that stores a plurality of speech unit waveforms, an information memory that correspondingly stores speech unit information and...
8438032 System for tuning synthesized speech  
An embodiment of the invention is a software tool used to convert text, speech synthesis markup language (SSML), and or extended SSML to synthesized audio. Provisions are provided to create, view,...
8428952 Text-to-speech user's voice cooperative server for instant messaging clients  
A system and method to allow an author of an instant message to enable and control the production of audible speech to the recipient of the message. The voice of the author of the message is...
8423366 Automatically training speech synthesizers  
A method includes receiving, by a system, a voice recording associated with a user, transcribing, the voice recording into text that includes a group of words, and storing an association between a...
8412529 Method and system for enhancing verbal communication sessions  
An approach is provided for enhancing verbal communication sessions. A verbal component of a communication session is converted into textual information. The converted textual information is...
8407054 Speech synthesis device, speech synthesis method, and speech synthesis program  
A speech synthesis device is provided with: a central segment selection unit for selecting a central segment from among a plurality of speech segments; a prosody generation unit for generating...
8374873 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...

Matches 1 - 50 out of 236 1 2 3 4 5 >