Match Document Document Title
8396708 Facial expression representation apparatus  
An avatar facial expression representation technology is provided. The avatar facial expression representation technology estimates changes in emotion and emphasis in a user's voice from vocal...
8392191 Chinese prosodic words forming method and apparatus  
The present invention provides a method and apparatus of forming Chinese prosodic words, which method comprises the steps of inputting Chinese text; performing process of word segmentation and...
8392194 System and method for machine-based determination of speech intelligibility in an aircraft during flight operations  
A method for effecting a machine-based determination of speech intelligibility in an aircraft during flight operations includes: (a) in no particular order: (1) providing a representation of a...
8380484 Method and system of dynamically changing a sentence structure of a message  
A method (50) of dynamically changing a sentence structure of a message can include the step of receiving (51) a user request for information, retrieving (52) data based on the information...
8374859 Automatic answering device, automatic answering system, conversation scenario editing device, conversation server, and automatic answering method  
An automatic answering device and an automatic answering method for automatically answering to a user utterance are configured: to prepare a conversation scenario that is a set of input sentences...
8374872 Dynamic update of grammar for interactive voice response  
A device provides a question to a user, and receives, from the user, an unrecognized voice response to the question. The device also provides the unrecognized voice response to an utterance agent...
8374876 Speech generation user interface  
A system and a method for speech generation which assist the speech of those with a disability or a medical condition such as cerebral palsy, motor neurone disease or a dysarthia following a...
8374873 Training and applying prosody models  
Techniques for training and applying prosody models for speech synthesis are provided. A speech recognition engine processes audible speech to produce text annotated with prosody information. A...
8370151 Systems and methods for multiple voice document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices where the portions of the text narrated using the different voices are selected by a user.
8370150 Character information presentation device  
The text information presentation device calculates an optimum readout speed on the basis of the content of text information being input, its arriving time, and its previous arriving time;...
8370148 System and method for answering a communication notification  
Disclosed herein are systems, methods, and computer readable-media for answering a communication notification. The method for answering a communication notification comprises receiving a...
8370149 Speech synthesis system, speech synthesis program product, and speech synthesis method  
Waveform concatenation speech synthesis with high sound quality. Prosody with both high accuracy and high sound quality is achieved by performing a two-path search including a speech segment...
8364466 Fast-and-engaging, real-time translation using a network environment  
The teachings described herein generally relate to a multilingual electronic translation of a source phrase to a destination language selected from multiple languages, and this can be accomplished...
8364487 Speech recognition system with display information  
A language processing system may determine a display form of a spoken word by analyzing the spoken form using a language model that includes dictionary entries for display forms of homonyms. The...
8364488 Voice models for document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. Further disclosed are techniques and systems for modifying a voice model associated with a...
8364472 Voice encoding device and voice encoding method  
Provided is an audio encoding device which can detect an optimal pitch pulse when using pitch pulse information as redundant information. The device includes: a search start decision unit (121)...
8359202 Character models for document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices where the portions of the text narrated using the different voices are selected by a user. Also...
8352268 Systems and methods for selective rate of speech and speech preferences for text to speech synthesis  
Algorithms for synthesizing speech used to identify media assets are provided. Speech may be selectively synthesized form text strings associated with media assets. A text string may be normalized...
8352267 Information processing system and method for reading characters aloud  
A plurality of input devices each includes a speaker, an operation data transmitter, a voice data receiver, and a voice controller. An information processing apparatus includes a voice storing...
8352269 Systems and methods for processing indicia for document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. Further disclosed are techniques and systems for processing indicia in a document to determine a...
8352271 Facilitating text-to-speech conversion of a username or a network address containing a username  
To facilitate text-to-speech conversion of a username, a first or last name of a user associated with the username may be retrieved, and a pronunciation of the username may be determined based at...
8346548 Aural similarity measuring system for text  
The aural similarity measuring system and method provides a measure of the aural similarity between a target text (10) and one or more reference texts (11). Both the target text (10) and the...
8346557 Systems and methods document narration  
Disclosed are techniques and systems to provide a narration of a text in multiple different voices. In some aspects, systems and methods described herein can include receiving a user-based...
8340797 Method and system for generating and processing digital content based on text-to-speech conversion  
A method and system is provided for generating digital content using text-to-speech (TTS) conversion. A predetermined script is selected using a portable terminal or user personal computer (PC). A...
8340972 Psychoacoustic method and system to impose a preferred talking rate through auditory feedback rate adjustment  
The use of SOLA speech time compression/expansion in the present invention method as a means to alter a speaker's talking rate by adjusting the speech rate at which people hear their own voice. A...
8338687 Apparatus and method for creating singing synthesizing database, and pitch curve generation apparatus and method  
Waveform data representative of singing voices of a singing music piece are analyzed to generate melody component data representative of variation over time in fundamental frequency component...
8340956 Information provision system, information provision method, information provision program, and information provision program recording medium  
An information provision system capable of providing attribute information to a contextually appropriate portion as well as a linguistic unit including a specific character string expression is...
8340967 Speech samples library for text-to-speech and methods and apparatus for generating and using same  
A method of recording speech for use in a speech samples library. In an exemplary embodiment, the method comprises recording a speaker pronouncing a phoneme with musical parameters characterizing...
8340965 Rich context modeling for text-to-speech engines  
Embodiments of rich context modeling for speech synthesis are disclosed. In operation, a text-to-speech engine refines a plurality of rich context models based on decision tree-tied Hidden Markov...
8332215 Dynamic range control module, speech processing apparatus, and method for amplitude adjustment for a speech signal  
The invention provides a dynamic range control module installed in a speech processing apparatus. In one embodiment, the dynamic range control module comprises a buffer, a voice activity detector,...
8332212 Method and system for efficient pacing of speech for transcription  
A method and system for improving the efficiency of real-time and non-real-time speech transcription by machine speech recognizers, human dictation typists, and human voicewriters using speech...
8332225 Techniques to create a custom voice font  
Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to...
8326613 Method of synthesizing of an unvoiced speech signal  
The present invention relates to a method of synthesizing a signal comprising the steps of determining a required pitch bell locations,mapping the required pitch bell locations onto the signal to...
8326629 Dynamically changing voice attributes during speech synthesis based upon parameter differentiation for dialog contexts  
A method of speech synthesis can include automatically identifying spoken passages and non-spoken passages within a text source and converting the text source to speech by applying different voice...
8326628 Method of auditory display of sensor data  
A method of auditory communication is provided. The method includes measuring physiological data from at least one sensor to form a data set; identifying a type of the data set; identifying an...
8326635 Method and system for message alert and delivery using an earpiece  
Earpieces and methods for an earpiece to manage a delivery of a message are provided. A method can include receiving a notice that a message is available at a communication device, parsing the...
8321208 Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information  
An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data by discrete Fourier transform. The spectral envelope information is represented...
8321227 Methods and devices for appending an address list and determining a communication profile  
Disclosed are methods and electronic communication devices, such as an in-car speaker device, that can receive via a downloading process, a communication address list from another device to the...
8321224 Text-to-speech method and system, computer program product therefor  
A text-to-speech system adapted to operate on text in a first language including sections in a second language, includes a grapheme/phoneme transcriptor for converting the sections in the second...
8321222 Synthesis by generation and concatenation of multi-form segments  
A speech synthesis system and method is described. A speech segment database references speech segments having various different speech representational structures. A speech segment selector...
8321225 Generating prosodic contours for synthesized speech  
The subject matter of this specification can be implemented in, among other things, a computer-implemented method including receiving text to be synthesized as a spoken utterance. The method...
8315871 Hidden Markov model based text to speech systems employing rope-jumping algorithm  
A rope-jumping algorithm is employed in a Hidden Markov Model based text to speech system to determine start and end models and to modify the start and end models by setting small co-variances....
8315872 Methods and apparatus for rapid acoustic unit selection from a large speech corpus  
A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen...
8315873 Sentence reading aloud apparatus, control method for controlling the same, and control program for controlling the same  
An apparatus for voice synthesis includes: a word database for storing words and voices; a syllable database for storing syllables and voices; a processor for executing a process including:...
8311830 System and method for client voice building  
Provided is a system and method for building and managing a customized voice of an end-user, comprising the steps of designing a set of prompts for collection from the user, wherein the prompts...
8311811 Method and apparatus for detecting pitch by using subharmonic-to-harmonic ratio  
A method and an apparatus for detecting a pitch in input voice signals by using a subharmonic-to-harmonic ratio (SHR). The pitch detection method includes performing a Fourier transform on the...
8301451 Speech synthesis with dynamic constraints  
A method is disclosed for providing speech parameters to be used for synthesis of a speech utterance. In at least one embodiment, the method includes receiving an input time series of first speech...
8302151 Improving comprehension of information in a security enhanced environment by representing the information in audio form  
In a software environment wherein one or more subjects respectively seek to access one or more objects, and wherein a security policy having rules is associated with the environment, a method is...
8296143 Audio signal processing apparatus, audio signal processing method, and program for having the method executed by computer  
An audio waveform processing not imparting any feeling of strangeness and high in definition, in which time stretch and pitch shift are performed by a vocoder method, and the variation of phase...
8285547 Audio font output device, font database, and language input front end processor  
An audio font output device is disclosed that is able to effectively convert characters or text into an audio signal recognizable by the acoustic sense of human beings. The audio font output...