Match Document Document Title
7617106 Error detection for speech to text transcription systems  
A method, a system and a computer program product detects errors within text generated by a speech to text transcription system. The transcribed text is re-transformed into an artificial speech...
7613612 Voice synthesizer of multi sounds  
In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency...
7610201 Method and apparatus for synthesizing speech  
A method for synthesizing speech includes an obtaining step of obtaining a speech message, and a resuming step of resuming speech output of the speech message according to resumption data...
7610200 System and method for controlling sound data  
A system and method for controlling access to parameter blocks of a sound processor. According to the method and system disclosed herein, the present invention includes a host, a sound processor...
7606709 Voice converter with extraction and modification of attribute data  
An apparatus is constructed for converting an input voice signal into an output voice signal according to a target voice signal. In the apparatus, an input device provides the input voice signal...
7603280 Speech output apparatus, speech output method, and program  
A speech output apparatus is disclosed, which can allow the user to easily catch synthetic speech when the synthetic speech is output upon being superposed on a music output. The apparatus output...
7603278 Segment set creating method and apparatus  
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
7599838 Speech animation with behavioral contexts for application scenarios  
Methods and systems, including computer program products, for speech animation. The system includes a speech animation server and one or more speech animation clients. The speech animation server...
7596497 Speech synthesis apparatus and speech synthesis method  
A speech synthesis apparatus and a speech synthesis method, in which a waveform of a desired formant shape may be generated with a small volume of computing operations. A voiced sound generating...
7590626 Distributional similarity-based models for query correction  
A distributional similarity between a word of a search query and a term of a candidate word sequences is used to determine an error model probability that describes the probability of the search...
7590540 Method and system for statistic-based distance definition in text-to-speech conversion  
A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be...
7587320 Automatic segmentation in speech synthesis  
Systems and methods for automatically segmenting speech inventories. A set of Hidden Markov Models (HMMs) are initialized using bootstrap data. The HMMs are next re-estimated and aligned to produce...
7587319 Speech recognition circuit using parallel processors  
A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a plurality...
7587310 Sound processor architecture using single port memory unit  
A system and method for implementing a sound processor. The sound processor includes a first voice engine, a second voice engine, and at least one single-port memory unit. An operation of the first...
7584104 Method and system for training a text-to-speech synthesis system using a domain-specific speech database  
A system, method and computer readable medium that trains a text-to-speech synthesis system for use in speech synthesis is disclosed. The method may include recording audio files of one or more...
7580839 Apparatus and method for voice conversion using attribute information  
A speech processing apparatus according to an embodiment of the invention includes a conversion-source-speaker speech-unit database; a voice-conversion-rule-learning-data generating means; and a...
7577568 Methods and system for creating voice files using a VoiceXML application  
Methods and systems for automating the assembly or creation of audio files for providing to listeners or for use in voice interactive services are provided. A voice application script is prepared...
7571104 Dynamic real-time cross-fading of voice prompts  
A system and method are provided for creating shorter more natural sounding voice messages and prompts from a plurality of pre-recorded sound segments, the prerecorded sound segments are...
7571099 Voice synthesis device  
A voice synthesis device for generating synthetic voice having great freedom in voice quality and good sound quality from text data is provided. The voice synthesis device is provided with: voice...
7565293 Seamless hybrid computer human call service  
A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data...
7565291 Synthesis-based pre-selection of suitable units for concatenative speech  
The instructions on the computer-readable medium control a computing device to perform the steps: selecting at least one phoneme from a triphone unit selection database as at least candidate...
7562018 Speech synthesis method and speech synthesizer  
A language processing portion ( 31 ) analyzes a text from a dialogue processing section ( 20 ) and transforms the text to information on pronunciation and accent. A prosody generation portion ( 32...
7558732 Method and system for computer-aided speech synthesis  
Method and system for computer-aided speed synthesis for synthesizing electronic text by performing a predefined series of rules-based analyses in a predefined order, each of the analyses operating...
7555433 Voice generator, method for generating voice, and navigation apparatus  
A main controller feeds a spelling translator with a text item representing a place name stored in a map database. The spelling translator translates the spelling of the text item according to...
7552053 Techniques for aiding speech-to-speech translation  
Techniques for assisting in translation are provided. A speech recognition hypothesis is obtained, corresponding to a source language utterance. Information retrieval is performed on a supplemental...
7552052 Voice synthesis apparatus and method  
A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is...
7546241 Speech synthesis method and apparatus, and dictionary generation method and apparatus  
In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed...
7542905 Method for synthesizing a voice waveform which includes compressing voice-element data in a fixed length scheme and expanding compressed voice-element data of voice data sections  
A method for synthesizing a voice waveform includes compressing voice-element data in a fixed length scheme that uses data from a preceding or succeeding frame. The compressed voice-element data of...
7533021 Speech processing for telephony API  
Systems, methods, and structures are discussed that enhance media processing. One aspect of the present invention includes a data structure to enhance media processing. The data structure includes...
7529674 Speech animation  
Methods and systems, including computer program products, for speech animation. The system includes a speech animation engine and a client application in communication with the speech animation...
7529672 Speech synthesis using concatenation of speech waveforms  
A method of synthesizing a speech signal by providing a first speech unit signal having an end interval and a second speech unit signal having a front interval, wherein at least some of the periods...
7526430 Speech synthesis apparatus  
A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a...
7523037 Data synthesis apparatus and program  
A data synthesis apparatus detects the start of a period of voice waveform data, stores the voice waveform data in a first storage device, starting with its part indicative of the start of the...
7523035 Hands-free circuit and method for communicating with a wireless device  
A hands-free circuit ( 10 ) and method produces audio information ( 90 ) corresponding to voice tag information ( 60 ) stored either in the hands-free circuit ( 10 ) or in a wireless device ( 320...
7516073 Electronic-book read-aloud device and electronic-book read-aloud method  
A control unit of an electronic-book read-aloud device reads book data and electronic-book data from an electronic bookmark and stores the read data in a storage unit. Further, the control unit...
7507894 Sound data encoding apparatus and sound data decoding apparatus  
A processing load at the time of playing back sound data having a loop part is reduced. A sound data encoding apparatus comprises a block dividing means that divides the sound data into blocks...
7502740 Communication apparatus  
A communication apparatus includes a registration unit for registering setting data specifying a type of email message to be read, a synthetic-speech output unit for outputting resulting speech...
7487093 Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof  
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “<morphing type=“emotion” start=“happy” end=“angry”>” and end tag...
7487092 Interactive debugging and tuning method for CTTS voice building  
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
7483834 Method and apparatus for audio navigation of an information appliance  
The invention includes an apparatus and method of providing information using an information appliance coupled to a network. The method includes storing text files in a database at a remote...
7483832 Method and system for customizing voice translation of text to speech  
A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized...
7478047 Interactive character system  
A system and method for controlling a synthetic character using a control system displays the character engaged in an activity, receiving a first input from a user, determines whether the input is...
7478039 Stochastic modeling of spectral adjustment for high quality pitch modification  
Natural-sounding synthesized speech is obtained from pieced elemental speech units that have their super-class identities known (e.g. phoneme type), and their line spectral frequencies (LSF) set in...
7475016 Speech segment clustering and ranking  
A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech...
7475007 Expression extraction device, expression extraction method, and recording medium  
Provided is an expression extraction device for extracting evaluation expressions from text having descriptions on evaluations of a specific evaluation target, which includes a registered...
7472066 Automatic speech segmentation and verification using segment confidence measures  
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit...
7472065 Generating paralinguistic phenomena via markup in text-to-speech synthesis  
Converting marked-up text into a synthesized stream includes providing marked-up text to a processor-based system, converting the marked-up text into a text stream including vocabulary items,...
7460997 Method and system for preselection of suitable units for concatenative speech  
A system and method for improving the response time of text-to-speech synthesis utilizes “triphone contexts” (i.e., triplets comprising a central phoneme and its immediate context) as the basic...
7457752 Method and apparatus for controlling the operation of an emotion synthesizing device  
Method and apparatus for controlling the operation of an emotion synthesizing device, notably of the type where the emotion is conveyed by a sound, having at least one input parameter whose value...
7457748 Method of automatic processing of a speech signal  
Method of automatically processing a speech signal which comprises the steps of: determining a sequence of probability models corresponding to a given text; determining a sequence of...