Match Document Document Title
7653540 Speech signal compression device, speech signal compression method, and program  
The present invention provides a speech signal compression device which allows a storage capacity of data representing speech to be efficiently compressed. In the present invention, a computer C1...
7649135 Sound synthesis  
A device for synthesizing sound having sinusoidal components includes a selector for selecting a limited number of the sinusoidal components from each of a number of frequency bands using a...
7643990 Global boundary-centric feature extraction and associated discontinuity metrics  
Portions from time-domain speech segments are extracted. Feature vectors that represent the portions in a vector space are created. The feature vectors incorporate phase information of the...
7640161 Wordspotting system  
An approach to improving the performance of a wordspotting system includes providing an interface for interactive improvement of a phonetic representation of a query based on an operator...
7636662 System and method for audio-visual content synthesis  
A system and method is provided for synthesizing audio-visual content in a video image processor. A content synthesis application processor extracts audio features and video features from...
7634405 Palette-based classifying and synthesizing of auditory information  
The subject invention leverages spectral “palettes” or representations of an input sequence to provide recognition and/or synthesizing of a class of data. The class can include, but is not limited...
7630896 Speech synthesis system and method  
A speech synthesis system in a preferred embodiment includes a speech unit storage section, a phonetic environment storage section, a phonetic sequence/prosodic information input section, a...
7630901 Multimodal input method  
In a multimodal input method, input information input from at least two input sources is received, control of the recognition of input from a second input source is performed based on the number...
7624020 Adapter for allowing both online and offline training of a text to text system  
An adapter for a text to text training. A main corpus is used for training, and a domain specific corpus is used to adapt the main corpus according to the training information in the domain...
7624017 System and method for configuring voice synthesis  
Systems and methods for providing synthesized speech in a manner that takes into account the environment where the speech is presented. A method embodiment includes, based on a listening...
7617106 Error detection for speech to text transcription systems  
A method, a system and a computer program product detects errors within text generated by a speech to text transcription system. The transcribed text is re-transformed into an artificial speech...
7613612 Voice synthesizer of multi sounds  
In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective...
7610200 System and method for controlling sound data  
A system and method for controlling access to parameter blocks of a sound processor. According to the method and system disclosed herein, the present invention includes a host, a sound processor...
7610201 Method and apparatus for synthesizing speech  
A method for synthesizing speech includes an obtaining step of obtaining a speech message, and a resuming step of resuming speech output of the speech message according to resumption data...
7606709 Voice converter with extraction and modification of attribute data  
An apparatus is constructed for converting an input voice signal into an output voice signal according to a target voice signal. In the apparatus, an input device provides the input voice signal...
7603278 Segment set creating method and apparatus  
A segment set before updating is read, and clustering considering a phoneme environment is performed to it. For each cluster obtained by the clustering, a representative segment of a segment set...
7603280 Speech output apparatus, speech output method, and program  
A speech output apparatus is disclosed, which can allow the user to easily catch synthetic speech when the synthetic speech is output upon being superposed on a music output. The apparatus output...
7599838 Speech animation with behavioral contexts for application scenarios  
Methods and systems, including computer program products, for speech animation. The system includes a speech animation server and one or more speech animation clients. The speech animation server...
7596497 Speech synthesis apparatus and speech synthesis method  
A speech synthesis apparatus and a speech synthesis method, in which a waveform of a desired formant shape may be generated with a small volume of computing operations. A voiced sound generating...
7590540 Method and system for statistic-based distance definition in text-to-speech conversion  
A method for distance definition in a text-to-speech conversion system by applying Gaussian Mixture Model (GMM) to a distance definition. According to an embodiment, the text that is to be...
7587310 Sound processor architecture using single port memory unit  
A system and method for implementing a sound processor. The sound processor includes a first voice engine, a second voice engine, and at least one single-port memory unit. An operation of the...
7587319 Speech recognition circuit using parallel processors  
A speech recognition circuit comprises an input buffer for receiving processed speech parameters. A lexical memory contains lexical data for word recognition. The lexical data comprises a...
7584104 Method and system for training a text-to-speech synthesis system using a domain-specific speech database  
A system, method and computer readable medium that trains a text-to-speech synthesis system for use in speech synthesis is disclosed. The method may include recording audio files of one or more...
7580839 Apparatus and method for voice conversion using attribute information  
A speech processing apparatus according to an embodiment of the invention includes a conversion-source-speaker speech-unit database; a voice-conversion-rule-learning-data generating means; and a...
7577568 Methods and system for creating voice files using a VoiceXML application  
Methods and systems for automating the assembly or creation of audio files for providing to listeners or for use in voice interactive services are provided. A voice application script is prepared...
7571104 Dynamic real-time cross-fading of voice prompts  
A system and method are provided for creating shorter more natural sounding voice messages and prompts from a plurality of pre-recorded sound segments, the prerecorded sound segments are...
7571099 Voice synthesis device  
A voice synthesis device for generating synthetic voice having great freedom in voice quality and good sound quality from text data is provided. The voice synthesis device is provided with: voice...
7565291 Synthesis-based pre-selection of suitable units for concatenative speech  
The instructions on the computer-readable medium control a computing device to perform the steps: selecting at least one phoneme from a triphone unit selection database as at least candidate...
7565293 Seamless hybrid computer human call service  
A Voice User Interface is provided for interactively responding in a synthesized voice to a call from a human caller, a Text to Speech system by which text entered by an agent and interactive data...
7562018 Speech synthesis method and speech synthesizer  
A language processing portion (31) analyzes a text from a dialogue processing section (20) and transforms the text to information on pronunciation and accent. A prosody generation portion (32)...
7558732 Method and system for computer-aided speech synthesis  
Method and system for computer-aided speed synthesis for synthesizing electronic text by performing a predefined series of rules-based analyses in a predefined order, each of the analyses...
7555433 Voice generator, method for generating voice, and navigation apparatus  
A main controller feeds a spelling translator with a text item representing a place name stored in a map database. The spelling translator translates the spelling of the text item according to...
7552052 Voice synthesis apparatus and method  
A plurality of voice segments, each including one or more phonemes are acquired in a time-serial manner, in correspondence with desired singing or speaking words. As necessary, a boundary is...
7552053 Techniques for aiding speech-to-speech translation  
Techniques for assisting in translation are provided. A speech recognition hypothesis is obtained, corresponding to a source language utterance. Information retrieval is performed on a...
7546241 Speech synthesis method and apparatus, and dictionary generation method and apparatus  
In a speech synthesis process, micro-segments are cut from acquired waveform data and a window function. The obtained micro-segments are re-arranged to implement a desired prosody, and superposed...
7542905 Method for synthesizing a voice waveform which includes compressing voice-element data in a fixed length scheme and expanding compressed voice-element data of voice data sections  
A method for synthesizing a voice waveform includes compressing voice-element data in a fixed length scheme that uses data from a preceding or succeeding frame. The compressed voice-element data...
7533021 Speech processing for telephony API  
Systems, methods, and structures are discussed that enhance media processing. One aspect of the present invention includes a data structure to enhance media processing. The data structure includes...
7529672 Speech synthesis using concatenation of speech waveforms  
A method of synthesizing a speech signal by providing a first speech unit signal having an end interval and a second speech unit signal having a front interval, wherein at least some of the...
7529674 Speech animation  
Methods and systems, including computer program products, for speech animation. The system includes a speech animation engine and a client application in communication with the speech animation...
7526430 Speech synthesis apparatus  
A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a...
7523035 Hands-free circuit and method for communicating with a wireless device  
A hands-free circuit (10) and method produces audio information (90) corresponding to voice tag information (60) stored either in the hands-free circuit (10) or in a wireless device (320). The...
7523037 Data synthesis apparatus and program  
A data synthesis apparatus detects the start of a period of voice waveform data, stores the voice waveform data in a first storage device, starting with its part indicative of the start of the...
7516073 Electronic-book read-aloud device and electronic-book read-aloud method  
A control unit of an electronic-book read-aloud device reads book data and electronic-book data from an electronic bookmark and stores the read data in a storage unit. Further, the control unit...
7507894 Sound data encoding apparatus and sound data decoding apparatus  
A processing load at the time of playing back sound data having a loop part is reduced. A sound data encoding apparatus comprises a block dividing means that divides the sound data into blocks...
7502740 Communication apparatus  
A communication apparatus includes a registration unit for registering setting data specifying a type of email message to be read, a synthetic-speech output unit for outputting resulting speech...
7487093 Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof  
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “” and end tag , a feature of synthetic voice is continuously changed while gradually...
7487092 Interactive debugging and tuning method for CTTS voice building  
A speech recognition device which can preferably be used for reducing the memory capacity required for speaker-independent speech recognition is provided. A matching unit loads speech models...
7483834 Method and apparatus for audio navigation of an information appliance  
The invention includes an apparatus and method of providing information using an information appliance coupled to a network. The method includes storing text files in a database at a remote...
7483832 Method and system for customizing voice translation of text to speech  
A method and system of customizing voice translation of a text to speech includes digitally recording speech samples of a known speaker, correlating each of the speech samples with a standardized...
7478039 Stochastic modeling of spectral adjustment for high quality pitch modification  
Natural-sounding synthesized speech is obtained from pieced elemental speech units that have their super-class identities known (e.g. phoneme type), and their line spectral frequencies (LSF) set...