Match Document Document Title
9043205 Dynamic language model  
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving a base language model for speech...
9043349 Image-based character recognition  
Various embodiments enable a device to perform tasks such as processing an image to recognize and locate text in the image, and providing the recognized text an application executing on the device...
9043204 Thought recollection and speech assistance device  
Some embodiments of the inventive subject matter include a method for detecting speech loss and supplying appropriate recollection data to the user. Such embodiments include detecting a speech...
9037469 Automated communication integrator  
An apparatus includes a plurality of applications and an integrator having a voice recognition module configured to identify at least one voice command from a user. The integrator is configured to...
9037460 Dynamic long-distance dependency with conditional random fields  
Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a...
9037461 Methods and systems for dictation and transcription  
Automated delivery and filing of transcribed material prepared from dictated audio files into a central record-keeping system are presented. A user dictates information from any location, uploads...
9037459 Selection of text prediction results by an accessory  
A method for entering text in a text input field using a non-keyboard type accessory includes selecting a character for entry into the text field presented by a portable computing device. The...
9031831 Method and system for looking up words on a display screen by OCR comprising a set of base forms of recognized inflected words  
Embodiments of the present invention disclose a dictionary lookup method and an electronic device that implements the dictionary lookup method. The dictionary lookup method allows a user to...
9031828 Systems and methods for multi-user multi-lingual communications  
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments may enable multi-lingual communications through different modes of...
9031829 Systems and methods for multi-user multi-lingual communications  
Various embodiments described herein facilitate multi-lingual communications. The systems and methods of some embodiments enable multi-lingual communications through different modes of...
9031840 Identifying media content  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental...
9031839 Conference transcription based on conference data  
In one implementation, a collaboration server is a conference bridge or other network device configured to host an audio and/or video conference among a plurality of conference participants. The...
9026182 Communication device  
The communication device comprising a voice communication implementer, a multiple & real-time & chronological speech-to-text implementer, and a caller ID.
9026446 System for generating captions for live video broadcasts  
An adaptive workflow system can be used to implement captioning projects, such as projects for creating captions or subtitles for live and non-live broadcasts. Workers can repeat words spoken...
9026431 Semantic parsing with multiple parsers  
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for semantic parsing with multiple parsers. One of the methods includes obtaining one or more...
9026442 System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring  
Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model...
9020816 Hidden markov model for speech processing with training method  
A method, system and apparatus are shown for identifying non-language speech sounds in a speech or audio signal. An audio signal is segmented and feature vectors are extracted from the segments of...
9020817 Using speech to text for detecting commercials and aligning edited episodes with transcripts  
Methods and apparatus, including computer program products, for using speech to text for detecting commercials and aligning edited episodes with transcripts. A method includes, receiving an...
9020818 Format based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015044 Formant based speech reconstruction from noisy signals  
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or...
9015043 Choosing recognized text from a background environment  
A computer-implemented method includes receiving an electronic representation of one or more human voices, recognizing words in a first portion of the electronic representation of the one or more...
9015046 Methods and apparatus for real-time interaction analysis in call centers  
A method and system for indicating in real time that an interaction is associated with a problem or issue, comprising: receiving a segment of an interaction in which a representative of the...
9009695 Method for changing over from a first adaptive data processing version to a second adaptive data processing version  
The invention relates to a method and to a system for changing over from a first adaptive data processing version (V1) on data processing means using at least one data model (dm) which is...
9009041 Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data  
A method is described for improving the accuracy of a transcription generated by an automatic speech recognition (ASR) engine. A personal vocabulary is maintained that includes replacement words....
9009043 Pattern processing system specific to a user group  
Methods and apparatus for identifying a user group in connection with user group-based speech recognition. An exemplary method comprises receiving, from a user, a user group identifier that...
9009042 Machine translation of indirect speech  
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating direct speech messages based on voice commands that include indirect speech...
9009040 Training a transcription system  
According to certain embodiments, training a transcription system includes accessing recorded voice data of a user from one or more sources. The recorded voice data comprises voice samples. A...
9009606 Instant messaging association to remote desktops  
A remote desktop capability includes a message area on the agent's remote desktop display. Incoming instant messages on an instant messaging application on the agent's primary desktop are passed...
9002708 Speech recognition system and method based on word-level candidate generation  
A speech recognition system and method based on word-level candidate generation are provided. The speech recognition system may include a speech recognition result verifying unit to verify a word...
9002702 Confidence level assignment to information from audio transcriptions  
Embodiments of the present invention provide an approach for automatically assigning a confidence level to information extracted from a transcription of a voice recording. Specifically, in a...
9002703 Community audio narration generation  
The community-based generation of audio narrations for a text-based work leverages collaboration of a community of people to provide human-voiced audio readings. During the community-based...
8996369 System and method for transcribing audio files of various languages  
System, method and program product for transcribing an audio file included in or referenced by a web page. A language of text in the web page is determined. Then, voice recognition software of the...
8996371 Method and system for automatic domain adaptation in speech recognition applications  
A system and method for adapting a language model to a specific environment by receiving interactions captured the specific environment, generating a collection of documents from documents...
8996370 Transferring data via audio link  
Transferring data via audio link is described. In an example a short sequence of data can be transferred between two devices by encoding the sequence of data as an audio sequence. For example, the...
8996386 Method and system for creating a voice recognition database for a mobile device using image processing and optical character recognition  
A method and system for controlling a mobile device from a head unit using voice control is disclosed. The head unit receives a graphical representation of a current user interface screen of the...
8996382 Lips blockers, headsets and systems  
Systems and methods for inhibiting access to the lips of speaking person including a sound receiving device for receiving speech of a person speaking, the person having lips that move when the...
8996373 State detection device and state detecting method  
A state detection device includes: a first model generation unit to generate a first specific speaker model obtained by modeling speech features of a specific speaker in an undepressed state; a...
8994522 Human-machine interface (HMI) auto-steer based upon-likelihood to exceed eye glance guidelines  
The described method and system provide for HMI steering for a telematics-equipped vehicle based on likelihood to exceed eye glance guidelines. By determining whether a task is likely to cause the...
8996376 Intelligent text-to-speech conversion  
Techniques for improved text-to-speech processing are disclosed. The improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech...
8996380 Methods and systems for synchronizing media  
Systems and methods of synchronizing media are provided. A client device may be used to capture a sample of a media stream being rendered by a media rendering source. The client device sends the...
8990077 Method and system for sharing portable voice profiles  
An embodiment of the present invention provides a speech recognition engine that utilizes portable voice profiles for converting recorded speech to text. Each portable voice profile includes...
8990090 Script compliance using speech recognition  
A system and method for evaluating the compliance of an agent reading a script to a client comprises conducting a voice interaction between the agent and the client wherein the agent follows a...
8989713 Selection of a link in a received message for speaking reply, which is converted into text form for delivery  
A link, called an X-Linkā„¢ and is placed in a message (SMS, MMS, email etc.) that is sent to a user and displayed on their device (e.g. mobile telephone). When the link is selected by the user, it...
8983840 Intent discovery in audio or text-based conversation  
Techniques, an apparatus and an article of manufacture identifying one or more utterances that are likely to carry the intent of a speaker, from a conversation between two or more parties. A...
8983835 Electronic device and server for processing voice message  
An electronic device includes a voice processing unit, a wireless communication unit, and a combining unit. The voice processing unit receives speech signals. The wireless communication unit sends...
8983838 Global speech user interface  
A global speech user interface (GSUI) comprises an input system to receive a user's spoken command, a feedback system along with a set of feedback overlays to give the user information on the...
8983836 Captioning using socially derived acoustic profiles  
Mechanisms for performing dynamic automatic speech recognition on a portion of multimedia content are provided. Multimedia content is segmented into homogeneous segments of content with regard to...
8983841 Method for enhancing the playback of information in interactive voice response systems  
A network communication node includes an audio outputter that outputs an audible representation of data to be provided to a requester. The network communication node also includes a processor that...
8977555 Identification of utterance subjects  
Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a...
8972257 Systems and methods to present voice message information to a user of a computing device  
Systems and methods to process and/or present information relating to voice messages for a user that are received from other persons. In one embodiment, a method implemented in a data processing...