20070203903 | METHODS AND APPARATUS FOR VISUALIZING, MANAGING, MONETIZING, AND PERSONALIZING KNOWLEDGE SEARCH RESULTS ON A USER INTERFACE | August, 2007 | Attaran Rezaei et al. |
20090248734 | SUGGESTING CONCEPT-BASED DOMAIN NAMES | October, 2009 | Adelman et al. |
20070094296 | Document management system for vehicle sales | April, 2007 | Peters III |
20080097981 | RANKING IMAGES FOR WEB IMAGE RETRIEVAL | April, 2008 | Williams et al. |
20050256863 | Context management system | November, 2005 | Crivella et al. |
20030220891 | Matter management computer software | November, 2003 | Fish |
20090119352 | Method for Optimizing Generational Garbage Collection Through Object Life Heuristics | May, 2009 | Branda et al. |
20070112860 | METHOD AND SYSTEM FOR ONLINE SUBMITTAL EXCHANGE | May, 2007 | Ostanik |
20040133567 | Run-time optimizations of queries with SQL spreadsheet | July, 2004 | Witkowski et al. |
20050216489 | Dynamic reference repository | September, 2005 | Young et al. |
20060059135 | Conducting a search directed by a hierarchy-free set of topics | March, 2006 | Palmon et al. |
[0001] This invention relates to systems and methods of generating and rendering annotated video files.
[0002] Individuals and organizations are rapidly accumulating large collections of video content. As these collections grow, individuals and organizations increasingly will require systems and methods for organizing and summarizing the video content in their collections so that desired video content may be found quickly and easily. To meet this need, a variety of different systems and methods for creating and summarizing video content have been proposed.
[0003] For example, storyboard summarization has been developed to enable full-motion video content to be browsed. In accordance with this technique, video information is condensed into meaningful representative snapshots and corresponding audio content. One known video browser of this type divides a video sequence into equal length segments and denotes the first frame of each segment as its key frame. Another known video browser of this type stacks every frame of the sequence and provides the user with information regarding the camera and object motions.
[0004] Content-based video summarization techniques also have been proposed. In these techniques, a long video sequence typically is classified into story units based on video content. In some approaches, scene change detection (also called temporal segmentation of video) is used to give an indication of when a new shot starts and ends. Scene change detection algorithms, such as scene transition detection algorithms based on DCT (Discrete Cosine Transform) coefficients of an encoded image, and algorithms that are configured to identify both abrupt and gradual scene transitions using the DCT coefficients of an encoded video sequence are known in the art.
[0005] In one video summarization approach, Rframes (representative frames) are used to organize the visual contents of video clips. Rframes may be grouped according to various criteria to aid the user in identifying the desired material. In this approach, the user may select a key frame, and the system then uses various criteria to search for similar key frames and present them to the user as a group. The user may search representative frames from the groups, rather than the complete set of key frames, to identify scenes of interest. Language-based models have been used to match incoming video sequences with the expected grammatical elements of a news broadcast. In addition, a priori models of the expected content of a video clip have been used to parse the clip.
[0006] In another approach, a hierarchical decomposition of a complex video selection is extracted for video browsing purposes. This technique combines visual and temporal information to capture the important relations within a scene and between scenes in a video, thus allowing the analysis of the underlying story structure with no a priori knowledge of the content. A general model of a hierarchical scene transition graph is applied to an implementation for browsing. Video shots are first identified and a collection of key frames is used to represent each video segment. These collections then are classified according to gross visual information. A platform is built on which the video is presented as directed graphs to the user, with each category of video shots represented by a node and each edge denoting a temporal relationship between categories. The analysis and processing of video is carried out directly on the compressed videos.
[0007] In each of the above-described video summarization approaches, the video summary information is stored separately from the original video content. Consequently, in these approaches there is risk that information enabling video summaries to be rendered may become disassociated from the corresponding original video files when the original video files are transmitted from one video rendering system to another.
[0008] The invention features systems and methods of generating and rendering annotated video files.
[0009] In one aspect, the invention features a method of generating an annotated video file. In accordance with this inventive method, an original video file is annotated by embedding therein information enabling rendering of at least one video summary that is contained in the annotated video file and comprises digital content summarizing at least a portion of the original video file.
[0010] In another aspect, the invention features a computer program for implementing the above-described annotated video file generation method.
[0011] Another aspect of the invention features a computer-readable medium tangibly storing an annotated video file having embedded therein information enabling rendering of at least one video summary that is contained in the annotated video file and comprises digital content summarizing at least a portion of an original video file.
[0012] In another aspect, the invention features a system for rendering an annotated video file that includes a video rendering engine. The video rendering engine is operable to identify information that is embedded in the annotated video file and enables rendering of at least one video summary that is contained in the annotated video file and comprises digital content summarizing at least a sequence of video frames contained in the video file. The video rendering engine is operable to render the at least one video summary.
[0013] Other features and advantages of the invention will become apparent from the following description, including the drawings and the claims.
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021] In the following description, like reference numbers are used to identify like elements. Furthermore, the drawings are intended to illustrate major features of exemplary embodiments in a diagrammatic manner. The drawings are not intended to depict every feature of actual embodiments nor relative dimensions of the depicted elements, and are not drawn to scale.
[0022] The embodiments described below feature systems and methods of generating annotated video files from original video files, which may or may not have been previously annotated. The annotated video files include embedded information enabling the rendering of at least one video summary that is contained in the annotated video file and includes digital content summarizing at least a portion of the original video file. In this way, the video summaries are is always accessible to a rendering system because the resulting annotated video files contain the contents of both the original video file and the video summaries. Users may therefore quickly and efficiently browse through a collection of annotated video files without risk that the video summaries will become disassociated from the corresponding video files, regardless of the way in which the video files are transmitted from one rendering system to another.
[0023] As used herein, “video summary” refers to any digital content that summarizes (i.e., represents, symbolizes, or brings to mind) the content of an associated sequence of video frames of an original video file. The digital content of a video summary may be in the form of one or more of text, audio, graphics, animated graphics, and full-motion video. For example, in some implementations, a video summary may include one or more images representative of original video file content and digital audio content synchronized to the one or more representative images.
[0024] I. System Overview
[0025] Referring to
[0026] In a computer-based implementation, both video file annotating engine
[0027] As shown in
[0028] II. Generating Annotated Video Files
[0029] Referring to
[0030] Referring to
[0031] In some implementations, the video summary rendering information that is embedded in the original video file corresponds to the video summary itself. As mentioned above, a video summary is any digital content (e.g., text, audio, graphics, animated graphics, and full-motion video) that summarizes (i.e., represents, symbolizes, or brings to mind) the content of the associated sequence of video frames of the original video file. Accordingly, in these implementations, the digital content of the video summaries are embedded in the original video files. In some implementations, a video summary may be derived from the original video file (e.g., keyframes of the original video file, short segments of the original video file, or an audio clip from the original video file). In other implementations, a video summary may be obtained from sources other than the original video file yet still be representative of the original video file (e.g., a trailer of a commercial motion picture, an audio or video clip, or a textual description of the original video).
[0032] Referring to
[0033] In some embodiments, the keyframes of each shot are organized into a hierarchy to allow a user to browse video summaries at multiple levels of detail. For example, in the illustrated embodiment, the first level of detail corresponds to the first keyframes
[0034] Referring to
[0035] Referring back to
[0036] III. Rendering Annotated Video Files
[0037] Referring to
[0038] In some implementations, while the user is browsing video summaries, the user may select a particular summary (e.g., keyframe) as corresponding to the starting point for rendering the original video file. In response, video file rendering engine
[0039] IV. Conclusion
[0040] Other embodiments are within the scope of the claims.
[0041] The systems and methods described herein are not limited to any particular hardware or software configuration, but rather they may be implemented in any computing or processing environment, including in digital electronic circuitry or in computer hardware, firmware, or software.