Plaque It!
Sponsored by: Flash of Genius |
This application is a divisional application of Ser. No. 10/552,371, which is a National Stage Application of International Application Serial No. PCT/JP2004/004997, filed Apr. 7, 2004.
The present invention relates generally to a readable, writable data recording medium, and relates more particularly to a data recording medium for recording moving picture (video) data, still image data, audio data, and other types of multimedia data in data broadcasting and various other formats. The invention also relates to an apparatus and method for recording data to such a data recording medium.
Rewritable optical discs have had a maximum storage capacity of approximately 650 MB, but this limit has been pushed to several gigabytes by the introduction of DVD-RAM discs, a phase-change type of storage medium. Used in conjunction with practical implementations of MPEG (particularly MPEG-2), a digital AV data encoding standard, DVD-RAM is not limited to computer applications and will soon find widespread use as a recording and playback medium in the audio-video (AV) and even home entertainment industries.
With the start of digital broadcasts in Japan it has become possible to multiplex and simultaneously transmit the video, audio, and data portions of plural programs to the MPEG transport stream (“MPEG_TS” below). Digital broadcast recorders using hard discs or DVD media to record these programs are also available.
These next-generation digital broadcast recorders typically record digital broadcasts in the original broadcast format without converting the MPEG_TS of the broadcast, and are expected to record AV data from an external line input using the MPEG_TS so that the recorder does not need to internally handle both the MPEG program stream (“MPEG_PS” below) and the MPEG_TS.
However, because the current DVD logic standards (including the DVD-Video standard, DVD-Audio standard, DVD Video Recording standard, and DVD Stream Recording standard) use the MPEG_PS for AV stream recording, MPEG_TS to MPEG_PS conversion (TS2PS conversion) is required in order to convert content recorded in the MPEG_TS format, such as by the above-noted digital broadcast recorder, to the DVD-Video format, for example (see for example, JP2002-344888A).
Converting a stream multiplexed to the MPEG_TS to MPEG_PS, however, involves a complex recalculation for decoder buffer management, the TS2PS conversion is time-consuming, and often involves re-encoding the elementary stream, resulting in degraded image quality and sound quality.
The present invention is directed to solving these problems and an object of the invention is to provide a data recording medium for recording an MPEG_TS stream enabling fast, simple conversion when converting content recorded in the MPEG_TS format to the MPEG_PS format. A further object is to provide an apparatus and a method for recording data using the data recording medium of the invention.
In a first aspect of the invention, provided is a recording medium for storing system stream including video elementary stream generated by encoding video information and audio elementary stream generated by encoding audio information with the video elementary stream and the audio elementary stream being multiplexed. In the recording medium, the system stream is allowed to have a first format (TS) and a second format (PS). The first format is allowed to have a constrained format used for converting the system stream from the first format (TS) to the second format (PS). According to the constrained format, a presentation order of the video information including continuous complete data blocks starts at a top field and ends at a bottom field, in which the continuous complete data blocks are included in continuous reference presentation time for video information and audio information, and the continuous reference presentation time includes at least one data block.
The elementary stream may be encoded in an encoding method which is allowed for both the first format and the second format.
The video elementary stream and the audio elementary stream may be multiplexed to the system stream in the same order as a presentation order in the second format (PS) that is converted according to the constrained format
The first format (TS) may have a structure for storing data segmented in packets, and each packet may be provided with time stamp information indicating relative transfer timing. The second format (PS) may have a structure for storing data segmented in packs, and each pack may be provided with time stamp information indicating transfer timing. The pack is larger than the packet in size. In this case, a predetermined number of packets are grouped and managed as a multiplexing unit, and total data size of packets managed in the multiplexing unit is not larger than data size of the pack.
The system stream may include encode information that indicates whether or not the video information including continuous complete data block starts at a top field and ends at a bottom field. The encode information may also be included in management information for managing data stored in the recording medium.
In a second aspect of the invention, provided is an information recording apparatus for encoding video information and audio information to system stream and recording the system stream to a recording medium. The system stream is allowed to have a first format (TS) and a second format (PS). The information recording apparatus includes a first encoding section operable to encode video information and audio information in a predetermined encoding manner according to the first format (TS) to generate video elementary stream and audio elementary stream; a second encoding section operable to perform system-encoding by multiplexing the video elementary stream and the audio elementary stream to generate the system stream according to the first format (TS); and a control section operable to control the first encoding section and the second encoding section.
The first format (TS) is allowed to have a constrained format used for converting the system stream from the first format (TS) to the second format (PS). The control section controls the first and second encoding section so that each encoding is done according to the constrained format. According to the constrained format, a presentation order of the video information including continuous complete data block starts at a top field and ends at a bottom field, the continuous complete data blocks are included in continuous reference presentation time for video information and audio information, and the continuous reference presentation time includes at least one data block.
In a third aspect of the invention, provided is an information recording method including encoding video information and audio information to system stream and recording the system stream to a recording medium.
The system stream is allowed to have a first format (TS) and a second format (PS). The first format (TS) is allowed to have a constrained format used for converting the system stream from the first format (TS) to the second format (PS).
The information recording method includes encoding video information and audio information in a predetermined encoding manner according to the first format (TS) to generate video elementary stream and audio elementary stream; and performing system-encoding by multiplexing the video elementary stream and the audio elementary stream to generate the system stream according to the first format (TS). According to the constrained format, a presentation order of the video information including continuous complete data blocks starts at a top field and ends at a bottom field, the continuous complete data blocks are included in continuous reference presentation time for video information and audio information, and the continuous reference presentation time includes at least one data block.
According to the present invention, video information is recorded so that, according to the constrained format, the video information including continuous complete data blocks starts at a top field and ends at a bottom field, the continuous complete data blocks being included in continuous reference presentation time for video information and audio information, the continuous reference presentation time including at least one data block. Such an arrangement causes re-encoding of video stream to be unnecessary and provides high speed conversion of a signal input externally from MPEG-TS to MPEG-PS complying with DVD standard.
FIG. 1 is a schematic diagram showing a DVD recording apparatus and an exemplary interface between the DVD recording apparatus and other components used in conjunction therewith.
FIG. 2 is a block diagram of the drive apparatus of a DVD recorder.
FIG. 3A illustrates a contiguous area on the disc, and
FIG. 3B is a graph illustrating the data accumulation in a track buffer.
FIG. 4 is a block diagram of a DVD recorder having a semiconductor memory card and hard disk drive.
FIGS. 5A and 5B show a data region of a disc and data structure of the disc, respectively.
FIGS. 6A and 6B show the logical data space of the disc.
FIG. 7 shows the disc directory and file structure.
FIG. 8 shows the structure of a video object.
FIG. 9 shows the MPEG system stream.
FIGS. 10A to 10 show the MPEG transport stream (MPEG_TS).
FIGS. 11A to 11 C show the MPEG program stream (MPEG_PS).
FIGS. 12A to 12 D show a TS packet.
FIGS. 13A to 13 C 2 shows a PAT table.
FIGS. 14A to 14 C show the arrangement of video objects on disc.
FIGS. 15A and 15B show the data structure of video management information.
FIGS. 16A and 16B show the data structure of video management information.
FIG. 17 shows the relationship between an object, object information, and PGC information in the video management information.
FIG. 18 is a block diagram showing the functional configuration of a playback apparatus.
FIG. 19 is a block diagram showing the functional configuration of a recording apparatus.
FIG. 20 is a block diagram showing the configuration of a data recording and reproducing apparatus according to the present invention.
FIG. 21 shows the structure of a self-encoding stream.
FIGS. 22A and 22B describes the packet transfer time interval.
FIG. 23 describes a storage method for a User Private packet.
FIG. 24 describes a storage method for a User Private packet.
FIG. 25 describes a storage method for a User Private packet.
FIG. 26 describes a storage method for a User Private packet.
FIGS. 27A to 27 H described conversion of an MPEG_TS to an MPEG_PS.
FIGS. 28A to 28 G show an encoding method for an MPEG_TS enabling easy conversion to an MPEG_PS.
FIG. 29 shows conversion to a DVD Video format (NTSC).
FIG. 30 shows conversion to a DVD Video format (PAL).
FIG. 31 shows the internal data structure of a User Private packet.
FIG. 32 shows the correlation between an MPEG_TS encoded for easy conversion to an MPEG_PS and the MPEG_PS after conversion.
FIG. 33 is a block diagram of the encoder of a data recording apparatus according to the present invention.
FIG. 34 shows differences in processes for converting from a self-encoded MPEG_TS to DVD formats due to differences in system encoding.
FIG. 35 shows the Tip packet data structure.
FIG. 36 shows the adaptation field data structure.
FIG. 37 shows the Data_ID data structure.
FIG. 38 shows the display_and_copy_info data structure.
FIG. 39 shows the encode_info data structure.
FIG. 40 shows the PES_info data structure.
FIG. 41 shows the MakersPrivateData data structure.
FIG. 42A shows PID of the Tip packet.
FIG. 42B shows the stream_type of Tip packet.
FIG. 43 shows field values of the PES packet header in a Constrained SESF stream.
FIG. 44 shows the PES_extension_flag and PES_header_data_length in a Constrained SESF stream.
FIG. 45 shows an example of an MPEG_TS self-encoded such that it does not satisfy T_STD model.
FIGS. 46A and 46B show an example of an MPEG_PS converted from a MPEG_TS such that the MPEG_PS does not satisfy the P_STD model.
FIG. 47 shows SCR calculation.
FIG. 48 shows the elementary stream attributes of a Constrained SESF when encode_condition =11b.
FIG. 49 shows the elementary stream attributes of a Constrained SESF when encode_condition=01b.
FIG. 50 shows the stream structure of a format conforming to the DVD Video standard.
FIG. 51 shows the structure of PCI data in NV_PCK.
FIG. 52 shows the structure of PCI_GI data in NV_PCK.
FIG. 53 shows the structure of DSI data in NV_PCK.
FIG. 54 shows the structure of DSI_GI data in NV_PCK.
FIG. 55 shows the structure of SML_PBI data in NV_PCK.
FIG. 56 shows the structure of SYNCI data in NV_PCK.
FIG. 57 shows the stream structure of a format conforming to the DVD Video Recording standard.
FIG. 58 is a flow chart of the TS packet (RD_PCK) conversion process.
FIG. 59 is a flow chart of the TS packet (V_PCK, A_PCK) conversion process.
FIG. 60 shows a part of the data structure of the pack header in an MPEG-2 program stream pack.
FIG. 61 shows a DVD format system header.
FIG. 62A shows the structure of a packet header stored in RDI_PCK.
FIG. 62B shows the structure of a packet header stored in RDI_PCK.
FIG. 63 shows a part of the data structure of the packet header in an MPEG-2 program stream packet.
FIG. 64 shows the structure of an AC-3 standard private header in the DVD format.
FIGS. 65A and 65B show converting a Constrained SESF to an MPEG_PS for a video pack.
FIGS. 66A and 66B show converting a Constrained SESF to an MPEG_PS for an audio pack.
FIG. 67 is a table of audio bit rates allowed by the Constrained SESF, and the maximum payload length stored to one audio PES packet for AC-3 and MPEG-1 Audio at the corresponding bit rates.
FIG. 68 is a flow chart of overall TS2PS conversion process.
FIG. 69 is a flow chart of initialization process in the TS2PS conversion process.
FIG. 70 is a flow chart of the capsule unit process in the TS2PS conversion process.
FIG. 71 is a flow chart of the pack unit process.
FIG. 72 is a flow chart of the SCR calculation process.
FIG. 73 is a flow chart of the pack header process.
FIG. 74 is a flow chart of the packet header process.
FIG. 75 is a flow chart of the stream ID process.
FIG. 76A is a flow chart of the video PES packet leading process.
FIG. 76B is a flow chart of the video PES packet non-leading process.
FIG. 77A is a flow chart of the audio PES packet leading process.
FIG. 77B is a flow chart of the audio PES packet non-leading process.
FIG. 78 is a flow chart of the payload process.
FIG. 79 is a flow chart of the padding packet process.
FIG. 80 shows the Constrained SESF stream format.
FIG. 81 shows the data structure of an MPEG standard PES packet.
FIG. 82 shows a method of generating NV_PCK data.
FIG. 83A shows efficiency multiplexing method using Multiplexing Unit with audio frames aligned.
FIG. 83B shows efficiency multiplexing method using Multiplexing Unit with I-picture aligned in the lead.
FIG. 84A shows encode conditions related to the order of video presentation fields in Constrained SESF (when satisfying DVD-Video standard).
FIG. 84B shows encode conditions related to the order of video presentation fields in Constrained SESF (when not satisfying DVD-Video standard).
FIG. 85 is a flow chart of the recording process according to Constrained SESF provided with constraints for a top field and a bottom field.
FIG. 86 is a flow chart of the end recording process.
A DVD disc, DVD recorder, and DVD player are described with reference to the accompanying figures in the sequence shown below as preferred embodiments of a data recording medium, recording apparatus, and playback apparatus according to the present invention.
Key points of the present invention are described particularly in the following section 8 , outline of the invention, and section 9 , detailed embodiments of the invention. While the relationship to the present invention may vary, all of the following describe various aspects of the invention.
1. Outline of the DVD recorder system
2. Function outline of the DVD recorder
3. Outline of the DVD disc
4. Outline of reproduced AV data
5. AV data management information and playback control
6. Basic operation of the playback function
7. Basic operation of the recording function
8. Outline of the invention
9. Detailed embodiments of the invention
The following terminology is used below.
“TS2PS conversion” refers to converting the MPEG transport stream (MPEG_TS) to the MPEG program stream (MPEG_PS).
“DVD format” refers to both the DVD-Video standard format and the DVD-Video Recording standard format, each being an MPEG_PS implementation.
FIG. 1 shows a typical DVD recorder in relation to other systems and devices used with the DVD recorder.
As shown in FIG. 1 a DVD, which is a type of optical disc, is loaded to the DVD recorder for recording video data to the disc and reproducing video data from the disc. A remote control device is typically used to operate the DVD recorder.
The video data input to the DVD recorder could be an analog signal or a digital signal with analog broadcasts exemplary of analog signals and digital broadcasts exemplary of digital signals. Generally speaking, analog broadcasts are received and demodulated by the receiver built in to a television, and input as an NTSC or other analog video signal to the DVD recorder for recording. Digital broadcasts are demodulated to a digital signal by the digital broadcast receiver (set-top box (STB)) input to the DVD recorder for recording.
Video data recorded to a DVD is reproduced by the DVD recorder and externally output. Like the video input, video output may be an analog signal or digital signal. Analog signals are input directly to the television. Digital signals are passed through the STB and converted to an analog signal, which is then input to the television for video presentation.
Video data may also be recorded to and reproduced from a DVD by a device other than a DVD recorder, such as a DVD camcorder or personal computer. A DVD disc storing video data recorded by a device other than a DVD recorder will also be reproduced by the DVD recorder when loaded therein.
It should be noted that audio data is normally associated with the video data of an analog broadcast or digital broadcast, and this audio data is likewise recorded and reproduced by the DVD recorder.
Furthermore, the video data is generally moving picture data, but could also include still images such as when a still image (photograph) is captured using the snapshot function of a DVD camcorder.
IEEE 1394, ATAPI, SCSI, or other standard could be used for the digital interface between the STB and DVD recorder.
It should also be noted that an NTSC signal is referred to above as the type of component video signal passed between the DVD recorder and television, but a component signal sending separate luminance and color difference signals could be used. Furthermore, changing the interface for transmitting video between AV components and televisions from an analog interface to a digital interface such as DVI is currently being researched, and we anticipate that a digital interface can also be used to connect DVD recorders and televisions.
FIG. 2 is a function block diagram of a DVD recorder. The drive device has an optical pickup 101 for reading data from a DVD-RAM disc 100 , an ECC (error correction code) processor 102 , track buffer 103 , switch 104 for changing track buffer 103 input and output, an encoder 105 , and a decoder 106 .
As shown in the figure, data is recorded to the DVD-RAM disc 100 with the smallest recording unit being one sector (=2 KB). Furthermore, 16 sectors equal 1 ECC block, and the ECC processor 102 applies error correction processing using ECC block units.
The DVD recorder could also use semiconductor memory cards or hard disk drives in addition to DVDs as data storage media. FIG. 4 is a block diagram of a DVD recorder having a semiconductor memory card and hard disk drive.
It should also be noted that 1 sector could be 512 bytes, 8 KB, or other size. The ECC block could also contain 1 sector, 16 sectors, 32 sectors, or other configuration. It is expected that the sector size and number of sectors in each ECC block will also increase as the recordable data capacity increases.
The track buffer 103 is a buffer for recording AV data at a variable bit rate (VBR) in order to record AV data more efficiently to the DVD-RAM disc 100 . The DVD-RAM disc 100 write rate (Va) is a fixed rate but the bit rate (Vb) of the AV data varies according to the complexity of the AV content (images in the case of video content). The track buffer 103 is used to absorb this bit rate difference.
In order to use this track buffer 103 even more effectively, the AV data can be distributively recorded to the disc 100 . This is further described with reference to FIGS. 3A and 3B.
FIG. 3A shows the disc address space. As shown in FIG. 3A, continuous playback of the AV data is enabled when the AV data is recorded to separate contiguous spaces [a 1 , a 2 ] and [a 3 , a 4 ] by supplying data accumulated in the track buffer to the decoder 106 while seeking from a 2 to a 3 . The change in the amount of data stored to the track buffer at this time is shown in FIG. 3B.
When reading starts at address a 1 , the AV data is input from time t 1 to the track buffer 103 and data output from the track buffer 103 also starts. Data then accumulates in the track buffer 103 at the rate (Va−Vb), that is, the difference between the input rate (Va) to the track buffer 103 and the track buffer output rate (Vb). This continues until the search area reaches a 2 , that is, until time t 2 . If the data accumulated in the track buffer 103 during this time is B(t 2 ), data can be supplied to the decoder 106 by gradually depleting the data B(t 2 ) accumulated in the track buffer 103 from time t 2 to the time t 3 at which reading from the address a 3 begins.
In other words, a continuous supply of AV data can be maintained during seek operations insofar as at least a specified amount of data ([a 1 , a 2 ]) has been read before the seek operation starts.
The size of the contiguous area required to enable continuous AV data output when converted to an ECC block count (N_ecc) is shown by the following equation:
N _ecc=Vb* Tj /(( N _sec*8* S _size)*(1−Vb/Va))
where N_sec is the number of sectors in an ECC block, S_size is the sector size, and Tj is the seek performance (maximum seek time).
A defective sector could also occur in a contiguous area. The required size of the contiguous area in this case is shown by the following equation:
N _ecc= dN _ecc+Vb* Tj /(( N _sec*8* S _size)*(1−Vb/Va))
where dN_ecc is the size of the allowed defective sector, and Ts is the time needed to skip the defective sector within the contiguous area. This equation also returns the size of the contiguous area as the number of ECC blocks.
The above example is described using reading data from a DVD-RAM disc, that is, data playback, by way of example, but it will be obvious that writing, that is, recording, data to the DVD-RAM disc can be handled in the same way.
Continuous data playback and recording can thus be achieved with a DVD-RAM disc even when the AV data is recorded to separate recording areas on the disc insofar as the data is recorded in blocks of a specific size or more. These contiguous areas are referred to as Contiguous Data Areas (CDA) in DVD terminology.
FIGS. 5A and 5B show the physical structure and a plan view of a DVD-RAM, i.e., a recordable optical disc. DVD-RAM discs are typically housed in a cartridge for loading to a DVD recorder. The purpose of the cartridge is to protect the disc. The DVD-RAM disc can, however, be loaded directly to the DVD recorder without being housed in a cartridge if the recording surface can be protected in some other way.
DVD-RAM discs are recorded using a phase-change recording technique. Data on the disc is managed by sector unit, and addresses are added for data access. Groups of 16 sectors are used for error correction, have an error correction code added thereto, and are referred to as ECC blocks.
FIG. 5A shows the recording area of a DVD-RAM disc, i.e., a recordable optical disc. As shown in the figure, a DVD-RAM disc has a lead-in area at the inside circumference, a lead-out area at the outside circumference, and a data area between the lead-in and lead-out areas.
Reference signals for stabilizing the servo when accessing the disc with the optical pickup, and an ID signal for distinguishing a DVD-RAM disc from other types of media, are recorded to the lead-in area.
The same reference signals are also recorded to the lead-out area.
The data area is segmented into sectors (each 2048 bytes) as the smallest access unit. The data area is also segmented into a plurality of zones in order to apply a rotational control technique known as Zone Constant Linear Velocity (Z-CLV) during recording and playback.
FIG. 5A shows plural zones formed concentrically on the DVD-RAM disc. In this example the DVD-RAM disc is divided into 24 zones, labelled zone 0 to zone 23 . The rotational angular velocity of the DVD-RAM is set differently in each zone such that it increases in proximity to the inside circumference and is constant while the optical pickup accesses data in the same zone. This increases the recording density of the DVD-RAM and enables easier rotational control during recording and playback.
FIG. 5B shows the lead-in area, lead-out area, and zones 0 to 23 concentrically arranged in FIG. 5A when viewed in a line through the disc radius.
The lead-in area and lead-out area each include a defect management area (DMA). The defect management area is for recording position information indicating the location of a sector containing a defect, and substitute sector position information indicating in which substitute area the sector substituted for the defective sector is located.
Each zone includes a user area between a substitute area and an unused area. The user area is the area that can be used by the file system as a recording area. The substitute area is the area substitutionally used when there is a defective sector. The unused area is an area not used for data recording, and is approximately two tracks wide. The sector address is recorded to the same position in adjacent tracks within each zone, but with Z-CLV the sector address is recorded to a different position in tracks adjacent to the zone boundary. This unused area is therefore provided to prevent sector address detection errors in tracks adjacent to the zone boundary.
There are, therefore, sectors not used for data recording at the zone boundaries. A logical sector number (LSN) is therefore assigned to each physical sector in the user area of a DVD-RAM disc sequentially from the inside circumference in order to continuously identify only those sectors used for data recording.
FIGS. 6A and 6B show the logical data space of a DVD-RAM disc comprising logical sectors. The logical data space is called the “volume space” and is used to record user data.
Data recorded in the volume space is managed with a file system. More specifically, a group of sectors storing data is a “file,” and volume structure information managing a group of files as a “directory” is recorded to the beginning and end of the volume area. The UDF file system is used in the present embodiment and conforms to ISO 13346.
The above-noted group of sectors is not necessarily contiguous within the volume space, and can be split into separate parts. Of the sectors constituting each file, the file system therefore manages each group of contiguous sectors in the volume space as an extent, and manages each file as a set of related extents.
FIG. 7 shows the structure of a directory and file recorded to DVD-RAM. Below the root is the VIDEO_RT directory, and below VIDEO_RT are the various object files containing the playback data and a VIDEO Manager file containing management information such as the playback sequence and various attributes.
Objects are data structures conforming to MPEG standards, and include PS_VOB, TS 1 _VOB, TS 2 _VOB, AOB, POB, and MNF (Manufacturer's Private Data).
PS_VOB, AOB, and POB are MPEG program streams (PS), and TS 1 _VOB and TS 2 _VOB are MPEG transport streams (TS). The program stream has a data structure designed for storing AV data to package media. The transport stream has a data structure intended for communications media.
PS_VOB, TS 1 _VOB and TS 2 _VOB are objects of primarily video data but containing both video data and audio data. In principle, TS 1 _VOB objects are encoded by the DVD recorder with an explicitly managed internal picture structure. TS 2 _VOB objects are encoded externally to the DVD recorder, and part of the internal picture structure and data structure is unknown.
Typically, TS 1 _VOB objects are externally input analog video signals encoded by the DVD recorder to the transport stream, and TS 2 _VOB objects are externally input digital video signal objects recorded directly to disc without further encoding by the DVD recorder. That is, when a DVD recorder records digital broadcasting, TS 2 _VOB is generally used.
AOB and POB are MPEG program streams. AOB, objects contain primarily audio data, and POB objects contain primarily still images.
The MNF (Manufacturer's Private Data) block is used to store information specific to a particular manufacturer.
“Primarily video data” and “primarily audio data” above indicate that a high bit rate is allocated. VOB are used in video and similar applications, and AOB are used in music applications.
FIG. 8 shows the structure of MPEG data recorded as AV objects to a DVD.
As shown in FIG. 8, the video stream and audio stream are segmented and multiplexed. The MPEG standard refers to the multiplexed streams as the system stream. In the case of DVD, a system stream containing DVD specific settings is called a VOB (Video OBject). The segmentation units are called pocks and packets, and are approximately 2 KB in size.
The video stream is encoded according to the MPEG standard, variable bit rate compressed such that the bit rate is increased in complex images such as images containing much movement. The pictures in an MPEG stream are encoded as I-pictures, P-pictures, or B-pictures. I-pictures are spatially compressed and complete within each frame. P-pictures and B-pictures are temporally compressed using inter-frame correlations. A series of pictures including at least one I-picture is referred to as a Group of Pictures (GOP) in MPEG. A GOP is the access point for fast play and other special play modes, which are made possible by the presence of at least one intra-frame compressed I-picture.
In addition to using MPEG audio, the audio stream of a DVD can be encoded using AC-3, LPCM, or other encoding technique.
As also shown in FIG. 8 the Video Object Unit (VOBU) is the data unit multiplexing the video data of a GOP with the associated audio data. Video management data can also be included in a VOBU as header information.
A program stream (PS) and transport stream (TS) are included in the system stream described with reference to FIG. 8. As noted above, the program stream has a data structure intended for package media and the transport stream data structure is intended for communications media.
FIG. 9 shows the concept of the program stream and transport stream data structures.
The program stream comprises fixed length packs that are the smallest unit for data transfer and multiplexing. Each pack contains one or more packets. Both packs and packets comprise a header part and a data part. The data part is referred to as the payload in MPEG. For compatibility with the sector size, the fixed length of a pack in DVD is 2 KB. A pack can contain multiple packets, but because packs storing DVD video and audio contain only one packet, 1 pack equals 1 packet except in special cases.
The data transfer and units for multiplexing of the transport stream comprises fixed length TS packets. TS packet size is 188 bytes for compatibility with ATM transmissions, a communications standard. One or more TS packets form a PES packet.
PES packets are a concept common to both the program stream and transport stream, and the data structure is the same. Packets stored in program stream packs directly form FES packets, and a group of one or more transport stream TS packets form a PES packet.
The PES packet is the smallest encoding unit and stores video data and audio data with common encoding. More specifically, video data and audio data encoded with different coding methods are not present in a same PES packet. However, if the coding method is the same, it is not necessary to ensure the picture boundaries and audio frame boundaries. As shown in FIG. 9 one frame is stored to plural PES packets, and plural frames may be stored to one PES packet.
FIGS. 10A to 10 C and FIGS. 11A to 11 C show the data structures of the transport stream and program stream.
As shown in FIGS. 10A to 10 C and FIGS. 12A to 12 D, each TS packet comprises a TS packet header, adaptation field, and payload. The TS packet header stores a Packet Identifier (PID) whereby the video, audio, or other stream to which the TS packet belongs can be identified.
The Program Clock Reference (PCR) is stored to the adaptation field. The PCR is the reference value for the system time clock (STC) of the device decoding the stream. The device typically demultiplexes the system stream based on the PCR timing, and then reassembles the video stream and other streams.
The Decoding Time Stamp (DTS) and Presentation Time Stamp (PTS) are stored to the PES header. The DTS indicates the decoding timing of the picture or audio frame stored to the PES packet, and the PTS indicates the presentation timing of the video or audio output.
It should be noted that the PTS and DTS need not be written to every PES packet header. Decoding and output are possible insofar as the PTS and DTS are written to the header of the PES packet where the first data of the I-picture is stored.
The TS packet structure is shown in detail in FIGS. 12A to 12 D.
As shown in FIGS. 12A to 12 D the adaptation field stores the PCR and a random access presentation flag. This flag indicates whether data that is at the beginning of the video or audio frame and can be used as an access point is stored in the corresponding payload. In addition to the above-noted PID, the TS packet header also stores a unit start presentation flag indicating the beginning of a PES packet, and adaptation field control data indicating whether an adaptation field follows.
FIGS. 11A to 11 C show the structure of packs in the program stream. The pack contains the SCR in the pack header and a stream_id in the packet header of packets stored in the pack. The SCR is effectively identical to the transport stream PCR, and the stream_id to the PID. The PES packet data structure is also the same as in the transport stream, and the PTS and DTS are stored in the PES header.
One major difference between the program stream and transport stream is that the transport stream allows for multiple programs. That is, in terms of program units, the program stream can carry only one program but the transport stream can simultaneously transmit multiple programs. This means that the playback device must be able to identify the video streams and audio streams constituting each program carried in the transport stream.
FIGS. 13A to 13 C 2 show the PAT table and PMAP table used to transmit structure information for the audio stream and video stream of each program. As shown in FIGS. 13 C 1 and 13 C 2 the PMAP table stores information relating to the combination of video and audio streams used in each program, and the PAT table stores information correlating programs and PMAP tables. The playback device can therefore reference the PAT table and PMAP table to detect the video and audio streams for the program to be output.
How the program stream packs and transport stream TS packets described above are arranged on the disc is described next with reference to FIGS. 14A to 14 C.
As shown in FIG. 14A there are 32 sectors in an ECC block.
As shown in FIG. 14B, the packs (PS Packs) forming a video object (PS_VOB) of a program stream type are located at the sector boundaries. This is because the pack size and sector size are both 2 KB.
Video objects (TS 1 _VOB, TS 2 _VOB) of the transport stream type, however, are 8 KB units and are therefore contained in the ECC block. Each 8 KB unit contains an 18 byte header area and 43 TS packets containing Arrival Time Stamp (ATS) information in the data area. The ATS information is data generated and added by the DVD recorder, and indicates the timing at which the packet was received by the DVD recorder from an external source.
It should be noted that an MPEG_TS storage format continuously recording combinations of fixed-byte length ATS and MPEG_TS packets is also possible as shown in FIG. 14C.
FIGS. 15A to 15 B and FIGS. 16A to 16 B show the data structure of the video management information file (Video Manager) shown in FIG. 7.
The video management information includes object information describing such management information as where objects are recorded on disc, and presentation control information describing the playback sequence of the objects.
FIG. 15A shows an example in which the objects recorded to the disc include PS_VOB# 1 -PS_VOB#n, TS 1 _VOB# 1 -TS 1 _VOB#n, and TS 2 _VOB# 1 -TS 2 _VOB#n.
As shown in FIG. 15A, a PS_VOB information table, TS 1 _VOB information table, and TS 2 _VOB information table are separately recorded according to the object types. Each of these tables stores VOB information for each object.
The VOB information includes general information about the corresponding object, object attribute data, an access map for converting the object playback time to a disc address value, and management information for the access map. The general information includes identification information for the corresponding object and object recording time. The attributes include video stream attributes (V_ATR) such as the video stream coding mode, the number of audio streams (AST_Ns), and audio stream attributes (A_ATR) such as the audio stream coding mode.
There are two reasons why an access map is required. The first is so that the playback path information avoids directly referencing object recording positions based on a sector address value, for example, and instead can indirectly reference object locations based on the object playback time. Object recording positions can change with RAM media as a result of editing the object, for example. This increases the amount of playback path information that must be updated if the playback path information references object recording positions directly based on the sector address. If the objects are referenced indirectly based on the playback time, however, it is not necessary to update the playback path information and only the access map needs to be updated.
The second reason is that the audio stream typically has two reference bases, the time base and data (bit stream) base, but the correlation therebetween is not complete.
For example, using a variable bit rate (a method of changing the bit rate according to the complexity of the image) is becoming the norm with MPEG-2 Video, an international standard for video stream encoding. In this case there is no proportional relationship between the amount of data from the stream start and playback time, and random access based on the time base is therefore not possible. An access map is used to resolve this problem by converting between the time base and data (bit stream) base.
As shown in FIG. 15A, the presentation control information includes a user-defined playback path information table, original playback path information table, and title search pointer.
As shown in FIG. 16A there are two types of playback paths data: originally defined playback path information generated automatically by the DVD recorder to describe all objects recorded during object recording, and user-defined playback path information enabling a user to freely define a particular playback sequence. The playback path information is uniformly referred to as Program Chain Information (PGC information) on a DVD, the user-defined playback path information is referred to as the U_PGC information, and the original playback path information as the O_PGC information. The U_PGC information and O_PGC information are tables listing the cell information describing the cells in the object playback period. The object playback period indicated by the O_PGC information is called an original cell (O_CELL), and the object playback period indicated by the U_PGC information is called a user cell (U_CELL).
A cell indicates the object playback period using the object playback start time and playback end time; the playback start and end times are converting by the access map described above to the actual location where the object is recorded on disc.
As shown in FIG. 16B, a cell group indicated by the PGC information defines a continuous playback sequence reproduced sequentially according to the order of entries in the table.
FIG. 17 shows a specific relationship between objects, cells, PGC, and access map.
As shown in FIG. 17 the original PGC information 50 contains at least one cell information entry 60 , 61 , 62 , 63 .
Each cell information entry defines the object to reproduce as well as the object type, and object playback period. The order of the cell information entries in the PGC information 50 defines the playback sequence of the objects defined by each cell when the objects are reproduced.
Each cell information entry (cell information 60 , for example) includes a Type 60 a indicating the type of specific object, an Object ID 60 b identifying a particular object, and a start presentation time Start_PTM 60 c and end presentation time End_PTM 60 d in the object on the time base.
During data playback, the cell information 60 is sequentially read from the PGC information 50 , and the objects specified by each cell are reproduced for the playback period defined by the cell.
The access map 80 c converts the start and end time information contained in the cell information to the object address on disc.
This access map is the map information described above and is generated and recorded when the objects are recorded. The picture structure of the object data must be analyzed in order to generate the map. More specifically, it is necessary to detect the I-picture location shown in FIG. 9, and detect the PTS and other time stamp information, that is, the I-picture playback time shown in FIGS. 10A to 10 C and FIGS. 11A to 11 C.
Problems occurring when generating the PS_VOB, TS 1 _VOB, and TS 2 _VOB map information are described next.
As described with reference to FIG. 1, the PS_VOB and TS 1 _VOB are primarily generated by the DVD recorder encoding a received analog broadcast to an MPEG stream. The I-picture and time stamp information is therefore auto-generated by the DVD recorder, the internal data structure of the stream is known to the DVD recorder, and the map information can be generated with no problem.
As also described with reference to FIG. 1, the TS 2 _VOB is a received digital broadcast recorded directly to disc by the DVD recorder with no intermediate encoding. Because the recorder thus does not generate the time stamp information and determine the I-picture locations as it does when recording a PS_VOB, the DVD recorder does not know the internal data structure of the stream and must therefore detect this information from the recorded digital stream.
To do this the DVD recorder detects the I-picture and time stamp information for the map information of a TS 2 _VOB recording a stream encoded externally to the recorder as follows.
First, I-pictures are detected by detecting the random access presentation information of the TS packet adaptation field shown in FIGS. 12A to 12 D. The time stamp information is detected by detecting the PTS in the PES header. Note that the PCR from the adaptation field or the ATS indicating the TS packet arrival time at the DVD recorder can be used instead of the PTS for the time stamp. In any case, the DVD recorder detects I-picture locations based on information in a high level system layer and does not need to analyze the data structure of the MPEG stream video layer. This is because the system overhead required to analyze the video layer in order to generate the map information is great.
There are also cases in which system layer detection is not possible. The map information cannot be generated in such cases and it is therefore necessary to indicate that there is no valid map information. The DVD recorder indicates this using the map management information shown in FIG. 15B.
The map management information shown in FIG. 15B contains map validity information and a self-encoding flag. The self-encoding flag indicates that an object was encoded by the DVD recorder, and thus indicates that the internal picture structure is known and that the map information time stamp information and I-picture location information is accurate. The map validity information indicates whether or not there is a valid access map.
Examples of when the system layer cannot be detected include when the adaptation field is not set and when the digital stream is not an MPEG transport stream. Various digital broadcasting standards and formats are used around the world, and there will naturally be cases in which the DVD recorder records objects for which it cannot generate a map. For example, if a DVD recorder designed for the Japanese market and recording digital broadcasts in Japan is used in the United States to record digital broadcasts in the United States, there will likely be cases in which the DVD recorder cannot generate a map for the recorded objects.
The DVD recorder can, however, sequentially reproduce from the beginning objects for which map information is not generated. In this case video from the recorded digital stream can be reproduced by outputting it through a digital interface to a STB appropriate to the stream.
The playback operation of a DVD recorder/player for reproducing content recorded to an optical disc as described above is described next below with reference to FIG. 18.
As shown in FIG. 18 the DVD player has an optical pickup 201 for reading data from the optical disc 100 , an ECC processor 202 for error correction processing of the read data, a track buffer 203 for temporarily storing the read data after error correction, a PS decoder 205 for reproducing video objects (PS_VOB) and other program streams, a TS decoder 206 for reproducing digital broadcast objects (TS 2 _VOB) and other transport streams, an audio decoder 207 for reproducing audio objects (AOB), a still picture decoder 208 for decoding still picture objects (POB), a switching means 210 for changing data input to the decoders 205 to 208 , and a controller 211 for controlling the various parts of the player.
Data recorded to the optical disc 100 is read by the optical pickup 201 , passed through the ECC processor 202 and stored to track buffer 203 . Data stored to the track buffer 203 is then input to and decoded and output by the PS decoder 205 , TS decoder 206 , audio decoder 207 , or still picture decoder 208 .
The controller 211 determines what data to be read based on the playback sequence defined by the playback path information (PGC) shown in FIGS. 16A and 16B. Using the example shown in FIGS. 16A and 16B, the controller 211 thus first reproduces part (CELL # 1 ) of VOB # 1 , then part (CELL # 2 ) of VOB # 3 , and finally VOB # 2 (CELL # 3 ).
Using the cell information of the playback path information (PGC) shown in FIG. 17, the controller 211 can also capture the type of cell reproduced, corresponding objects, and the playback start and end times of the objects. The controller 211 inputs the data for the period of the object specified by the cell information to the appropriate decoder.
The controller 211 also identifies the objects to be reproduced based on the Object ID of the cell information. The controller 211 also identifies the a cell, which is the playback period of the identified object, by converting the Start_PTM and End_PTM of the cell information to a disc address value by referencing the access map of the corresponding VOB information.
A player according to this embodiment of the invention also has a digital interface 204 for supplying the AV stream to an external device. It is therefore possible to supply the AV stream to an external device through an IEEE 1394, IEC 958, or other communications means. This is so that, for example, when the player does not have an internal decoder for decoding a TS 2 _VOB not encoded by the recorder/player the TS 2 _VOB can be output directly without decoding through the digital interface 204 to an external STB for decoding and presentation via the STB.
When the digital data is directly output to an external device, the controller 211 determines whether random access playback is possible based on the map information shown in FIG. 15B. If the access point data flag (random access presentation flag) is valid, the access map contains I-picture location information. In this case the controller 211 is able to access and output digital data containing an I-picture to an external device through the digital interface in response to fast play and other requests from the external device. Furthermore, time-base access is also possible if the time access information flag is valid. In this case the controller 211 can access and output digital data including the picture data at a specified playback time to an external device through the digital interface in response to a time-base access request from an external device.
The configuration and operation of a DVD recorder according to the present invention for recording and reproducing an optical disc as described above is described next below with reference to FIG. 19.
As shown in FIG. 19 the DVD recorder has a user interface 222 for receiving user requests and displaying information and prompts to the user, a system controller 212 handling the overall management and control of the DVD recorder, an analog broadcast tuner 213 for receiving VHF and VHF broadcasts, an encoder 214 for converting analog signals to digital signals and encoding the digital signals to an MPEG program stream, a digital broadcast tuner 215 for receiving digital satellite broadcasts, an analyzer 216 for interpreting the MPEG transport stream sent from a digital satellite, a display unit 217 such as a television and speakers, and a decoder 218 for decoding the AV stream. The decoder 218 has first and second decoders, for example, such as shown in FIG. 18. The DVD recorder also has a digital interface 219 , track buffer 220 for temporarily storing write data, and a drive 221 for writing data to the disc. The digital interface 219 is an IEEE 1394 or other communications interface for outputting data to an external device.
With a DVD recorder thus comprised the user interface 222 first receives a request from the user. The user interface 222 then passes the request to the system controller 212 , and the system controller 212 interprets the user request and instructs the various modules to run appropriate processes.
Recording includes self-encoding in which the DVD recorder encodes the input digital data, and outside encoding for recording already encoded digital data to disc without further encoding.
Recording with self-encoding is described specifically first below using by way of example encoding and recording an analog broadcast to a PS_VOB stream.
The system controller 212 sends a receive command to the analog broadcast tuner 213 and an encode command to the encoder 214 .
The encoder 214 then video encodes, audio encodes, and system encodes the AV data from the analog broadcast tuner 213 , and passes the encoded data to the track buffer 220 .
Immediately after encoding starts, the encoder 214 sends the time stamp information at the beginning of the MPEG program stream being encoded to the system controller 212 as the playback start time (PS_VOB_V_S_PTM), and parallel to the encoding process sends the data required to create the access map to the system controller 212 . This value is set as the Start_PTM of the cell information shown in FIG. 17 and generated later. The time stamp information is generally the PTS, but the SCR can be used instead.
The system controller 212 then sends a record command to the drive 221 , and the drive 221 thus extracts and records data accumulated in the track buffer 220 to the DVD-RAM disc 100 . A contiguous data area (CDA) as described above is also found in the recordable area of the disc and the data is recorded to the located contiguous data area.
Recording typically ends when the user inputs a stop recording command. Stop recording commands from the user are input through the user interface 222 to the system controller 212 , and the system controller 212 then sends a stop command to the analog broadcast tuner 213 and encoder 214 .
The encoder 214 stops encoding when it receives the stop encoding cord from the system controller 212 , and sends the time stamp information of the last data in the last encoded MPEG program stream to the system controller 212 as the playback end time (PS_VOB_V_E_PTM). This value is set as the End_PTM of the cell information shown in FIG. 17. The PTS is normally used for the time stamp information but the SCR can be used instead.
After ending the encoding process, the system controller 212 generates the presentation control information and VOB information (PS_VOBI) for the PS_VOB shown in FIGS. 15A and 15B.
The VOB information generated here includes map management information and an access map appropriate to the object type. The system controller 212 sets the map validity information of the map management information to “valid,” and sets the self-encoding flag ON.
Original playback information (O_PGC information) as shown in FIG. 16A for the recorded object as one of the playback objects is generated as the presentation control information. This O_PGC information is added to the original playback path table. The original playback path (O_PGC information) contains cell information. The cell information Type is set to PS_VOB.
The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 220 and to record the PS_VOB VOB information (PS_VOBI) and presentation control information. The drive 221 thus records the remaining data in the track buffer 220 and this information to the optical disc 100 , and the recording process ends.
It will be obvious that an analog broadcast could be encoded to TS 1 _VOB. In this case the encoder 214 must be an encoder for converting the analog signal to a digital signal and encoding the digital signal to the MPEG transport stream, and the cell information Type is set to TS 1 _VOB.
The PTS or PCR can be used for the Start_PTM and End_PTM.
Recording with outside encoding is described specifically next below with reference to recording a digital broadcast. The recorded object type in this case is TS 2 _VOB.
A digital broadcast recording request from the user is passed from the user interface 222 to the system controller 212 . The system controller 212 then instructs the digital broadcast tuner 215 to receive and instructs the analyzer 216 to interpret the received data.
An MPEG transport stream sent from the digital broadcast tuner 215 is passed through the analyzer 216 to the track buffer 220 .
To generate the VOB information (TS 2 _VOBI) of the encoded MPEG transport stream (TS 2 _VOB) received as a digital broadcast, the analyzer 216 first extracts the time stamp information at the beginning of the transport stream as the start time information (TS 2 _VOB_V_S_PTM) and sends it to the system controller 212 . This start time value is set as the Start_PTM of the cell information shown in FIG. 17 and generated later. The time stamp information is the PCR or PTS. The ATS indicating the timing at which the object is sent to the DVD recorder could alternatively be used.
The analyzer 216 then analyzes the system layer of the MPEG transport stream to detect the information needed for access map generation. The I-picture locations in the object are detected based on the random access indicator (random_access_indicator) in the adaptation field of the TS packet header as described above.
The system controller 212 then outputs a record command to the drive 221 , and the drive 221 thus extracts and records data accumulated in the track buffer 220 to the DVD-RAM disc 100 . The system controller 212 also instructs the drive 221 where to record on the disc based on the allocation data of the file system. A contiguous data area (CDA) as described above is also found in the recordable area of the disc and the data is recorded to the located contiguous data area.
Recording typically ends when the user inputs a stop recording command. Stop recording commands from the user are input through the user interface 222 to the system controller 212 , and the system controller 212 then sends a stop commend to the digital broadcast tuner 215 and analyzer 216 .
In response to the received stop command from the system controller 212 , the analyzer 216 stops interpreting the received data and sends the time stamp information at the end of the last interpreted MPEG transport stream to the system controller 212 as the playback end time (TS 2 _VOB_V_E_PTM)). This value is set as the End_PTM of the cell information shown in FIG. 17. The PCR or PTS is used for the time stamp information but the ATS indicating the time when the object was sent to the DVD recorder can be used instead.
After ending the digital broadcast reception process, the system controller 212 generates the presentation control information and VOB information (TS 2 _VOBI) for the TS 2 _VOB as shown in FIGS. 15A and 15B based on the information received from the analyzer 216 .
The VOB information generated here includes map management information and an access map appropriate to the object type. The system controller 212 sets the map validity information of the map management information to “valid” if the I-picture locations in the objects were detected and the access map could be generated. The self-encoding flag is set OFF. If a valid access map could not be generated the map validity information is set to an “invalid” state. Examples of when a valid access map cannot be generated include when a corresponding digital broadcast is not received and when there is no random access data set in the adaptation field. If the signal is input directly through the digital interface the signal may also not be an MPEG transport stream, and in this case, too, the map validity flag is set to “invalid.”
Original playback information (O_PGC information) as shown in FIGS. 16A and 16B for the recorded object as one of the playback objects is generated as the presentation control information. This O_PGC information is added to the original playback path table. The original playback path (O_PGC information) contains cell information. The cell information Type is set to TS 2 _VOB.
The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 220 and to record the TS 2 _VOB VOB information (TS 2 _VOBI) and presentation control information. The drive 221 thus records the remaining data in the track buffer 220 and this information to the optical disc 100 , and the recording process ends.
While the above recording operations are described with reference to user-input recording start and end commands, it will be obvious that the same essential operation applies to timer recordings controlled by a VCR, for example. In this case the system controller automatically issues the recording start and end commands instead of the user, and there is no essential change in DVD recorder operation.
A data recording medium according to the present invention is a medium for recording data of various different formats, including analog broadcast or digital broadcast content and various types of data input through an analog/digital interface. A data recording apparatus according to the present invention is an apparatus for recording AV data to and reproducing AV data from the same data recording medium.
More particularly, externally input AV data is recorded as an MPEG_TS, and a stream adding decoder input time data for each MPEG_TS packet to each MPEG_TS packet is recorded to the data recording medium of the present invention.
Recorder specific or content specific information and the locations of PSI (Program Specific Information) packets containing MPEG_TS control information are also embedded as a user private stream (UP packet), and the decoder input time of each packet is added in a format suitable for accumulation.
Furthermore, so simplify conversion to an MPEG_PS when multiplexing the MPEG_TS, data less than one pack (2048 bytes) is system encoded as one continuous multiplexing unit, and an MPEG_TS is recorded while allocating each continuous multiplexing unit to one or plural MPEG_TS packets.
The basic recording and playback operations of a data recording and reproducing apparatus according to the present invention are substantially as described above, and only the basic operation for recording analog line input is therefore described specifically below with reference to FIG. 20. The recorded object type in this case is TS 1 _VOB.
Analog line input recording requests from a user are passed from the user interface 222 to the system controller 212 . The system controller 212 then sends a receive command to the line input unit 223 and a data encoding command to the encoder 214 .
The MPEG transport stream from the encoder 214 is sent to the track buffer 220 .
To generate the VOB information (TS 1 _VOBI) of the encoded MPEG transport stream (TS 1 _VOB), the encoder 214 first sets the time stamp information as the presentation start time (TS 1 _VOB_V_S_PTM) and sends it to the system controller 212 . This start time value is set as the Start_PTM of the cell information generated later and shown in FIG. 17. The time stamp information is the PCR or PTS.
The encoder 214 also generates the data needed for access map generation while generating the MPEG transport stream. This is done by, for example, storing the adaptation field in the first MPEG transport packet of the I-picture, setting the random_access_indicator bit, and notifying the system controller 212 of the start of a VOBU.
The system controller 212 then sends a record command to the drive 221 , and the drive 221 extracts and records data from the track buffer 220 to the DVD-RAM disc 100 . The system controller 212 also instructs the drive 221 where to record on the disc based on the allocation data of the file system. A contiguous data area (CDA) as described above is also found in the recordable area of the disc and the data is recorded to the located contiguous data area.
Recording typically ends when the user inputs a stop recording command. Stop recording commands from the user are input through the user interface 222 to the system controller 212 , and the system controller 212 then sends a stop command to the encoder 214 .
In response to the received stop command from the system controller 212 , the encoder 214 stops the encoding process and sends the time stamp information included in data at the end of the last encoded MPEG transport stream to the system controller 212 as the end presentation time (TS 1 _VOB_V_E_PTM). This value is set as the End_PTM of the cell information shown in FIG. 17. The time stamp information becomes PCR or PTS.
After ending the recording process, the system controller 212 generates the playback control information and VOB information (TS 1 _VOBI) for the TS 1 _VOB as shown in FIGS. 15A and 15B based on the information received from the encoder 214 .
The VOB information generated here includes an access map and map management information those adapted to the object type. The system controller 212 sets the map validity information of the map management information to “valid”. The self-encoding flag is set ON.
Original playback path information (O_PGC information) as shown in FIGS. 16A and 16B for the recorded object as one of the playback objects is generated as the presentation control information. This O_PGC information is added to the original playback path table. The original playback path information (O_PGC information) contains cell information. Type information of the cell information is set to “TS 1 _VOB”.
The system controller 212 then instructs the drive 221 to stop recording data accumulated in the track buffer 220 and to record the VOB information (TS 1 _VOBI) and playback control information for TS 1 _VOB. The drive 221 thus records the remaining data in the track buffer 220 and this information to the optical disc 100 , and the recording process ends.
The self-encoding MPEG transport stream generated by the encoder 214 is described in further detail below.
The structure of the self-encoding. MPEG transport stream is shown in FIGS. 21A and 21B. As shown in the figure the self-encoding MPEG transport stream is segmented into VOBU units. Each VOBU starts with a PAT packet, PMT packet, and a User Private packet (UP packet) embedded with stream-specific data. A PAT packet and PMT packet at least are also located at the beginning of the VOB.
As shown in FIG. 21B an ATS indicating the decoder input time is also added to each packet, and each packet is input to the decoder at the time intended by the ATS.
The self-encoding program information (such as the PMT packet PID) is stored to the PAT packet of the first packet and input to the decoder at the time indicated by ATS 1 .
The PID for each elementary stream composing the program is stored to the PMT packet of the second packet. In this example PIDs for the video, audio, data broadcast (“Data” in the figure), and user private (“private” in the figure) packets are stored.
Information added to the stream is stored to the user private packet in the third packet. This added information could, for example, include: stream title information; recording date and time information; stream attributes, that is, stream encoding information such as the bit rate, video resolution, frame rate, aspect ratio, or encoding method; input source identification information for identifying whether the line input is analog or digital; information indicating the AV data encoding method if the data is digital; copyright protection information indicating whether copying is allowed or prohibited; Vertical Blanking Interval (VBI) signals such as closed caption (CC) data, teletext data, or Wide_Screen Signaling (WSS) data used for display control; information indicating system encoding conditions; DVD standard compatibility information; menu information provided for user convenience using specific data provided by the manufacturer that recorded the stream; and data useful for conversion to various DVD standard MPEG program streams (MPEG_PS).
The decoder input time for a packet stored in this added information and located in the MPEG transport stream as above is described next with reference to FIGS. 22A and 22B.
FIG. 22A is a block diagram showing the basic configuration of a decoder referred to as a transport stream system target decoder (T_STD). This figure further shows a system decoder 235 for interpreting a PSI packet and providing decoder control (not described above).
When a PAT (PSI packet), or PMT packet, as PSI packet, is input to the T_STD, the packet is discriminated according to packet type by demultiplexer 232 , and the PSI packet which is used for system control is sent immediately to a transport buffer 233 .
Data accumulated in the transport buffer 233 is then streamed to the system buffer 234 at a rate of 1,000,000 bits/second (=Rsys).
The PSI data becomes valid the moment the required PSI data is accumulated in the system buffer 234 .
This T_STD model in MPEG thus defines an operating model for the decoder and defines standards for the MPEG transport stream transfer rate, for example.
There are several restrictions on PSI packet transfer because the data recording apparatus must self-encode the transport stream according to an MPEG transport stream format that assures the T_STD can correctly decode the transport stream. A method of determining the ATS that determines the packet transfer rate is described next with reference to FIG. 22B.
When reproducing a self-encoding stream the leading PAT, PMT, and UP packets are input to the T_STD at the time indicated by ATS 1 , ATS 2 , and ATS 3 , respectively.
The PMT packet and UP packet are now considered, in order to interpret, by the T_STD, the PID of the UP packet specified by the PMT packet and valid it, the last byte (byte m) of the TS_program_map_section must be stored in the system buffer 234 .
That is, for the PMT to be valid (m+n+5)×8/Rsys seconds must have passed from ATS 2 as the PMT packet input time. Note that n is the byte length of the PMT packet adaptation_field.
Because the System Clock Frequency (SCF) as the T_STD reference clock is 27,000,000 Hz (with a defined tolerance range of ±810 Hz for error), the following relationship between ATS 3 and ATS 2 must be true if the ATS is a time expressed to the precision of the System Clock Frequency.
ATS3≧ATS2+(( m+n+ 5)*8/Rsys)*SCF
Because the shortest interval between ATS 2 and ATS 3 is only when there is no adaptation_field (n=0) in the PMT packet and the smallest TS_program_map_section (21 bytes) is stored in the PMT packet, a time interval of 208/Rsys×SCF is shortest.
The following relationship is likewise required for the input time ATS 1 of the PAT packet and input time ATS 2 of the PMT packet
ATS2≧ATS1+(( m 0+ n 0+5)*8/Rsys)*SCF
where m0 is the byte length of the Program association section in the PAT packet, and n0 is the byte length of the adaptation_field in the PAT packet.
Furthermore, because the shortest interval between ATS 1 and ATS 2 is only when there is no adaptation_field (n=0) in the PAT packet and the smallest Program association section (16 bytes) is stored to the PAT packet, a time interval of 168/Rsys×SCF is shortest.
If time is expressed with a precision of 27 MHz using a System Clock Frequency (SCF) of 27 MHz, the shortest time interval between ATS 1 and ATS 2 and between ATS 2 and ATS 3 is 4536 and 5616, respectively.
Storing the User Private packet to the self-encoding transport stream is described next with reference to FIGS. 23 to 26 .
FIG. 23 shows storing the UP packet when the UP packet is defined as a User Private stream. In this case, an identification number greater than or equal to “0x80” and less than or equal to “0xFF” is allocated to stream_type of the PMT corresponding to the UP packet. A unique PID is assigned to the UP packet. The internal data structure of the UP packet does not conform to the MPEG standard. Note that in this example the UP packet includes a section structure called the DVD_attribute_section( ).
FIG. 24 shows a further storage method whereby a private_section structure is included in the UP packet and a unique PID is assigned. The data structure of the private_section will vary somewhat according to the value of the section syntax indicator in the private_section, but data specific to the UP packet is stored in the private_data_byte of the private_section. In this case, identification number of 0x00 is assigned to stream_type.
FIG. 25 shows a method of storing a UP packet as a packet with the same PID as the PMT packet. In this case the UP packet data structure conforms to the private_section structure. The stream type is not defined, and PID of PMT packet is assigned to UP packet.
FIG. 26 shows an example in which the UP packet is not stored separately but is enclosed in the PMT packet. In this case the specific data equivalent to the UP packet has a private_section structure, and the private_section is written after the TS_program_map_section. That is, PMT packet includes both TS_program_map_section and private_section.
The specific data stored to the MPEG_TS by the above-noted methods is described next.
As shown in FIGS. 23 to 26 , this specific data includes the Real-time Data Information General Information (RDI_GI) of the RDI Unit and the Display Control Information and Copy Control Information (DCI_CCI) of the DVD Video Recording standard.
The RDI_GI stores the first presentation start tire (VOBU_S_PMT) of the VOBU and the recording date and time information. The DCI_CCI stores, for example, the VOBU aspect ratio information, subtitle mode information, film or camera mode information and other information related to display control, copy generation management information, APS information, and input source information. (For further information about RDI_GI and DCI_CCI, see the DVD Video Recording standard.)
The V_ATR field stores the video bit rate, resolution, frame rate (or video format such as NTSC or PAL), aspect ratio, and encoding method (an MPEG2_Video or MPEG1_Video identifier).
Likewise, the A_ATR field stores the bit rate for all or part of the audio, encoding method, channel count, quantization bits, and dynamic range control information according to the number of audio streams.
The CC field stores the closed caption data for the VOBU. To improve the transferability of PS conversion, closed caption data can be written in an extension_and_user_data (1) format (a method of storing user data to the GOP layer), or the closed caption data could be written separately.
Storing the closed caption data to the user data of the GOP layer improves MPEG_PS conversion efficiency because the DVD Video and DVD Video Recording standards are defined for this purpose.
The C_SE field stores information relating to some problems associated with TS2PS conversion of the VOBU or VOB.
Regarding the CC, WSS, or teletext data storage location information, that information indicates whether, for example, closed caption data is contained in the UP packet, whether closed caption data is written as user data to the picture headers, or whether there is no closed caption data in the particular VOBU (or VOB).
Regarding the WSS storage location information, that information further indicates whether it is stored as specific data in the UP packet, or whether it is written to the user data in the picture headers.
Regarding the teletext storage location information, it indicates whether a TS packet is provided for storing the teletext data, or whether it is written to the user data in the picture headers.
Regarding the multiplexed block structure and transfer information, that information includes information indicating if the number of TS packets in the multiplex block (a data block in which only one elementary stream is stored without being mixed with another elementary stream) as shown in FIGS. 27A to 27 H is fixed or variable, the number of packets if the number is fixed, information indicating whether a PTS/DTS is added to the first TS packet in the multiplex block, or the transfer rate within the same multiplex block. During MPEG_TS encoding imposing no conditions on conventional multiplexing, the multiplex block can be written with a fixed length including only one TS packet.
The decoder buffer control information includes vbv_delay, a parameter of the video verifying buffer, information such as vbv_buffer_size indicating the remaining video buffer capacity (this information is used to determine how far ahead of the ATS input time the video data can be read), and the time difference between the decoding time and the input completion time of the VOBU frame for which the buffer input time is closest to the frame decoding time (this information is used to determine how far back from the ATS input time the video or audio data can be read).
The DVD_Compatibility information indicates the overhead involved with system transcoding a MPEG_TS to a MPEG_PS conforming to a DVD standard.
The DVD_Compatibility information indicates how easy it is to convert a MPEG_TS to other DVD formats. For example, if the multiplex blocks are 2 KB or less, a level 1 indicator is set; if there is closed caption, WSS, or teletext data, the closed caption or WSS data is stored to an UP packet, and the teletext data is stored as a teletext packet in a multiplex block storing video data, a level 2 indicator is set; if it is not necessary to consider buffer management when the closed caption, WSS, or teletext data is stored to the area specified by the DVD standard, a level 3 indicator is set; and if it is not necessary to consider buffer management when the ATS of the first TS packet in the multiplex block is replaced by the SCR, a level 4 indicator is set.
This DVD_Compatibility information is thus a data set indicating the ease of convertibility to various DVD formats, including DVD Video, DVD Audio, DVD Video Recording, and DVD Stream Recording.
FIGS. 27A to 27 H show the structure of an MPEG_TS using multiplex blocks, and the data structure when this MPEG_TS is converted to DVD Video and DVD Video Recording formats.
The self-encoded TS stream shown in FIG. 27A comprises the VOBU (playback and decoding units) of the self-encoded TS stream shown in FIG. 27B. As shown in FIG. 27C one VOBU includes multiple multiplex blocks (corresponding to MPEG_PS packs). Each multiplex block can be segmented into fixed length data units as shown in FIG. 27D (enabling easy packaging in the device) or into variable length data units as shown in FIG. 27E (thereby consuming less disc space). In the cases shown in FIGS. 27D and 27E, the multiplex blocks are respectively formed by segmenting non-elementary steams such as PSI/SI packets or UP packets and the elementary stream, but as shown in FIG. 27F a multiplex block could store both an elementary stream and non-elementary stream objects such as PSI/SI packets or UP packets. Note that in FIG. 27F multiplex block # 1 and multiplex block # 2 are one multiplex block.
The above streams can be easily converted to the DVD Video format shown in FIG. 27G or the DVD Video Recording format shown in FIG. 27H.
In this case it is important for simple TS2PS conversion that the MPEG_PS packs are formed in the multiplex block sequence and one multiplex block is the unit storing one pack of data.
It should be noted that the capsule header and ATS are only loosely related to the present invention and are therefore omitted in FIGS. 27A to 27 H. In addition, the packs in the converted MPEG_PS shown in FIGS. 27G and 27H are also stuffed or padded as appropriate according to the byte length and VOBU alignment of the stored elementary.
FIGS. 28A to 28 G describes the multiplexing method of the present invention, comparing with the conventional stream multiplexing method shown in FIG. 8. As shown in the figure the final format conforms to the MPEG_TS format shown in FIG. 28G. The video stream (FIG. 28A) comprises plural GOP (FIG. 28B). Each GOP contains specific picture data, and a TS packet group of a data size equivalent to the data size of one pack when converted to an MPEG_PS is one multiplex block (FIG. 28C). That is, one multiplex block is segmented into plural TS packets equivalent to the data size of one pack as shown in FIG. 28D. The audio stream is likewise packed in one multiplex block group having a plurality of TS packets. As shown in FIG. 28E, a VOBU is formed by multiplexing by multiplex block unit. The greatest difference between the present invention and the prior art shown in FIG. 8 is in that data units of a size equivalent to the data size of one MPEG_PS pack are grouped to form the multiplex blocks (see FIG. 28E)).
Furthermore, the ATS may be added to each MPEG_TS packet while increased by a specific amount (ΔATS) in each packet within the same multiplex block as shown in FIG. 29. This is effective to avoid complex buffer management during TS2PS conversion, and convert ATS to SCR using a simple offset or no offset. ATSi (i=0, 1, 2 . . . ) in this case satisfies the following equation.
ATSi+(packet count in the multiplex block)×ΔATS≦ATSi+1
When the multiplex block is a fixed length, the number of TS packets in one multiplex block is fixed and thus the multiplex block boundaries are easily known. However, when the multiplex block is variable length, the number of TS packets in one multiplex block is also variable and thus the multiplex block boundaries are not easily known. Therefore, the increase (BATS) in the ATS at the multiplex block boundary is set to a specific value different from the (constant) increase within the multiplex block. That is, the difference (ΔATS) between the ATS of the last packet in the previous multiplex block and the ATS of the first packet in the immediately following multiplex block is set to a specific value which is not the constant value. This makes it possible to know the multiplex block boundaries by monitoring ΔATS. A 1:1 correlation between packs and TS packets when converting to an MPEG_PS can therefore be assured. ATSi in this case satisfies the following equation.
ATSi+(packet count in the multiplex block)×ΔATS<ATSi+1
Furthermore, the ATSi added to the first packet in the MPEG_TS multiplex block corresponds to SCRi added to each pack in the MPEG_PS after conversion.
Furthermore, as also shown in FIG. 29, closed caption, DSI, or other text information can also be stored in the UP packet. The DSI in the UP packet is used to generate NV_PCK data after conversion, and the closed caption data is stored to the video pack. To enable compatibility with the PAL standard used in Europe, packets storing teletext data in the multiplex block can be inserted between the video data packets as shown in FIG. 30. In this case the teletext data packets are located immediately before the simultaneously presented picture having the same PTS. After conversion the teletext data is stored to the video pack.
FIG. 31 shows the data structure of a UP packet storing the DSI as described above.
Information (such as a relative number from the beginning of the VOBU) identifying the TS packet storing the last byte of the first I-picture in the VOBU can also be described in the added information of the UP packet to enable efficient special playback modes. Special playback modes can also be supported by also describing picture encoding type information of same of I- and P-pictures or all pictures in the VOBU, the data size of each picture (such as information identifying the TS packet containing the last byte), and information indicative of the DTS/PTS for each picture.
It should be noted that if encoding is done so that TS packet containing the PTS/DTS is located at the beginning of the multiplex block in the present embodiment, the beginning of an access unit will be located at the beginning of the packs after TS2PS conversion, and simplified DVD-specific header processing can be expected.
To prevent an overflow of data stored to MPEG_PS packs and ease conversion to an MPEG_PS, the TS packets of the multiplex blocks can be appropriately stuffed or a necessary number of stuffing bytes can be inserted after the last TS packet in the multiplex block.
The present embodiment has been described primarily with reference to recording to DVD, but the invention will obviously not be so limited. More specifically, after recording a self-encoded transport stream to a hard disk, semiconductor memory, or other data recording medium, a stream converted to an MPEG program stream can be recorded to the same medium or to a different medium.
Furthermore, the PAT, PMT, and UP packets are described as recorded to the beginning of each VOBU in the present embodiment, but they can be recorded to the beginning of at least a VOB or to the beginning of a Cell which is the playback management unit.
Yet further, this embodiment is described recording PAT, PMT, and UP packets, but the UP packet can be omitted.
Yet further, the PAT, PMT, and UP packets are described as fixed at the beginning in the present embodiment, but the invention shall not be so limited, and a packet storing a Null packet can be recorded inserted therebetween.
Yet further, a self-encoded stream is described starting from a PAT packet, but the invention shall not be so limited and the stream could start from a Null packet.
Furthermore, the system transfer rate can be set to a fixed rate by appropriately inserting Null packets in the self-encoded stream.
It should also be noted that a data area for storing manufacturer-private information can be provided as shown in FIG. 7, and MPEG_TS system encoding conditions can be written to this data area.
It should also be noted that all or part of the information written to the UP packet in the above embodiment can be written to the TS 1 _VOB information shown in FIG. 15.
It will also be noted that the DVD Video format does not allow for dual mono audio. It is, however, possible to convert a self-encoding transport stream recorded with dual mono audio channels to the DVD Video format by separating the dual mono audio channels into two separate audio streams recorded as left and right monaural audio channels.
Part or all of the parameters written to the UP packet in the above embodiment could also be written into the management information. By thus avoiding recording a parameter that does not change within a self-encoding transport stream multiple times, recording space is not wasted and the decoder does not need to waste processing time trying to determine whether or not the parameter changed each tire a UP packet is detected.
<Encoder Configuration>
An alternative embodiment of the present invention is described next below. The description is made to an encoder of a data recording apparatus according to the present invention by focusing first the encoding process to receive and self-encode AV input to an MPEG transport stream.
FIG. 33 shows the configuration of the encoder in a data recording apparatus according to the present invention. As shown in the figure, the encoder 214 includes elementary stream encoders 230 a , 230 b and 230 c , and a system encoder 232 . The encoder 214 receives a control signal from the system controller 212 and then runs the encoding process with the elementary stream encoders 230 a , 230 b and 230 c , or the system encoder 232 while switching between elementary encoding and system encoding. Each of the elementary stream encoders 230 a , 230 b and 230 c receives video, audio, and VBI (Vertical Blanking Interval) signals for encoding.
The video encoder 230 a receives a control signal from the system controller 212 and based thereon encodes the bit rate, resolution, aspect ratio, and other attributes of the video stream within a predefined range. More specifically, the video encoder 230 a receives a control signal from the system controller 212 specifying the operating mode as the “DVD Video compatible mode,” DVD Video Recording compatible mode,” or “normal mode.” If the mode specified by the control signal is the DVD Video compatible mode, the video encoder 230 a generates a video stream conforming to the video attributes of the DVD Video standard; if the DVD Video Recording compatibility mode, it generates a video stream conforming to the video attributes of the DVD Video Recording (“DVD VR” below) standard; and if the normal mode, generates a video stream conforming to a specific attribute range.
The audio encoder 230 b likewise receives a control signal from the system controller 212 and based thereon encodes the bit rate, quantization rate, channel count, and other attributes of the audio stream within a predefined range. Like the video encoder 230 a , the audio encoder 230 b specifically receives a control signal from the system controller 212 specifying the operating mode. If the mode specified by the control signal is the DVD Video compatibility mode, the audio encoder 230 b generates an audio stream conforming to the audio attributes of the DVD Video standard; if the DVD VR compatibility mode, it generates an audio stream conforming to the audio attributes of the DVD Video Recording (“DVD VR” below) standard; and if the normal mode, generates an audio stream conforming to a specific attribute range.
The VBI data encoder 230 c likewise receives a control signal specifying the operating mode from the system controller 212 and encodes the VBI data accordingly. Specifically, if the elementary stream encoding control signal input from the system controller 212 to the VBI data encoder 230 c indicates the DVD Video compatible mode or DVD VR compatible mode, it additionally encodes VBI data according to the VBI data storage method specified by the respective standards. There is a case that a VBI data storage method is separately defined even in the original normal mode, and in that case “additionally encode” means that VBI data is redundantly stored to the elementary stream.
The encoded elementary streams are then multiplexed to the MPEG_TS system stream by the system encoder 232 .
Like the elementary stream encoders 230 a , 230 b and 230 c , the system encoder 232 also receives an encoding control signal from the system controller 212 to encode according to the received signal.
The control signal from the system controller 212 to the system stream encoder 232 is either a system encoding control signal for encoding a normal MPEG_TS, or a system encoding control signal (DVD-Video mode or DVD-Video Recording mode) applying constraints on the normal MPEG_TS in order to enable easy conversion to an MPEG_PS (particularly a specific DVD format).
If the control signal is for encoding a normal MPEG_TS, the system stream encoder 232 applies the system encoding to the elementary streams input from the elementary stream encoders 230 a , 230 b and 230 c while managing the buffers so that the input streams are not corrupted by the decoder model (T_STD) which is a reference for the MPEG_TS system stream.
If the control signal from the system controller 212 is a control signal specifying system encoding to an MPEG_TS enabling easy conversion to an MPEG_PS, the encoding is conducted while also following additional special system encoding rules.
The encoder 214 then outputs the resulting self-encoding MPEG_TS system stream.
The data recording apparatus according to the present invention is thus characterized by switching the encoding mode at the elementary stream and system stream encoding levels. The processes applied in each encoding mode to convert to a particular DVD format when the encoding mode is changed as described above are shown in the table in FIG. 34.
An MPEG_TS enabling easy conversion to an MPEG_PS is thus generated by driving the elementary stream encoders 230 a , 230 b and 230 c and system encoder 232 to encode the respective streams assuming the conversion to an MPEG_PS.
<A Self-Encoded MPEG_TS>
A detailed embodiment of the format of an MPEG_TS self-encoded by a data recording apparatus according to the present invention is described next below. The differences between a normal MPEG_TS (“SESF” below) and an MPEG_TS enabling easy conversion to an MPEG_PS (a “Constrained SESF” below) are also described.
In the following example, information presenting the stream encoding conditions is stored to the VOBI storing attributes and other information in MPEG_TS stream units. By thus storing information about the encoding conditions to the management information and not in the stream, it is possible to quickly determine whether a stream can be easily converted to a DVD Video or DVD VR format without analyzing the stream. Note that this information presenting the stream encoding conditions can be stored to a Tip packet which is described further below.
The information presenting the stream encoding conditions is represented by an “encode_condition” flag which has two bits. The flag value is described below.
00b; normal MPEG_TS (SESF)
01b: MPEG_TS enabling easy conversion to a DVD VR stream format (Constrained SESF)
10b: reserved
11b: MPEG_TS enabling easy conversion to a DVD Video stream format (Constrained SESF)
Two cases are possible if the encode_condition flag is set to 00b in the stream management information: the stream is originally encoded without considering high speed conversion to MPEG_PS, and a sequence of MPEG program stream are linked by user editing for easy conversion to individual MPEG program stream.
If the encode_condition flag is also set in the stream, it is meaningless to set encode_condition=00b indicating a normal MPEG_TS in the stream. It is therefore also possible for the encode_condition flag to be used differently inside and outside the stream, reserving the encode_condition=00b setting so that it is not used in the stream (in the Tip packet described below).
By thus setting this flag, it is possible to determine from the value of the VOBI encode_condition field whether the stream can be easily converted to a DVD Video or DVD VR format. “Easily converted” as used herein means convertible by the conversion method described further below.
<Constrained SESF Stream Structure>
FIG. 80 shows the complete stream structure of a Constrained SESF. A Constrained SESF includes plural SESF capsules. An SESF capsule contains specific multiplexing units, and a Tip packet (detailed below) at the head. The presentation tire stamp (PTS) of each SESF capsule and an address of the Tip packet are correlated in the access map 80 c . As described below, for TS2PS conversion, a conversion process is accomplished in SESF capsule units.
FIG. 32 shows the correlation between MPEG_PS packs and packets in on SESF capsule. As shown in FIG. 32 a TS packet (Tip packet below) storing stream-specific information is inserted to a Constrained SESF. The Tip packet embedded in a Constrained SESF is described below with reference to FIG. 35 to FIG. 41.
<Tip Packet>
FIG. 35 shows the overall Tip packet structure. As shown in FIG. 35, a Tip packet stores a Data_ID identifying the packet as a Tip packet, display and copy info corresponding to the DVD VR DCI_CCI field and including display control and copy control information, encode_info storing stream encoding information, and MakersPrivateData for storing additional information unique to the manufacturer.
As shown in FIG. 35 and FIG. 36, the PCR value needed for the SCR calculations described below is written to the adaptation field of the Tip packet. This adaptation field is a fixed byte length, and thereby enables accessing information in the Tip packet using a fixed address.
The Data_ID structure is shown in FIG. 37. The Data_ID has a Data_Identifier for identifying whether the corresponding packet is a Tip packet. The Data_Identifier is a 3-byte field holding a value of “0x544950” expressing “TIP” in the ASCII code. The decoder of the playback device reads the value of this field to identify that it is a Tip packet.
The display_and_copy_info structure is shown in FIG. 38. Generating the RDI pack when converting a Constrained SESF to the DVD VR format is simplified by providing the same structure and information an the DCI_CCI field of the RDI Unit in the DVD VR standard in display_and_copy_info. (Note that the DCI_CCI field of the DVD VR standard is fully described in “DVD Specifications for Rewritable/Re-recordable Disc, Part 3, VIDEO RECORDING,” and in Japanese patent No. 3162044. While some of the field names are different in these documents, the field definitions are the same so as to enable direct copying when converting to the DVD VR format.)
The encode_info field structure is shown in FIG. 39. Resolution information for the video stream following the Tip packet is written to the video resolution field. The value of encode_info is shown below.
0000b: 720×480(NTSC), 720×576(PAL)
0001b: 704×480(NTSC), 704×576(PAL)
0010b: 352×480(NTSC), 352×576(PAL)
0011b: 352×240(NTSC), 352×288(PAL)
0100b: 544×480(NTSC), 544×576(PAL)
0101b: 480×480(NTSC), 480×576(PAL)
Others: reserved
Resolution can vary during a single continuous recording in the DVD VR format. However, streams of different resolutions are managed as separate VOBs and it assures seamless connection during playback by a certain recorder. This field is therefore used to determine where it is necessary to split the VOB when converting to the DVD VR format, if there is a resolution change during Constrained SESF recording.
In a Constrained SESF recorded with consideration for converting to the DVD Video format (encode_condition=11b), the resolution does not change within a single stream.
The encode_condition field is the sane as the value stored to the VOBI (except when 00b). The reason why the encode_condition field is stored and embedded in the stream and not only in the stream management information is to enable the recorder to easily determine if it is possible to convert to the DVD format by referencing the encode_condition field in the Tip packet when, for example, a stream is copied through a digital interface such as IEEE 1394.
VOBU_S_PTM of the DVD VR standard is recorded to the FVFPST field. This is to eliminate the process of analyzing the video stream encoded after the Tip packet and calculating the presentation time of the first appearing video field when converting a Constrained SESF to a DVD Video or VR format.
The FVFPST field includes a 32-bit field denoting the video field presentation time at 90 KHz precision, and a 16-bit field denoted at 27 MHz precision.
The PES_info structure is shown in FIG. 40. PES_info is needed to convert a Constrained SESF to the DVD Video format without analyzing the elementary streams. This information is needed to generate the information inserted to the DVD Video stream and stored in the packs, referred to as NV_PCK, supporting special playback modes.
The PES_info can store information for 136 PES packets each storing video data or audio data units. Four bits are assigned to each PES packet, and the NV_PCK information can be generated without analyzing PES packet content. PES packets not storing video or audio data are ignored.
In a SESF capsule being the data unit from one Tip packet to the packet immediately preceding the next Tip packet, a PES_existence_flag declares if the j-th PES packet is present in the SESF capsule. The value of PES_existence_flag is set as follows.
0b: j-th PES packet is not in the SESF capsule
1b: j-th PES packet is in the SESF capsule
If the PES_extension_flag=0b (when there is no PES packet), all remaining fields in the PES packet are set to 0b.
The PES_payload_identifier identifies whether the data stored in the PES packet is video or audio data. PES_payload_identifier values are set as follows.
0b: video stream
1b: audio stream
The PES_existence_flag and PES_payload_identifier fields are set for all relevant PES packets.
When it is determined from the PES_payload_identifier whether video or audio data is stored, the remaining field definitions vary according to the type of stream stored in the PES packet.
If the PES packet stores a video stream (PES_payload_identifier=0b), picture_coding_type indicating the type of picture stored in the PES packet is defined after the PES_payload_identifier field.
The value of the picture_coding_type field is set as follows.
00b: a picture encoded with encoding other than 01b or 10b
01b: a frame encoded I-picture; a pair of field encoded I-pictures; or a pair of field encoded I-picture and field encoded P-picture
10b: a pair of frame encoded P-pictures or a pair of field encoded P-pictures
11b: reserved
In other words, a picture with 01b or 10b is a picture used as the reference picture defined by the DVD Video standard. The above description is for information added to PES packets storing video.
If the PES packet stores an audio stream (PES_payload_identifier=1b), the PES_payload_identifier is followed by a stream_identifier and a sync_presentation_flag. The stream_identifier identifies whether the audio stream in the PES packet is a first audio stream or a second audio stream. The sync_presentation_flag is a flag to identify whether there is an audio frame for which presentation begins simultaneously to or immediately following the FVFPST field (the presentation start time of the video field presented first) written to each Tip packet.
The value of stream identifier is set as follows.
0b: first audio stream
1b: second audio stream
The first and second audio stream can be discriminated by the PID setting rules and the order of elementary stream declaration in the PMT.
The value of sync_presentation_flag is set as follows.
0b: an audio frame for which presentation begins simultaneously to or immediately following the FVFPST is not stored in the audio PES packet
1b: an audio frame for which presentation begins simultaneously to or immediately following the FVFPST is stored in the audio PES packet
Information added to PES packets storing audio is as described above.
The PES_info field thus extracts and stores information for each PES packet following a Tip packet.
FIG. 41 shows the MakersPrivateData. As shown in the figure, the MakersPrivateData has a maker_ID field identifying the manufacturer of the Constrained SESF, and maker_private_data field containing specific additional information described by the manufacturer.
FIGS. 42A and 42B shows an example of a value of PID of the Tip packet and a value of stream_type indicating the stream type. Other PID and stream_type values are reserved by the MPEG standard and other standards, and these values were selected to indicate private data beyond the scope of the MPEG standard without interfering with reserved values.
Various stream attribute information is thus extracted and stored to the Tip packets stored in a Constrained SESF. How the fields described above are used during conversion to a different DVD format is descried in further detail below.
<System Encoding Conditions>
The system encoding conditions for a Constrained SESF, are described in detail next below. Note that the following system encoding conditions do not apply to a normal SESF.
<Multiplexing Unit>
TS packet storing elementary Streams in a Constrained SESF is composed of a Multiplexing Unit which is a unit of data stored in 2 KB packs according to a DVD format. Note that this Multiplexing Unit corresponds to the multiplex block of the first embodiment.
Only TS packets storing one type of elementary stream are stored to each Multiplexing Unit, and these TS packets are not mixed with TS packets storing another type of elementary stream. Mixing TS packets with Null packets is not prohibited because it may be necessary to include one or more Null packets in order to generate a Multiplexing Unit (such as the Multiplexing Unit storing the last part of a stream). This is necessary to clarify the relationship between Multiplexing Units and packs.
One Multiplexing Unit contains eleven continuous TS packets, and the elementary stream (payload data) in