libtiff-4.0.7/html/TIFFTechNote2.html

*5c402d22SFrank Warmerdam<pre>
*5c402d22SFrank WarmerdamDRAFT TIFF Technical Note #2				17-Mar-95
*5c402d22SFrank Warmerdam============================
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThis Technical Note describes serious problems that have been found in
*5c402d22SFrank WarmerdamTIFF 6.0's design for embedding JPEG-compressed data in TIFF (Section 22
*5c402d22SFrank Warmerdamof the TIFF 6.0 spec of 3 June 1992).  A replacement TIFF/JPEG
*5c402d22SFrank Warmerdamspecification is given.  Some corrections to Section 21 are also given.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamTo permit TIFF implementations to continue to read existing files, the 6.0
*5c402d22SFrank WarmerdamJPEG fields and tag values will remain reserved indefinitely.  However,
*5c402d22SFrank WarmerdamTIFF writers are strongly discouraged from using the 6.0 JPEG design.  It
*5c402d22SFrank Warmerdamis expected that the next full release of the TIFF specification will not
*5c402d22SFrank Warmerdamdescribe the old design at all, except to note that certain tag numbers
*5c402d22SFrank Warmerdamare reserved.  The existing Section 22 will be replaced by the
*5c402d22SFrank Warmerdamspecification text given in the second part of this Tech Note.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamProblems in TIFF 6.0 JPEG
*5c402d22SFrank Warmerdam=========================
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAbandoning a published spec is not a step to be taken lightly.  This
*5c402d22SFrank Warmerdamsection summarizes the reasons that have forced this decision.
*5c402d22SFrank WarmerdamTIFF 6.0's JPEG design suffers from design errors and limitations,
*5c402d22SFrank Warmerdamambiguities, and unnecessary complexity.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamDesign errors and limitations
*5c402d22SFrank Warmerdam-----------------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe fundamental design error in the existing Section 22 is that JPEG's
*5c402d22SFrank Warmerdamvarious tables and parameters are broken out as separate fields which the
*5c402d22SFrank WarmerdamTIFF control logic must manage.  This is bad software engineering: that
*5c402d22SFrank Warmerdaminformation should be treated as private to the JPEG codec
*5c402d22SFrank Warmerdam(compressor/decompressor).  Worse, the fields themselves are specified
*5c402d22SFrank Warmerdamwithout sufficient thought for future extension and without regard to
*5c402d22SFrank Warmerdamwell-established TIFF conventions.  Here are some of the significant
*5c402d22SFrank Warmerdamproblems:
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam* The JPEGxxTable fields do not store the table data directly in the
*5c402d22SFrank WarmerdamIFD/field structure; rather, the fields hold pointers to information
*5c402d22SFrank Warmerdamelsewhere in the file.  This requires special-purpose code to be added to
*5c402d22SFrank Warmerdam*every* TIFF-manipulating application, whether it needs to decode JPEG
*5c402d22SFrank Warmerdamimage data or not.  Even a trivial TIFF editor, for example a program to
*5c402d22SFrank Warmerdamadd an ImageDescription field to a TIFF file, must be explicitly aware of
*5c402d22SFrank Warmerdamthe internal structure of the JPEG-related tables, or else it will probably
*5c402d22SFrank Warmerdambreak the file.  Every other auxiliary field in the TIFF spec contains
*5c402d22SFrank Warmerdamdata, not pointers, and can be copied or relocated by standard code that
*5c402d22SFrank Warmerdamdoesn't know anything about the particular field.  This is a crucial
*5c402d22SFrank Warmerdamproperty of the TIFF format that must not be given up.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam* To manipulate these fields, the TIFF control logic is required to know a
*5c402d22SFrank Warmerdamgreat deal about JPEG details, for example such arcana as how to compute
*5c402d22SFrank Warmerdamthe length of a Huffman code table --- the length is not supplied in the
*5c402d22SFrank Warmerdamfield structure and can only be found by inspecting the table contents.
*5c402d22SFrank WarmerdamThis is again a violation of good software practice.  Moreover, it will
*5c402d22SFrank Warmerdamprevent easy adoption of future JPEG extensions that might change these
*5c402d22SFrank Warmerdamlow-level details.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam* The design neglects the fact that baseline JPEG codecs support only two
*5c402d22SFrank Warmerdamsets of Huffman tables: it specifies a separate table for each color
*5c402d22SFrank Warmerdamcomponent.  This implies that encoders must waste space (by storing
*5c402d22SFrank Warmerdamduplicate Huffman tables) or else violate the well-founded TIFF convention
*5c402d22SFrank Warmerdamthat prohibits duplicate pointers.  Furthermore, baseline decoders must
*5c402d22SFrank Warmerdamtest to find out which tables are identical, a waste of time and code
*5c402d22SFrank Warmerdamspace.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam* The JPEGInterchangeFormat field also violates TIFF's proscription against
*5c402d22SFrank Warmerdamduplicate pointers: the normal strip/tile pointers are expected to point
*5c402d22SFrank Warmerdaminto the larger data area pointed to by JPEGInterchangeFormat.  All TIFF
*5c402d22SFrank Warmerdamediting applications must be specifically aware of this relationship, since
*5c402d22SFrank Warmerdamthey must maintain it or else delete the JPEGInterchangeFormat field.  The
*5c402d22SFrank WarmerdamJPEGxxTables fields are also likely to point into the JPEGInterchangeFormat
*5c402d22SFrank Warmerdamarea, creating additional pointer relationships that must be maintained.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam* The JPEGQTables field is fixed at a byte per table entry; there is no
*5c402d22SFrank Warmerdamway to support 16-bit quantization values.  This is a serious impediment
*5c402d22SFrank Warmerdamto extending TIFF to use 12-bit JPEG.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam* The 6.0 design cannot support using different quantization tables in
*5c402d22SFrank Warmerdamdifferent strips/tiles of an image (so as to encode some areas at higher
*5c402d22SFrank Warmerdamquality than others).  Furthermore, since quantization tables are tied
*5c402d22SFrank Warmerdamone-for-one to color components, the design cannot support table switching
*5c402d22SFrank Warmerdamoptions that are likely to be added in future JPEG revisions.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAmbiguities
*5c402d22SFrank Warmerdam-----------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamSeveral incompatible interpretations are possible for 6.0's treatment of
*5c402d22SFrank WarmerdamJPEG restart markers:
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam  * It is unclear whether restart markers must be omitted at TIFF segment
*5c402d22SFrank Warmerdam    (strip/tile) boundaries, or whether they are optional.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam  * It is unclear whether the segment size is required to be chosen as
*5c402d22SFrank Warmerdam    a multiple of the specified restart interval (if any); perhaps the
*5c402d22SFrank Warmerdam    JPEG codec is supposed to be reset at each segment boundary as if
*5c402d22SFrank Warmerdam    there were a restart marker there, even if the boundary does not fall
*5c402d22SFrank Warmerdam    at a multiple of the nominal restart interval.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam  * The spec fails to address the question of restart marker numbering:
*5c402d22SFrank Warmerdam    do the numbers begin again within each segment, or not?
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThat last point is particularly nasty.  If we make numbering begin again
*5c402d22SFrank Warmerdamwithin each segment, we give up the ability to impose a TIFF strip/tile
*5c402d22SFrank Warmerdamstructure on an existing JPEG datastream with restarts (which was clearly a
*5c402d22SFrank Warmerdamgoal of Section 22's authors).  But the other choice interferes with random
*5c402d22SFrank Warmerdamaccess to the image segments: a reader must compute the first restart
*5c402d22SFrank Warmerdamnumber to be expected within a segment, and must have a way to reset its
*5c402d22SFrank WarmerdamJPEG decoder to expect a nonzero restart number first.  This may not even
*5c402d22SFrank Warmerdambe possible with some JPEG chips.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe tile height restriction found on page 104 contradicts Section 15's
*5c402d22SFrank Warmerdamgeneral description of tiles.  For an image that is not vertically
*5c402d22SFrank Warmerdamdownsampled, page 104 specifies a tile height of one MCU or 8 pixels; but
*5c402d22SFrank WarmerdamSection 15 requires tiles to be a multiple of 16 pixels high.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThis Tech Note does not attempt to resolve these ambiguities, so
*5c402d22SFrank Warmerdamimplementations that follow the 6.0 design should be aware that
*5c402d22SFrank Warmerdaminter-application compatibility problems are likely to arise.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamUnnecessary complexity
*5c402d22SFrank Warmerdam----------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe 6.0 design creates problems for implementations that need to keep the
*5c402d22SFrank WarmerdamJPEG codec separate from the TIFF control logic --- for example, consider
*5c402d22SFrank Warmerdamusing a JPEG chip that was not designed specifically for TIFF.  JPEG codecs
*5c402d22SFrank Warmerdamgenerally want to produce or consume a standard ISO JPEG datastream, not
*5c402d22SFrank Warmerdamjust raw compressed data.  (If they were to handle raw data, a separate
*5c402d22SFrank Warmerdamout-of-band mechanism would be needed to load tables into the codec.)
*5c402d22SFrank WarmerdamWith such a codec, the TIFF control logic must parse JPEG markers emitted
*5c402d22SFrank Warmerdamby the codec to create the TIFF table fields (when writing) or synthesize
*5c402d22SFrank WarmerdamJPEG markers from the TIFF fields to feed the codec (when reading).  This
*5c402d22SFrank Warmerdammeans that the control logic must know a great deal more about JPEG details
*5c402d22SFrank Warmerdamthan we would like.  The parsing and reconstruction of the markers also
*5c402d22SFrank Warmerdamrepresents a fair amount of unnecessary work.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamQuite a few implementors have proposed writing "TIFF/JPEG" files in which
*5c402d22SFrank Warmerdama standard JPEG datastream is simply dumped into the file and pointed to
*5c402d22SFrank Warmerdamby JPEGInterchangeFormat.  To avoid parsing the JPEG datastream, they
*5c402d22SFrank Warmerdamsuggest not writing the JPEG auxiliary fields (JPEGxxTables etc) nor even
*5c402d22SFrank Warmerdamthe basic TIFF strip/tile data pointers.  This approach is incompatible
*5c402d22SFrank Warmerdamwith implementations that handle the full TIFF 6.0 JPEG design, since they
*5c402d22SFrank Warmerdamwill expect to find strip/tile pointers and auxiliary fields.  Indeed this
*5c402d22SFrank Warmerdamis arguably not TIFF at all, since *all* TIFF-reading applications expect
*5c402d22SFrank Warmerdamto find strip or tile pointers.  A subset implementation that is not
*5c402d22SFrank Warmerdamupward-compatible with the full spec is clearly unacceptable.  However,
*5c402d22SFrank Warmerdamthe frequency with which this idea has come up makes it clear that
*5c402d22SFrank Warmerdamimplementors find the existing Section 22 too complex.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamOverview of the solution
*5c402d22SFrank Warmerdam========================
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamTo solve these problems, we adopt a new design for embedding
*5c402d22SFrank WarmerdamJPEG-compressed data in TIFF files.  The new design uses only complete,
*5c402d22SFrank Warmerdamuninterpreted ISO JPEG datastreams, so it should be much more forgiving of
*5c402d22SFrank Warmerdamextensions to the ISO standard.  It should also be far easier to implement
*5c402d22SFrank Warmerdamusing unmodified JPEG codecs.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamTo reduce overhead in multi-segment TIFF files, we allow JPEG overhead
*5c402d22SFrank Warmerdamtables to be stored just once in a JPEGTables auxiliary field.  This
*5c402d22SFrank Warmerdamfeature does not violate the integrity of the JPEG datastreams, because it
*5c402d22SFrank Warmerdamuses the notions of "tables-only datastreams" and "abbreviated image
*5c402d22SFrank Warmerdamdatastreams" as defined by the ISO standard.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamTo prevent confusion with the old design, the new design is given a new
*5c402d22SFrank WarmerdamCompression tag value, Compression=7.  Readers that need to handle
*5c402d22SFrank Warmerdamexisting 6.0 JPEG files may read both old and new files, using whatever
*5c402d22SFrank Warmerdaminterpretation of the 6.0 spec they did before.  Compression tag value 6
*5c402d22SFrank Warmerdamand the field tag numbers defined by 6.0 section 22 will remain reserved
*5c402d22SFrank Warmerdamindefinitely, even though detailed descriptions of them will be dropped
*5c402d22SFrank Warmerdamfrom future editions of the TIFF specification.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamReplacement TIFF/JPEG specification
*5c402d22SFrank Warmerdam===================================
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam[This section of the Tech Note is expected to replace Section 22 in the
*5c402d22SFrank Warmerdamnext release of the TIFF specification.]
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThis section describes TIFF compression scheme 7, a high-performance
*5c402d22SFrank Warmerdamcompression method for continuous-tone images.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIntroduction
*5c402d22SFrank Warmerdam------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThis TIFF compression method uses the international standard for image
*5c402d22SFrank Warmerdamcompression ISO/IEC 10918-1, usually known as "JPEG" (after the original
*5c402d22SFrank Warmerdamname of the standards committee, Joint Photographic Experts Group).  JPEG
*5c402d22SFrank Warmerdamis a joint ISO/CCITT standard for compression of continuous-tone images.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe JPEG committee decided that because of the broad scope of the standard,
*5c402d22SFrank Warmerdamno one algorithmic procedure was able to satisfy the requirements of all
*5c402d22SFrank Warmerdamapplications.  Instead, the JPEG standard became a "toolkit" of multiple
*5c402d22SFrank Warmerdamalgorithms and optional capabilities.  Individual applications may select
*5c402d22SFrank Warmerdama subset of the JPEG standard that meets their requirements.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe most important distinction among the JPEG processes is between lossy
*5c402d22SFrank Warmerdamand lossless compression.  Lossy compression methods provide high
*5c402d22SFrank Warmerdamcompression but allow only approximate reconstruction of the original
*5c402d22SFrank Warmerdamimage.  JPEG's lossy processes allow the encoder to trade off compressed
*5c402d22SFrank Warmerdamfile size against reconstruction fidelity over a wide range.  Typically,
*5c402d22SFrank Warmerdam10:1 or more compression of full-color data can be obtained while keeping
*5c402d22SFrank Warmerdamthe reconstructed image visually indistinguishable from the original.  Much
*5c402d22SFrank Warmerdamhigher compression ratios are possible if a low-quality reconstructed image
*5c402d22SFrank Warmerdamis acceptable.  Lossless compression provides exact reconstruction of the
*5c402d22SFrank Warmerdamsource data, but the achievable compression ratio is much lower than for
*5c402d22SFrank Warmerdamthe lossy processes; JPEG's rather simple lossless process typically
*5c402d22SFrank Warmerdamachieves around 2:1 compression of full-color data.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe most widely implemented JPEG subset is the "baseline" JPEG process.
*5c402d22SFrank WarmerdamThis provides lossy compression of 8-bit-per-channel data.  Optional
*5c402d22SFrank Warmerdamextensions include 12-bit-per-channel data, arithmetic entropy coding for
*5c402d22SFrank Warmerdambetter compression, and progressive/hierarchical representations.  The
*5c402d22SFrank Warmerdamlossless process is an independent algorithm that has little in
*5c402d22SFrank Warmerdamcommon with the lossy processes.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIt should be noted that the optional arithmetic-coding extension is subject
*5c402d22SFrank Warmerdamto several US and Japanese patents.  To avoid patent problems, use of
*5c402d22SFrank Warmerdamarithmetic coding processes in TIFF files intended for inter-application
*5c402d22SFrank Warmerdaminterchange is discouraged.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAll of the JPEG processes are useful only for "continuous tone" data,
*5c402d22SFrank Warmerdamin which the difference between adjacent pixel values is usually small.
*5c402d22SFrank WarmerdamLow-bit-depth source data is not appropriate for JPEG compression, nor
*5c402d22SFrank Warmerdamare palette-color images good candidates.  The JPEG processes work well
*5c402d22SFrank Warmerdamon grayscale and full-color data.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamDescribing the JPEG compression algorithms in sufficient detail to permit
*5c402d22SFrank Warmerdamimplementation would require more space than we have here.  Instead, we
*5c402d22SFrank Warmerdamrefer the reader to the References section.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhat data is being compressed?
*5c402d22SFrank Warmerdam------------------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIn lossy JPEG compression, it is customary to convert color source data
*5c402d22SFrank Warmerdamto YCbCr and then downsample it before JPEG compression.  This gives
*5c402d22SFrank Warmerdam2:1 data compression with hardly any visible image degradation, and it
*5c402d22SFrank Warmerdampermits additional space savings within the JPEG compression step proper.
*5c402d22SFrank WarmerdamHowever, these steps are not considered part of the ISO JPEG standard.
*5c402d22SFrank WarmerdamThe ISO standard is "color blind": it accepts data in any color space.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamFor TIFF purposes, the JPEG compression tag is considered to represent the
*5c402d22SFrank WarmerdamISO JPEG compression standard only.  The ISO standard is applied to the
*5c402d22SFrank Warmerdamsame data that would be stored in the TIFF file if no compression were
*5c402d22SFrank Warmerdamused.  Therefore, if color conversion or downsampling are used, they must
*5c402d22SFrank Warmerdambe reflected in the regular TIFF fields; these steps are not considered to
*5c402d22SFrank Warmerdambe implicit in the JPEG compression tag value.  PhotometricInterpretation
*5c402d22SFrank Warmerdamand related fields shall describe the color space actually stored in the
*5c402d22SFrank Warmerdamfile.  With the TIFF 6.0 field definitions, downsampling is permissible
*5c402d22SFrank Warmerdamonly for YCbCr data, and it must correspond to the YCbCrSubSampling field.
*5c402d22SFrank Warmerdam(Note that the default value for this field is not 1,1; so the default for
*5c402d22SFrank WarmerdamYCbCr is to apply downsampling!)  It is likely that future versions of TIFF
*5c402d22SFrank Warmerdamwill provide additional PhotometricInterpretation values and a more general
*5c402d22SFrank Warmerdamway of defining subsampling, so as to allow more flexibility in
*5c402d22SFrank WarmerdamJPEG-compressed files.  But that issue is not addressed in this Tech Note.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamImplementors should note that many popular JPEG codecs
*5c402d22SFrank Warmerdam(compressor/decompressors) provide automatic color conversion and
*5c402d22SFrank Warmerdamdownsampling, so that the application may supply full-size RGB data which
*5c402d22SFrank Warmerdamis nonetheless converted to downsampled YCbCr.  This is an implementation
*5c402d22SFrank Warmerdamconvenience which does not excuse the TIFF control layer from its
*5c402d22SFrank Warmerdamresponsibility to know what is really going on.  The
*5c402d22SFrank WarmerdamPhotometricInterpretation and subsampling fields written to the file must
*5c402d22SFrank Warmerdamdescribe what is actually in the file.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamA JPEG-compressed TIFF file will typically have PhotometricInterpretation =
*5c402d22SFrank WarmerdamYCbCr and YCbCrSubSampling = [2,1] or [2,2], unless the source data was
*5c402d22SFrank Warmerdamgrayscale or CMYK.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamBasic representation of JPEG-compressed images
*5c402d22SFrank Warmerdam----------------------------------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamJPEG compression works in either strip-based or tile-based TIFF files.
*5c402d22SFrank WarmerdamRather than repeating "strip or tile" constantly, we will use the term
*5c402d22SFrank Warmerdam"segment" to mean either a strip or a tile.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhen the Compression field has the value 7, each image segment contains
*5c402d22SFrank Warmerdama complete JPEG datastream which is valid according to the ISO JPEG
*5c402d22SFrank Warmerdamstandard (ISO/IEC 10918-1).  Any sequential JPEG process can be used,
*5c402d22SFrank Warmerdamincluding lossless JPEG, but progressive and hierarchical processes are not
*5c402d22SFrank Warmerdamsupported.  Since JPEG is useful only for continuous-tone images, the
*5c402d22SFrank WarmerdamPhotometricInterpretation of the image shall not be 3 (palette color) nor
*5c402d22SFrank Warmerdam4 (transparency mask).  The bit depth of the data is also restricted as
*5c402d22SFrank Warmerdamspecified below.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamEach image segment in a JPEG-compressed TIFF file shall contain a valid
*5c402d22SFrank WarmerdamJPEG datastream according to the ISO JPEG standard's rules for
*5c402d22SFrank Warmerdaminterchange-format or abbreviated-image-format data.  The datastream shall
*5c402d22SFrank Warmerdamcontain a single JPEG frame storing that segment of the image.  The
*5c402d22SFrank Warmerdamrequired JPEG markers within a segment are:
*5c402d22SFrank Warmerdam	SOI	(must appear at very beginning of segment)
*5c402d22SFrank Warmerdam	SOFn
*5c402d22SFrank Warmerdam	SOS	(one for each scan, if there is more than one scan)
*5c402d22SFrank Warmerdam	EOI	(must appear at very end of segment)
*5c402d22SFrank WarmerdamThe actual compressed data follows SOS; it may contain RSTn markers if DRI
*5c402d22SFrank Warmerdamis used.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAdditional JPEG "tables and miscellaneous" markers may appear between SOI
*5c402d22SFrank Warmerdamand SOFn, between SOFn and SOS, and before each subsequent SOS if there is
*5c402d22SFrank Warmerdammore than one scan.  These markers include:
*5c402d22SFrank Warmerdam	DQT
*5c402d22SFrank Warmerdam	DHT
*5c402d22SFrank Warmerdam	DAC	(not to appear unless arithmetic coding is used)
*5c402d22SFrank Warmerdam	DRI
*5c402d22SFrank Warmerdam	APPn	(shall be ignored by TIFF readers)
*5c402d22SFrank Warmerdam	COM	(shall be ignored by TIFF readers)
*5c402d22SFrank WarmerdamDNL markers shall not be used in TIFF files.  Readers should abort if any
*5c402d22SFrank Warmerdamother marker type is found, especially the JPEG reserved markers;
*5c402d22SFrank Warmerdamoccurrence of such a marker is likely to indicate a JPEG extension.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe tables/miscellaneous markers may appear in any order.  Readers are
*5c402d22SFrank Warmerdamcautioned that although the SOFn marker refers to DQT tables, JPEG does not
*5c402d22SFrank Warmerdamrequire those tables to precede the SOFn, only the SOS.  Missing-table
*5c402d22SFrank Warmerdamchecks should be made when SOS is reached.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIf no JPEGTables field is used, then each image segment shall be a complete
*5c402d22SFrank WarmerdamJPEG interchange datastream.  Each segment must define all the tables it
*5c402d22SFrank Warmerdamreferences.  To allow readers to decode segments in any order, no segment
*5c402d22SFrank Warmerdammay rely on tables being carried over from a previous segment.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhen a JPEGTables field is used, image segments may omit tables that have
*5c402d22SFrank Warmerdambeen specified in the JPEGTables field.  Further details appear below.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe SOFn marker shall be of type SOF0 for strict baseline JPEG data, of
*5c402d22SFrank Warmerdamtype SOF1 for non-baseline lossy JPEG data, or of type SOF3 for lossless
*5c402d22SFrank WarmerdamJPEG data.  (SOF9 or SOF11 would be used for arithmetic coding.)  All
*5c402d22SFrank Warmerdamsegments of a JPEG-compressed TIFF image shall use the same JPEG
*5c402d22SFrank Warmerdamcompression process, in particular the same SOFn type.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe data precision field of the SOFn marker shall agree with the TIFF
*5c402d22SFrank WarmerdamBitsPerSample field.  (Note that when PlanarConfiguration=1, this implies
*5c402d22SFrank Warmerdamthat all components must have the same BitsPerSample value; when
*5c402d22SFrank WarmerdamPlanarConfiguration=2, different components could have different bit
*5c402d22SFrank Warmerdamdepths.)  For SOF0 only precision 8 is permitted; for SOF1, precision 8 or
*5c402d22SFrank Warmerdam12 is permitted; for SOF3, precisions 2 to 16 are permitted.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe image dimensions given in the SOFn marker shall agree with the logical
*5c402d22SFrank Warmerdamdimensions of that particular strip or tile.  For strip images, the SOFn
*5c402d22SFrank Warmerdamimage width shall equal ImageWidth and the height shall equal RowsPerStrip,
*5c402d22SFrank Warmerdamexcept in the last strip; its SOFn height shall equal the number of rows
*5c402d22SFrank Warmerdamremaining in the ImageLength.  (In other words, no padding data is counted
*5c402d22SFrank Warmerdamin the SOFn dimensions.)  For tile images, each SOFn shall have width
*5c402d22SFrank WarmerdamTileWidth and height TileHeight; adding and removing any padding needed in
*5c402d22SFrank Warmerdamthe edge tiles is the concern of some higher level of the TIFF software.
*5c402d22SFrank Warmerdam(The dimensional rules are slightly different when PlanarConfiguration=2,
*5c402d22SFrank Warmerdamas described below.)
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe ISO JPEG standard only permits images up to 65535 pixels in width or
*5c402d22SFrank Warmerdamheight, due to 2-byte fields in the SOFn markers.  In TIFF, this limits
*5c402d22SFrank Warmerdamthe size of an individual JPEG-compressed strip or tile, but the total
*5c402d22SFrank Warmerdamimage size can be greater.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe number of components in the JPEG datastream shall equal SamplesPerPixel
*5c402d22SFrank Warmerdamfor PlanarConfiguration=1, and shall be 1 for PlanarConfiguration=2.  The
*5c402d22SFrank Warmerdamcomponents shall be stored in the same order as they are described at the
*5c402d22SFrank WarmerdamTIFF field level.  (This applies both to their order in the SOFn marker,
*5c402d22SFrank Warmerdamand to the order in which they are scanned if multiple JPEG scans are
*5c402d22SFrank Warmerdamused.)  The component ID bytes are arbitrary so long as each component
*5c402d22SFrank Warmerdamwithin an image segment is given a distinct ID.  To avoid any possible
*5c402d22SFrank Warmerdamconfusion, we require that all segments of a TIFF image use the same ID
*5c402d22SFrank Warmerdamcode for a given component.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIn PlanarConfiguration 1, the sampling factors given in SOFn markers shall
*5c402d22SFrank Warmerdamagree with the sampling factors defined by the related TIFF fields (or with
*5c402d22SFrank Warmerdamthe default values that are specified in the absence of those fields).
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhen DCT-based JPEG is used in a strip TIFF file, RowsPerStrip is required
*5c402d22SFrank Warmerdamto be a multiple of 8 times the largest vertical sampling factor, i.e., a
*5c402d22SFrank Warmerdammultiple of the height of an interleaved MCU.  (For simplicity of
*5c402d22SFrank Warmerdamspecification, we require this even if the data is not actually
*5c402d22SFrank Warmerdaminterleaved.)  For example, if YCbCrSubSampling = [2,2] then RowsPerStrip
*5c402d22SFrank Warmerdammust be a multiple of 16.  An exception to this rule is made for
*5c402d22SFrank Warmerdamsingle-strip images (RowsPerStrip >= ImageLength): the exact value of
*5c402d22SFrank WarmerdamRowsPerStrip is unimportant in that case.  This rule ensures that no data
*5c402d22SFrank Warmerdampadding is needed at the bottom of a strip, except perhaps the last strip.
*5c402d22SFrank WarmerdamAny padding required at the right edge of the image, or at the bottom of
*5c402d22SFrank Warmerdamthe last strip, is expected to occur internally to the JPEG codec.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhen DCT-based JPEG is used in a tiled TIFF file, TileLength is required
*5c402d22SFrank Warmerdamto be a multiple of 8 times the largest vertical sampling factor, i.e.,
*5c402d22SFrank Warmerdama multiple of the height of an interleaved MCU; and TileWidth is required
*5c402d22SFrank Warmerdamto be a multiple of 8 times the largest horizontal sampling factor, i.e.,
*5c402d22SFrank Warmerdama multiple of the width of an interleaved MCU.  (For simplicity of
*5c402d22SFrank Warmerdamspecification, we require this even if the data is not actually
*5c402d22SFrank Warmerdaminterleaved.)  All edge padding required will therefore occur in the course
*5c402d22SFrank Warmerdamof normal TIFF tile padding; it is not special to JPEG.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamLossless JPEG does not impose these constraints on strip and tile sizes,
*5c402d22SFrank Warmerdamsince it is not DCT-based.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamNote that within JPEG datastreams, multibyte values appear in the MSB-first
*5c402d22SFrank Warmerdamorder specified by the JPEG standard, regardless of the byte ordering of
*5c402d22SFrank Warmerdamthe surrounding TIFF file.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamJPEGTables field
*5c402d22SFrank Warmerdam----------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe only auxiliary TIFF field added for Compression=7 is the optional
*5c402d22SFrank WarmerdamJPEGTables field.  The purpose of JPEGTables is to predefine JPEG
*5c402d22SFrank Warmerdamquantization and/or Huffman tables for subsequent use by JPEG image
*5c402d22SFrank Warmerdamsegments.  When this is done, these rather bulky tables need not be
*5c402d22SFrank Warmerdamduplicated in each segment, thus saving space and processing time.
*5c402d22SFrank WarmerdamJPEGTables may be used even in a single-segment file, although there is no
*5c402d22SFrank Warmerdamspace savings in that case.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamJPEGTables:
*5c402d22SFrank Warmerdam	Tag = 347 (15B.H)
*5c402d22SFrank Warmerdam	Type = UNDEFINED
*5c402d22SFrank Warmerdam	N = number of bytes in tables datastream, typically a few hundred
*5c402d22SFrank WarmerdamJPEGTables provides default JPEG quantization and/or Huffman tables which
*5c402d22SFrank Warmerdamare used whenever a segment datastream does not contain its own tables, as
*5c402d22SFrank Warmerdamspecified below.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamNotice that the JPEGTables field is required to have type code UNDEFINED,
*5c402d22SFrank Warmerdamnot type code BYTE.  This is to cue readers that expanding individual bytes
*5c402d22SFrank Warmerdamto short or long integers is not appropriate.  A TIFF reader will generally
*5c402d22SFrank Warmerdamneed to store the field value as an uninterpreted byte sequence until it is
*5c402d22SFrank Warmerdamfed to the JPEG decoder.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamMultibyte quantities within the tables follow the ISO JPEG convention of
*5c402d22SFrank WarmerdamMSB-first storage, regardless of the byte ordering of the surrounding TIFF
*5c402d22SFrank Warmerdamfile.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhen the JPEGTables field is present, it shall contain a valid JPEG
*5c402d22SFrank Warmerdam"abbreviated table specification" datastream.  This datastream shall begin
*5c402d22SFrank Warmerdamwith SOI and end with EOI.  It may contain zero or more JPEG "tables and
*5c402d22SFrank Warmerdammiscellaneous" markers, namely:
*5c402d22SFrank Warmerdam	DQT
*5c402d22SFrank Warmerdam	DHT
*5c402d22SFrank Warmerdam	DAC	(not to appear unless arithmetic coding is used)
*5c402d22SFrank Warmerdam	DRI
*5c402d22SFrank Warmerdam	APPn	(shall be ignored by TIFF readers)
*5c402d22SFrank Warmerdam	COM	(shall be ignored by TIFF readers)
*5c402d22SFrank WarmerdamSince JPEG defines the SOI marker to reset the DAC and DRI state, these two
*5c402d22SFrank Warmerdammarkers' values cannot be carried over into any image datastream, and thus
*5c402d22SFrank Warmerdamthey are effectively no-ops in the JPEGTables field.  To avoid confusion,
*5c402d22SFrank Warmerdamit is recommended that writers not place DAC or DRI markers in JPEGTables.
*5c402d22SFrank WarmerdamHowever readers must properly skip over them if they appear.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWhen JPEGTables is present, readers shall load the table specifications
*5c402d22SFrank Warmerdamcontained in JPEGTables before processing image segment datastreams.
*5c402d22SFrank WarmerdamImage segments may simply refer to these preloaded tables without defining
*5c402d22SFrank Warmerdamthem.  An image segment can still define and use its own tables, subject to
*5c402d22SFrank Warmerdamthe restrictions below.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAn image segment may not redefine any table defined in JPEGTables.  (This
*5c402d22SFrank Warmerdamrestriction is imposed to allow readers to process image segments in random
*5c402d22SFrank Warmerdamorder without having to reload JPEGTables between segments.)  Therefore, use
*5c402d22SFrank Warmerdamof JPEGTables divides the available table slots into two groups: "global"
*5c402d22SFrank Warmerdamslots are defined in JPEGTables and may be used but not redefined by
*5c402d22SFrank Warmerdamsegments; "local" slots are available for local definition and use in each
*5c402d22SFrank Warmerdamsegment.  To permit random access, a segment may not reference any local
*5c402d22SFrank Warmerdamtables that it does not itself define.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamSpecial considerations for PlanarConfiguration 2
*5c402d22SFrank Warmerdam------------------------------------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIn PlanarConfiguration 2, each image segment contains data for only one
*5c402d22SFrank Warmerdamcolor component.  To avoid confusing the JPEG codec, we wish the segments
*5c402d22SFrank Warmerdamto look like valid single-channel (i.e., grayscale) JPEG datastreams.  This
*5c402d22SFrank Warmerdammeans that different rules must be used for the SOFn parameters.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIn PlanarConfiguration 2, the dimensions given in the SOFn of a subsampled
*5c402d22SFrank Warmerdamcomponent shall be scaled down by the sampling factors compared to the SOFn
*5c402d22SFrank Warmerdamdimensions that would be used in PlanarConfiguration 1.  This is necessary
*5c402d22SFrank Warmerdamto match the actual number of samples stored in that segment, so that the
*5c402d22SFrank WarmerdamJPEG codec doesn't complain about too much or too little data.  In strip
*5c402d22SFrank WarmerdamTIFF files the computed dimensions may need to be rounded up to the next
*5c402d22SFrank Warmerdaminteger; in tiled files, the restrictions on tile size make this case
*5c402d22SFrank Warmerdamimpossible.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamFurthermore, all SOFn sampling factors shall be given as 1.  (This is
*5c402d22SFrank Warmerdammerely to avoid confusion, since the sampling factors in a single-channel
*5c402d22SFrank WarmerdamJPEG datastream have no real effect.)
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAny downsampling will need to happen externally to the JPEG codec, since
*5c402d22SFrank WarmerdamJPEG sampling factors are defined with reference to the full-precision
*5c402d22SFrank Warmerdamcomponent.  In PlanarConfiguration 2, the JPEG codec will be working on
*5c402d22SFrank Warmerdamonly one component at a time and thus will have no reference component to
*5c402d22SFrank Warmerdamdownsample against.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamMinimum requirements for TIFF/JPEG
*5c402d22SFrank Warmerdam----------------------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamISO JPEG is a large and complex standard; most implementations support only
*5c402d22SFrank Warmerdama subset of it.  Here we define a "core" subset of TIFF/JPEG which readers
*5c402d22SFrank Warmerdammust support to claim TIFF/JPEG compatibility.  For maximum
*5c402d22SFrank Warmerdamcross-application compatibility, we recommend that writers confine
*5c402d22SFrank Warmerdamthemselves to this subset unless there is very good reason to do otherwise.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamUse the ISO baseline JPEG process: 8-bit data precision, Huffman coding,
*5c402d22SFrank Warmerdamwith no more than 2 DC and 2 AC Huffman tables.  Note that this implies
*5c402d22SFrank WarmerdamBitsPerSample = 8 for each component.  We recommend deviating from baseline
*5c402d22SFrank WarmerdamJPEG only if 12-bit data precision or lossless coding is required.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamUse no subsampling (all JPEG sampling factors = 1) for color spaces other
*5c402d22SFrank Warmerdamthan YCbCr.  (This is, in fact, required with the TIFF 6.0 field
*5c402d22SFrank Warmerdamdefinitions, but may not be so in future revisions.)  For YCbCr, use one of
*5c402d22SFrank Warmerdamthe following choices:
*5c402d22SFrank Warmerdam	YCbCrSubSampling field		JPEG sampling factors
*5c402d22SFrank Warmerdam	1,1				1h1v, 1h1v, 1h1v
*5c402d22SFrank Warmerdam	2,1				2h1v, 1h1v, 1h1v
*5c402d22SFrank Warmerdam	2,2  (default value)		2h2v, 1h1v, 1h1v
*5c402d22SFrank WarmerdamWe recommend that RGB source data be converted to YCbCr for best compression
*5c402d22SFrank Warmerdamresults.  Other source data colorspaces should probably be left alone.
*5c402d22SFrank WarmerdamMinimal readers need not support JPEG images with colorspaces other than
*5c402d22SFrank WarmerdamYCbCr and grayscale (PhotometricInterpretation = 6 or 1).
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamA minimal reader also need not support JPEG YCbCr images with nondefault
*5c402d22SFrank Warmerdamvalues of YCbCrCoefficients or YCbCrPositioning, nor with values of
*5c402d22SFrank WarmerdamReferenceBlackWhite other than [0,255,128,255,128,255].  (These values
*5c402d22SFrank Warmerdamcorrespond to the RGB<=>YCbCr conversion specified by JFIF, which is widely
*5c402d22SFrank Warmerdamimplemented in JPEG codecs.)
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWriters are reminded that a ReferenceBlackWhite field *must* be included
*5c402d22SFrank Warmerdamwhen PhotometricInterpretation is YCbCr, because the default
*5c402d22SFrank WarmerdamReferenceBlackWhite values are inappropriate for YCbCr.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIf any subsampling is used, PlanarConfiguration=1 is preferred to avoid the
*5c402d22SFrank Warmerdampossibly-confusing requirements of PlanarConfiguration=2.  In any case,
*5c402d22SFrank Warmerdamreaders are not required to support PlanarConfiguration=2.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIf possible, use a single interleaved scan in each image segment.  This is
*5c402d22SFrank Warmerdamnot legal JPEG if there are more than 4 SamplesPerPixel or if the sampling
*5c402d22SFrank Warmerdamfactors are such that more than 10 blocks would be needed per MCU; in that
*5c402d22SFrank Warmerdamcase, use a separate scan for each component.  (The recommended color
*5c402d22SFrank Warmerdamspaces and sampling factors will not run into that restriction, so a
*5c402d22SFrank Warmerdamminimal reader need not support more than one scan per segment.)
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamTo claim TIFF/JPEG compatibility, readers shall support multiple-strip TIFF
*5c402d22SFrank Warmerdamfiles and the optional JPEGTables field; it is not acceptable to read only
*5c402d22SFrank Warmerdamsingle-datastream files.  Support for tiled TIFF files is strongly
*5c402d22SFrank Warmerdamrecommended but not required.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamOther recommendations for implementors
*5c402d22SFrank Warmerdam--------------------------------------
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe TIFF tag Compression=7 guarantees only that the compressed data is
*5c402d22SFrank Warmerdamrepresented as ISO JPEG datastreams.  Since JPEG is a large and evolving
*5c402d22SFrank Warmerdamstandard, readers should apply careful error checking to the JPEG markers
*5c402d22SFrank Warmerdamto ensure that the compression process is within their capabilities.  In
*5c402d22SFrank Warmerdamparticular, to avoid being confused by future extensions to the JPEG
*5c402d22SFrank Warmerdamstandard, it is important to abort if unknown marker codes are seen.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThe point of requiring that all image segments use the same JPEG process is
*5c402d22SFrank Warmerdamto ensure that a reader need check only one segment to determine whether it
*5c402d22SFrank Warmerdamcan handle the image.  For example, consider a TIFF reader that has access
*5c402d22SFrank Warmerdamto fast but restricted JPEG hardware, as well as a slower, more general
*5c402d22SFrank Warmerdamsoftware implementation.  It is desirable to check only one image segment
*5c402d22SFrank Warmerdamto find out whether the fast hardware can be used.  Thus, writers should
*5c402d22SFrank Warmerdamtry to ensure that all segments of an image look as much "alike" as
*5c402d22SFrank Warmerdampossible: there should be no variation in scan layout, use of options such
*5c402d22SFrank Warmerdamas DRI, etc.  Ideally, segments will be processed identically except
*5c402d22SFrank Warmerdamperhaps for using different local quantization or entropy-coding tables.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamWriters should avoid including "noise" JPEG markers (COM and APPn markers).
*5c402d22SFrank WarmerdamStandard TIFF fields provide a better way to transport any non-image data.
*5c402d22SFrank WarmerdamSome JPEG codecs may change behavior if they see an APPn marker they
*5c402d22SFrank Warmerdamthink they understand; since the TIFF spec requires these markers to be
*5c402d22SFrank Warmerdamignored, this behavior is undesirable.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamIt is possible to convert an interchange-JPEG file (e.g., a JFIF file) to
*5c402d22SFrank WarmerdamTIFF simply by dropping the interchange datastream into a single strip.
*5c402d22SFrank Warmerdam(However, designers are reminded that the TIFF spec discourages huge
*5c402d22SFrank Warmerdamstrips; splitting the image is somewhat more work but may give better
*5c402d22SFrank Warmerdamresults.)  Conversion from TIFF to interchange JPEG is more complex.  A
*5c402d22SFrank Warmerdamstrip-based TIFF/JPEG file can be converted fairly easily if all strips use
*5c402d22SFrank Warmerdamidentical JPEG tables and no RSTn markers: just delete the overhead markers
*5c402d22SFrank Warmerdamand insert RSTn markers between strips.  Converting tiled images is harder,
*5c402d22SFrank Warmerdamsince the data will usually not be in the right order (unless the tiles are
*5c402d22SFrank Warmerdamonly one MCU high).  This can still be done losslessly, but it will require
*5c402d22SFrank Warmerdamundoing and redoing the entropy coding so that the DC coefficient
*5c402d22SFrank Warmerdamdifferences can be updated.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThere is no default value for JPEGTables: standard TIFF files must define all
*5c402d22SFrank Warmerdamtables that they reference.  For some closed systems in which many files will
*5c402d22SFrank Warmerdamhave identical tables, it might make sense to define a default JPEGTables
*5c402d22SFrank Warmerdamvalue to avoid actually storing the tables.  Or even better, invent a
*5c402d22SFrank Warmerdamprivate field selecting one of N default JPEGTables settings, so as to allow
*5c402d22SFrank Warmerdamfor future expansion.  Either of these must be regarded as a private
*5c402d22SFrank Warmerdamextension that will render the files unreadable by other applications.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamReferences
*5c402d22SFrank Warmerdam----------
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam[1] Wallace, Gregory K.  "The JPEG Still Picture Compression Standard",
*5c402d22SFrank WarmerdamCommunications of the ACM, April 1991 (vol. 34 no. 4), pp. 30-44.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThis is the best short technical introduction to the JPEG algorithms.
*5c402d22SFrank WarmerdamIt is a good overview but does not provide sufficiently detailed
*5c402d22SFrank Warmerdaminformation to write an implementation.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam[2] Pennebaker, William B. and Mitchell, Joan L.  "JPEG Still Image Data
*5c402d22SFrank WarmerdamCompression Standard", Van Nostrand Reinhold, 1993, ISBN 0-442-01272-1.
*5c402d22SFrank Warmerdam638pp.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThis textbook is by far the most complete exposition of JPEG in existence.
*5c402d22SFrank WarmerdamIt includes the full text of the ISO JPEG standards (DIS 10918-1 and draft
*5c402d22SFrank WarmerdamDIS 10918-2).  No would-be JPEG implementor should be without it.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam[3] ISO/IEC IS 10918-1, "Digital Compression and Coding of Continuous-tone
*5c402d22SFrank WarmerdamStill Images, Part 1: Requirements and guidelines", February 1994.
*5c402d22SFrank WarmerdamISO/IEC DIS 10918-2, "Digital Compression and Coding of Continuous-tone
*5c402d22SFrank WarmerdamStill Images, Part 2: Compliance testing", final approval expected 1994.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamThese are the official standards documents.  Note that the Pennebaker and
*5c402d22SFrank WarmerdamMitchell textbook is likely to be cheaper and more useful than the official
*5c402d22SFrank Warmerdamstandards.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamChanges to Section 21: YCbCr Images
*5c402d22SFrank Warmerdam===================================
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam[This section of the Tech Note clarifies section 21 to make clear the
*5c402d22SFrank Warmerdaminterpretation of image dimensions in a subsampled image.  Furthermore,
*5c402d22SFrank Warmerdamthe section is changed to allow the original image dimensions not to be
*5c402d22SFrank Warmerdammultiples of the sampling factors.  This change is necessary to support use
*5c402d22SFrank Warmerdamof JPEG compression on odd-size images.]
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamAdd the following paragraphs to the Section 21 introduction (p. 89),
*5c402d22SFrank Warmerdamjust after the paragraph beginning "When a Class Y image is subsampled":
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	In a subsampled image, it is understood that all TIFF image
*5c402d22SFrank Warmerdam	dimensions are measured in terms of the highest-resolution
*5c402d22SFrank Warmerdam	(luminance) component.  In particular, ImageWidth, ImageLength,
*5c402d22SFrank Warmerdam	RowsPerStrip, TileWidth, TileLength, XResolution, and YResolution
*5c402d22SFrank Warmerdam	are measured in luminance samples.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	RowsPerStrip, TileWidth, and TileLength are constrained so that
*5c402d22SFrank Warmerdam	there are an integral number of samples of each component in a
*5c402d22SFrank Warmerdam	complete strip or tile.  However, ImageWidth/ImageLength are not
*5c402d22SFrank Warmerdam	constrained.  If an odd-size image is to be converted to subsampled
*5c402d22SFrank Warmerdam	format, the writer should pad the source data to a multiple of the
*5c402d22SFrank Warmerdam	sampling factors by replication of the last column and/or row, then
*5c402d22SFrank Warmerdam	downsample.  The number of luminance samples actually stored in the
*5c402d22SFrank Warmerdam	file will be a multiple of the sampling factors.  Conversely,
*5c402d22SFrank Warmerdam	readers must ignore any extra data (outside the specified image
*5c402d22SFrank Warmerdam	dimensions) after upsampling.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	When PlanarConfiguration=2, each strip or tile covers the same
*5c402d22SFrank Warmerdam	image area despite subsampling; that is, the total number of strips
*5c402d22SFrank Warmerdam	or tiles in the image is the same for each component.  Therefore
*5c402d22SFrank Warmerdam	strips or tiles of the subsampled components contain fewer samples
*5c402d22SFrank Warmerdam	than strips or tiles of the luminance component.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	If there are extra samples per pixel (see field ExtraSamples),
*5c402d22SFrank Warmerdam	these data channels have the same number of samples as the
*5c402d22SFrank Warmerdam	luminance component.
*5c402d22SFrank Warmerdam
*5c402d22SFrank WarmerdamRewrite the YCbCrSubSampling field description (pp 91-92) as follows
*5c402d22SFrank Warmerdam(largely to eliminate possibly-misleading references to
*5c402d22SFrank WarmerdamImageWidth/ImageLength of the subsampled components):
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	(first paragraph unchanged)
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	The two elements of this field are defined as follows:
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	Short 0: ChromaSubsampleHoriz:
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	1 = there are equal numbers of luma and chroma samples horizontally.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	2 = there are twice as many luma samples as chroma samples
*5c402d22SFrank Warmerdam	horizontally.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	4 = there are four times as many luma samples as chroma samples
*5c402d22SFrank Warmerdam	horizontally.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	Short 1: ChromaSubsampleVert:
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	1 = there are equal numbers of luma and chroma samples vertically.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	2 = there are twice as many luma samples as chroma samples
*5c402d22SFrank Warmerdam	vertically.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	4 = there are four times as many luma samples as chroma samples
*5c402d22SFrank Warmerdam	vertically.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	ChromaSubsampleVert shall always be less than or equal to
*5c402d22SFrank Warmerdam	ChromaSubsampleHoriz.  Note that Cb and Cr have the same sampling
*5c402d22SFrank Warmerdam	ratios.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	In a strip TIFF file, RowsPerStrip is required to be an integer
*5c402d22SFrank Warmerdam	multiple of ChromaSubSampleVert (unless RowsPerStrip >=
*5c402d22SFrank Warmerdam	ImageLength, in which case its exact value is unimportant).
*5c402d22SFrank Warmerdam	If ImageWidth and ImageLength are not multiples of
*5c402d22SFrank Warmerdam	ChromaSubsampleHoriz and ChromaSubsampleVert respectively, then the
*5c402d22SFrank Warmerdam	source data shall be padded to the next integer multiple of these
*5c402d22SFrank Warmerdam	values before downsampling.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	In a tiled TIFF file, TileWidth must be an integer multiple of
*5c402d22SFrank Warmerdam	ChromaSubsampleHoriz and TileLength must be an integer multiple of
*5c402d22SFrank Warmerdam	ChromaSubsampleVert.  Padding will occur to tile boundaries.
*5c402d22SFrank Warmerdam
*5c402d22SFrank Warmerdam	The default values of this field are [ 2,2 ].  Thus, YCbCr data is
*5c402d22SFrank Warmerdam	downsampled by default!
*5c402d22SFrank Warmerdam</pre>