Image Data Codecs

This appendix describes the ImageN image data compression coders and decoders (codecs).

11.1 Introduction

The org.eclipse.imagen.codec package provides a set of interfaces and classes for encoding and decoding compressed image data files. Java Advanced Imaging currently supports the following codecs:

[Codec]{#51903} [Description]{#51905}
—————– ———————————————————-
[BMP]{#51907} [Microsoft Windows bitmap image file format]{#51909}
[FPX]{#56606} [FlashPix by the Digital Imaging Group (DIG)]{#56608}
[JPEG]{#51911} [Joint Photographic Experts Group (JPEG) format]{#51913}
[PNG]{#55152} [Portable Network Graphics file format]{#55154}
[PNM]{#51915} [Portable aNy Map file format]{#51917}

The org.eclipse.imagen.codec package also enables you to create your own additional codecs if those listed above are not sufficient. See Chapter 12, "Extending the API," for more information.

The Java Advanced Imaging codecs are used to both read (decode) and write (encode) compressed images. Decode operations convert compressed image data back into the uncompressed form. Encode operations convert original image data into a compressed data form. These are often called image coding operations, because they use data coding methods to make the conversion to and from the compressed form.

As shown in Figure 11-1, the image decode operations transform an InputStream into a BufferedImage or Raster object. The image encode operations transform a BufferedImage or Raster into an OutputStream. Various parameters affect the encode and decode operations. These parameters are usually different for each codec, but include such things as the bit depth, choice of conversion tables, and factors that affect image quality and compression ratio.

If you need to know more about image data compression than is presented here, see "Related Documentation" in the Preface.

11.2 Interfaces and Classes

The following tables list the interfaces and classes used for image data compression and decompression. These interfaces and classes are all part of the org.eclipse.imagen.codec package.

[Interface]{#52066} [Description]{#52068}
—————————- ———————————————————————————————————————————————————-
[ImageDecoder]{#52074} [Describes objects that transform an InputStream into a BufferedImage or Raster.]{#52098}
[ImageEncoder]{#52150} [Describes objects that transform a BufferedImage or Raster into an OutputStream.]{#52152}
[ImageDecodeParam]{#52141} [Extends: java.lang.Cloneable]{#52143} [An empty (marker) interface to be implemented by all image decoder parameter classes.]{#52144}
[ImageEncodeParam]{#52078} [Extends: ImageDecodeParam, java.lang.Cloneable]{#52080} [An empty (marker) interface to be implemented by all image encoder parameter classes.]{#52113}: [Table 11-1 Codec Interfaces]{#52062}
[Class]{#52159} [Description]{#52161}
—————————- —————————————————————————————————————————————————————————————————————————————————————————————————————-
[ImageCodec]{#52163} [Extends: java.lang.Object]{#52190} [Allows the creation of image decoders and encoders. Instances of ImageCodec may be registered by name. Once a codec has been registered, the name associated with it may be used as the name parameter in the createImageEncoder and createImageDecoder methods.]{#52192}
[ImageDecoderImpl]{#52167} [Extends: java.lang.Object]{#52228} [Implements: ImageDecoder]{#52169} [An implementation of the ImageDecoder interface useful for subclassing.]{#52238}
[ImageEncoderImpl]{#52171} [Extends: java.lang.Object]{#52248} [Implements: ImageEncoder]{#52249} [An implementation of the ImageEncoder interface useful for subclassing.]{#52173}
[BMPEncodeParam]{#52175} [Extends: java.lang.Object]{#52259} [Implements: ImageEncodeParam]{#52177} [Specifies parameters for encoding BMP format image files.]{#52269}
[JPEGEncodeParam]{#52179} [Extends: java.lang.Object]{#52276} [Implements: ImageEncodeParam, com.sun.image.codec.jpeg.JPEGEncodeParam]{#52181} [Specifies parameters for encoding JPEG format image files.]{#52284}
[PNMEncodeParam]{#52183} [Extends: java.lang.Object]{#52288} [Implements: ImageEncodeParam]{#52185} [Specifies parameters for encoding PNM format image files.]{#52296}: [Table 11-2 Codec Classes]{#52155}

11.3 Encoding a Compressed Image File

The encoding operation is usually performed at the sink end of a rendering graph or chain, after any other operations have been performed and the image is converted into a RenderedImage or a Raster. At this point, the image is ready to be written to a file in one of the supported compressed image data formats.

The encoding of an image file into a supported image format is done with one of the ImageCodec.createImageEncoder methods. To encode a compressed image file, you:

             RenderedImage im;
             String filename;

             FileOutputStream dst = new FileOutputStream(filename)

             ImageEncoder enc =
                 ImageCodec.createImageEncoder("codec", dst, null);

 The "codec" parameter specifies the name of the codec to be used ("BMP", "JPEG", "PNG", or "PNM"). The dst parameter specifies the OutputStream to write to.

             enc.encode(im);

             dst.close();

The following sample code is an example of encoding a BMP file.

     RenderedImage im;

     String filename;

     FileOutputStream dst = new FileOutputStream(filename)

     ImageEncoder enc =

         ImageCodec.createImageEncoder("BMP", dst, null);

     enc.encode(im);

     dst.close();

API: org.eclipse.imagen.codec.ImageCodec | | |

static ImageEncoder createImageEncoder(java.lang.String name, 
java.io.OutputStream dst, ImageEncodeParam param)

: returns an ImageEncoder object suitable for encoding to the supplied OutputStream, using the supplied ImageEncodeParam object. Parameters: name The name associated with the codec ("BMP", "JPEG", or "PNM"). dst The OutputStream to write to. param An instance of ImageEncodeParam suitable for use with the named codec, or null.

static ImageEncoder createImageEncoder(java.lang.String name, 
java.io.OutputStream dst)

: Returns an ImageEncoder object suitable for encoding to the supplied OutputStream object. A null ImageEncoderParam is used.

API: org.eclipse.imagen.codec.ImageEncod | | | er

void encode(java.awt.image.Raster ras)

: encodes a Raster and writes the output to the OutputStream associated with this ImageEncoder.

void encode(java.awt.image.RenderedImage im)

: encodes a RenderedImage and writes the output to the OutputStream associated with this ImageEncoder.

11.4 Decoding a Compressed Image File

The decoding operation is usually performed at the source end of a rendering graph or chain, before any other operations have been performed.

The decoding of any of the supported image formats is done with one of the ImageCodec.createImageDecoder methods. To decode a compressed image file, you:

The following sample code is an example of decoding a JPEG file.

     RenderedImage im;

     String filename;

     FileInputStream dst = new FileInputStream(filename)

     ImageDecoder dec =

         ImageCodec.createImageDecoder("JPEG", dst, null);

     enc.decode(im);

     dst.close();

API: org.eclipse.imagen.codec.ImageCodec | | |

static ImageDecoder createImageDecoder(java.lang.String name, 
java.io.InputStream src, ImageDecodeParam param)

: returns an ImageDecoder object suitable for decoding from the supplied InputStream, using the supplied ImageDecodeParam object. Parameters: name The name associated with the codec ("BMP", "JPEG", or "PNM"). src The InputStream to read from. param An instance of ImageEncodeParam suitable for use with the named codec, or null.

static ImageDecoder createImageDecoder(java.lang.String name, 
java.io.InputStream src)

: returns an ImageDecoder object suitable for decoding from the supplied InputStream. A null ImageDecodeParam is used.

API: org.eclipse.imagen.codec.ImageDecod | | | er

java.awt.image.Raster decodeAsRaster()

: returns a Raster that contains the decoded contents of the InputStream associated with this ImageDecoder.

java.awt.image.RenderedImage decodeAsRenderedImage()

: returns a RenderedImage that contains the decoded contents of the InputStream associated with this ImageDecoder.

11.5 Standard Image Compression Schemes

Several standardized image compression techniques have evolved to support the requirements of different segments of the imaging industry. Java Advanced Imaging supports a small number of these standards. Due to the great variety of standards available, it is not possible for any imaging API to support them all. However, Java Advanced Imaging makes it possible to add any number of image coders and decoders to support special needs. See Chapter 12, "Extending the API," for more information.

11.5.1 BMP Coding

The BMP (Microsoft Windows bitmap image file) file format is a commonly-used file format on IBM PC-compatible computers. BMP files can also refer to the OS/2 bitmap format, which is a strict superset of the Windows format. The OS/2 2.0 format allows for multiple bitmaps in the same file, for the CCITT G3 1bpp encoding, and also a RLE24 encoding.

Java Advanced Imaging currently reads and writes Version2, Version3, and some of the Version 4 images, as defined in the Microsoft Windows BMP file format.

Version 4 of the BMP format allows for the specification of alpha values, gamma values, and CIE color spaces. These are not currently handled, but the relevant properties are emitted, if they are available from the BMP image file.

Versions 3 and later support run-length encoded (RLE) formats for compressing bitmaps that use four or eight bits per pixel.

BMP files are stored in a device-independent bitmap (DIB) format that allows the bitmap to be displayed on any type of display device. The format specifies pixel color in a form independent of the method used by a display to represent color.

The BMP file contains the following:

Bitmap-file header - describes the type, size, and layout of the bitmap file.
Bitmap-information header - specifies the dimensions, compressed type, and color format for the bitmap.
Color table - an array of RGB quad structures that describe a color consisting of relative intensities of red, green, and blue. The array contains as many elements as there are colors in the bitmap. There is no color table for bitmaps with 24 color bits since each pixel is represented by 24-bit red-green-blue (RGB) values in the actual bitmap data area.
Bitmap data - an array of bytes that defines the bitmap bits. The bytes represent consecutive rows (or scan lines) of the bitmap. Each scan line consists of consecutive bytes representing the pixels in the scan line, in left-to-right order. The number of bytes representing a scan line depends on the color format and the width, in pixels, of the bitmap.

The bit count information in the bitmap-information header determines the number of bits that define each pixel and the maximum number of colors in the bitmap. The bit count information can have any of the following values:

[Value]{#51720} [Description]{#51722}
—————– —————————————————————————————————————————————————————————————
[1]{#51724} [Bitmap is monochrome and the color table contains two entries. Each bit in the bitmap array represents a pixel.]{#51748}
[4]{#51728} [Bitmap has a maximum of 16 colors. Each pixel in the bitmap is represented by a four-bit index into the color table.]{#51755}
[8]{#51732} [Bitmap has a maximum of 256 colors. Each pixel in the bitmap is represented by a one-byte index into the color table.]{#51766}
[24]{#51736} [Bitmap has a maximum of 2^24^ colors - each three-byte sequence in the bitmap array represents the relative intensities of red, green, and blue, respectively, for a pixel.]{#51779}

11.5.1.1 BMP Coding Parameters

Java Advanced Imaging provides four parameters for defining BMP coding: version, compression type, data layout, and the number of bits per pixel.

11.5.1.2 BMP Version

Java Advanced Imaging currently reads and writes Version2, Version3, and some of the Version 4 images. The BMP version number is read and specified with getVersion and setVersion methods in the BMPEncodeParam class. The BMP version parameters are as follows:

[Parameter]{#51346} [Description]{#51348}
———————- ———————————–
[VERSION_2]{#51354} [Specifies BMP Version 2]{#51356}
[VERSION_3]{#51358} [Specifies BMP Version 3]{#51360}
[VERSION_4]{#51362} [Specifies BMP Version 4]{#51364}

If not specifically set, VERSION_3 is the default version.

API: org.eclipse.imagen.codec.BMPEncodeP | | | aram

int getVersion()

: returns the BMP version to be used.

void setVersion(int versionNumber)

: sets the BMP version to be used.

11.5.1.3 BMP Compression Type

The BMP compression type is read and specified with getCompression and setCompression methods in the BMPEncodeParam class. Java Advanced Imaging currently supports four compression types:

[Parameter]{#51404} [Description]{#51406}
————————- —————————————————————————————-
[BI_RGB]{#51427} [Specifies that the bitmap is not compressed.]{#51429}
[BI_RLE8]{#51431} [Specifies a run-length encoded format for bitmaps with eight bits per pixel.]{#51433}
[BI_RLE4]{#51435} [Specifies a run-length encoded format for bitmaps with four bits per pixel.]{#51437}
[BI_BITFIELDS]{#51453} [Specifies bitfield compression]{#51455}

If not specifically set, BI_RGB is the default type (no compression).

API: org.eclipse.imagen.codec.BMPEncodeP | | | aram

int getCompression()

: returns the compression type to be used.

void setCompression(int compressionType)

: sets the compression type to be used.

11.5.1.4 BMP Data Layout

The scan lines in the BPM bitmap are stored from the bottom up. This means that the first byte in the array represents the pixels in the lower-left corner of the bitmap, and the last byte represents the pixels in the upper-right corner.

The BMP bitmap data layout is read and specified with getDataLayout and setDataLayout methods in the BMPEncodeParam class. The data layout is one of the following:

[Parameter]{#51542} [Description]{#51544}
———————- ———————————————
[TOP_DOWN]{#51546} [The constant for top-down layout]{#51548}
[BOTTOM_UP]{#51550} [The constant for bottom-up layout]{#51552}

API: org.eclipse.imagen.codec.BMPEncodeP | | | aram

int getDataLayout()

: returns the data layout to be used.

void setDataLayout(int dataLayout)

: sets the data layout to be used.

11.5.1.5 BMP Bits Per Pixel

The number of bits per pixel is specified with getBitsPerPixel and setBitsPerPixel methods in the BMPEncodeParam class. Valid values are 1, 4, 8, and 24. Support for 16- and 32-bit images has also been implemented in Java Advanced imaging, though such BMP images are not very common.

API: org.eclipse.imagen.codec.BMPEncodeP | | | aram

int getBitsPerPixel()

: returns the bit depth to be used.

void setBitsPerPixel(int bitsPerPixel)

: sets the bit depth to be used.

11.5.2 FPX (FlashPix) Coding

The FlashPix standard was developed by the Digital Imaging Group (DIG), a not-for-profit consortium of several companies whose purpose is to grow the marketplace for digital imaging solutions through the promotion and ongoing development of open standards. FlashPix is a multi-resolution, tiled file format that allows images to be stored at different resolutions for different purposes, such as editing or printing. Each resolution is divided into 64 x 64 blocks, or tiles. Within a tile, pixels can be either uncompressed, JPEG compressed or single-color compressed.

Flashpix objects are stored in structured storage container files. The image data is stored in defined color spaces. By defining the color space options and providing standard ICC color management profiles, colors remain consistent when viewed across various displays and printers.

FlashPix viewing parameters include area selection, a filtering parameter, a spatial orientation matrix, a colortwist matrix, and a contrast parameter. FlashPix images also includes non-image data definitions, including information such as content description, camera information, and scan description.

The FlashPix file format is designed to work with the Internet Imaging Protocol (IIP). IIP takes a Flashpix object and transports it over a network using a request-response protocol. For example, a command such as "TIL" requests a single tile or ranges of tiles that include coding information. IIP improves the efficiency of transporting Flashpix image files over the Internet by transferring just the data needed. This eliminates the need to create and store multiple files on a server, since one file contains both low and high resolutions - the IIP server delivers the correct resolution needed for the application.

The image information is transferred over the Internet via CGI or server modules. A user then makes a connection through TCP/ IP sockets and IIP does the rest. If a problem should occur, IIP returns an error message.

11.5.3 JPEG Coding

The JPEG standard was developed by a working group, known as the Joint Photographic Experts Group (JPEG). The JPEG image data compression standard handles grayscale and color images of varying resolution and size.

JPEG compression identifies and discards "extra" data that is beyond what the human eye can see. Since it discards data, the JPEG compression algorithm is considered "lossy." This means that once an image has been compressed and then decompressed, it will not be identical to the original image. In most cases, the difference between the original and compressed version of the image is indistinguishable.

An advantage of JPEG compression is the ability to select the quality when compressing the image. The lower the quality, the smaller the image file size, but the more different it will appear than the original.

The JPEG File Interchange Format (JFIF) is a minimal file format that enables JPEG bitstreams to be exchanged between a wide variety of platforms and applications. This minimal format does not include any of the advanced features found in the TIFF JPEG specification or any application-specific file format. The sole purpose of this simplified format is to allow the exchange of JPEG compressed images.

The JFIF features are:

Uses the JPEG baseline image compression algorithm
Uses JPEG interchange format compressed image representation
Compatible with most platforms (PC, Mac, or Unix)
Standard color space: one or three components. For three components, YC~b~C~r~ (CCIR 601-256 levels)
APP0 marker used to specify Units, x pixel density, y pixel density, and thumbnail. The APP0 marker may also be used to specify JFIF extensions and application-specific information

The APP0 marker is used to identify a JFIF file. The marker provides information that is missing from the JPEG stream, such as version number, x and y pixel density (dots per inch or dots per cm.), pixel aspect ratio (derived from x and y pixel density), and thumbnail.

11.5.3.1 JPEG Encoding Overview

Java Advanced Imaging uses the JPEG baseline DCT coding process, shown in Figure 11-2.

For encoding, the image array is divided into 8 x 8 pixel blocks and a discrete cosine transform (DCT) is taken of each block, resulting in an 8 x 8array of transform coefficients. The DCT is a mathematical operation that takes the block of image samples as its input and converts the information from the spatial domain to the frequency domain. The 8 x 8 matrix input to the DCT represents brightness levels at specific x, y coordinates. The resulting 8 x 8 matrix values represent relative amounts of 64 spatial frequencies that make up the spectrum of the input data.

The next stage in the encoder quantizes the transform coefficients by dividing each DCT coefficient by a value from a quantization table. The quantization operation discards the smaller-valued frequency components, leaving only the larger-valued components.

After an image block has been quantized, it enters the entropy encoder, which creates the actual JPEG bitstream. The entropy encoder assigns a binary Huffman code to coefficient values. The length of each code is chosen to be inversely proportional to the expected probability of occurrence of a coefficient amplitude - frequently-occurring coefficient values get short code words, seldom-occurring coefficient values get long code words. The entropy encoder uses two tables, one for the AC frequency components and one for the DC frequency components.

The JPEG decoding process is essentially the inverse of the encoding process. The compressed image array data stream passes through the entropy encoder, which recreates the quantized coefficient values. Then, the quantized coefficients are reconstructed by multiplication with the quantizer table values. Finally, an inverse DCT is performed and the reconstructed image array is produced.

11.5.3.2 JPEG Coding Parameters

The following are the parameters that may be specified for JPEG DCT compression.

Quantization Table

The setQTable and getQTable methods are used to specify and retrieve the quantization table that will be used in encoding a particular band of the image. There are, by default, two quantizer tables:

[Table]{#53022} [Band]{#53024}
—————– —————————
[0]{#53026} [Band 0]{#53028}
[1]{#53030} [All other bands]{#53032}

The parameter tableNum is usually a value between 0 and 3. This value indicates which of four quantization tables you are specifying. Table 0 is designed to be used with the luminance band of eight-bit YCC images. Table 1 is designed to be used with the chrominance bands of eight-bit YCC images. Tables 2 and 3 are not normally used.

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setQTable(int tableNum, 
com.sun.image.codec.jpeg.JPEGQTable qTable)

com.sun.image.codec.jpeg.JPEGQTable getQTable(int tableNum)

void setQTableComponentMapping(int component, int table)

com.sun.image.codec.jpeg.JPEGQTable getQTableForComponent(int 
component)

int getQTableComponentMapping(int component)

AC Huffman Table

Parameters enable you to associate a particular AC Huffman table with a particular band of the image. The JPEG compressor supports four tables for the AC discrete cosine transform coefficients, as listed in Table 11-3.

[Table]{#53179} [Band]{#53181}
—————– —————————————————————————————————————————————————————————————
[0]{#53183} [Encodes AC coefficients in band 0 (luminance band of eight-bit YCC images). This table contains the values specified in Table K.5 of the ISO JPEG specification.]{#53185}
[1]{#53187} [Encodes AC coefficients of all other bands (chrominance band of eight-bit YCC images). This table contains the values specified in Table K.6 of the ISO JPEG specification.]{#53189}
[2]{#53191} [Not used.]{#53193}
[3]{#53195} [Not used.]{#53197}: [Table 11-3 AC Huffman Tables]{#53175}

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setACHuffmanTable(int tableNum, 
com.sun.image.codec.jpeg.JPEGHuffmanTable huffTable)

com.sun.image.codec.jpeg.JPEGHuffmanTable 
getACHuffmanTable(int tableNum)

void setACHuffmanComponentMapping(int component, int table)

int getACHuffmanComponentMapping(int component)

DC Huffman Table

Parameters enable you to associate a particular DC Huffman table with a particular band of the image. The JPEG compressor supports four tables for the DC discrete cosine transform coefficients, as listed in Table 11-4.

[Table]{#53260} [Band]{#53262}
—————– —————————————————————————————————————————————————————————————
[0]{#53240} [Encodes DC coefficients in band 0 (luminance band of eight-bit YCC images). This table contains the values specified in Table K.3 of the ISO JPEG specification.]{#53242}
[1]{#53244} [Encodes DC coefficients of all other bands (chrominance band of eight-bit YCC images). This table contains the values specified in Table K.4 of the ISO JPEG specification.]{#53246}
[2]{#53248} [Not used.]{#53250}
[3]{#53252} [Not used.]{#53254}: [Table 11-4 DC Huffman Tables]{#53232}

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setDCHuffmanTable(int tableNum, 
com.sun.image.codec.jpeg.JPEGHuffmanTable huffTable)

void setDCHuffmanComponentMapping(int component, int table)

com.sun.image.codec.jpeg.JPEGHuffmanTable 
getDCHuffmanTable(int tableNum)

com.sun.image.codec.jpeg.JPEGHuffmanTable 
getDCHuffmanTableForComponent(int component)

int getDCHuffmanComponentMapping(int component)

Horizontal and Vertical Subsampling

To ensure proper image post-processing and accurate image presentation of JFIF files requires the specification of the spatial positioning of pixel samples within components relative to the samples of other components. This is known as subsampling.

In JFIF files, the position of the pixels in subsampled components are defined with respect to the highest resolution component. Since components must be sampled orthogonally (along rows and columns), the spatial position of the samples in a given subsampled component may be determined by specifying the horizontal and vertical offsets of the first sample, i.e., the sample in the upper left corner, with respect to the highest resolution component.

The horizontal and vertical offsets of the first sample in a

subsampled component, Xoffseti[0,0] and Yoffseti[0,0], is defined

to be:

Xoffseti[0,0] = (Nsamplesref / Nsamplesi) / 2 - 0.5\ Yoffseti[0,0] = (Nlinesref / Nlinesi) / 2 - 0.5

where: Nsamplesref is the number of samples per line in the largest component\ Nsamplesi is the number of samples per line in the ith component\ Nlinesref is the number of lines in the largest component\ Nlinesi is the number of lines in the ith component

Proper subsampling of components incorporates an antialiasing filter that reduces the spectral bandwidth of the full resolution components. Subsampling can easily be accomplished using a symmetrical digital filter with an even number of taps (coefficients). A commonly used filter for 2:1 subsampling uses two taps (1/2, 1/2).

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setHorizontalSubsampling(int component, int subsample)

: sets the horizontal subsampling value (number of samples per line).

void setVerticalSubsampling(int component, int subsample)

: sets the vertical subsampling value (number of lines).

int getHorizontalSubsampling(int component)

: returns the horizontal subsampling value.

int getVerticalSubsampling(int component)

: returns the vertical subsampling value.

Marker Data

The JPEG APP0 marker is used to specify units, x pixel density, y pixel density, and thumbnail. The APP0 marker may also be used to specify JFIF extensions and application-specific information. The APP0 marker syntax is defined in Annex B of ISO DIS 10918-1. In addition, a JFIF file uses APP0 marker segments and constrains certain parameters in the frame header as defined below.

     X'FF', SOI

          X'FF', APP0, length, identifier, version, units, XDensity,

     YDensity, Xthumbnail, Ythumbnail, (RGB)n

Table 11-5 describes the marker parameters.

[Parameter]{#53470} [Size]{#53472} [Description]{#53474}
———————– ——————– ———————————————————————————————————————————————————————————————————-
[length]{#53476} [2 bytes]{#53478} [Total APP0 field byte count, including the byte count value (2 bytes), but excluding the APP0 marker itself]{#53530}
[identifier]{#53482} [5 bytes]{#53484} [X'4A', X'46', X'49', X'46', X'00']{#53486} [A zero terminated string ("JFIF") that uniquely identifies this APP0 marker.]{#53558}
[version]{#53488} [2 bytes]{#53490} [X'0102']{#53492} [The most significant byte is used for major revisions, the least significant byte for minor revisions. Version 1.02 is the current released revision.]{#53574}
[DensityUnit]{#53494} [1 byte]{#53496} [Units for the x and y densities:]{#53498} [0 = no units, x and y specify the pixel aspect ratio]{#53604} [1 = x and y are dots per inch]{#53606} [2 = x and y are dots per cm.]{#53602}
[XDensity]{#53500} [2 bytes]{#53502} [The horizontal pixel density (bytes per pixel).]{#53504}
[YDensity]{#53506} [2 bytes]{#53508} [The vertical pixel density (bytes per pixel).]{#53510}
[Xthumbnail]{#53512} [1 byte]{#53514} [The thumbnail horizontal pixel count.]{#53516}
[Ythumbnail]{#53518} [1 byte]{#53520} [The thumbnail vertical pixel count.]{#53522}
[(RGB)n]{#53524} [3n bytes]{#53526} [Packed (24-bit) RGB values for thumbnail pixels, n = Xthumbnail * Ythumbnail.]{#53649}: [Table 11-5 Marker Data]{#53464}

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setMarkerData(int marker, byte[][] data)

: sets the marker data.

void addMarkerData(int marker, byte[] data)

: adds data to the marker.

boolean getMarker(int marker)

: returns a marker.

byte[][] getMarkerData(int marker)

: returns marker data.

Density

You can set density parameters, such as the units (inches or centimeters) for the pixel densities and the number of bytes per pixel for both the horizontal and vertical dimensions.

The setDensityUnit method sets the unit of measure for the x and y values. The following values are legal for the setDensityUnit method:

[Value]{#53772} [Meaning]{#53774}
—————– —————————————————————-
[0]{#53776} [No units. x and y specify the pixel aspect ratio]{#53778}
[1]{#53780} [x and y are dots per inch]{#53782}
[2]{#53784} [x and y are dots per cm.]{#53786}

The setXDensity and setYDensity methods set the horizontal and vertical pixel density, respectively.

To specify a pixel aspect ratio, use the setDensityUnit method with a value of 0 (no units) and set XDensity and YDensity for the desired aspect ratio. For example, set XDensity = 1 and YDensity = 1 to specify a 1:1 aspect ratio. The values for XDensity and YDensity should always be non-zero.

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setDensityUnit(int unit)

: sets the unit of measure for the x and y values.

void setXDensity(int density)

: sets the horizontal pixel density.

void setYDensity(int density)

: sets the vertical pixel density.

int getDensityUnit()

: returns the unit-of-measure setting.

int getXDensity()

: returns the horizontal pixel density.

int getYDensity()

: returns the vertical pixel density.

Compression Quality

Compression quality specifies a factor that relates to the desired tradeoff between image quality and the image data compression ratio. The quality value is a float between 1 and 100. A setting of 100 produces the highest quality image at a lower compression ratio. A setting of 1 produces the highest compression ratio, with a sacrifice to image quality. The quality value is typically set to 50.

The compression quality value controls image quality and compression ratio by determining a scale factor the encoder will use in creating scaled versions of the quantization tables. A quality value of 50 defines a scaling factor of 1, which means that the scaled quantization tables are identical to the original tables. A quality value of less than 1 produces a scaling factor greater than 1 and a quality value greater than 1 produces a scaling factor less than 1.

Note: The values stored in the quantization table also affect image quality and compression ratio. See also "Quantization Table."

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setQuality(float quality, boolean forceBaseline)

: sets the compression quality factor.

What does forceBaseline do - set to the baseline process?

Miscellaneous

These methods don't currently have a home. I'm not sure what to do with them.

API: org.eclipse.imagen.codec.JPEGEncode | | | Param

void setImageInfoValid(boolean flag)

void setTableInfoValid(boolean flag)

void setRestartInterval(int restartInterval)

int getWidth()

int getHeight()

boolean isImageInfoValid()

boolean isTableInfoValid()

int getEncodedColorID()

int getNumComponents()

int getRestartInterval()

11.5.4 PNG Coding

The PNG (Portable Network Graphics) is an extensible file format for the lossless, portable, compressed storage of raster images. PNG was developed as a patent-free alternative to GIF and can also replace many common uses of TIFF. Indexed-color, grayscale, and truecolor images are supported, plus an optional alpha channel. Sample depths range from 1 to 16 bits.

11.5.4.1 PNG Hints

Table 11-6 lists the PNG hints.

[Hint Key]{#56565} [Class]{#56567} [Default Value]{#56569}
—————————————— ——————- ————————-
[KEY_PNG_EMIT_ALPHA]{#56571} [Boolean]{#56573} [Boolean.FALSE]{#56575}
[KEY_PNG_EMIT_16BITS]{#56577} [Boolean]{#56579} [Boolean.FALSE]{#56581}
[KEY_PNG_EMIT_SQUARE_PIXELS]{#56583} [Boolean]{#56585} [Boolean.FALSE]{#56587}: [Table 11-6 PNG Hints]{#56559}

PNG images stored with a bit depth of 16 are truncated to eight bits of output unless the KEY_PNG_EMIT_16BITS hint is set to Boolean.TRUE. Similarly, the output image will not have an alpha channel unless the KEY_PNG_EMIT_ALPHA hint is set.

Normally no correction is made for images with non-square pixels. To automatically perform a simple scale to correct for such distortion, use the KEY_PNG_EMIT_SQUARE_PIXELS hint. The correct aspect ratio may be retrieved by means of the pixel_aspect_ratio property.

Will there be a PNGEncodeParam class?

11.5.5 PNM Coding

The PNM format is one of the extensions of the PBM file format (PBM, PGM, and PPM). The portable bitmap format is a lowest-common-denominator monochrome file format. It was originally designed to make it reasonable to mail bitmaps between different types of machines. It now serves as the common language of a large family of bitmap conversion filters.

The PNM fotmat comes in six variants:

PBM ASCII - three-banded images
PBM raw - three-banded images
PGM ASCII - single-banded images
PGM raw - single-banded images
PPM ASCII - single-banded images
PPM raw - single-banded images

API: org.eclipse.imagen.codec.PNMEncodeP | | | aram

void setRaw(boolean raw)

: sets the RAWBITS option flag.

boolean getRaw()

: retrieves the RAWBITS option flag.

11.5.6 TIFF Coding

Note: TIFF image compression and decompression is not yet implemented.

A TIFF image file consists of several entries, each of which has a tag and some associated data. The tag indicates the purpose of the associated data. The tags and data define the image's format, size, and other parameters, including the image data itself.

As shown in Figure 11-3, a TIFF image file is divided into a header, one or more Image File Directories (IFD), IFD entries, and data.

The file header consists of eight bytes of data, containing the following information:

Byte-order indicator - two bytes that specify whether the image data is ordered by least-significant byte first as used in Intel processors), or most-significant byte first (as used in Sun SPARC and Motorola processors). The indicator is II for Intel platforms, MM for Sun SPARC and Motorola processors. The byte-order indicator is what makes the image file portable between platforms.
Version number - two bytes that indicate the TIFF version number (not to be confused with the revision number, which changes with each release of the TIFF standard).
IFD address - four bytes that specify the address of the first IFD relative to the beginning of the file. A TIFF image file generally includes a single IFD, but may contain more.

The Image file directory contains the following:

Number of entries - two bytes that indicate the number of entries in the IFD.
Directory entries - one or more IFD entries, which consist of the following:
Tag number - a tag that indicates the meaning of the following data. The Java Advanced Imaging TIFF reader reads all the tags listed in Table 11-7.
Field type - defines the type of data. Java Advanced Imaging supports all 12 of the TIFF 6.0 standard data types listed in Table 11-8.
Field length - four bytes that indicate the number of data items in the field data.
Data address - four bytes that contain the address of the entry field data, relative to the start of the file.

11.5.6.1 Image Tags

The tags listed in Table 11-7 define the parameters and data of the image stored in the TIFF file. The TIFF_ImageWidth, TIFF_ImageLength, and TIFF_BitsPerSample tags define the number of pixels in the image, the number of lines, and the pixel depth, respectively. The TIFF_Compression tag defines the data compression algorithm used to store the image, such as run-length coding, Lempel-Ziv-Welch (LZW), or JPEG.

[Tag Name]{#56008} [Tag Number]{#56010}
——————————————– ———————-
[TIFF_NewSubfileType]{#56012} [254]{#56014}
[TIFF_SubfileType]{#56016} [255]{#56018}
[TIFF_ImageWidth]{#56020} [256]{#56022}
[TIFF_ImageLength]{#56024} [257]{#56026}
[TIFF_BitsPerSample]{#56028} [258]{#56030}
[TIFF_Compression]{#56032} [259]{#56034}
[TIFF_PhotometricInterpretation]{#56036} [262]{#56038}
[TIFF_Thresholding]{#56040} [263]{#56042}
[TIFF_CellWidth]{#56044} [264]{#56046}
[TIFF_CellLength]{#56048} [265]{#56050}
[TIFF_FillOrder]{#56052} [266]{#56054}
[TIFF_DocumentName]{#56056} [269]{#56058}
[TIFF_ImageDescription]{#56060} [270]{#56062}
[TIFF_Make]{#56064} [271]{#56066}
[TIFF_Model]{#56068} [272]{#56070}
[TIFF_StripOffsets]{#56072} [273]{#56074}
[TIFF_Orientation]{#56076} [274]{#56078}
[TIFF_SamplesPerPixel]{#56080} [277]{#56082}
[TIFF_RowsPerStrip]{#56084} [278]{#56086}
[TIFF_StripByteCounts]{#56088} [279]{#56090}
[TIFF_MinSampleValue]{#56092} [280]{#56094}
[TIFF_MaxSampleValue]{#56096} [281]{#56098}
[TIFF_XResolution]{#56100} [282]{#56102}
[TIFF_YResolution]{#56104} [283]{#56106}
[TIFF_PlanarConfiguration]{#56108} [284]{#56110}
[TIFF_PageName]{#56112} [285]{#56114}
[IFF_XPosition]{#56116} [286]{#56118}
[TIFF_YPosition]{#56120} [287]{#56122}
[TIFF_FreeOffsets]{#56124} [288]{#56126}
[TIFF_FreeByteCounts]{#56128} [289]{#56130}
[TIFF_GrayResponseUnit]{#56132} [290]{#56134}
[TIFF_GrayResponseCurve]{#56136} [291]{#56138}
[TIFF_T4Options]{#56140} [291]{#56142}
[TIFF_T6Options]{#56144} [293]{#56146}
[TIFF_ResolutionUnit]{#56148} [296]{#56150}
[TIFF_PageNumber]{#56152} [297]{#56154}
[TIFF_Software]{#56156} [301]{#56158}
[TIFF_Software]{#56160} [305]{#56162}
[TIFF_DateTime]{#56164} [306]{#56166}
[TIFF_Artist]{#56168} [315]{#56170}
[TIFF_HostComputer]{#56172} [316]{#56174}
[TIFF_Predictor]{#56176} [317]{#56178}
[TIFF_WhitePoint]{#56180} [318]{#56182}
[TIFF_PrimaryChromaticities]{#56184} [319]{#56186}
[TIFF_ColorMap]{#56188} [320]{#56190}
[TIFF_HalftoneHints]{#56192} [321]{#56194}
[TIFF_TileWidth]{#56196} [322]{#56198}
[TIFF_TileLength]{#56200} [323]{#56202}
[TIFF_TileOffsets]{#56204} [324]{#56206}
[TIFF_TileByteCounts]{#56208} [325]{#56210}
[TIFF_InkSet]{#56212} [332]{#56214}
[TIFF_InkNames]{#56216} [333]{#56218}
[TIFF_NumberOfInks]{#56220} [334]{#56222}
[TIFF_DotRange]{#56224} [336]{#56226}
[TIFF_TargetPrinter]{#56228} [337]{#56230}
[TIFF_ExtraSamples]{#56232} [338]{#56234}
[TIFF_SampleFormat]{#56236} [339]{#56238}
[TIFF_SMinSampleValue]{#56240} [340]{#56242}
[TIFF_SMaxSampleValue]{#56244} [341]{#56246}
[TIFF_TransferRange]{#56248} [342]{#56250}
[TIFF_JPEGProc]{#56252} [512]{#56254}
[TIFF_JPEGInterchangeFormat]{#56256} [513]{#56258}
[TIFF_JPEGInterchangeFormatLngth]{#56260} [514]{#56262}
[TIFF_JPEGRestartInterval]{#56264} [515]{#56266}
[TIFF_JPEGLosslessPredictors]{#56268} [517]{#56270}
[TIFF_JPEGPointTransforms]{#56272} [518]{#56274}
[TIFF_QTables]{#56276} [519]{#56278}
[TIFF_DCTables]{#56280} [520]{#56282}
[TIFF_ACTables]{#56284} [521]{#56286}
[TIFF_ACTables]{#56288} [529]{#56290}
[TIFF_YCbCrSubSampling]{#56292} [530]{#56294}
[TIFF_YCbCrPositioning]{#56296} [531]{#56298}
[TIFF_ReferenceBlackWhite]{#56300} [532]{#56302}
[TIFF_Copyright]{#56304} [33432]{#56306}
[TIFF_ModelPixelScaleTag]{#56308} [33550]{#56310}
[TIFF_ModelTransformationTag]{#56312} [33920]{#56314}
[TIFF_ModelTiepointTag]{#56316} [33922]{#56318}
[TIFF_GeoKeyDirectoryTag]{#56320} [34735]{#56322}
[TIFF_GeoDoubleParamsTag]{#56324} [34736]{#56326}
[TIFF_GeoAsciiParamsTag]{#56328} [34737]{#56330}: [Table 11-7 Standard TIFF Image Tags]{#56004}

The TIFF_PhotometricInterpretation tag defines the image type. For example, for a bilevel or grayscale image, whether the image is quantized with black = 0 or with white = 0. For color images, this tag identifies the image as RGB full color, palette color, or transparency mask.

TIFF extensions, such as GeoTIFF (see the GeoTIFF Web Page at http://home.earthlink.net/~ritter/geotiff/geotiff.html), operate by defining additional private tags, usually referenced by a small number of globally-registered TIFF tags. Java Advanced Imaging allows such tags to be decoded into first-class tags by adding suitable PropertyGenerator objects.

11.5.6.2 Data Types

Java Advanced Imaging supports all of the TIFF 6.0 standard data types listed in Table 11-8.

[TIFF Data Type]{#56446} [Java Data Type]{#56448} [Description]{#56450}
————————— ————————– ————————————————————-
[TIFF_ASCII]{#56452} [byte]{#56454} [Null-terminated ASCII strings]{#56456}
[TIFF_BYTE]{#56458} [byte]{#56460} [8-bit unsigned integers]{#56462}
[TIFF_SBYTE]{#56464} [byte]{#56466} [8-bit signed integers]{#56468}
[TIFF_SHORT]{#56470} [char]{#56472} [16-bit unsigned integers]{#56474}
[TIFF_SSHORT]{#56476} [short]{#56478} [16-bit signed integers]{#56480}
[TIFF_LONG]{#56482} [long]{#56484} [32-bit unsigned integer]{#56486}
[TIFF_SLONG]{#56488} [int]{#56490} [32-bit signed integers]{#56492}
[TIFF_FLOAT]{#56494} [float]{#56496} [Pairs of 32-bit IEEE floating-point values]{#56498}
[TIFF_DOUBLE]{#56500} [double]{#56502} [Pairs of 32-bit IEEE double floating-point values]{#56504}
[TIFF_RATIONAL]{#56506} [long[2]]{#56508} [Pairs of 32-bit unsigned integers]{#56510}
[TIFF_SRATIONAL]{#56512} [int[2]]{#56514} [Pairs of 32-bit signed integers]{#56516}
[TIFF_UNDEFINED]{#56518} [byte]{#56520} [8-bit uninterpreted bytes]{#56522}: [Table 11-8 TIFF Data Types]{#56440}

11.6 Transform Coding

Transform coding is a form of lossy block coding that transforms blocks of the image from the spatial domain to the frequency domain. The fundamental frequency component values of the frequency domain image are the codes that are stored as the compressed image.

Java Advanced Imaging supports a form of transform coding known as the discrete cosine transform (DCT). The DCT is one of several frequency transforms that may be used in transform image compression. One of the things that makes DCT unique is that it works well for a wide range of arbitrary image types. It produces the fewest number of frequency components and, thus, the smallest compressed image file size.

The DCT is similar to the discrete Fourier transform (see Section 6.6.1, "Discrete Fourier Transform"). However, the DCT is better at compactly representing very small images. The drawback is that the DCT requires more computational power than the discrete Fourier transform.

The DCT operation breaks the source image into pixel blocks of 8 x 8 pixels. The discrete cosine transform then is performed on each block, transforming each block into frequency-domain representation. Minimally-valued frequency components are discarded, leaving only those components that contribute significantly to the image. The DCT image compression causes some image degradation that increases proportionally with increased compression ratios. Typically, DCT provides a compression ratio of 10:1 without serious image quality degradation.

Like the discrete Fourier transform (DFT), the DCT also has an inverse operation, the inverse discrete cosine transform (IDCT).

11.6.1 Discrete Cosine Transform (DCT)

The DCT operation computes the even discrete cosine transform of an image. Each band of the destination image is derived by performing a two-dimensional DCT on the corresponding band of the source image.

The DCT operation does not take any parameters.

API: org.eclipse.imagen.JAI

static RenderedOp create("dct", RenderedImage im)

: creates a RenderedOp as a result of performing a DCT operation on the specified image.

static RenderableOp createRenderable("dct", 

ParameterBlock paramBlock, RenderingHints hints)

: creates a RenderableImage as the result of applying the DCT operation to a tuple (source, parameters, hints).

11.6.2 Inverse Discrete Cosine Transform (IDCT)

The IDCT operation computes the inverse even discrete cosine transform of an image. Each band of the destination image is derived by performing a two-dimensional inverse DCT on the corresponding band of the source image.

The IDCT operation does not take any parameters.

API: org.eclipse.imagen.JAI

static RenderedOp create("idct", RenderedImage im)

: creates a RenderedOp as a result of performing a IDCT operation on the specified image.

static RenderableOp createRenderable("idct", 

ParameterBlock paramBlock, RenderingHints hints)

: creates a RenderableImage as the result of applying the IDCT operation to a tuple (source, parameters, hints).