AV Prototype Project Working Documents

Data Dictionary for Administrative Metadata
for Audio, Image, Text, and Video Content

To Support the Revision of Extension Schemas for METS, March 2002


This data dictionary lists the principal elements and attributes proposed as specialized administrative metadata for audio, images (and things that are the sources for images, e.g., documents on paper), machine-readable texts (ASCII or Unicode), or video content. This data dictionary has been excerpted from the data dictionary (downloadable MS-Excel file) for the relational database used to capture the Library of Congress Audio-Visual Prototyping Project metadata for later conversion to XML. This fact may result in some infelicities in presentation, e.g., the fields are listed in alphabetical rather than logical order.
Audio metadata.
Image metadata. Also, related notes from the February 2002 meeting at Harvard.
Text metadata. Also, related notes from the February 2002 meeting at Harvard.
Video metadata.

AUDIO
analog_digi_flag Indicator of the how this entity is describe in the METS document instance.
audio_block_size Size of an audio block in bytes.
audio_data_encoding Structure for audio data; current known types are pulse code modulation (PCM) and SONY's DSD structure.
base_material Base material of the source sound recording, e.g., plastic, glass, vinyl, metal, wax, polycarbonate, etc. MAVIS codes exist.
binder Type of adhesive used to bind recording surface to base material, e.g., CA (cellulose acetate) , PVC (polyvinyl chloride) etc. MAVIS codes exist.
bits_per_sample Number of bits per audio sample, e.g., 16, 20, 24, etc.
calibration_ext_int Indicator that the calibration information is contained within the file or externally.
calibration_location Temporary location of the calibration file if it is external, e.g. URL.
calibration_type Type of calibration, e.g., the ITU test sequences known as CCITT 0.33.00 (mono) and CCITT 0.33.01 (stereo).
codec_creator_app The name of the application used to apply the codec, e.g., SoundForge.
codec_creator_app_version The version of the application used to apply the codec, e.g., 5.2.
codec_name The name and version (or subtype) of the compression algorithm used, e.g., Frauenhofer xyz ["xyz" is a placeholder for versioning information].
codec_quality Indication whether the codec is lossy or lossless.
condition Narrative description of the physical condition of the item, e.g., brittle paper, sticky shed tape, chipped disc, etc.
data_rate Data rate of the audio in an MP3 or other compressed file, expressed in kbps, e.g., 64, 128, 256, etc.
data_rate_mode Indicator whether the data rate is fixed or variable.
dimensions_depth Depth of the object in the unit of measure indicated in dimensions_unit, e.g., 7, 12.
dimensions_diameter Diameter of any circular object expressed in the unit of measure indicated in dimensions_unit, e.g., 3.5, 5, 7.
dimensions_height Height of the object in the unit of measure indicated in dimensions_unit, e.g., 23.
dimensions_note Desciption of odd-shaped objects that cannot be described using the standard dimensions fields.
dimensions_unit Unit of measurement of the source object, e.g., inches.
dimensions_width Width of any non-circular object expressed in the unit of measure indicated in dimensions_unit, e.g., 3.5, 5, 7.
disposition What became of the source item, e.g., reshelved on shelf number 1234, discarded, loaned to XYZ organization, destroyed, etc.
disc_surface Information regarding the disk surface of the analog recording, e.g., aluminium, celluloid, etc. MAVIS codes exist.
duration Elapsed time of the entire file, expressed using ISO 8601 syntax; see http://www.w3.org/TR/NOTE-datetime.
equalization Equalization system inherent in source recording, e.g., FFRR (Full Frequency Range Recording), RIAA (Recording Industry Association Of America), etc. MAVIS codes exist.
first_sample_offset Location of the first valid sound byte in the file.
first_valid_byte_block Location of the first valid sound byte in the block.
gauge Gauge or width of source tape, including indication of unit of measure, e.g., 8 mm, 0.5 inch, 0.25 inch, etc. MAVIS codes exist.
generation Generation of physical source item which was digitized, e.g., studio master, original disc, preservation tape copy, etc. MAVIS codes exist.
groove Groove type of audio source, e.g., standard groove, hill and dale cutting, microgroove, etc. MAVIS codes exist.
last_valid_byte_block Location of the last valid sound byte in the block.
length Length of source open-reel tape recording, including indication of unit of measure, e.g., 700 feet, 1200 feet, etc.
noise_reduction Noise reduction system inherent in source recording, e.g., DA ( Dolby A), DB ( Dolby S), FL (Flat) etc. MAVIS codes exist.
note Additional information about the audio source item.
num_channels Number of audio channels, e.g., 1, 2, 4, 5, etc.
num_sample_frames The number of frames within an audio file.
oxide Type of oxide used for the coating of a tape recording, e.g., cobalt modified, chromium dioxide, etc. MAVIS codes exist.
phys_format Name for the physical format of the source e.g., Record, Audiocassette, etc.
reflective_layer The substrate of an optical disk.
sampling_frequency Rate at which the audio was sampled, expressed in kHz, e.g., 22, 44.1, 48, 96, etc.
sound_channel_map Information about the channel configuration, e.g., mapping the audio channel to their intended aural position/loudspeakers. The values represent parseable compound metadata using commas as separators, e.g., 1=left_front, 2=right_front, 3=center, 4=left_rear, 5=right_rear; or 1=French_narration, 2=Spanish_narration, 3=English_narration, 4=synch_sound, 5=music.
sound_field Indicates aural space arrangement of the sound recording, e.g., monaural, stereo, joint stereo, surround sound DTS 5.1, etc. MAVIS codes exist.
speed Nominal speed of the source recording, including indication of the unit of measure, e.g., 1 7/8 ips, 4.75 cm/s, 15 ips, 78 rpm, 45 rpm, etc. MAVIS codes exist.
speed_adjustment Speed actually used at playback expressed as a percentage of nominal speed, e.g., 90 or 110.
speed_note Note to state actual speed as a number (73 rpm) or to state a deviation such as "corrected to C sharp pitch.
stock_brand Manufacturer and stock number for source recording, e.g., Scotch 208, Ampex 407, Webco steel wire, Quik-cut 3215 acetate disc, etc. MAVIS codes exist.
tape_thickness The thickness of a tape.
temp_dimensions_note Temporary field for use in upgrading LC legacy data pertaining to the dimension fields of the legacy SOURCE table.
time_stamp Exact location of calibration tones within a file.
track_format Track format of a magnetic tape, e.g., full track, half track, quarter track stereo, video HiFi, etc. MAVIS codes exist.
tracking_type The type of tracking code, e.g., MAVIS number, actual shelf numbers, bar-code, etc.
tracking_value Shelf number or other identifier for source, e.g., MAVIS number, actual shelf numbers, etc.
word_size Number of bytes that comprise a single sample of audio data, which generally maps to bits_per_sample. Files with a bit depth of 24 will usually be expressed as a 3-byte word_size; however, some applications may store 24-bit audio in a 4-byte word.
IMAGE
analog_digi_flag Indicator of the how this entity is describe in the METS document instance.
bits_per_sample Bit depth at each pixel, e.g., 8, 16, 24, etc.
chromaticities_primary Chromaticities of the primary colors of the image.
chromaticities_white_point White point chromaticity of the effective illumination source.
color_map_location The location of the color map, e.g., image file, auxiliary file, associated file, metadata.
color_map_value Color lookup table for non-RGB images.
compression Designates the compression scheme used to store the image data. The NISO-image term set is based on TIFF 6.0, and examples include Uncompressed, CCITT Group 3, CCITT Group 4, LZW, JPEG, PackBits, ands None.
condition Narrative description of the physical condition of the item, e.g., brittle paper, foxing, staining, etc.
dimensions_depth Depth of the object expressed in the unit of measure indicated in dimensions_unit, e.g., 4, 12.
dimensions_diameter Diameter of any circular object expressed in the unit of measure indicated in dimensions_unit, e.g., 3.5, 5, 7.
dimensions_height Height of the object expressed in the unit of measure indicated in dimensions_unit, e.g., 6, 12, 24.
dimensions_note Desciption of odd-shaped objects that cannot be described using the standard dimensions fields.
dimensions_unit Unit of measurement of the source object, e.g., inches.
dimensions_width Width of any non-circular object expressed in the unit of measure indicated in dimensions_unit, e.g., 3.5, 5, 7.
disposition What became of the source item, e.g., reshelved on shelf number 1234, discarded, loaned to XYZ organization, destroyed, etc.
extra_samples Specifies that each pixel has extra components.
generation Generation of physical source item which was digitized, e.g., photostat copy, slide copy, film interpositive from negative, etc.
gray_response_loc The location of the gray response curve values, e.g., image file, auxiliary file, associated file, metadata.
gray_response_unit Unit of measure of gray response curve values.
gray_response_value Gray response curve values, used when a response curve is generated at the start of a capture session.
image_data Temporary location of the target file if not contained within the image file (external) e.g. URN, URL, etc. See also target_id and target_type. (NISO-image terminology)
note Additional information about the image source item.
orientation_disk Orientation of the image saved on disk, e.g., normal, normal rotated 180°, normal rotated cw 90°, etc.D164
orientation_display Orientation of the image as displayed, e.g., landscape or portrait.
photometric_interp Designates the color space of the decompressed image data as defined in the NISO image data dictionary. 0 = White Is Zero 1 = Black Is Zero 2 = RGB
phys_format Name for the physical format of the source e.g., Oil Painting, Photograph, etc.
pixels_horizontal Horizontal dimension of image in pixels, e.g., 1024.
pixels_vertical Vertical dimension of image in pixels, e.g., 768, called ImageLength in the NISO image data dictionary.
planar Designates how the components of each pixel are stored. 1 = Chunky Format 2 = Planar Format
samples_per_pixel The number of color components (samples) per pixel, e.g., 3 (RGB) , 4 (CYMK), etc.
sampling_freq_unit Unit of measure for horizontal and vertical sampling frequency, e.g., inches, centimeters, or unknown.
sampling_frequency_horizontal Number of pixels per sampling frequency unit by width.
sampling_frequency_plane Reference plane location for horizontal and vertical sampling frequency.
sampling_frequency_vertical Number of pixels per sampling frequency unit by height.
segment_form Specifies whether the image is stored in tiles or strips.
strip_byte_counts Number of image data bytes stored within each strip after compression.
strip_offsets Byte offset of a strip.
strip_rows Number of rows per strip.
target_id Calibration target name, manufacturer, version, etc. See also image_data and target_type. (NISO-image terminology)
target_type Conveys whether the target image is internal to the image file or external. See also image_data and target_id. (NISO-image terminology)
temp_dimensions_note Temporary field for use in upgrading LC legacy data pertaining to the dimension fields of the legacy SOURCE table.
tile_byte_counts Number of image data bytes stored within each tile after compression.
tile_height Tile height in pixels, called TileLength in the NISO image data dictionary.
tile_offsets Byte offset of a tile.
tile_width Tile width in pixels.
tracking_type The type of tracking code, e.g., MAVIS number, actual shelf numbers, bar-code, etc.
tracking_value Shelf number or other identifier for source, e.g., MAVIS number, actual shelf numbers, etc.
watermark Type of watermark used in the file, e.g., Digimarc, Giovanni, Alpha-Tec, StirMark, etc.
Notes from February 2002 meeting at Harvard. (1) We may wish for more metadata to describe compression, e.g., to track the quantization table for JPEG. Some felt that it might be useful to make a parallel set to those proposed for audio compression: codecName: The name and version (or subtype) of the compression algorithm used; codecCreatorApplication: The name of the application used to apply the codec; codecCreatorApplicationVersion: The version of the application used to apply the codec; codecQuality [or type]: Indication whether the codec is lossy or lossless.
(2) We may wish for more metadata to describe file formats. The proposed audio set has these elements/attributes:
formatSpecification [may rename as formatName]: The official name for the file format taken from format documentation; formatSpecificationVersion: The version of the format specified in formatName; formatSpecificationType [flavor?]: Added application specific information.
(3) There was very unresolved discussion about describing a source for an image. LC has a placeholder here; the group suggested waiting for domain experts to develop something for paintings, photos, etc. Examples might include:
type: reflection/transmission; medium (e.g, watercolor, printing ink); support (e.g., paper); and note.
TEXT
encoding Character encoding scheme used in the text document, e.g, ASCII, UTF-8, Unicode, etc.
markup Type of markup language used to markup the document, e.g, SGML, XML, HTML, etc.
note Additional information or comments about the text object.
Notes from February 2002 meeting at Harvard. At the meeting, the following suggestions for added elements or attributes were made: language_name; language_code; transliteration_scheme; directionality (left to right); and dtd_schema.
VIDEO
analog_digi_flag Indicator of the how this entity is describe in the METS document instance.
aspect_ratio The desired aspect ratio of the image on screen, e.g., 4:3, etc. Some files produced for display on non-square-pixel monitors have a desired aspect ratio that differs from the ratio of horizontal to vertical pixels.
bits_per_sample The number of bits of sample depth, e.g., 8, 24, etc.
calibration_ext_int Indicator that the calibration information is contained within the file or externally
calibration_location Temporary location of the calibration file if it is external e.g. URL
calibration_type Type of calibration used
closed_captioning_note Information about closed captioning in this source item.
closed_captioning_type Type of closed captioning.
color_burst Indicates presence or absence of color burst signal.
codec_creator_app The name of the application used to apply the codec, e.g., SoundForge.
codec_creator_app_version The version of the application used to apply the codec, e.g., 5.2.
codec_name The name and version (or subtype) of the compression algorithm used, e.g., Frauenhofer xyz ["xyz" is a placeholder for versioning information}.
codec_quality Indication whether the codec is lossy or lossless.
condition Narrative description of the physical condition of the item, e.g., sticky shed tape, deformed by slots in reel, etc.
data_rate Data rate of the audio in an MPEG or other compressed file expressed in mbps, e.g., 8, 12, 15, etc.
data_rate_mode Indicator that the data rate of the video is fixed or variable.
dimensions_depth Depth of the object expressed in the unit of measure indicated in dimensions_unit, e.g., 12.
dimensions_diameter Diameter of any circular object expressed in the unit of measure indicated in dimensions_unit, e.g., 3.5, 5, 7.
dimensions_height Height of the object expressed in the unit of measure indicated in dimensions_unit, e.g., 23.
dimensions_note Desciption of odd-shaped objects that cannot be described using the standard dimensions fields.
dimensions_unit Unit of measurement of the source object, e.g., inches.
dimensions_width Width of any non-circular object expressed in the unit of measure indicated in dimensions_unit, e.g., 3.5, 5, 7.
disposition What became of the source item, e.g., reshelved on shelf number 1234, discarded, loaned to XYZ organization, destroyed, etc.
dtv_aspect_ratio Aspect ratio of digital video source item expressed as ratio, e.g., 4:3, 16:9, etc.
dtv_note Note about digital video source item.
dtv_resolution Resolution of digital video source item expressed as horizontal lines.
dtv_scan Indication whether digital video source item is scanned in an interlaced or progressive mode.
duration Elapsed time of the entire file, expressed using ISO 8601 syntax; see http://www.w3.org/TR/NOTE-datetime.
frame_rate The number of frames per second at which the video source item was captured.
gauge Gauge or width of source tape, including indication of unit of measure, e.g., 8 mm, 0.5 inch, 0.25 inch, etc. MAVIS codes exist.
generation Generation of physical source item which was digitized, e.g., studio master, preservation tape copy, photostat copy, etc. MAVIS codes exist.
length Length of source open-reel tape recording, including indication of unit of measure, e.g., 700 feet, 1200 feet, etc.
note Additional information about the video source item.
num_sample_frames The number of frames within a video file.
number_carriers The number of carriers (reels, cassettes) needed to house the video source item.
phys_format Name for the physical format of the source e.g., Videotape, Film Reel etc.
pixels_horizontal The horizontal size of a frame in picture elements.
pixels_vertical The vertical size of a frame in picture elements.
reflective_layer The substrate of an optical disk.
sampling The video sampling format (in terms of luminance and chrominance), e.g., 4:2:0, 4:2:2, 2:4:4, etc.
signal_format Signal format of the video source item. Analog-source examples include composite monochrome, NTSC (composite color), PAL, SECAM, and analog component. Digital-source examples include digital component and others. [Future action: identification of set of specific SMPTE and ITU terms/codes for use in this field.]
sound Indicator of the presence of sound in the video file. If the value "yes" is selected, then the video file will also be associated with an instance of audioMD (metadata for audio files) or audiophysrcMD (metadata for physical-form audio source items).
stock_brand Manufacturer and stock number for source recording, e.g., Scotch XYZ, Ampex ABC, etc. MAVIS codes exist.
tape_thickness The thickness of a tape.
temp_dimensions_note Temporary field for use in upgrading LC legacy data pertaining to the dimension fields of the legacy SOURCE table.
timecode_record_method Method for recording timecode on the video source item, e.g., longitudinal, vertical interval, etc.
timecode_type Type of timecode recorded on video source item, e.g., SMPTE dropframe, SMPTE nondropframe, etc.
time_stamp Exact location of calibration tones within a file.
tracking_type The type of tracking code, e.g., MAVIS number, actual shelf numbers, bar-code, etc.
tracking_value Shelf number or other identifier for source, e.g., MAVIS number, actual shelf numbers, etc.
videodisc_type Identification of whether this videodisc recording is constant linear velocity (CLV) or constant angular velocity (CAV).
videotape_type General type of videotape format, e.g., 2-inch quadraplex, 1-inch type C, VHS, Betacam SP, etc. Complementary to stock_brand.

Go to top
Go to New Extension Schema Page
Go to AV Prototype Project Documents
Go to AV Prototype Project Home
(3/25/02)
Library of Congress
Comments: AV Prototype Coordinator ([email protected])
Legal | External Link Disclaimer
( August 31, 2010 )