Sustainability of Digital Formats: Planning for Library of Congress Collections |
|
![]() |
|
Introduction | Sustainability Factors | Content Categories | Format Descriptions | Contact |
Full name | NLS modification to ANSI/NISO Digital Talking Book specification to include AMR-WB+ Codec |
---|---|
Description | Modification of DTB (NISO Digital Talking Book Standard, ANSI/NISO Z39.86-2002), which defines the format and content of the electronic file set that comprises a digital talking book (DTB) and establishes a limited set of requirements for DTB playback devices. The Library of Congress specification extends the NISO standard by adding the option to use AMR-WB+ compressed audio coding. |
Production phase | This bundle of files is likely to be used primarily as a middle-state format, with dissemination to end-users managed through publishers or aggregators who provide in a form appropriate for viewers/players that enforce limitations on access and use consistent with terms imposed by copyright holders. |
Relationship to other formats | |
Contains | OEBPS_1_0, OEBPS, Open Ebook Forum Publication Structure 1.0.1 |
Defined via | XML_DTD, XML Document Type Definition (DTD) |
Contains | AMR-WB+, AMR-WB+, Extended Adaptive Multi-Rate - Wideband Speech Codec |
Modification of | DTB_2002, DTB (Digital Talking Book), 2002. Main specification omits AMR-WB+ speech codec |
LC experience or existing holdings | The National Library Service for the Blind and Print Disabled (NLS), which is part of the Library of Congress, is developing this standard and produces books to this specification. See Current Strategic Business Plan for the Implementation of Digital Systems (dated December 2006). Available via Internet Archive capture from July 24, 2017. |
---|---|
LC preference | Suggested format for digital talking books. A DTB or DTB_2002_NLS incorporating the full text of a work is among the acceptable XML-based formats for textual works, assuming no encryption or DRM is applied. |
Disclosure | Open standard |
---|---|
Documentation | For the DTB: ANSI/NISO Z39.86-2002. Specifications for the Digital Talking Book, ISSN: 1041-5653. Information about the extension: Current Strategic Business Plan for the Implementation of Digital Systems, dated December 2006 and available via Internet Archive capture from July 24, 2017. See also AMR-WB+ for links to the audio codec specification. |
Adoption | The National Library Service for the Blind and Print Disabled (NLS) is using the extended version of the DTB standard for all production of talking books. As of April 2009, none of the NLS talking books included full text and there were no plans to produce books with full text. |
Licensing and patents | None |
Transparency | See information on the encoding formats employed for the files that comprise a DTB file set: XML, AMR-WB+. XML rates highly for transparency. For the audio, the master WAV format file used in the production process is more transparent. |
Self-documentation | The Package file can include Dublin Core metadata and an extended set of elements intended to record information about rights and the provenance and generation of the talking book from a source text. |
External dependencies | None. The format is designed to support effective use of special hardware and software players for the visually impaired, but does not require them. |
Technical protection considerations | None in relation to the sets of files that comply with the ANSI/NISO Z39.86-2002 specification. Digital Talking Books from NLS will be distributed to end users via mechanisms that do impose technical protections. Hence it is probable that LC will need to receive such files direct from publisher or aggregator by a special transmittal process rather than by harvesting as if an end user. |
Sound | |
---|---|
Normal rendering | Good support. |
Fidelity (high audio resolution) | Moderate quality (appropriate for speech) from AMR-WB+ encoding. |
Multiple channels | DTB Players are not required to support multiple channels, but must recognize stereo and render at least as monaural. |
Support for user-defined sounds, samples, and patches | Not investigated at this time. |
Functionality beyond normal rendering | Synchronization with text transcription. |
Text | |
Normal rendering | Good support. |
Integrity of document structure | The logical structure of a document is an important part of a DTB with textual content. See DAISY Structure Guidelines. Available via Internet Archive capture from February 20, 2013. |
Integrity of layout and display | This standard focuses on the textual content, the logical structure, and the synchronization of text with audio of the text being read. Layout is not of primary significance in rendering for the visually impaired. |
Support for mathematics, formulae, etc. | Not supported |
Functionality beyond normal rendering | Supports embedding of audio, images, and synchronization of text with audio of the text being read. |
Tag | Value | Note |
---|---|---|
Filename extension | opf |
For the required Package file. Documented in NISO standard. |
Filename extension | xml |
For textual content files. Documented in NISO standard. |
Filename extension | ncx |
For navigation control files. Documented in NISO standard. |
Filename extension | amr |
For AMR-WB+ audio files. Presumed to be the case; The File Extension Source provides amr as the extension for Adaptive Multi-Rate Speech Codec files . |
Filename extension | aac |
For AAC_MP4 audio files. Documented in NISO standard. |
Filename extension | mp3 |
For MP3 audio files. Documented in NISO standard. |
Filename extension | wav |
For WAVE_LPCM audio files. Documented in NISO standard. |
Pronom PUID | See note. | PRONOM has no corresponding entry as of April 2022. |
Wikidata Title ID | Q1151676 |
See https://www.wikidata.org/wiki/Q1151676. |
General | In addition to requiring the AMR-WB+ audio encoding using prescribed parameters and wrapped in the 3GP file format, the NLS Specification for the Construction of Digital Talking Books specifies particular usage of the ANSI/NISO Z39.86:2002 standard. Required or specified features include, for example, file-naming and metadata conventions, audio-clips for spoken title and author name and to support navigation, opening announcements, and positioning of pauses. |
---|---|
History | See DTB_2002. |
|