NAME: Changes to Field 856 (Electronic Location and Access) in the USMARC Bibliographic Format
SOURCE: Library of Congress; Federal Geographic Data Committee
SUMMARY: This paper proposes the addition of three subfields to field 856. Two of the new subfields and one change of name are intended to accommodate location and access information for non-Internet resources, as needed to satisfy the Content Standards for Digital Geospatial Metadata (subfield $j added for Bits per Second; subfield $r added for Settings; subfield $b changed from IP address to Access number). Subfield $v is proposed for hours access method is available, which was requested during the discussion of Proposal No. 94-9 (Changes to the USMARC Bibliographic Format to Accommodate Online Systems and Services). In addition codes are being added to field 856 subfield $2 to align access methods with those specified in the Uniform Resource Locator draft standard.
KEYWORDS: Field 856; Electronic Location and Access (Bibliographic/Holdings)
RELATED: 94-9 (June 1994); DP 78 (June 1994)
STATUS/COMMENTS:
12/2/94 - Forwarded to USMARC Advisory Group for discussion at the February 1995 MARBI meetings.
2/16/95 - Results of USMARC Advisory Group discussion - Approved as amended.
Discussion included both changes proposed in the paper and changes suggested for the field in general. The following changes were approved:
1st indicator: The section "Proposed changes" did not include the addition of value 3 for "Dial-up"; this will be added.
Subfield $b: The phone number needs to be standardized. The MARBI chair will contact NISO to see if there is such a standard. In its absence, the phone number will be input as follows: country code followed by hyphen; area code followed by hyphen; rest of number followed by hyphen; extension preceded by "x" (if applicable). If the latter is not applicable, the phone number ends after the last portion of the number (e.g., 1-202-707-6237; in this case "1" is the country code for U.S.). Local systems will be able to determine which part of the number it needs to dial. Guidelines for inputting phone numbers will be specified in the format. The field needs to be made repeatable. It may be repeated if there is more than one phone number and all the other data in the field is the same. Phone numbers recorded in other USMARC formats will specify the same conventions.
Subfield $j: The data will be recorded with lowest BPS first and highest second, separated by a hyphen. If there is only lowest it is followed by a hyphen; if there is only highest it is preceded by a hyphen.
Subfield $r: Guidelines should explicitly specified the possible values: O (Odd), E (Even), N (None), S (Space), M (Mark).
Subfield $u: "URL:" will be taken out of the example. It will be specified that the subfield is repeatable if all the rest of the data in the field is the same. Subfield $2: The list of codes will be in a code list, rather than in the format.
2/16/95 - Results of final LC review - Agreed with the MARBI decisions.
PROPOSAL NO. 95-1 Changes to Field 856
(Electronic Location and Access)
I. BACKGROUND
Proposal No. 94-9 (Changes to the USMARC Bibliographic Format to
Accommodate Online Systems and Services) and Discussion Paper No.
78 (Location and Access Information for Non-Internet Resources in
USMARC Records) addressed location and access information for
catalog records for electronic resources. 94-9 was intended to
accommodate USMARC records for online systems and services, while
previously only records for electronic data resources (electronic
text, software, data files, bibliographic databases, electronic
graphics files, etc.) were accommodated. The discussion also
revealed a need to align access method with the schemes specified
in the URL draft standard. Discussion Paper No. 78 generally
attempted to accommodate location and access information for non-
Internet resources. It was initiated by the Content Standards for
Digital Geospatial Metadata (CSDGM), although the need had been
previously noted. This standard has mapped its data elements to
MARC, and requires that specific information on dial-up access to
electronic resources be included.
II. ACCESS METHOD
The discussion of 94-9 indicated a need to define codes for access
methods that use value 7 (Method specified in subfield $2) in the
first indicator in field 856 (Electronic Location and Access). It
was suggested that subfield $2 be coordinated with access methods
defined in the Internet Engineering Task Force's Uniform Resource
Locator (URL) draft standard. The following are access schemes
specified in the Internet-Draft URL standard, with their
accompanying definitions. Corresponding USMARC data elements are
given where applicable. (The full URL document is available by
anonymous FTP from: ds.internic.net\internet-drafts\draft-ietf-uri-
url-08.txt.)
ftp File Transfer protocol
The FTP URL scheme is used to designate files and directories
on Internet hosts accessible using the FTP protocol (RFC959).
(Accommodated by 1st indicator, value 1)
http Hypertext Transfer Protocol
The HTTP URL scheme is used to designate Internet resources
accessible using HTTP (HyperText Transfer Protocol).
gopher Gopher protocol
The Gopher URL scheme is used to designate Internet resources
accessible using the Gopher protocol. The base Gopher
protocol is described in RFC 1436 and supports items and
collections of items (directories).
mailto Electronic mail address
The mailto URL scheme is used to designate the Internet
mailing address of an individual or service. No additional
information other than an Internet mailing address is present
or implied.
(Accommodated by 1st indicator, value 0)
news USENET news
The news URL scheme is used to refer to either news groups or
individual articles of USENET news, as specified in RFC 1036.
nntp USENET news using NNTP access
The nntp URL scheme is an alternative method of referencing
news articles, useful for specifying news articles from NNTP
servers (RFC 977).
telnet Reference to interactive sessions
The Telnet URL scheme is used to designate interactive
services that may be accessed by the Telnet protocol.
(Accommodated by 1st indicator, value 2)
wais Wide Area Information Servers
The WAIS URL scheme is used to designate WAIS databases,
searches, or individual documents available from a WAIS
database. The WAIS protocol is described in RFC 1625 [17];
Although the WAIS protocol is based on Z39.50-1988, the WAIS
URL scheme is not intended for use with arbitrary Z39.50
services.
file Host-specific file names
The file URL scheme is used to designate files accessible on
a particular host computer. This scheme, unlike most other URL
schemes, does not designate a resource that is universally
accessible over the Internet.
prospero Prospero Directory Service
The Prospero URL scheme is used to designate resources that
are accessed via the Prospero Directory Service.
Each scheme not given its own value in field 856, 1st indicator,
could be coded as value 7 (Source specified in subfield $2), and
that subfield will contain the specific access method. Field 856
subfield $2 indicates that the access method used is controlled by
an authoritative list maintained at the Library of Congress. The
above values could be used in such a list. The URL document states
that other schemes may be specified by future specifications;
Section 4 of the document describes how new schemes may be
registered with the Internet Assigned Numbers Authority (IANA), and
lists some scheme names that are under development. The USMARC
authoritative list should remain consistent with those registered
with IANA.
III. HOURS HOST IS AVAILABLE
A subfield is needed for hours that the electronic location is
available. This data element may be distinguished from the hours
the electronic resource itself is available, if the record is one
for an online system or service. In that case, as discussed with
Proposal No. 94-9, field 307 (which will replace field 301 (Hours,
etc.)) contains the hours of the resource itself. However, the
particular access method described in field 856 could have
different hours. For instance, "BIOSIS Previews" can be mounted on
the University of California system under GLADIS or on the Harvard
system under HOLLIS, as well as by BRS/After Dark. A record for
the electronic resource BIOSYS Previews could contain separate 856
fields for each host that makes it available, and each could have
different hours.
IV. ACCESS FOR NON-INTERNET RESOURCES
The consensus of participants in the USMARC Advisory Group
discussion of Discussion Paper No. 78 was that field 856 should be
used for both Internet and non-Internet resources, with the access
method indicated in the first indicator (to include a value for
non-Internet, or dial-up access). It was determined that the
telephone number could be recorded in subfield $b (currently IP
address) if it were generalized. Two new subfields would be needed
to accommodate the other data elements needed for dial-up access,
Highest and Lowest bits per second (BPS) used for transmission of
data, and Settings (Number DataBits; Number StopBits; Parity).
Settings could be included with specified syntax, such as hyphens
separating each piece of data. It was determined that if a
resource had both an Internet location and one available through
dial-up access, two 856 fields would be given, with the applicable
information repeated in separate subfields in each field.
The following are definitions for the new data elements, repeated
from Discussion Paper No. 78.
Dialup telephone number: contains the telephone number used
for electronic access. If there are more than one they are
repeated in separate fields. Data could be included in 856$b
if it were generalized. Currently it is IP address; its
definition could be changed to Access number to accommodate
both IP address and telephone number.
Lowest BPS: contains the lowest number of bits per second
used for transmission of data.
Highest BPS: contains the highest number of bits per second
used for transmission of data.
The two above could be contained in 856$j (BPS). The syntax
could be: lowest/highest, e.g. 1200/9600.
Number DataBits: contains the data bits, or number of bits
per character.
Number StopBits: contains the stop bits, or the number of
bits to signal the end of a byte.
Parity: contains the parity checking technique used.
The three above could be contained in 856$r (Settings). The
syntax could be: Parity-Number DataBits-Number StopBits
separated by hyphens, e.g. E-7-1.
V. PROPOSED CHANGES
The following is presented for consideration:
- In the USMARC Holdings/Bibliographic Formats, define the
following subfields in Field 856 (Electronic Location and
Access)
$j Bits Per Second (BPS) (NR)
[Syntax of data <Highest BPS>-<Lowest BPS>
If only lowest given: -<Lowest BPS>
If only highest given: <Highest BPS>
$r Settings (NR)
[Syntax of data <Parity>-<Number DataBits>-<Number
StopBits>]
If only Parity given: <Parity>
If only one of the other two given, leave data blank but
include hyphens, e.g.,
<Parity>--<Number StopBits>
<Parity>-Number DataBits>-
$v Hours access method is available (R)
- Change the definition of 856$b from IP Address to Access
number.
- Include codes for access methods in Field 856$2 consistent
with those defined in the URL draft standard.
See Attachment A for a description of this field if this
proposal is approved.
-------------------------------------------------------------------
ATTACHMENT A
< > indicates addition; [ ] indicates deletion
856 Electronic Location and Access (R)
Indicators
First Access method
0 Email
1 FTP
2 Remote login (Telnet)
< 3 Dial-up>
7 Method specified in subfield 2
Second Undefined
Undefined
Subfield Codes
a Host name (R)
b [IP address] <Access number> (NR)
c Compression information (R)
d Path (R)
f Electronic name (R)
g Electronic name -- End of range (R)
h Processor of request (NR)
i Instruction (R)
< j BPS (NR)>
k Password (NR)
l Logon/login (NR)
m Contact for access assistance (R)
n Name of location of host in subfield a (NR)
o Operating system (NR)
p Port (NR)
q File transfer mode (NR)
< r Settings (NR)>
s File size (R)
t Terminal emulation (R)
u Uniform Resource Locator (R)
< v Hours access method available (R)>
w Record control number (R)
x Nonpublic note (R)
z Public note (R)
2 Access method (NR)
3 Materials specified (NR)
FIELD DEFINITION AND SCOPE
This field contains the information required to locate an
electronic item. The information identifies the electronic
location containing the item or from which it is available. It
also contains information to retrieve the item by the access method
identified in the first indicator position. The information
contained in this field is sufficient to allow for the electronic
transfer of a file, subscription to an electronic journal, or logon
to a library catalog. In some cases, only unique data elements are
recorded which allow the user to access a locator table on a remote
host containing the remaining information needed to access the
item.
Field 856 is repeated when the location data elements vary
(subfields $a, $b, $d) and when more than one access method may be
used. It is also repeated whenever the electronic filename varies
(subfield $f), except for the situation when a single intellectual
item is divided into different parts for online storage or
retrieval. (This field is identical to field 856 (Electronic
Location and Access) defined in the USMARC Format for Bibliographic
Data.)
GUIDELINES FOR APPLYING CONTENT DESIGNATORS
INDICATORS
First Indicator - Access method
The first indicator position contains a value that defines how
the rest of the data in the field will be used. If the
resource is available by more than one method, the field is
repeated with data appropriate to each method. The methods
defined are the main TCP/IP protocols.
The value in the first indicator position determines which
subfields are appropriate for use. For example, when first
indicator value 1 (FTP) is used, subfields $d (Path), $f
(Electronic name), $c (Compression information), and $s (File
size) are appropriate, whereas they would not be with first
indicator value 2 (Remote login (Telnet)).
0 - Email
Value 0 indicates that access to the electronic resource is
through email. This access includes subscribing to an
electronic journal or electronic forum through software
intended to be used by an email system.
1 - FTP
Value 1 indicates that the access to the electronic resource
is through the File Transfer Protocol (FTP). Additional
information in subfields of the record may enable the user
to transfer the resource electronically.
2 - Remote login (Telnet)
Value 2 indicates that access to the electronic resource is
through remote login (Telnet). Additional information in
subfields of the record may enable the user to connect to
the resource electronically.
< 3 - Dial-up
Value 3 indicates that access to the electronic resource is
through dial-up by use of a telephone line. Additional
information in subfields of the record may enable the user
to connect to the resource.>
7 - Method specified in subfield $2
Value 7 indicates that access to the electronic resource is
through a method other than the defined values and for which
an identifying code is given in subfield $2 (Source of
access).
Second Indicator - Undefined
The second indicator position is undefined and contains a
blank ($).
SUBFIELD CODES
$a - Host name
Subfield $a contains the fully qualified domain (host name) of
the electronic location. It contains a network address which
is repeated if there is more than one address for the same
host. The convention for a BITNET address is to add .bitnet.
856 1#$aharvada.harvard.edu$aharvarda.bitnet
$b - [IP address] <Access number>
Subfield $b contains the [Internet Protocol (IP) numeric
address] <access number> associated with a host. <It can
contain the Internet protocol (IP) numeric address if an
Internet resource, or a telephone number if access is through a
telephone line.> This data <may> change[s] frequently and [is]
<may be> generated by the system, rather than statically stored.
856 2#$aanthrax.micro.umn.edu$b128.101.95.23
< 856 3#$b12027072316$j2400/9600$nLibrary of Congress,
Washington, DC$oUNIX$rE-7-1$tvt100 $zRequires
logon and password>
$c - Compression information
Subfield $c contains information about the compression of a
file. If a specific program is required to decompress the file,
it is noted here. The filename in subfield $f may indicate the
type of compression by its extension (portion after "." or first
space). The subfield may be repeated if two compression
programs are used, noting the latest compression first.
856 1#$amaine.maine.edu$cMust be decompressed with
PKUNZIP$fresource.zip
$d - Path
Subfield $d contains the path with directory names where the
file is stored. The filename itself is recorded in subfield $f.
This may be a surrogate path leading the user to the host where
complete and current access information is stored in a locator
table.
856 1#$awuarchive.wustl.edu$d/aii/admin/CAT.games$fmac-
qubic.22.hqx
$f - Electronic name
Subfield $f contains the electronic name as it exists in the
directory indicated in subfield $d, on the host machine in
subfield $a. It may be repeated if a single logical file has
been divided into parts and stored under different names, but
that together constitute a single intellectual item. In all
other cases, a file that may be retrieved under different
filenames contains two 856 fields in the record, each with a
different subfield $f. A filename may include wildcard
characters (*) if applicable (with a subfield $z note explaining
how fields are named). A filename may be case sensitive for
some systems.
856 1#$awuarchive.wustl.edu$dmirrors/info-mac/util$fcolor-
system-icons.hqx
856 0#$akentvm.bitnet$facadlist file1$facadlist
file2$facadlist file3
This subfield also contains the name of the electronic
publication or conference.
856 0#$auicvm.bitnet$fAN2
$g - Electronic name -- End of range
Subfield $g contains the name of the last physical file in a
sequence when the resource is contained in more than one
physical file. This subfield is generally used when the
repetition of subfield $f is not possible.
856 1#$asunx.loc.gov$dLCPP04A$f4A49751$g4A49755
$h - Processor of request
Subfield $h contains the username, or processor of the request;
generally the data which precedes the @ in the host address.
856 0#$auicvm.bitnet$fAN2$hListserv
$i - Instruction
Subfield $i contains an instruction or command needed for the
remote host to process a request.
856 0#$auccvma.bitnet$fIR-L$hListserv$isubscribe
<$j - Bits per second
Subfield $j contains the lowest and highest bits per second used
for transmission of data. Syntax of data should be: <Highest
BPS>-<Lowest BPS>. If only lowest given: -<Lowest BPS>; If only
highest given: <Highest BPS>.
856 0#$b12027072316$j2400/9600$nLibrary of Congress,
Washington, DC$oUNIX$r7-1-E >
$k - Password
Subfield $k contains the password required to access the
resource. An FTP site may require the user to enter an Internet
address or may require a specific password, or a library catalog
may require a password. If a password is required but anything
may be used, this subfield need not be used. This subfield is
used for general use passwords, not for any requiring security.
Textual instructions about passwords are contained in subfield
$z (Public note).
856 1#$aharvarda.harvard.edu$kguest
$l - Logon/login
Subfield $l contains characters needed to logon to a library
catalog or FTP site. Often with anonymous file transfer the
logon is "anonymous." An account number required for login may
also be indicated. This subfield should be used for general use
logins, not for any requiring security.
856 1#$aunmvm.bitnet$lanonymous
$m - Contact for access assistance
Subfield $m contains the contact for access assistance to the
resource at the host specified in subfield $a.
856 2#$agopac.berkeley.edu$mRoy Tennant
$n - Name of location of host in subfield $a
Subfield $n contains the full name of the location of the host
in subfield $a, including its geographical location.
856 2#$apucc.princeton.edu$nPrinceton University, Princeton,
N.J.
$o - Operating system
For informational purposes, operating system for the host name
specified in subfield $a is indicated here. Conventions for
path and filenames may be dependent on the operating system.
For the operating system of the resource itself (i.e., the title
recorded in field 245), rather than the operating system of the
host making it available, field 753 (Technical Details Access to
Computer Files), subfield $c (Operating system) in the USMARC
Format for Bibliographic Data is used in the related
bibliographic record.
856 1#$aseq1.loc.gov$d/pub/soviet.archive
$fk1famine.bkg$nLibrary of Congress,
Washington, D.C.$oUNIX
$p - Port
Subfield $p contains the portion of the address that identifies
a process or service in the host.
856 2#$amadlab.sprl.umich.edu$nUniversity of Michigan Weather
Underground$p3000
$q - File transfer mode
Subfield $q contains the file transfer mode, which determines
how the file is transferred through the network. A normal ASCII
file contains certain characters which are translated between
systems to make the text files more readable. A file with non-
ASCII characters must be transferred using another file transfer
mode.
856 1#$aarchive.cis.ohio-state.edu
$dpub/comp.sources.Unix/volume
10$fcomobj.lisp.10.Z$qbinary
<$r - Settings
Subfield $r contains the settings used for transfer of data.
Included are: Number Data Bits (number of bits per character);
Number Stop Bits (number of bits to signal the end of a byte);
and Parity (parity checking technique used). Syntax of data is:
<Parity>-<Number DataBits>-<Number StopBits>. If only Parity
given: <Parity>. If only one of the other two given, leave data
blank but include hyphens, e.g.,
<Parity>--<Number StopBits>
<Parity>-Number DataBits>-
856 3#$b12027072316$j2400/9600$nLibrary of Congress,
Washington, DC$oUNIX$rE-7-1$tvt100 $zRequires logon and
password>
$s - File size
Subfield $s contains the size of the file as stored under the
filename indicated in subfield $f. It is generally expressed in
terms of bytes. It may be repeated in cases where the filename
is repeated and is recorded directly following the subfield $f
to which it applies. This information would not be given for an
electronic journal, since the field relates to the entire title,
not to particular issues.
856 1#$awuarchive.wustl.edu$dmirrors/info-mac/util$fcolor-
system-icons.hqx$s16874 bytes
856 0#$akeptvm.bitnet$facadlist file1$s34,989 bytes
$facadlist file2$s32,876 bytes$facadlist file3
$s23987 bytes
$t - Terminal emulation
Subfield $t contains the terminal emulation supported when
necessary to specify for remote login (first indicator contains
value 2 (Remote login (Telnet)).
856 2#$amaine.maine.edu$nUniversity of Maine$t3270
$u - Uniform Resource Locator
Subfield $u contains the Uniform Resource Locator (URL), which
provides standard syntax for locating an object using existing
Internet protocols. Field 856 is structured to create a URL
from separate subfields. Subfield $u may be used instead of
those separate subfields or in addition to them. It might be
desirable to include subfield $u and the other subfields if a
user display is desired as well as a URL. The field is repeated
if more than one URL needs to be recorded.
856 1#$uURL: ftp://path.net/pub/docs/urn2urc.ps
<$v - Hours access method available
Subfield $v contains the hours that access to the resource at
the electronic location indicated in this field is available.
If the record is for a system or service the hours it is
available are recorded in field 307 (Hours, etc.). This
subfield is only for the hours at the specific location
indicated in field 856.
856 2#$apac.carl.org$b192.54.81.128$mCARL Situation
Room$mhelp@CARL.org$nCARL Systems Inc., Denver, CO
$v24 hours
856 3#$alocis.loc.gov$b140.147.254.3 $mlconline@loc.gov
$t3270$tline mode (e.g., vt100)$vM-F 6:00 a.m.-21:30 p.m.
USA EST, Sat. 8:30-17:00 USA EST, Sun. 13:00-17:00 USA
EST >
$w - Record control number
Subfield $w contains the system control number of the related
record preceded by the USMARC code, enclosed in parentheses, for
the agency to which the control number applies. The data in the
subfield links field 856 to the USMARC record having the same
data in a control number field. (The source of the code is
Symbols of American Libraries that is maintained by the Library
of Congress.)
$x - Nonpublic note
Subfield $x contains a note relating to the electronic location
of the source identified in the field. The note is written in
a form that is not adequate for public display or contains
processing information about the file at the location specified.
856 1#$awuarchive.wustl.edu$cdecompress with
PKUNZIP.exe$d/mirrors2/win3/games$fatmoids.
zip$xcannot verify because of transfer difficulty
$z - Public note
Subfield $z contains a note relating to the electronic location
of the source identified in the field. The note is written in
a form that is adequate for public display.
$2 - Access method
Subfield $2 contains the access method when the first indicator
position contains value 7 (Method specified in subfield $2).
This subfield may include access methods other than the three
main TCP/IP protocols specified in the first indicator. This
subfield is controlled by an authoritative list maintained at
the Library of Congress. <It contains one of the following
codes:
http Hypertext Transfer Protocol
The HTTP URL scheme is used to designate Internet resources
accessible using HTTP (HyperText Transfer Protocol).
gopher Gopher protocol
The Gopher URL scheme is used to designate Internet resources
accessible using the Gopher protocol. The base Gopher
protocol is described in RFC 1436 and supports items and
collections of items (directories).
news USENET news
The news URL scheme is used to refer to either news groups or
individual articles of USENET news, as specified in RFC 1036.
nntp USENET news using NNTP access
The nntp URL scheme is an alternative method of referencing
news articles, useful for specifying news articles from NNTP
servers (RFC 977).
wais Wide Area Information Servers
The WAIS URL scheme is used to designate WAIS databases,
searches, or individual documents available from a WAIS
database. The WAIS protocol is described in RFC 1625 [17];
Although the WAIS protocol is based on Z39.50-1988, the WAIS
URL scheme is not intended for use with arbitrary Z39.50
services.
file Host-specific file names
The file URL scheme is used to designate files accessible on
a particular host computer. This scheme, unlike most other URL
schemes, does not designate a resource that is universally
accessible over the Internet.
prospero Prospero Directory Service
The Prospero URL scheme is used to designate resources that
are accessed via the Prospero Directory Service. >
$3 - Materials specified
Subfield $3 contains information that specifies the part of the
bibliographic item to which the field applies.
856 0#$3Finding aid to the Edgar F. Kaiser
papers$agopher.berkeley.edu$b128.224.55$kguest
$lanonymous$mReference Services, The Bancroft Library,
University of California, Berkeley, CA 94720, tel.
510-642-6481 $nThe Library, University of California,
Berkeley$oUNIX$p70 $qbinary$s2,394,394,444$xThe
finding aid to the Kaiser papers is currently under
revision. $zFor further information about shipbuilding
files, contact The Bancroft Library Reference
INPUT CONVENTIONS
Embedded Holdings Information
When holdings information is embedded in a USMARC bibliographic
record, no additional 856 field is input. Multiple 856 fields
may be used only when the information does not include other
holdings information fields that must be linked to a specific 856
field for intelligibility. However, subfield $3 may be used to
specify a subset of the item to which the 856 information is
applicable.
RELATED USMARC FIELD OR DOCUMENT
Symbols of American Libraries
USMARC Format for Bibliographic Data: 753 Technical Details
Access to Computer Files
USMARC Format for Bibliographic Data: 856 Electronic Location
and Access
<Uniform Resource Locators (URL)>