PCC Task Force on Multiple Manifestations of Electronic Resources
Interim report
January 7, 2001
Membership: Diane Boehr (NLM), Robert Bremer (OCLC), James Castrataro (Indiana),
Tad Downing (GPO), Ed Glazier (RLG), Beth Guay (Maryland), Ruth Haas (Harvard),
Lynne Howarth (IFLA Review Group liaison), Wayne Jones (MIT, interim chair
Oct-Dec 2000), Judy Knop (ATLA, liaison to CC:DA major/minor task force),
George Prager (Brooklyn Law School), Regina Reynolds (LC/ISSN), John Riemer
(UCLA, chair), Cecilia Sercan (Cornell)
I. Summary/Introduction--What's at stake:
The more that our cataloging policy calls for splitting e-resource
versions onto separate bibliographic records,
--the greater the cost to copy-cataloging operations in sifting through
a proliferating array of bibliographic records to find the "correct" one
to work with and
--the greater the corresponding multiplicity of records that OPAC users
must deal with.
The more that our cataloging policy calls for lumping e-resource
versions onto a single bibliographic record,
--the greater the complexity involved in identifying and using portions
of those records relating to a single version, and
--the greater the complexity involved in identifying and removing data
that is no longer current.
II. First charge:
"Identifying the most common types of versions and reproductions for
textual resources currently available and the most bibliographically
significant characteristics of each."
We identified a typology of the most commonly occurring differences
among manifestations of e-resources, for which cataloging guidance is
needed on the number of records required to accommodate those differences.
- What to do when the only difference is the setting (individual institution
or consortium) in which the resource is available, e.g. the GALILEO
and the OhioLink representation of the same provider's resource.
- What to do when the only difference is the provider of the resource.
- What to do when the difference lies among the particular file formats,
e.g. ascii, html, pdf, etc.
- What to do when the resource provider also is responsible for digitizing
the material.
- What to do when a given resource provider contributes special added
value, such as indexing across time or across multiple titles simultaneously.
- What to do when the difference is in access restriction, e.g. paid-for
versus freely available or available on campus only versus off campus
also.
- What to do when the difference is in "holdings," i.e. the range
of material available.
- What to do when the difference is in "content," e.g. limited full
text or text without images or text + images, etc.
- What to do when the difference is in packaging, e.g. one version
is part of a rather large aggregation and another version is not.
- What to do when (equal amounts of) the material is available in
multiple languages from what amounts to a single source.
- What to do when the difference is in the title the versions have.
- What to do when the difference lies in the formatting of the data,
i.e. discrete serial issues in one version and amalgamation of text
into a database in another.
- What to do when standard identifiers differ, e.g. URLs.
III. Second charge:
"Recommending best practices for each, taking into account both cataloging
staff and workload levels. Consideration should be given to:
- Methods of description that provide for clarity, efficiency, and
low-maintenance
- Differing needs of shared and local catalogs
- The advisability of the single record technique (noting the existence
of the electronic on the record for the print) and its applicability
to monographic resources."
Obviously, one manifestation is not going to be identical to another,
for, if it were, it would be the same thing rather than a "mere" manifestation.
The challenge is in making guidelines about how much has to added to
or changed within a resource for it to become what one might term a different
manifestation.
Under current cataloging rules, if we regard the provider as a publisher,
then we would regard each new provider as a new manifestation; however,
a provider seems to task force members most often to resemble a distributor.
If such a provider were also responsible for having digitized the resource,
it would strike catalog users as rather odd to single out one particular
provider for a separate bibliographic record and to consolidate all other
providers on another record. Who is doing the digitizing can be significant,
when one considers that a digitizer who is also an original publisher
of a print version is increasingly able to add sound or image files and/or
additional articles/data files that would not fit in the print version.
The cases where task force discussion was closely divided were content
differences (#8) and the presence of (parallel) multilingual content
(#10). The degree to which e-version content can vary ranges from provision
of multi-volume/cross-title indexing to subset relationships (some articles/images
present in one version but not in another) to completely different content
between versions. The most significant of the factors in section II are
content differences, and the greater the degree of content difference
the more separate bibliographic records seem warranted. Separate records
are more warranted when multilingual versions available from a single
provider are not all freely available or offered in the same package,
but instead require separate subscriptions.
In its discussions to date, for each of the other cases, a majority
of the task force is inclined to consolidate multiple e-versions on a
single record.
The greater the variety of sources for e-version records (commercial
and noncommercial), the more support and interest there is among task
force members for separate records, for the ease of removing sets of
records from the catalog.
The Task Force is aware of the interim
CONSER policy issued on October 25, 2000, which allows for noting
the existence of multiple e-versions on a single print record but calls
for separate records describing e-versions singly on separate bibliographic
records. If later on it is easier to collapse data than to separate
it, thought will need to be given to which fields in the separate bibliographic
records will facilitate the eventual linking up or consolidation of
data.
There are different and conflicting needs between local catalogs and
shared catalogs. A local catalog is intended to contain descriptions
of those manifestations to which the local population has access. From
the end-user perspective, it is probably clearest if the local catalog
contains as few records as possible for the same bibliographic entity,
regardless of manifestation. That is why in many cases, it is most helpful
to describe the various manifestations in a single record. If this is
done, it must be clear to local staff and local users what the locally
available manifestations are and how to access them.
Records in a shared database, like a regional union catalog or bibliographic
utility have many functions: a source for shared cataloging copy, acquisitions
data, interlibrary loan, and reference and "discovery" both for staff
and end-users. In a master record database like OCLC, if a single record
is used for multiple manifestations at all the holding institutions,
shared cataloging requires detailed review and editing if the local institution
does not own or have access to all the manifestations represented in
the record. If records for consolidated e-versions were to became commonplace,
thought would have to given to how ILL, acquisitions, or reference staff
would clearly identify a version they were interested in borrowing, ordering,
or having access to. In the RLIN/Eureka context, clustering records together
becomes more complex when some descriptions describe one manifestation
and some describe another.
A contributing factor to the complexity may well be the partial integration
of holdings and location characteristics ("362 1 Coverage as of ..." and
contents of 856 fields) into bibliographic records. If users learned
they could depend on finding all manifestations clustered in a set of
records, they'll be more likely to take the time to look for and examine
those individuals in the set. This is the collocating function of the
catalog! In choosing to distinguish the remote electronic secondary manifestations,
we may be hindering that function. We have to find ways to provide this
service to our users.
The paper
Michael Kaplan recently presented to the Library of Congress' Conference
on Bibliographic Control in the New Millennium inspires hope that
we might, with the aid of technology, harmonize the public service
need for unity of display among related e-versions with the expediency
of behind-the-scenes technical processing involving discrete records.
IV. Third charge:
"Defining principles by which determinations about whether to create
single or separate records for versions and reproductions can be made
in order to facilitate future decisions when new types of version are
published (e.g., Palm Pilot editions)."
Some possible criteria on which to base decisions include:
- Cost to the cataloging process of sorting through multiple related
records for the one to use; parallel burdens to the OPAC user in sorting
through multiple hits, and to the union catalog user determining what
information in a record to ignore.
- The highly changeable nature of electronic resources (more volatile,
ephemeral than print serials)
- Content versus carrier the relative importance of each.
- Discernible, significant differences in resource content
- Whether an item can be considered a "separate publication"
- What the user is likely to construe as a single entity, regardless
of individual content differences
- The relative ease of consolidating versus separating records/data
after-the-fact (is it easier to delete irrelevant data?)
- Clarity of data content within a single bibliographic record that
attempts to describe multiple e-version (can one tell easily what is
being said about any given provider?)
- The threshold for a needing a separate record in other environments,
e.g. ISSN assignment
(In current ISSN practice, when rights to the primary product are
licensed or sold to another publisher, the presumption is that this
is a secondary manifestation, which does not warrant a separate ISSN
assignment. The possibility for content difference in this situation
is considered much lower than in the case of a publisher who offers
both a print and a digital version; in the latter situation the two
versions are regarded as different editions or separate products.)
- Degree of overall involvement and/or intellectual responsibility
a provider has in a resource---offering access to it (not much), digitization/publication
(some), provision of added value such as cross-publication searching
and auxiliary background material (a lot)
- When, if at all, monographic resources warrant separate solutions
from those we might recommend for serials/continuing resources
- Whether aggregated resources are a special situation, possibly warranting
a different treatment.
V. Outreach:
We solicited the ideas of people in the bibliographic community by
posting messages in fall 2000 about our task force's work to various
online discussion lists: [SERIALST, DIG_REF, AUTOCAT, OLAC, DIG-LIB,
CORC-L]. Readers of this report with additional ideas and suggestions
are encouraged to submit them to any task force member listed at: http://www.loc.gov/catdir/pcc/tgmuler.html.
A short
article about our task force appeared in the November 2000 issue
of D-Lib Magazine, contributed
by Wayne Jones. At least one task force member has been asked to talk
about the group's work at the next ALA Annual meeting.
The task force's home page is located at: http://www.loc.gov/catdir/pcc/tgmuler.html
Compiled by John J. Riemer
|