Just as standardization of descriptive vocabulary is important for the long-term success of digital museum projects (see Besser, 1999), standardization of metadata is important for the integration of those projects into the web. For users to have access to the full potential of the web, users' computers will need common languages for communicating with each other and with the countless servers, databases, digital libraries, and repositories out there. Many information scientists are working on a "semantic web" that will allow this communication (see for instance W3C, 2002, Mar. 7), but for now it is mostly a dream.
There is a confusing array of data standards and metadata schemas available to museums:
| abbreviation | full name | description |
source
|
| AMICO Data Specification | Art Museum Image Consortium Data Specification |
A list of data categories with accompanying metadata standards for each element; designed for describing items to be included in the AMICO image database.
|
AMICO Library, 2002 |
| CDWA | Categories for the Description of Works of Art |
A core list of data categories necessary for describing museum objects; development funded by the J. Paul Getty Trust, the National Endowment for the Humanities, and the College Art Association.
|
Art Information Task Force, 2000 |
| CHIN Data Dictionaries | Canadian Heritage Information Network's Data Dictionaries for the Humanities, Natural Sciences, and Archaeological Sites |
A list of data categories for describing cultural heritage objects; designed for use by Canadian institutions, especially those participating in the Artefacts Canada database.
|
CHIN, n.d. |
| CIDOC Information Categories |
International Committee for Documentation Guidelines for Museum Object Information
|
A list of data categories for describing museum objects. | CIDOC, 1995 |
| CIMI Dublin Core |
Consortium for the Computer Interchange of Museum Information Guide to Best Practice: Dublin Core
|
Recommendations for implementation of Dublin Core metadata standards in museums. | CIMI, 2000 |
| Dublin Core | Dublin Core Metadata Element Set |
A list of core data categories with metadata standards for each element; designed for describing web-based files. Also includes recommendations for incorporating other metadata standards into the Dublin Core system.
|
Dublin Core Metadata Initiative, 1999, Jul. 2; Dublin Core Metadata Initiative, 2000 |
| EAD | Encoded Archival Description |
An SGML document type definition (DTD) developed as a way for archives and repositories to standardize their online finding aids, inventories, registries, indexes, and other documents.
|
Network Development & MARC Standards Office, 2001 |
| ISAD(G) | General International Standard on Archival Description |
A list of data categories for describing archival materials; an international standard meant to be combined with national standards.
|
Committee on Descriptive Standards, 1999 |
| LC Core | Library of Congress Digital Repository Development Core Metadata Elements |
A list of data categories with metadata standards for each element; designed for use with Library of Congress digital libraries.
|
Library of Congress, 2000 |
| MDA Documentation Checklist | MDA (formerly Museum Documentation Association) Documentation Checklist |
A list of data categories necessary for describing museum objects; designed to prepare museums for registration with the United Kingdom Museums & Galleries Commission.
|
MDA, 2000 |
| MESL Data Dictionary | Museum Educational Site Licensing Project Data Dictionary |
A list of data categories for describing museum objects; designed for a project now superseded by AMICO.
|
J. Paul Getty Trust, 1998, pp. 171-183 |
| OAI Protocol | Open Archives Initiative Protocol for Metadata Harvesting |
An interoperability framework, based on OAI compliance of World Wide Web repositories; OAI-compliant metadata can be expressed as unqualified Dublin Core metadata.
|
OAI, 2001 |
| Object ID | Object ID |
A list of data categories for describing cultural objects as a means of crime prevention; development funded by the J. Paul Getty Trust and promoted by law enforcement agencies and various corporations and organizations.
|
Council for the Prevention of Art Theft, 1999 |
| OWL | Ontology Web Language |
A list of goals for metadata and interoperability.
|
W3C, 2002, Mar. 7 |
| PCC CBR-AV | Program for Cooperative Cataloging Core Bibliographic Record for Audiovisual Materials |
A list of data categories necessary for describing audio or visual materials in the USMARC system.
|
PCC CBR-AV Task Group, 1996 |
| PRESERV Metadata | RLG Preservation Metadata Elements |
A metadata standard for describing digitized images; developed to preserve information about the digitization process, as well as aid in re-creation of defunct image formats.
|
PRESERV, 1998 |
| RDF | Resource Description Framework |
A metadata standard for describing files on the World Wide Web; designed for maximum interoperability between platforms, databases, and applications.
|
W3C, 2002, Mar. 19 |
| REACH Element Set |
Record Export for Art and Cultural Heritage Element Set for Shared Description of Museum Objects
|
A list of data categories for describing museum objects; designed for data-sharing among RLG institutions. | RLG, 1998 |
| SPECTRUM | SPECTRUM |
An "out of the box" program for developing a museum documentation system, designed for United Kingdom museums; conforms to MDA standards.
|
MDA, 2002 |
| USMARC | United States Machine-Readable Cataloging Format |
An exhaustive list of data categories and metadata standards for describing library materials, including objects and other cultural heritage materials.
|
Network Development & MARC Standards Office, 2002, Apr. 3 |
| VRA Core 3.0 | Visual Resources Association Core Categories, Version 3.0 |
A core list of data categories with accompanying metadata standards for each element; for describing items owned by or of interest to members of the Visual Resources Association, especially slide libraries.
|
VRA Data Standards Committee, 2002 |
| Z39.50 |
International Standard, ISO 23950: Information Retrieval (Z39.50): Application Service Definition and Protocol Specification
|
An interoperability framework; originally designed for data-sharing between libraries, but now used by many organizations. | Network Development & MARC Standards Office, 2002, Mar. 14 |
I believe that simplicity will be the key, at least in the beginnings of the semantic web. Most museum staff are not information professionals; many are volunteers. Many museums can devote only tiny portions of their budgets to collection documentation. Until there is an international, cheap, out-of-the-box software system available (like SPECTRUM), the museum world will be better off defining an easy, minimum standard.
That said, I will concentrate on selecting three things for the Drexel Digital Museum: an interoperability standard, core data categories, and a metadata standard.
Interoperability increases the value of the data in individual repositories by allowing easy interlinking of databases. The two major choices are OAI Protocol and RDF.
OAI Protocol is an XML schema providing additional structure to Dublin Core metadata. Although it is being developed for use by archives, CIMI has tested it for museum use and found it simple to implement (Perkins, 2001).
RDF is an XML schema also, but not necessarily dependent on Dublin Core. Furthermore, it is being developed for use by all providers of web-based data and is thus a more universal standard.
Both OAI Protocol and RDF are an "envelope" around metadata, routing queries to the correct namespace, which can translate metadata inside the "envelope" into whatever form is readable by the querying machine. Both are simpler to implement than Z39.50 and are meant to supersede it, though neither has Z39.50's proven track record (Miller, 1999). Because RDF is a more universal standard than OAI Protocol, I would recommend it for the Drexel Digital Museum. However, since the museum director has already invested considerable time in OAI (see Martin, 2002), the museum should follow its protocol. As OAI becomes more common, it will probably become interoperable with RDF.
Various museum groups and other organizations have set forth core categories of data which must be compiled to describe or represent an object. These usually include creator, object type, a description, and several other categories. The data categories an individual museum chooses will depend on its goals and affiliations.
For the Drexel Digital Museum I recommend the following:
- CIDOC Information Categories, because they are an international standard
- AMICO Data Specification, because it is becoming a U.S. standard and because it is a prerequisite for participation in the AMICO consortium, an important educational resource
- Object ID, because it is recommended by national and international law enforcement agencies
VRA Core 3.0 and CDWA are both important standards, which are, nevertheless, almost identical to CIDOC and AMICO. The REACH Element Set could become important if Drexel University joins RLG.
Dublin Core has become a universal metadata standard. OAI Protocol depends on it and RDF assumes that most data providers will use it.
AMICO Data Specification has its own metadata standards for each data category, though the standards are not terribly different from Dublin Core.
PRESERV Metadata store information often ignored or poorly documented in digital projects, such as digitization process and image viewing requirements.
USMARC is a huge and complex standard and is beyond the scope of this paper. Nevertheless, USMARC fields can generally be mapped onto metadata elements of other standards if necessary (see Baca & others, 2001).
For Drexel Digital Museum, I recommend the following:
- Dublin Core, following CIMI Dublin Core best practices as appropriate
- AMICO Data Specification, as a second metadata set
- PRESERV Metadata, for metadata representing technical details of imaging
Currently, items in the Drexel Digital Museum do not have extensive metadata visible to the public (see html code for web display of garment number 3607 a,b, below):
However, Olson & Javins (n.d.) have developed a cataloging template for the Drexel Digital Museum based on the Dublin Core. I assume that this template has been used for cataloging the collection even though the results are not visible at this time.
Marcia Lei Zeng is doing interesting work on museum object metadata, specifically for objects in the Kent State University Historical Fashion Collections. She has designed a data entry interface for cataloging objects (Zeng, 2001, especially http://circ.slis.kent.edu/mze ng/vra3template.htm). Once the data has been entered, the cataloger can choose to have it processed as VRA Core 3.0, USMARC, or Dublin Core. Zeng's documentation is not entirely clear, but it appears that the cataloger may also choose to have the data elements stored without processing according to an outside standard.
This last option seems most sensible to me. Potentially, almost any metadata standard could then be supported by programming a "crosswalk" that would take pieces of data and place them in the proper metadata "slots." For instance, a crosswalk for the title of an object (following Baca & others, 2001):
|
AMICO Data Specification |
Object-Title-Name |
|
CIDOC Information Categories |
Object name |
|
CIMI |
object Title |
|
CDWA |
TITLES OR NAMES (core)
|
|
DC |
Title
|
|
EAD |
<titleproper> (in <eadheader>) |
|
FDA Guide |
Group/Item Identification-Repository Title
|
|
Object ID |
Title
|
|
USMARC |
24Xa Title and Title-Related Information
|
|
VRA Core 2.0 |
W2. Title
|
|
VRA Core 3.0 |
Title |
I have created a rough metadata crosswalk (available as a separate file, crosswalk.xls) for data elements required by the various standards I have recommended.
In my opinion, the specific standards adopted are less important than the act of adopting a standard. As long as a museum has implemented at least one generally recognized standard, it should be able to share data with other museums.
next page: data entry interface requirements
back to project home page
Donny
Smith
May 2002
[email protected] or [email protected]