[ Yahoo! ] options

Text Encoding Initiative

The Text Encoding Initiative (TEI) is one of the oldest Metadata standard. Started in 1987, TEI becomes "an international and interdisciplinary standard that helps libraries, museums, publishers, and individual scholars represent all kinds of literary and linguistic texts for online research and teaching, using an encoding scheme that is maximally expressive and minimally obsolescent." The new TEI Consortium was set up in December 2000 to continue maintaining and developing the TEI standard. TEI mainly focuses on the exchange of textual information, but other formats such as image and sound are also addressed.

One of the main characteristics of TEI is that its descriptive metadata is created as the header (known as TEI header) of the encoded file (known as the TEI text proper). The format of TEI encoding scheme is based on the SGML standard. The TEI header has four major parts: , , , , each of which contains a hierarchical set of description using TEI tags. TEI text also includes a set of tags. All the TEI tags are defined in TEI DTD (Data Type Definition).

When TEI was designed, it was expected that the producers of the textual digital data would create the encoding data for the files at the same time. In fact the TEI is very complicated, and the TEI Guidelines encompass some 1300 pages of information. It requires the metadata producer fully understanding the TEI scheme and tags for particular types of text. To make their work easy, a useful basic tag set was selected from the several hundred SGML elements and defined by the full TEI scheme, which contains the most needed elements. Since many texts marked up according to the TEI guidelines are based on printed books for which AACR2R/MARC catalog records exist, in many cases header data is created or reviewed and revised by librarians.

There is a widely usage of TEI in the world. For example, The Electronic Text Centers at the University of Virginia and the University of Michigan have used TEI Lite to encode their holdings. TEI Website also maintains a list of projects using TEI encoding scheme.

<< Dublin Core FDGC >>



Home� � About Author� � References� � Articles
Contact Us� � Glossary� � Site map

Last updated 12/12/2001 Metadata Gateway.
All Rights Reserved

Hosted by www.Geocities.ws

1