BHL
Archive
This is a read-only archive of the BHL Staff Wiki as it appeared on Sept 21, 2018. This archive is searchable using the search box on the left, but the search may be limited in the results it can provide.

BHL Metadata Requirements

Biodiversity Heritage Library (BHL)
Metadata Schema - Draft 2

Introduction


The metadata schema to be adopted by the BHL must strike a balance between being practical and comprehensive. It must at minimum serve to identify and locate the digitized literature but it should also make provision for additional functions that may become possible in the future, which may mean that sections of the schema remain unused in the early stages of BHL development.

Note: In developing a "BHL Application Profile" it is not necessary to adopt comprehensive external schemas in their entirety. By using namespace declarations it is possible to select just those elements that will be of value to BHL.

This document outlines the functional areas for metadata coverage and the options for existing open data standards that may be adopted, at least in part. Comments and additions are encouraged and should be sent to n.thomson@nhm.ac.uk for incorporation.

Purpose


The purpose of the schema is to provide a data standard for the pooling of data between the partners and to aid the process of creating and managing the BHL. A subset will be used as the entrance point for users.

At this stage, it is not helpful to define the exact schema that the BHL will use - too much evolution has yet to take place based on the assessed needs of users and funders and the level of funding received. However, the functional areas that will require metadata can be defined and an appropriate standard identified from which data elements will be drawn to make up the BHL schema.

Functional areas to be covered


Some level of metadata will be required in several functional areas, outlined below. It is strongly recommended that BHL make use of open standards where these exist, rather than inventing its own to serve the same purpose.

As noted above, not all of every standard needs to be adopted, but through the use of namespace identification, those elements that are of direct use may be imported into a "BHL Application Profile". Even then, not all the elements need to be filled straight away, but by making provision for a rich structure, future services may be developed more easily, as and when time and finance allow.

Given that the BHL is a collaborative project which should have a very long lifespan, all the candidate schemas are XML-based to aid data exchange, aggregation and sustainability.







Functional areas to be covered - summary table


Packaging

COMMENT
Level of granularity to be agreed
OPTIONS SELECTION
METS
CURRENT PREFERRED OPTION
METS

Bibliographic or Descriptive

COMMENT
Export from partner's OPACs and third-party indexes e.g. IK
OPTIONS SELECTION
MODS / MARCXML
CURRENT PREFERRED OPTION
MODS

Technical

COMMENT
Includes requirements for interpreting the files and digital sustainability data
OPTIONS SELECTION
VRA Core / MIX for images. PREMIS for digital sustainability
CURRENT PREFERRED OPTION
MIX
PREMIS

Administrative

COMMENT
Includes rights and publisher agreements
OPTIONS SELECTION
Science Commons / ODRL / XRML / ROMEO / DC:AC
CURRENT PREFERRED OPTION
DC:AC

Identifiers

COMMENT
To enable links with other domains, such as specimen data, sequences and the original documents
OPTIONS SELECTION
LSID or National Library of Australia Persistent Identifier Scheme
CURRENT PREFERRED OPTION
LSID

Workflow support

COMMENT
Includes register of intent flags for "Done", "Priority", "Exception"
OPTIONS SELECTION
BHL-specific / JSTOR
CURRENT PREFERRED OPTION
BHL

Further information