BHL
Archive
This is a read-only archive of the BHL Staff Wiki as it appeared on Sept 21, 2018. This archive is searchable using the search box on the left, but the search may be limited in the results it can provide.

QA Policy

back

Table of Contents

QA Issues for IC Discussion
Scope/Purpose of QA:
Procedural Recommendations:
Common Errors and examples:
Other Issues
Culled notes

QA Issues for IC Discussion

Scope/Purpose of QA:

The purpose of performing Quality Assurance testing on scans is to ensure a consistent level of scan and metadata quality across the various scanning centers and BHL partner libraries and to minimize the loss of intellectual content in the works scanned. As such, it is imperative that all BHL partner libraries perform baseline Quality Assurance on their scans, regardless of scanning vendor or source, using the procedures outlined below. The primary consideration when performing QA on scans is to determine if the digital object created will support the access and data mining needs of the BHL portal, EOL, and other human and machine users of the materials. To be clear, the goal of digitizing for BHL is not for digital preservation or to create true facsimilies of works. For this reason QA will not address scan quality issues related to the user's experience of the item, such as color variation on pages, etc. unless they affect the ability to access or data mine the intellectual content of the work. For our purposes, the determination of what constitutes intellectual content for each item is at the discretion of each institution, based on the guidelines outlined in this policy.

Procedural Recommendations:

QA should always be done with the original object in hand - this is particularly important for items with odd pagination or unpaginated plates. It is most efficient to insert QA into the libraries' workflow immediately after receipt of scanned items. Ideally, procedures will go something like this:

Common Errors and examples:

Issue
Minor Examples
Major Examples: affects intellectual content
Notes
Missing Page(s)
  • blank pgs which misalign page location within the item ex. versos changed to rectos
  • missing tissue that affects page order but text not affected
  • tip-ins
  • page MIA
  • misalignment of centerfolds
  • tissue obscures content in scan
  • adverts and non-meaty content determination will be up to library/subject expert
cropped text
  • edge of a letter cut-off
  • several letters that make OCR on word or phrase impossible

lcontrast / white balance issues or blurry scans
  • does not compromise OCR, but difficult to read on screen
  • readable, but compromises significant parts of OCR text, or compromises OCR of taxonomic information or key words on page.
  • unreadable / illegible to average person
  • when minor color/contrast problems are detected, alert scanning center and request that the cameras be recalibrated
foldouts
  • orientation does not match item, but does not compromise OCR or view-ability / readability
  • orientation is way off, upside down / backwards
  • if color variation is so bad your sp. is now a sp. nov.

skew
  • does not affect OCR
  • affects OCR
  • this is a rare issue
gutter span
  • gutter and portion of next page visible
  • content that spans gutter not addressed as a foldout, content is cropped or otherwise unclear
  • BHLers need to indicate gutter spans as foldouts if necessary

Other Issues




Culled notes