Documentation

Collection Policy

Policy Statement

This document describes the policies governing the acquisition, curation, and management of materials in York University Digital Library. These include digital objects created by YUL and published digital objects that require local hosting and rights management. York University Digital Library makes available collections that support research and scholarship according to the needs of the York University Digital Library Designated Community and in accordance with its Digital Preservation Strategic Plan.

Selection Criteria

For inclusion within York University Digital Library, objects must meet all Level One criteria and a consideration of Level Two criteria.

Level One criteria cover objective standards such as copyright, extent, and format issue.

Each item must:

  • Be in the public domain, or have documented, non-revocable permission granted by the copyright holder.
  • Be intended for non-commercial public viewing, and educational/research and use.
  • Be complete, such as an entire publication, article, etc., and not a "part" such as an abstract, forward, or title page
  • Meet the standards required for long-term digital curation, as described by documentation of York University Digital Library content types
  • Be in a standard format accessible through current file viewers or have a documented conversion path to move the format into a standard format
  • Be intended for permanent storage in York University Digital Library
  • With the exception of self-archiving of published works, items must be unique or novel, a similar or identical digital object should not already exist
  • Strive to meet accessibility standards as described by the Accessibility for Ontarians with Disabilities Act

Level Two criteria cover subjective elements requiring review, assessment, and the professional judgement of the Digital Initiatives Advisory Group in collaboration with subject experts. As an initial consideration and overarching principle, requests should fit within the mandate of YUL's general collection policy, including that of YorkSpace. Materials should also have broad and enduring value, as opposed to, for example, course-specific materials better housed elsewhere or material for which the demand does not go beyond specific or one-time usage. Research and/or administrative value is defined both locally and globally, with local (provincial and national) as a priority. Added value components acting as part of the consideration include degree of integration in an online environment, intellectual control (metadata), improvement of resource sharing, and enhancement of access. Finally, a request may receive merit based on its potential as a strategic opportunity, whether in teaching and learning or partnership and collaboration.

Retention and Evaluation

All objects included in York University Digital Library are intended to be retained permanently upon acceptance into York University Digital Library. York University Digital Library will not be used as a temporary storage facility for digital items.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Critical Processes and OAIS Mandatory Responsibilities

Critical Processes and OAIS Mandatory Responsibilities

  1. Introduction
    • This document traces critical processes employed by York University Libraries (YUL) to meet the "mandatory responsibilities" of a digital repository as described in OAIS. This document identifies which processes are necessary for the repository to fulfill its mandatory responsibilities.
  2. OAIS 3.1: "Negotiate for and accept appropriate information from information Producers."
    • YUL has a clearly defined process for negotiating with producers and ensuring that it acquires appropriate information. See the Rights Policy for more information.
  3. OAIS 3.1: "Obtain sufficient control of the information provided to the level needed to ensure Long-Term Preservation."
    • YUL obtains rights from individual producers that give the repository control over all of the information deposited by the producer. The nature and scope of these rights varies by submitter. In cases where the repository takes responsibility for the preservation of information, the rights include provisions for YUL to receive a local copy of the information and host it in perpetuity. In some cases, the repository obtains the right to modify information in order to ensure long-term preservation and accessibility. See the Rights Policy for more information.
  4. OAIS 3.1: "Determine, either by itself or in conjunction with other parties, which communities should become the Designated Community and, therefore, should be able to understand the information provided."
  5. OAIS 3.1: "Ensure that the information to be preserved is Independently Understandable to the Designated Community. In other words, the community should be able to understand the information without needing the assistance of the experts who produced the information."
  6. OAIS 3.1: "Follow documented policies and procedures which ensure that the information is preserved against all reasonable contingencies, and which enable the information to be disseminated as authenticated copies of the original, or as traceable to the original."
    • YUL has policies and procedures for the long-term preservation of information. See the Preservation Implementation Plan, and Definition of AIP for more information about the repository’s ingest, data management, and archival storages processes. AIPs are not deleted as a part of the repository’s normal operations.
    • The repository maintains backups of all content. See the Backup Plan for more details.
    • The repository is an integral component of disaster recovery planning for YUL.
    • YUL negotiates submission policies and procedures with individual producers. See the Rights Policy and the Definition of SIP for more information.
    • The repository has policies and procedures for the dissemination of information to its Designated Community. See the Definition of DIP for more information. To maintain the understandability and accessibility of disseminated information, YUL carries out extensive usability testing and solicits feedback from its Designated Community.
    • To ensure authenticity, each AIP is linked to a specific object and source file by information in the preservation metadata. This information is not visible to the Designated Community in the DIP, but can made available if necessary. The repository’s DIPs are always generated from a single AIP.
  7. OAIS 3.1: "Make the preserved information available to the Designated Community."
    • YUL disseminates the information to its Designated Community through its own user interfaces. See the Definition of DIP for more information about the repository's dissemination process. Depending on the license, access may be restricted to users affiliated with the York University community. See the Access Policy for more information.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Metadata Specifications

Metadata Specifications

1. Policy Statement

York University Digital Library requires thorough, well-structured metadata in order to preserve the content, relationships, activities and logical structure of the object.

2. Implementation

  • Each object in YUDL has a MODS (Metadata Object Description Schema) datastream to provide descriptive metadata.
  • Each object in YUDL has a TECHMD_FITS (see Registry of file formats) datastream to provide file identification and characterizationi information.
  • Each object in YUDL has a RELS-EXT (Releationship External) datastream that describes the object's relationship(s) to other object's in the repository.
  • Each object in YUDL has a POLICY (XACML) datastream for AuthZ.
  • YUDL utilizes the PREMIS (Preservation Metadata Implementation Strategy) vocabulary via Islandora PREMIS. PREMIS's data dictionary provides ways of describing objects and processes that are necessary for digital preservation. YUDL makes use of the objects, events and rights entities described in the PREMIS Data Model.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Review Cycle for Documentation Policy

Review Cycle for Documentation Policy

1. Policy Statement

The policies surrounding the operations of York University Digital Library as a preservation repository are subject to review and revision on two schedules: an ongoing basis and a cycle of regular review.

  • Ongoing
  • Regular Review
    • York University Libraries' preservation policy and all its related documents will be reviewed every two years in its entirety. This review will be led by the Digital Assets Librarian, in consultation with the Digital Initiatives Advisory Group.
  • Documentation history/versioning
    • All York University Libraries digital preservation policy and documentation is done in the Markdown format.
    • All York University Libraries digital preservation policy and documentation is version controlled using Git, and available in the York University Libraries GitHub Organization.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

URI Policy

URI Policy

Policy Statement

URIs created by York University Digital Library

  • York University Digital Library uses a systematic convention to generate unambiguously unique identification for digital objects within its repository. This convention will create a stable name or reference to an object that can be permanently associated with that object, regardless of future changes to organizational structure or to digital access protocols.
  • This is in conformance with section 4.2.4 of Metrics for Digital Repository Audit and Certification (CCSDS, June 2009) which states that a compliant repository "shall have and use a convention that generates persistent, unique identifiers for all AIPs" and "its components."
  • This convention will ensure that “each AIP can be unambiguously found in the future” and that "each AIP can be distinguished from all other AIPs in the repository"

Implementation

Islandora object

York University Digital Library canonical URIs are consistently constructed in the following manner:

  • /islandora/object/PID

These URIs are aliased using Islandora Pathauto to the following pattern:

  • [fedora:pid]/[fedora:label]

Example:

  • Photograph: New Woodbine : racehorses train for opening of season
  • Canonical URI: http://digital.library.yorku.ca/islandora/object/yul:88675
  • Aliases URL: http://digital.library.yorku.ca/yul-88675/new-woodbine-racehorses-train-opening-season

Islandora object datastream

York University Digital Library object datastream canonical URIs are consistently constructed in the following manner:

  • /islandora/object/PID/datastream/DATASTREAM_NAME/view
  • /islandora/object/PID/datastream/DATASTREAM_NAME/download
  • [fedora:pid]/[fedora:label]/datastream/DATASTREAM_NAME/view
  • [fedora:pid]/[fedora:label]/datastream/DATASTREAM_NAME/download

Example:

  • Photograph: New Woodbine : racehorses train for opening of season
  • Canonical URI: http://digital.library.yorku.ca/islandora/object/yul:88675/datastream/JPG/view
  • Aliases URL: http://digital.library.yorku.ca/yul-88675/new-woodbine-racehorses-train-opening-season/datastream/JPG/download

Publicly available datastream names

Audio:

  • TN (Thumbnail)
  • PROXY_MP3 (Streaming quality MP3)

Book:

  • TN (Thumbnail)
  • ORIGINAL_PDF (Only for Buddhism Across Boundaries: Buddhist Periodicals and Books from Colonial Burma collection )

Images:

  • TN (Thumbnail)
  • JPG (Medium sized JPG)
  • OCR (OCR'd text)

Metadata:

  • MODS (Descriptive metadata)
  • DC (Descriptive metadata)
  • TECHMD_FITS (Technical metadata)
  • RELS-EXT (Fedora Object to Object Relationship)

Video:

  • TN (Thumbnail)
  • MP4 (Streaming quality MP4)

Web ARChive:

  • TN (Thumbnail)
  • JPG (Medium sized JPEG)
  • WARC_CSV (WARC Index)
  • WARC_FILTERED (WARC filtered)
  • OBJ (Warc)

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Video - Preservation Action Plan

Introduction

This document describes the preservation plan for video content in the York University Digital Library. Most of the video content content is from the Sound and Moving Image Library and the Clara Thomas Archives and Special Collections. The preservation plan for video content follows from policies and practices described in the Digital Preservation Strategic Plan and the Digital Preservation Implementation Plan. This document explains practical steps that York University Libraries take to preserve the intellectual content of video in digital format. It outlines the basic tools, methods, and standards used for the long-term preservation of video content.

Content Formats

For the preservation of video content, York University Libraries require uncompressed 8 or 10 bit AVI versions of the content, and descriptive metadata. During the ingest process, derivatives are created for streaming. York University Libraries continuously monitors developments in file formats to determine if and when formats require migration (see Environmental Monitoring of Preservation Formats).

SIP Format

Video SIPs (see Definition of SIP) generally consists of an AVI file, and an associated MODS descriptive metadata file.

Analysis on Ingest

Upon ingest, every file in the repository is subject to identification of its file format and validation using FITS. The output of the FITS identification and validation processes are recorded to a techincal metadata datastream (TECHMD_FITS) that is associated with the object in the repository.

Content Excluded

York University Libraries doe not ingest files that are not referenced (either as part of a representation or as associated datastreams) in the associated metadata. As the SIP is retained, these files can later be ingested if necessary.

Format Normalization

There is no format normailziation if the submitted object is an uncompressed 8 or 10 bit AVI.

Metadata Normalization

When necessary, York University Libraries crosswalk descriptive metadata from MODS to Dublin Core. The repository creates preservation metadata for each file. The preservation level, explained in the Digital Preservation Implementation Plan, is applied to each file upon ingest and recorded in the preservation metadata for each file.

Acceptable Formats

For the Full Preservation level for video, currently the acceptable formats are AVI. Video submissions may be MPEG, MP4, OGG, MKV, or Quicktime format, however they will be preserved at the Bit-level Preservation level.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Image - Preservation Action Plan

Introduction

This document describes the preservation plan for image content in the York University Digital Library. Most of the image content content is from the Clara Thomas Archives and Special Collections. The preservation plan for image content follows from policies and practices described in the Digital Preservation Strategic Plan and the Digital Preservation Implementation Plan. This document explains practical steps that York University Libraries take to preserve the intellectual content of image in digital format. It outlines the basic tools, methods, and standards used for the long-term preservation of image content.

Content Formats

For the preservation of image content, York University Libraries require uncompressed TIF versions of the content, and descriptive metadata. During the ingest process, derivatives are created for display. York University Libraries continuously monitors developments in file formats to determine if and when formats require migration (see Environmental Monitoring of Preservation Formats).

SIP Format

Image SIPs (see Definition of SIP) generally consist of a TIF file, and an associated MODS descriptive metadata file.

Analysis on Ingest

Upon ingest, every file in the repository is subject to identification of its file format and validation using FITS. The output of the FITS identification and validation processes are recorded to a techincal metadata datastream (TECHMD_FITS) that is associated with the object in the repository.

Content Excluded

York University Libraries do not ingest files that are not referenced (either as part of a representation or as associated datastreams) in the associated metadata. As the SIP is retained, these files can later be ingested if necessary.

Format Normalization

There is no format normailziation if the submitted object is a TIF.

Metadata Normalization

When necessary, York University Libraries crosswalk descriptive metadata from MODS to Dublin Core. The repository creates preservation metadata for each file. The preservation level, explained in the Digital Preservation Implementation Plan, is applied to each file upon ingest and recorded in the preservation metadata for each file.

Acceptable Formats

For the Full Preservation level for image, currently the acceptable formats are TIF. Image submissions may be JPG or PNG format, however they will be preserved at the Bit-level Preservation level.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Audio - Preservation Action Plan

Introduction

This document describes the preservation plan for audio content in the York University Digital Library. Most of the audio content content is from the Sound and Moving Image Library and the Clara Thomas Archives and Special Collections. The preservation plan for audio content follows from policies and practices described in the Digital Preservation Strategic Plan and the Digital Preservation Implementation Plan. This document explains practical steps that York University Libraries takes to preserve the intellectual content of audio in digital format. It outlines the basic tools, methods, and standards used for the long-term preservation of audio content.

Content Formats

For the preservation of audio content, York University Libraries require WAV or FLAC versions of the content, and descriptive metadata. During the ingest process, derivatives are created for streaming. York University Libraries continuously monitors developments in file formats to determine if and when formats require migration (see Environmental Monitoring of Preservation Formats).

SIP Format

Audio SIPs (see Definition of SIP) generally consist of a WAV or FLAC file, and an associated MODS descriptive metadata file.

Analysis on Ingest

Upon ingest, every file in the repository is subject to identification of its file format and validation using FITS. The output of the FITS identification and validation processes are recorded to a techincal metadata datastream (TECHMD_FITS) that is associated with the object in the repository.

Content Excluded

York University Libraries do not ingest files that are not referenced (either as part of a representation or as associated datastreams) in the associated metadata. As the SIP is retained, these files can later be ingested if necessary.

Format Normalization

There is no format normailziation if the submitted object is a WAV or FLAC.

Metadata Normalization

When necessary, York University Libraries crosswalk descriptive metadata from MODS to Dublin Core. The repository creates preservation metadata for each file. The preservation level, explained in the Digital Preservation Implementation Plan, is applied to each file upon ingest and recorded in the preservation metadata for each file.

Acceptable Formats

For the Full Preservation level for audio, currently the acceptable formats are WAV and FLAC. Audio submissions may be MP3 format, however they will be preserved at the Bit-level Preservation level.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

Environmental Monitoring of Preservation Formats

Web Archives - Preservation Action Plan

Introduction

This document describes the preservation plan for web archives content in the York University Digital Library. Most of the web archive content content is York University web sites. The preservation plan for web archives content follows from policies and practices described in the Digital Preservation Strategic Plan and the Digital Preservation Implementation Plan. This document explains practical steps that York University Libraries take to preserve the intellectual content of web archives in digital format. It outlines the basic tools, methods, and standards used for the long-term preservation of web archive content.

Content Formats

For the preservation of web archive content, York University Libraries require WARC versions of the content, descriptive metadata, and if possible, screen captures of the archived site. During the ingest process, derivatives are created for full-text searching of web archives. York University Libraries continuously monitors developments in file formats to determine if and when formats require migration (see Environmental Monitoring of Preservation Formats).

SIP Format

York University Libraries create their own web archive SIPs (see Definition of SIP) using its YUDL Web archiving scripts.

Analysis on Ingest

Upon ingest, every file in the repository is subject to identification of its file format and validation using FITS. The output of the FITS identification and validation processes are recorded to a techincal metadata datastream (TECHMD_FITS) that is associated with the object in the repository.

Content Excluded

York University Libraries do not ingest files that are not referenced (either as part of a representation or as associated datastreams) in the associated metadata. As the SIP is retained, these files can later be ingested if necessary.

Format Normalization

There is no format normailziation if the submitted object is a WARC.

Metadata Normalization

When necessary, York University Libraries crosswalk descriptive metadata from MODS to Dublin Core. The repository creates preservation metadata for each file. The preservation level, explained in the Digital Preservation Implementation Plan, is applied to each file upon ingest and recorded in the preservation metadata for each file.

Acceptable Formats

For the Full Preservation level for web archives, currently the only acceptable format is WARC.

Acknowledgements

Adapted from and inspired by:

License

CC0

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication