Child pages
  • Checkm
Skip to end of metadata
Go to start of metadata

Checkm: A Checksum-based Manifest Format

Name:

Checkm

Version:

0.7 (2010-06-11)

Status:

Beta

Specification:

Checkm: A Checksum-Based Manifest Format

Recent changes:

Differences from previous version

Download:

Not available

More information:

Curation home page

Checkm is a general-purpose text-based file manifest format. Each line of a Checkm manifest is a set of '|'-separated (in version 0.1, whitespace-separated) tokens, the first of which identifies the corresponding digital content by filename or URL. Other tokens identify digest algorithm, checksum, content length, and modification time. Tokens may be left unspecified, the degenerate case being a simple file list. It is up to tools that use the Checkm format to specify any further restrictions on tokens (e.g., allowed defaults and digest algorithms) and on overall manifest completeness and coherence. Checkm is designed to support tools that verify the bit-level integrity of groups of files in support of such things as content fixity, replication, import, and export. A manifest may be single-level or multi-level (hierarchical), the latter being useful, for example, in harvesting material from very large web sites (cf. sitemaps).

Implementations

Checkm (Python)

Code for version 0.3.1:

  • No labels

3 Comments

  1. The specification link is to the 2009 version, and the "differences" link is broken.

  2. Looks like the "differences" list is preserved at the Internet Archive: https://web-beta.archive.org/web/20151023233116/http://www.cdlib.org/uc3/docs/checkm-old-03-diffs.html

    It would be nice to have a clean copy of the 0.7 spec, though.

    Oh, and "Curation home page" redirects to a Confluence page with no info on Checkm.