Checkm: A Checksum-based Manifest Format
Name: | Checkm |
Version: | 0.7 (2010-06-11) |
Status: | Beta |
Specification: | |
Recent changes: | |
Download: | Not available |
More information: |
Checkm is a general-purpose text-based file manifest format. Each line of a Checkm manifest is a set of '|'-separated (in version 0.1, whitespace-separated) tokens, the first of which identifies the corresponding digital content by filename or URL. Other tokens identify digest algorithm, checksum, content length, and modification time. Tokens may be left unspecified, the degenerate case being a simple file list. It is up to tools that use the Checkm format to specify any further restrictions on tokens (e.g., allowed defaults and digest algorithms) and on overall manifest completeness and coherence. Checkm is designed to support tools that verify the bit-level integrity of groups of files in support of such things as content fixity, replication, import, and export. A manifest may be single-level or multi-level (hierarchical), the latter being useful, for example, in harvesting material from very large web sites (cf. sitemaps).
Implementations
Checkm (Python)
Code for version 0.3.1:
- Ben O'Steen: http://github.com/benosteen/checkm
3 Comments
Unknown User (dmoles)
The specification link is to the 2009 version, and the "differences" link is broken.
Unknown User (pwillett)
Looking around.....
Unknown User (dmoles)
Looks like the "differences" list is preserved at the Internet Archive: https://web-beta.archive.org/web/20151023233116/http://www.cdlib.org/uc3/docs/checkm-old-03-diffs.html
It would be nice to have a clean copy of the 0.7 spec, though.
Oh, and "Curation home page" redirects to a Confluence page with no info on Checkm.