Skip to content

Fetcher manipulator: SpecificSet

Mark Jordan edited this page Oct 16, 2015 · 11 revisions

Overview

Fetcher manipulator that filters out a specific set of objects. Useful for testing and QA purposes, or for regenerating a subset of Islandora ingest packages from a larger set of input data.

MIK's --limit parameter applies as if this manipulator were absent. If the identifiers listed in the input file match records retrieved within the limit, they are included in the set processed by MIK; if not, they are excluded from the set processed by MIK. Since the speicifc set is by definition a limit on how many records are processed, the --limit parameter is not usually used in conjuction with this manipulator.

Toolchains

Can be used within any toolchain (i.e., is not specific to CONTENTdm CSV, etc.).

Configuration

To register this manipulator in your toolchain, add the following line to the "[MANIPULATORS]" section of your .ini file:

fetchermanipulators[] = "SpecificSet|/tmp/record_ids.txt"

Parameters

This manipulator takes a single parameter: the path to the file containing the identifiers for the records you want MIK to process.

Functionality

The identifiers for each record that are identified as 'record_key' in the .ini file are retrieved from a plain text file contianing one ID per line. The path to this file is this manipulator's sole parameter, e.g., fetchermanipulators[] = "SpecificSet|/tmp/record_ids.txt".

The input file should contain a list of CONTENTdm pointers, CSV row IDs, or whatever field is defined in the .ini file as the record_key, one ID per line. Comments are allowed in this file, using a '#' at the start of comment lines.

Clone this wiki locally