- Status: Completed
- Type: Specific
- Work Package: WP4
- Participating Institutes: International Institute of Social History (IISG) and Vrije Universiteit Amsterdam (VU)
- Coordinators: Richard Zijdeman (IISG)
- Developers: Albert Meroño Peñuela, Roderick van de Weerdt, Melvin Roest
- End-users: The software is designed for the so called ‘digital’ historian: e.g. someone with basic command line skills.
- Interest Groups: IG-LOD, IG-Curation, and Worflows
Historians often use tabular data as input for their analyses. These dataset often reside on local machines, which hinders the reproducibility and replicability of research, as well as preventing to link their dataset to other useful data. This package is a comprehensive command-line utility for batch conversion of multiple datasets expressed in CSV.
Trough Linked Data conversion CoW aims to enhance the workflow of historical research - and other research working with tabular data - by facilitating the sharing, standardization, and interlinking of datasets (and queries).
Its technical features are:
- Expressive CSVW-compatible schemas based on the Jinja template engine;
- highly efficient implementation leveraging multithreaded and multicore architectures;
- available as a pythonic CLI tool and library;
- supports Python 3
Any CSV dataset is required as input, preferably using UTF-8 encoding.
- Python 3
References to related resources and publications and especially links to related use-cases: