Skip to content
This repository was archived by the owner on Nov 18, 2021. It is now read-only.

Latest commit

 

History

History
57 lines (36 loc) · 2.88 KB

usecase-cow.md

File metadata and controls

57 lines (36 loc) · 2.88 KB

CoW: CSV On the Web

Metadata

  • Status: Completed
  • Type: Specific
  • Work Package: WP4
  • Participating Institutes: International Institute of Social History (IISG) and Vrije Universiteit Amsterdam (VU)
  • Coordinators: Richard Zijdeman (IISG)
  • Developers: Albert Meroño Peñuela, Roderick van de Weerdt, Melvin Roest
  • End-users: The software is designed for the so called ‘digital’ historian: e.g. someone with basic command line skills.
  • Interest Groups: IG-LOD, IG-Curation, and Worflows

Description

What is the research about?

Historians often use tabular data as input for their analyses. These dataset often reside on local machines, which hinders the reproducibility and replicability of research, as well as preventing to link their dataset to other useful data. This package is a comprehensive command-line utility for batch conversion of multiple datasets expressed in CSV.

What problem is hindering the research?

Trough Linked Data conversion CoW aims to enhance the workflow of historical research - and other research working with tabular data - by facilitating the sharing, standardization, and interlinking of datasets (and queries).

Its technical features are:

  • Expressive CSVW-compatible schemas based on the Jinja template engine;
  • highly efficient implementation leveraging multithreaded and multicore architectures;
  • available as a pythonic CLI tool and library;
  • supports Python 3

Data

Any CSV dataset is required as input, preferably using UTF-8 encoding.

Tools

What software and services are involved?

  • Python 3

References

References to related resources and publications and especially links to related use-cases: