Skip to content

Latest commit

 

History

History
191 lines (143 loc) · 7.2 KB

data-formats.md

File metadata and controls

191 lines (143 loc) · 7.2 KB

Compare Data Formats


CSV

Schema

NaN / NA

statistics on tabular data (CSV)

Diff Binary files: Libreoffice and DVCS

To resolve conflits on binary files, you can checkout the remote version and your version with the –theirs’ and –ours’ checkout flags. This allows you to inspect both version, then write back the correct resulting file.

$ git checkout --theirs -- doc/manual.odt
$ mv doc/manual.odt doc/theirs.manual.odt

$ git checkout --ours -- doc/manual.odt
$ mv doc/manual.odt doc/ours.manual.odt

Open and compare theirs.manual.odt’ with ours.manual.odt’, and save back the result into `manual.odt’.

$ git add doc/manual.odt
$ git commit

ELAN

  • A ruby package to handle ELAN files:: MExiCo (short for "Multimodal Experiment Corpora") is a library for the modeling and management of large, heterogeneous data collections from the field of linguistics, psycholinguistics, and related disciplines. Its central organising unit is the Corpus class which allows researchers to bundle resources from an experiment with related background data, conceptual data, and metadata.
  • https://tla.mpi.nl/tools/tla-tools/elan/
  • http://www.mpi.nl/tools/elan/EAF_Annotation_Format.pdf

Python packages




LaTex2PDF

Automate LaTeX

Paper Writing


MAT


Markdown



Pandoc


RDF

RDF == Resource Description Framework


Python with XML


Standards

MetaData


YAML