Here, you can find scripts and datasets for the analysis of cross-linguistic data in the context of the project "Canonical Rate & Language Properties"
Structure of the pipeline:
- LangComplex - analysis on children CP wrt. to their language complexity (e.g. different syllable complexity levels);
- Adult_CR_analysis - analysis on adult CP wrt. to their language complexity;
- PhonComplex - Qualitative analysis of phonetical complexity (i.e. vowels and consonants) based on Maddieson classification;
- Numeric_Consonants_and_Vowels - Quantative analysis of phonetical complexity (i.e. vowels and consonants)
Data processed from annotated ELAN files into readable xlsx/cvc datasets are located in Data folder. Raw Data contains original data from OSF used for the analysis.
You can either make changes to the analysis pipeline, or simply reproduce the pipeline.
You need to be a coauthor of this github repo. Ask Chiara or Alex to add you.
You may need to download GitHub Desktop and follow this tutorial to clone the current repository. You only need to do this once.
From now on, you can make changes to the code locally on your machine which will be tracked automatically. To push your changes to the remote repository on GitHub, click Push origin. Next time you launch the binder these changes will show up.
To get changes others have made, click "pull".
Author: Chiara Semenzin, edits by Alejandrina Cristia