layout |
---|
default |
The Universal Morphology (UniMorph) project is a collaborative effort to improve how NLP handles complex morphology in the world's languages. The goal of UniMorph is to annotate morphological data in a universal schema that allows an inflected word from any language to be defined by its lexical meaning, typically carried by the lemma, and by a rendering of its inflectional form in terms of a bundle of morphological features from our schema. The specification of the schema is described [here]({{ site.baseurl }}/schema) and in [Sylak-Glassman (2016)]({{ site.baseurl }}/doc/unimorph-schema.pdf).
Plus, we're now available in a Python package! pip install unimorph
- SIGMORPHON 2022 Shared Task
- SIGMORPHON 2021 Shared Task
- SIGMORPHON 2020 Shared Task
- SIGMORPHON 2019 Shared Task
- CoNLL–SIGMORPHON 2018 Shared Task
- CoNLL–SIGMORPHON 2017 Shared Task
- SIGMORPHON 2016 Shared Task
The following {{ site.data.languages | size }} languages have been annotated according to the UniMorph schema. Missing parts of speech will be filled in soon.
{% assign sorted_langs = site.data.languages | sort:"name" %}
{% for language in sorted_langs %}
<tr>
<!--td>{% if language.flag %}<span class="flagspan"><img class="flag" src="{{ site.baseurl }}/images/flags/{{ language.flag }}"/></span>{% endif %}</td-->
<td> ↪{{ language.name }}</td>
<td style="font-family: monospace">{{ language.iso }}</td>
<td style="text-align:right">{{ language.forms }}</td>
<td style="text-align:right">{{ language.paradigms }}</td>
<td style="text-align:right">{% if language.nouns %}✔{% endif %}</td>
<td style="text-align:right">{% if language.verbs %}✔{% endif %}</td>
<td style="text-align:right">{% if language.adjectives %}✔{% endif %}</td>
<td>{% case language.source %}
{% when 'ling' %}
<span><a rel="source" href="">ℒ</a></span>
{% when 'surrey' %}
<a rel="source" href="https://oto-manguean.surrey.ac.uk/">Surrey</a>
{% when 'vepkar' %}
<a rel="source" href="http://dictorpus.krc.karelia.ru/">VepKar</a>
{% when nil %}
<span><img class="source" src="{{ site.baseurl }}/images/wiki.png"/></span>
{% endcase %}
</td>
<td style="text-align:right">{% case language.license %}
{% when 'unknown' %}
–
{% when 'lgpllr' %}
<a rel="license" href="https://spdx.org/licenses/LGPLLR.html">LGPLLR</a>
{% when nil %}
<a rel="license" href="http://creativecommons.org/licenses/by-sa/3.0/">
<img class="check" alt="Creative Commons License" src="https://i.creativecommons.org/l/by-sa/3.0/88x31.png" />
</a>
{% endcase %}</td>
<td><div class="arrow"></div></td>
</tr>
<tr>
<td colspan="11">
<div class="detail-box">
<div class="mylist">
<ul class="unstyled">
<li>
<span class="detail1">
<b>Download Data:</b>
</span>
<a href="https://github.com/unimorph/{{ language.iso }}">repo</a>
</li>
<li><span class="detail1"><b>Report Errors:</b></span> <a href="https://github.com/unimorph/{{ language.iso }}/issues">issues</a></li>
</ul>
</div>
<div class="mylist">
<ul class="unstyled">
<li><span class="detail4"><b>2016 Shared Task Splits:</b></span> {% if language.splits.sixteen %}<a href=" https://github.com/ryancotterell/sigmorphon2016/">yes</a>{% else %}no{% endif %}</li>
<li><span class="detail4"><b>2017 Shared Task Splits:</b></span> {% if language.splits.seventeen %}<a href="https://github.com/sigmorphon/conll2017">yes</a>{% else %}no{% endif %}</li>
</ul>
</div>
<div class="mylist">
<ul class="unstyled">
<li><span class="detail2"><b>Typology:</b></span> {{ language.typology }}</li>
<li><span class="detail2"><b>Templatic:</b></span> {{ language.templatic }}</li>
</ul>
</div>
<div class="mylist">
<ul class="unstyled">
<li><span class="detail3"><b>Info:</b></span> <a href="https://en.wikipedia.org/wiki/{{ language.name }}_language">wikipedia</a></li>
<li><span class="detail3"><b>Type:</b></span> {{ language.type }}</li>
</ul>
</div>
</div>
</td>
</tr>
{% endfor %}
Language | ISO 639-3 | Forms | Paradigms | Nouns | Verbs | Adjectives | Source | License |
---|