Skip to content

Latest commit

 

History

History
196 lines (169 loc) · 7.9 KB

index.md

File metadata and controls

196 lines (169 loc) · 7.9 KB
layout
default
<script src="js/jquery.js"></script> <script type="text/javascript"> $(document).ready(function(){ $("#annotated tr:odd").addClass("odd"); $("#annotated tr:not(.odd)").hide(); $("#annotated tr:first-child").show(); $("#annotated tr.odd").click(function(){ $(this).next("tr").toggle(); $(this).find(".arrow").toggleClass("up"); }); }); $(document).ready(function(){ $("#coming tr:odd").addClass("odd"); $("#coming tr:not(.odd)").hide(); $("#coming tr:first-child").show(); $("#coming tr.odd").click(function(){ $(this).next("tr").toggle(); $(this).find(".arrow").toggleClass("up"); }); }); </script>

UniMorph

The Universal Morphology (UniMorph) project is a collaborative effort to improve how NLP handles complex morphology in the world's languages. The goal of UniMorph is to annotate morphological data in a universal schema that allows an inflected word from any language to be defined by its lexical meaning, typically carried by the lemma, and by a rendering of its inflectional form in terms of a bundle of morphological features from our schema. The specification of the schema is described [here]({{ site.baseurl }}/schema) and in [Sylak-Glassman (2016)]({{ site.baseurl }}/doc/unimorph-schema.pdf).

Plus, we're now available in a Python package! pip install unimorph

UniMorph Events

Annotated Languages

The following {{ site.data.languages | size }} languages have been annotated according to the UniMorph schema. Missing parts of speech will be filled in soon.

{% assign sorted_langs = site.data.languages | sort:"name" %}

{% for language in sorted_langs %}
<tr>
<!--td>{% if language.flag %}<span class="flagspan"><img class="flag"     src="{{ site.baseurl }}/images/flags/{{ language.flag }}"/></span>{% endif %}</td-->
<td>&nbsp;&#8618;{{ language.name }}</td>
<td style="font-family: monospace">{{ language.iso }}</td>
<td style="text-align:right">{{ language.forms }}</td>
<td style="text-align:right">{{ language.paradigms }}</td>
<td style="text-align:right">{% if language.nouns %}&#x2714;{% endif %}</td>
<td style="text-align:right">{% if language.verbs %}&#x2714;{% endif %}</td>
<td style="text-align:right">{% if language.adjectives %}&#x2714;{% endif %}</td>
<td>{% case language.source %}
{% when 'ling' %}
 <span><a rel="source" href="">&#8466;</a></span>
{% when 'surrey' %}
 <a rel="source" href="https://oto-manguean.surrey.ac.uk/">Surrey</a>
{% when 'vepkar' %}
<a rel="source" href="http://dictorpus.krc.karelia.ru/">VepKar</a>	
   {% when nil %}
<span><img class="source" src="{{ site.baseurl }}/images/wiki.png"/></span>
{% endcase %}
  </td>
<td style="text-align:right">{% case language.license %}
  {% when 'unknown' %}
  	–
  {% when 'lgpllr' %}
   <a rel="license" href="https://spdx.org/licenses/LGPLLR.html">LGPLLR</a>
  {% when nil %}
   <a rel="license" href="http://creativecommons.org/licenses/by-sa/3.0/">
   	<img class="check" alt="Creative Commons License"  src="https://i.creativecommons.org/l/by-sa/3.0/88x31.png" />
   </a>
{% endcase %}</td>
<td><div class="arrow"></div></td>
</tr>
<tr>
<td colspan="11">
<div class="detail-box">
<div class="mylist">
<ul class="unstyled">
<li>
<span class="detail1">
<b>Download Data:</b>
</span>
<a href="https://github.com/unimorph/{{ language.iso }}">repo</a>
</li>
<li><span class="detail1"><b>Report Errors:</b></span> <a href="https://github.com/unimorph/{{ language.iso }}/issues">issues</a></li>
</ul>
</div>
<div class="mylist">
<ul class="unstyled">
<li><span class="detail4"><b>2016 Shared Task Splits:</b></span> {% if language.splits.sixteen %}<a href=" https://github.com/ryancotterell/sigmorphon2016/">yes</a>{% else %}no{% endif %}</li>
<li><span class="detail4"><b>2017 Shared Task Splits:</b></span> {% if language.splits.seventeen %}<a href="https://github.com/sigmorphon/conll2017">yes</a>{% else %}no{% endif %}</li>  
</ul>
</div>
<div class="mylist">
<ul class="unstyled">
<li><span class="detail2"><b>Typology:</b></span> {{ language.typology }}</li>
<li><span class="detail2"><b>Templatic:</b></span> {{ language.templatic }}</li>
</ul>
</div>
<div class="mylist">
<ul class="unstyled">
<li><span class="detail3"><b>Info:</b></span> <a href="https://en.wikipedia.org/wiki/{{ language.name }}_language">wikipedia</a></li>
<li><span class="detail3"><b>Type:</b></span> {{ language.type }}</li>  
</ul>
</div>
</div>
</td>
</tr>
{% endfor %}
Language ISO 639-3 Forms Paradigms Nouns Verbs Adjectives Source License
<script src="https://code.jquery.com/jquery-3.1.1.slim.min.js" integrity="sha384-A7FZj7v+d/sdmMqp/nOQwliLvUsJfDHW+k9Omg/a/EheAdgtzNs3hpfag6Ed950n" crossorigin="anonymous"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/tether/1.4.0/js/tether.min.js" integrity="sha384-DztdAPBWPRXSA/3eYEEUWrWCy7G5KFbe8fFjk5JAIxUYHKkDx6Qin1DkWx51bBrb" crossorigin="anonymous"></script> <script src="https://maxcdn.bootstrapcdn.com/bootstrap/4.0.0-alpha.6/js/bootstrap.min.js" integrity="sha384-vBWWzlZJ8ea9aCX4pEW3rVHjgjt7zpkNpZk+02D9phzyeVkE+jo0ieGizqPLForn" crossorigin="anonymous"></script>