This is an Apertium monolingual language package for Armenian. What you can use this language package for:
- Morphological analysis of Armenian
- Morphological generation of Armenian
- Part-of-speech tagging of Armenian
You will need the following software installed:
- lttoolbox (>= 3.5.1)
- apertium (>= 3.6.1)
- vislcg3 (>= 1.3.1)
- hfst (>= 3.15.1)
If this does not make any sense, we recommend you look at: apertium.org.
Given the requirements being installed, you should be able to just run:
$ ./configure
$ make
You can use ./autogen.sh
instead of ./configure
if you're compiling
from source.
If you're doing development, you don't have to install the data, you can use it directly from this directory.
If you are installing this language package as a prerequisite for an Apertium translation pair, then do (typically as root / with sudo):
$ make install
You can use a --prefix
with ./configure
to install as a non-root user,
but make sure to use the same prefix when installing the translation
pair and any other language packages.
If you are in the source directory after running make, the following commands should work based on the dialect that you use.
To run the analyzer on a corpus, use scripts and commands from hyw-corpus.
For Western Armenian, use the following commands.
- Morphological analysis:
echo "ապակիներէն ուրախացայ։" | apertium -d . -f none hyx_hyw-morph
^ապակիներէն/ապակի<n><pl><abl><def>$
^ուրախացայ/ուրախանալ<v><iv><inch><past><pret><p1><sg><indc>$
^։/։<sent>$
- Tagging (analysis + disambiguation):
% echo "ապակիներէն ուրախացայ։" | apertium -d . -f none hyx_hyw-morph
^ապակիներէն/ապակի<n><pl><abl><def>$
^ուրախացայ/ուրախանալ<v><iv><inch><past><pret><p1><sg><indc>$
^։/։<sent>$
- Morphological generation:
% echo "^ապակի<n><pl><abl><def>$ ^ուրախանալ<v><iv><inch><past><pret><p1><sg><indc>$" | apertium -d . -f none hyx_hyw-gener
ապակիներէն ուրախացայ
For Eastern Armenian, use the following commands.
- Morphological analysis:
echo "ապակիներից ուրախացա" | apertium -d . -f none hyx_hye-morph
^ապակիներից/ապակի<n><pl><abl>$
^ուրախացա/ուրախանալ<v><iv><inch><past><pret><p1><sg><indc>$
^։/։<sent>$
- Tagging (analysis + disambiguation):
% echo "ապակիներից ուրախացա։" | apertium -d . -f none hyx_hye-tagger
^ապակիներից/ապակի<n><pl><abl>$
^ուրախացա/ուրախանալ<v><iv><inch><past><pret><p1><sg><indc>$
^։/։<sent>$
- Morphological generation:
% echo "^ապակի<n><pl><abl><def>$ ^ուրախանալ<v><iv><inch><past><pret><p1><sg><indc>$" | apertium -d . -f none hyx_hye-gener
ապակիներէն ուրախացա
lexd_components
- Components of morphotactic dictionaryapertium-hyx.hyx.twol
- Morphophonological rulesapertium-hyx.hyx.rlx
- Constraint Grammar disambiguation rulesapertium-hyx.post-hyx.dix
- Post-generatormodes.xml
- Translation modes
When referencing this work in an academic publication, we ask that you cite the following source:
- Dolatian, Hossep, Daniel Swanson, and Jonathan Washington (2022) A Free/Open-Source Morphological Transducer for Western Armenian. In Proceedings of The Workshop on Processing Language Variation: Digital Armenian (DigitAm) within the 13th Language Resources and Evaluation Conference. Slides and paper.
- https://wiki.apertium.org/wiki/Installation
- https://wiki.apertium.org/wiki/apertium-hyx
- https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary
If you need help using this language pair or data, you can contact:
- Mailing list: [email protected]
- IRC:
#apertium
on irc.oftc.net (irc://irc.oftc.net/#apertium)
See also the file AUTHORS
, included in this distribution.