Skip to content

languagetool-org/french-pos-dict

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ff11b57 · Oct 21, 2024

History

48 Commits
Oct 21, 2024
Mar 25, 2024
Oct 21, 2024
Mar 26, 2024
Mar 25, 2024
Oct 21, 2024
Mar 21, 2022
Mar 26, 2024
Mar 25, 2024
Jul 2, 2020
Mar 26, 2024
Apr 12, 2024
Sep 24, 2021
Mar 25, 2024
Sep 24, 2021

Repository files navigation

french-pos-dict

A French part-of-speech dictionary that can be used from Java. This repo contains no Java code, but it contains Morfologik binary files to look up part-of-speech data.

As a developer, consider using LanguageTool instead of this. If you really want to use this directly, though, do it at your own peril.

Also use LanguageTool to export the data in these dictionaries, as documented here.

Workflow

If you simply intend on making changes to the dictionary data (i.e. adding/removing words, changing part-of-speech tags, etc.), you can follow this workflow:

  1. clone this repo and make changes to the dictionary files in ./data, or update the LT files in lt-changes;
  2. push to this repo;
  3. the GitHub Actions workflows will trigger, and build the dictionary files, as well as test them against current LT;
  4. if all tests pass, push a new tag — this will release the new dictionary binaries to SonaType;
  5. update the french-pos-dict version in LT's pom.xml to the new one.

Development

If you want to build the dictionaries locally, you will need to set up your environment as per the instructions of the dict_tools submodule. Once you do, simply running ./dict_tools/scripts/build_tagger_dicts.py (with the appropriate arguments) will build the dictionaries.

The French-specific scripts are legacy, and we do not recommend modifying them extensively. If you intend on working on them, please consider instead rewriting them in a more future-proof language, like Python.