Skip to content

An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

License

Notifications You must be signed in to change notification settings

proycon/foliapy

Repository files navigation

FoLiA Library for Python

https://github.com/proycon/foliapy/actions/workflows/foliapy.yml/badge.svg?branch=master Documentation Status http://applejack.science.ru.nl/lamabadge.php/foliapy Project Status: Active – The project has reached a stable, usable state and is being actively developed. Latest release in the Python Package Index

This Python module provides an extensive library for parsing, creating and otherwise processing documents in the Format for Linguistic Annotation, aka FoLiA. It has been in active development since 2010 and used by numerous Natural Language Processing (NLP) tools.

This library used to be part of PyNLPL (pynlpl.formats.folia), but has been migrated to this standalone library in spring 2019.

Acknowledgement

FoLiA development is funded in the scope of the larger CLARIN-NL project and its successor CLARIAH.

The following modules are available:

  • folia.main - The main library
  • folia.setdefinition - A module for FoLiA Set Definitions
  • folia.fql - Support for the FoLiA Query Language (FQL)

Installation

Download and install the latest stable version directly from the Python Package Index with pip install folia (or pip3 for Python 3.7+ on most systems). For global installations prepend sudo.

Alternatively, clone this repository and run python setup.py install (or python3 setup.py install for Python 3 on most system. Prepend sudo for global installations.

The Python FoLiA library is also included in our LaMachine distribution.

Documentation

API Documentation and tutorials can be found here .

About

An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published