Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ingesting CONNL-U: besides POS, also XPOS extracted #190

Closed
pirolen opened this issue Jul 1, 2024 · 2 comments
Closed

Ingesting CONNL-U: besides POS, also XPOS extracted #190

pirolen opened this issue Jul 1, 2024 · 2 comments
Assignees
Labels

Comments

@pirolen
Copy link

pirolen commented Jul 1, 2024

Hi, upon ingesting CONNL-U in FLAT from this medieval Slavic UD treebank repo, the converter seems to extract not only the POS column values (from column 4), but also the XPOS (from column 5).

The latter values do not get an annotation scheme assigned by FLAT (understandably).
FLAT does issue a warning about "one or more set definitions" missing.

And then in the GUI, when selecting for Annotation Focus the (correct UD) POS tagset visualization, no Legend gets rendered. Upon selecting the XPOS tagset, its values get rendered well.

Attached is the document in which we encountered this; both as the original input in CONLL-U and also in the converted FoLiA. (In the latter btw provenance info got harder to trace back to specific users, since the reverse proxy name gets filled in as user.)

Many thanks for looking into this!

aninaswonderworker.folia.xml.txt
aninaswonderworker.conllu.txt

@proycon proycon self-assigned this Jul 4, 2024
@proycon proycon added the bug label Jul 5, 2024
@proycon
Copy link
Owner

proycon commented Jul 5, 2024

FLAT does issue a warning about "one or more set definitions" missing.

Right, this warning is about https://raw.githubusercontent.com/proycon/folia/master/setdefinitions/universal-dependencies.foliaset.ttl (which was never defined), not https://raw.githubusercontent.com/proycon/folia/master/setdefinitions/universal-pos.foliaset.ttl . So you can just ignore it if you don't care for the dependency relations.

And then in the GUI, when selecting for Annotation Focus the (correct UD) POS tagset visualization, no Legend gets rendered.

I can reproduce it indeed, that looks like a bug to me, the set is there so I don't see why it can't be visualised. I'll look into it.

@proycon
Copy link
Owner

proycon commented Jul 5, 2024

This should now be fixed in v0.11.5 (docker container also published), the legend and class colours appear again:

screenshot20240705132841

@proycon proycon closed this as completed Jul 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants