-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lemma me vs I #2
Comments
I'm look at forking this dataset. Are you thinking
In fact, this dataset only seems to have the 4th type. Perhaps it should be expanded to include a few instances of the others as well. |
Yes, that's how I would lemmatize those as well. |
I think this issue can be closed, although perhaps you want to check my work first @amir-zeldes |
Thanks @AngledLuffa - I guess my only comment on this is that on closer inspection, the FEAT |
Yeah looks like it should be |
This should be now fixed (#5). |
Hi and thanks for contributing this interesting dataset - I was wondering why the lemma of 'mine' is 'me' and not 'I'. The lemma of 'hers' seems to be 'she', so wouldn't we expect 'I' as the first person lemma?
Another thing to consider is cross-treebank comparability: in other UD English corpora (GUM, EWT) the lemma of the attributive possessive pronouns is the attributive possessive itself, for the substitutive possessive also itself, and for the other forms it's the subject form, so we get:
If there's no special reason to change these, maybe the lemmas could be changed to match the other corpora? See also UniversalDependencies/docs#517
The text was updated successfully, but these errors were encountered: