Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lemma me vs I #2

Open
amir-zeldes opened this issue Oct 24, 2019 · 6 comments
Open

Lemma me vs I #2

amir-zeldes opened this issue Oct 24, 2019 · 6 comments

Comments

@amir-zeldes
Copy link

Hi and thanks for contributing this interesting dataset - I was wondering why the lemma of 'mine' is 'me' and not 'I'. The lemma of 'hers' seems to be 'she', so wouldn't we expect 'I' as the first person lemma?

Another thing to consider is cross-treebank comparability: in other UD English corpora (GUM, EWT) the lemma of the attributive possessive pronouns is the attributive possessive itself, for the substitutive possessive also itself, and for the other forms it's the subject form, so we get:

  • I -> I
  • me -> I
  • my -> my
  • mine -> mine

If there's no special reason to change these, maybe the lemmas could be changed to match the other corpora? See also UniversalDependencies/docs#517

@AngledLuffa
Copy link
Contributor

I'm look at forking this dataset. Are you thinking

She wants to drive -> she
Give her the keys -> she
This is her car -> her
It is hers -> hers

In fact, this dataset only seems to have the 4th type. Perhaps it should be expanded to include a few instances of the others as well.

@amir-zeldes
Copy link
Author

Yes, that's how I would lemmatize those as well.

@AngledLuffa
Copy link
Contributor

I think this issue can be closed, although perhaps you want to check my work first @amir-zeldes

@amir-zeldes
Copy link
Author

Thanks @AngledLuffa - I guess my only comment on this is that on closer inspection, the FEAT Case=Gen is kind of strange for the substitutive possessive, since in sentence context it is actually a matrix argument. So in "Mine is nice" I would have said "Mine" is nominative, and the xpos PRP (rather than PRP$) suggests this as well IMO.

@nschneid
Copy link

Yeah looks like it should be Poss=Yes: https://universaldependencies.org/u/feat/Poss.html

@dan-zeman
Copy link
Member

..., the FEAT Case=Gen is kind of strange for the substitutive possessive, ...

This should be now fixed (#5).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants