-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistent decisions about SCONJ/ADV and mark/advmod for subordinate-clause-introducers #88
Comments
When is the trickier of the two and I can imagine that it will be more However, where is adverb, and in the above example it is even coreferential with the modified nominal in Sudan. |
@dan-zeman If these can be |
When they are |
@dan-zeman Should that be added to the |
Relatedly, there are instances of instead/ |
I can confirm that the test currently specifically targets |
My knee-jerk reaction to this is to say that distinguishing temporal/conditional 'when' in the syntax tree is unrealistic, and I think that's probably the right thing to say. But paradoxically, this is actually something we could probably distinguish for GUM automatically because we have complete discourse parses distinguishing a relation But since that's not feasible to use for the other English corpora, I'd say we should just make the interrogative ones be |
@amir-zeldes : But if some occurrences of when are |
Yes, the tagging and deprel issue are bundled, I just meant that distinguishing the two cases in EWT is non-trivial so probably not a feasible edit. If anyone does want to try, I'm happy to point out training resources for
or as a query in ANNIS, something like this: |
The ADV/SCONJ split is now implemented in GUM V6.1.0 and live in the UD dev branch, incl. in UD_English-GUMReddit, and in GU ANNIS. I can run the same depedit script on EWT, but I'm not sure if someone is editing the EWT dev branch concurrently. If someone wants me to do this I can make a PR, just let me know. |
@amir-zeldes I pushed some EWT fixes last night but am done now, so have at it! |
* Fixes #88 * Implemented using the following DepEdit script: ``` #no special adverbial status for WH adverb subordinations lemma=/^(when|how|where|while|why|whenever|wherever)$/&func=/mark/&head=/(.*)/&pos=/ADV/ none #1:pos=SCONJ lemma=/^(when|how|where|while|why|whenever|wherever)$/&func=/advmod/&head=/(.*)/ none #1:func=mark;#1:pos=SCONJ;#1:head2=$2:mark #exception for WH adverbs in questions, identified by question mark and not being an advcl func!=/advcl/;lemma=/^(when|how|where|while|why|whenever|wherever)$/&pos=/SCONJ/&func=/mark/&head=/(.*)/;text=/^(\?+)!?$/ #1>#2;#2.*#3 #2:func=advmod;#2:pos=ADV;#2:head2=$2:advmod #exception for 'why not' func=/root/;lemma=/why/&func=/mark/&head=/(.*)/;lemma=/not/\t#1>#2;#2.#3\t#2:func=advmod;#2:pos=ADV;#2:head2=$1:advmod #exception for do support func=/root/;lemma=/^(why|how|when|where)$/&func=/mark/&head=/(.*)/;lemma=/do/\t#1>#2;#2.#3\t#2:func=advmod;#2:pos=ADV;#2:head2=$2:advmod ```
@dan-zeman Should |
I would think that if they have |
I don't think SCONJ can't have pronoun features (in some languages, even words like 'if' inflect to agree with subordinate clause subjects), but in English I think it's neither:
So I would say no PronType, though I don't feel strongly about it. |
I have some doubts about the analysis of I went to France when de Gaulle was president. Here I think that when refers to certain time. You could also say I went to France in the time when de Gaulle was president. and then the time would be mentioned in the matrix clause. Seems elliptical, parallel to what in I give you what you need. vs. I give you the thing that/which you need. For me, an example where when is |
@dan-zeman Are you saying that all temporal clauses are inherently relative because matrix clauses happen at some time? |
@amir-zeldes I'm not sure I want to call them relative but I find it natural to consider the wh- thing there to be a relative adverb. On the other hand, it is difficult to judge for me as a non-native speaker. The situation in English is complicated by the fact that the same word, when, is used both in temporal and in conditional clauses (and in any grayzone cases between the two). If it were Czech, there would be two possible translations of when: kdy vs. když. The former is a temporal adverb, interrogative or relative, and cannot be used in conditional clauses. The latter is a subordinating conjunction and can be used in conditional as well as temporal clauses; I would not tag it as a relative adverb in the temporal ones. This sounds like I'm contradicting myself, but the difference is that in Czech I know that one of the words is |
The analysis of when-clause as adverbial headless relative clauses has already been proposed. In French, the equivalent of when, quand, is also a qu-words as the other relative pronouns. But the frontier between adverbial wh-words and SCONJ is very subtle and I am not sure it is possible to draw a line. In a sense, when is both an adverbial wh-word and an SCONJ, because adverbial wh-words are particular cases of SCONJ. |
I agree that the line is hard to draw, but in many cases you can coordinate when with subjunctions, which may be a reason to analyse it as such: I went to France when, or just before, de Gaulle became president |
Would substitutability of "whenever" be a test for conditional vs. temporal?
This distinction potentially interacts with tense/aspect; not sure if that is what we want. |
I think we have two separate questions here: should it be For the first, I tend to reject Rel, because I think the 'elliptical relative' argument could be applied to other cases we don't want to include, whereas normal relatives don't allow this ellipsis. For example, if we agree that 'if' is
In this example, the word 'condition' functions like 'time' in the other example, but I still think its correct analysis is as
So at least for English, I would say the two cases are different, and that 'when' is more similar to 'if' than it is to 'that' or 'which'. Then about the ability to distinguish POS tags (in English), I would say we should use ADV in main clause interrogatives, or reported interrogatives inside
For the advcl case, I think we can use SCONJ:
Also note that the POS tag can actually disambiguate two constructions:
But if people don't agree on this, I'd much prefer to have only ADV than only SCONJ, since in a main clause SCONJ seems really wrong to me. |
Fixed in #90 |
Reopening this because the current policy produces odd results. First, interrogative determiners are always WDT and DET, not SCONJ. Why should interrogative adverbs be different? The first result from the linked query illustrates this nicely, suggesting the bracketing:
Whereas the structure I would assign is:
Second, "how" can modify an adjective * Moreover, in this construction "how" is the interrogative equivalent of "as", which is ADV/advmod: "I have [[as good] a horse]." |
…y with other similar uses of "how", pending discussion in #88
Agreed, the first "opinions" analysis is wrong for me, since I think it's a free relative. It should be: nmod(opinions, how) And then it's clear that "how" is not |
Actually after reading up on free relatives I don't think it is one, I think it's an interrogative content clause. But this is a complicated discussion so let's do that in a meeting. |
Sure, happy to talk more about it when we meet - FWIW I think a content clause would be acl (the opinion that...), whereas this one conflates two functions, like a "what" free relative (it's short for a separaable "opinions about the way/nmod, how..."). So we prefer the matrix function, and tag "how" as nmod IMO. |
Whew, so today we decided to dispense with SCONJ/mark and always use ADV/advmod for "where", "when", "why", and "how". The best argument in my mind is a sentence like
(interrogative complement clause with predicate "has")—mark(much, how/SCONJ) would be decidedly weird, because the SCONJ usually marks the predicate itself. This brings WH words in complement clauses in line with main clauses (and relative clauses). Implementing this change should be pretty straightforward since we are removing a putative distinction, not adding one. |
Actually we had kept "how/advmod ADJ" (how much etc.) in GUM even after the shift to mark for the independent WH subordinators, so that doesn't need to change. I do wonder whether we have thought this through though: do we have an exhaustive list of the ones that should be advmod? Like, how about "once" or "while"? |
I think we only were talking about WH adverbs. "While" makes sense as SCONJ because it always introduces a clause, right? "Once" can be an adverb (unlike "if", "while", "that") but I'm not sure whether that means we shouldn't call it SCONJ where it introduces a clause. |
If "once" is SCONJ when introducing a clause (i.e. stands where 'if' can stand) then I don't really understand why we would want "when" to be advmod... Not that I feel really strongly about any of this (it's automatable and therefore doesn't mean much), but it feels arbitrary to me. If I'm being honest, I don't think there is a syntactic difference between "if we go" and "when we go". |
Well, I don't think the ADV vs. SCONJ dichotomy for the words we're discussing is a perfect fit for English, but we're stuck with it. :) I could go either way on "once". I just want all the WH-words to be ADV so that "I wonder how you feel", "I wonder how much money you make", "I wonder where you live", "I wonder when it ends", etc. all end up the same, and match the main clause versions of those subordinate clauses ("How do you feel?", "How much money do you make?", "Where do you live?", "When does it end?"). If we wanted to be very particular we could make an argument that "once" has been grammaticalized from an adverb to a subordinator, so it should be ADV/mark in these constructions. But we haven't gone down that road (yet) for prepositions functioning as subordinators. For now probably best to say "once" is SCONJ/mark. |
OK, I can live with that |
Originally posted by @amir-zeldes in #346 (comment) In at least some cases, yes. how, when, Xever, ... |
From above I think we decided "while" is SCONJ. |
Mm, so 'while' stays SCONJ/mark (in all contexts), and when|where|whither|whence|why are ADV/advmod? Is there ever a context in which 'while' is ADV/advmod or one of the others above is SCONJ/mark? |
Not that I know of? They are all WH-words except for "while". |
OK, I can make this happen in GUM, but looking at EWT dev it's currently not this way. Is this change pending in EWT? |
Yes |
* implementation of UniversalDependencies/UD_English-EWT#88 in GUM
Especially with "when" and "where":
Should all of these be
SCONJ
andmark
in UDv2?SCONJ
guidelines mention "when" introducing a clause.Note that in GUM, "where" and "when" are always tagged
SCONJ
, and nearly always attach to a subordinate clause asmark
or head a relative clause.The text was updated successfully, but these errors were encountered: