Evaluation treats multiple categories too leniently #91

danielhers · 2020-05-15T13:36:01Z

Evaluation is by spans, and if there is a non-empty intersection of the categories, then the span is considered correct. This is a problem because parsers can just predict many unary edges or multi-category edges and not be penalized for it: https://github.com/danielhers/ucca/blob/master/ucca/evaluation.py#L102
@omriabnd @nschneid

nschneid · 2020-05-17T02:24:35Z

One subtlety is that, because F nodes are moved under the root, we are left with superfluous C nodes:

[F The] [H [P [C service] ] ... [D poor] [U ...] ] [F is]

Should they be removed? I.e.:

[F The] [H [P service ] ... [D poor] [U ...] ] [F is]

Scoring P and C separately here (in an edge-based evaluation) would seem inconsistent with the notion of ignoring where F attaches.

danielhers · 2020-05-18T12:22:05Z

Yes, I think normalization (including C-flattening) should occur again after moving Fs.

nschneid · 2020-05-21T02:47:36Z

Should moving all Fs be part of normalization? For structures like [S [F the] [C xyz]] it would make it more transparent that xyz is evoking a scene.

nschneid · 2020-06-30T19:40:07Z

Also: the confusion matrix code should match the F-score computation

omriabnd mentioned this issue May 15, 2020

Action items for benchmarking UCCA UniversalConceptualCognitiveAnnotation/UniversalConceptualCognitiveAnnotation.github.io#1

Open

danielhers mentioned this issue May 22, 2020

Better handling of Functions in evaluation #94

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation treats multiple categories too leniently #91

Evaluation treats multiple categories too leniently #91

danielhers commented May 15, 2020

nschneid commented May 17, 2020

danielhers commented May 18, 2020

nschneid commented May 21, 2020

nschneid commented Jun 30, 2020

Evaluation treats multiple categories too leniently #91

Evaluation treats multiple categories too leniently #91

Comments

danielhers commented May 15, 2020

nschneid commented May 17, 2020

danielhers commented May 18, 2020

nschneid commented May 21, 2020

nschneid commented Jun 30, 2020