Rework ev. #2914

jaheba · 2023-06-12T12:59:52Z

Changes:

All metrics now have canonical names, for example RMSE now is RMSE[mean], indicating that we compare against the mean prediction.
Renamed Evaluator to Metric, and Metric to MetricDefinition.
Add two classes MetricCollection and MetricDefinitionCollection to group respective objects.
Add evaluate function to run evaluations on metrics end to end.

Usage:

from gluonts.ev import nd, rmse, evaluate

evaluate(nd + rmse, data_batches, axis=1)

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Please tag this pr with at least one of these labels to make our release process faster: BREAKING, new feature, bug fix, other change, dev setup

lostella · 2023-06-13T09:30:45Z

Do we need 3 "layers" for this: MetricDefinition, Metric, and Evaluator. In a sense, the Evaluator appears to be a collection of Metric objects, just like MetricCollection is a collection ("sum") of MetricDefinition, so I'm wondering if everything could be more compact.

jaheba · 2023-06-13T09:43:16Z

I think we need it.

Otherwise we loose the flexibility that we wanted in the first place.

lostella · 2023-06-13T09:23:43Z

src/gluonts/ev/metrics.py

+    A derived metric updates multiple, simpler metrics independently and in
+    the end combines their results as defined in `post_process`."""
+
+    evaluators: Dict[str, Metric]


would metrics be a better name, to avoid confusion?

lostella · 2023-06-13T15:17:56Z

src/gluonts/ev/metrics.py

+    def update_all(self, stream: Iterator[Mapping[str, np.ndarray]]) -> None:
+        for element in stream:
+            self.update(element)


This could return self maybe

lostella · 2023-06-13T15:19:20Z

src/gluonts/ev/__init__.py

+def evaluate(metrics, data_batches, axis=None):
+    evaluator = metrics(axis)
+    evaluator.update_all(data_batches)
+    return evaluator.get()


If update_all returned self, this would be metrics(axis).update_all(data_batches).get()

lostella · 2023-06-13T15:21:08Z

src/gluonts/ev/metrics.py

+    def update(self, data: Mapping[str, np.ndarray]) -> None:
+        for metric in self.metrics:
+            metric.update(data)
+
+    def update_all(self, stream: Iterator[Mapping[str, np.ndarray]]) -> None:


These methods could use a docstring I guess

lostella · 2023-06-14T13:06:38Z

src/gluonts/ev/metrics.py

@@ -34,115 +43,250 @@
 )


+@dataclass
+class MetricCollection:


Could this have Metric as base class?

Jasper Zschiegner added 2 commits June 12, 2023 14:59

Rework ev.

2ee0664

Update __all__.

be35f80

lostella added the BREAKING This is a breaking change (one of pr required labels) label Jun 12, 2023

Add noqa comments.

ad1c48c

Jasper Zschiegner added 5 commits June 13, 2023 13:55

Rename Evaluator.

709b5a2

Add update_all to Metric.

318a099

Amend test.

000265c

Simplify.

9547233

Merge branch 'dev' into rework-ev

3f0bd83

lostella reviewed Jun 13, 2023

View reviewed changes

Address comments.

f090f29

lostella approved these changes Jun 14, 2023

View reviewed changes

src/gluonts/ev/metrics.py

@@ -34,115 +43,250 @@

)

@dataclass

class MetricCollection:

Copy link

Contributor

lostella Jun 14, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could this have Metric as base class?

jaheba merged commit 1c383af into awslabs:dev Jun 14, 2023
20 of 21 checks passed

jaheba deleted the rework-ev branch June 14, 2023 13:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework ev. #2914

Rework ev. #2914

jaheba commented Jun 12, 2023 •

edited

Loading

lostella commented Jun 13, 2023

jaheba commented Jun 13, 2023

lostella Jun 13, 2023

lostella Jun 13, 2023

lostella Jun 13, 2023

lostella Jun 13, 2023

lostella Jun 14, 2023

Rework ev. #2914

Rework ev. #2914

Conversation

jaheba commented Jun 12, 2023 • edited Loading

lostella commented Jun 13, 2023

jaheba commented Jun 13, 2023

lostella Jun 13, 2023

Choose a reason for hiding this comment

lostella Jun 13, 2023

Choose a reason for hiding this comment

lostella Jun 13, 2023

Choose a reason for hiding this comment

lostella Jun 13, 2023

Choose a reason for hiding this comment

lostella Jun 14, 2023

Choose a reason for hiding this comment

jaheba commented Jun 12, 2023 •

edited

Loading