Fix Rotbaum to handle short series #3065

leica2023 · 2023-11-28T04:50:24Z

Issue #, if available:

Description of changes:
for items with less than forecast_horizon historical steps, jobs failed due to error 'Number of features of the model must match the input. Model n_features_ is 88 and input n_features is 60'. This PR fixes this error.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Please tag this pr with at least one of these labels to make our release process faster: BREAKING, new feature, bug fix, other change, dev setup

lostella · 2023-11-28T11:39:58Z

Thanks @leica2023! I think it would make sense to add a small test that reproduces the issue, and that would fail before this fix.

More at a high level: in this PR, the make_features method is being fixed to add padding whenever target is too short. This method appears to only be invoked a prediction time: is this going to work as intended? Will the model be trained to understand how to treat those padded series?

lostella · 2023-11-28T11:40:14Z

Also fix the styling issue by applying black

lostella · 2023-11-28T11:17:34Z

src/gluonts/ext/rotbaum/_preprocess.py

@@ -454,7 +454,7 @@ def make_features(self, time_series: Dict, starting_index: int) -> List:
            prefix = [None] * abs(starting_index)
        else:
            prefix = []
-        time_series_window = time_series["target"][starting_index:end_index]
+        time_series_window = time_series["target"] if prefix else time_series["target"][starting_index:end_index]     


I would include this in the previous if else above

Sure, makes sense.

leica2023 · 2023-11-28T20:58:25Z

Thanks @leica2023! I think it would make sense to add a small test that reproduces the issue, and that would fail before this fix.

More at a high level: in this PR, the make_features method is being fixed to add padding whenever target is too short. This method appears to only be invoked a prediction time: is this going to work as intended? Will the model be trained to understand how to treat those padded series?

Correct. The issue happens at prediction time for instances that have history length shorter than context_length. This method is also used in training data preparation, but this part ensures ts is long enough to contribute to training samples.

lostella · 2023-11-30T13:21:28Z

@leica2023 need to also format the test scripts:

black src test

lostella · 2023-11-30T22:00:50Z

src/gluonts/nursery/spliced_binned_pareto/run_model_example.ipynb

@@ -45,7 +45,7 @@
    "import os\n",
    "import pprint\n",
    "import random\n",
-    "from scipy import stats \n",


@leica2023 could you revert the changes to notebooks? No need to run black on them

lostella · 2023-11-30T22:01:38Z

test/ext/rotbaum/test_rotbaum_smoke.py

+        freq=freq,
+    )
+
+    predictor = TreePredictor(


TreePredictor also needs importing?

Added this and reverted ipynb styling.

lostella · 2023-11-30T22:04:05Z

test/ext/rotbaum/test_rotbaum_smoke.py

@@ -59,3 +61,69 @@ def test_rotbaum_smoke(datasets):
    predictor = estimator.train(dataset_train)
    forecasts = list(predictor.predict(dataset_test))
    assert len(forecasts) == len(dataset_test)
+
+
+def test_short_history_item_pred():


@leica2023 would this test fail before the fix contained in the PR? (That is, would it fail if run on the current dev branch)

fixed bug for items with short than H history

e17f49c

lostella reviewed Nov 28, 2023

View reviewed changes

leica2023 added 2 commits November 28, 2023 15:21

added a test case

71257a6

reformat

3c06a8e

leica2023 added 3 commits November 30, 2023 08:54

src test styling

1903ff2

fixed test func

e4f2bdb

added import

d4a28c3

lostella reviewed Nov 30, 2023

View reviewed changes

lostella changed the title ~~fixed bug for items with short than H history~~ Fix Rotbaum to handle short series Nov 30, 2023

lostella added the bug fix (one of pr required labels) label Nov 30, 2023

leica2023 added 3 commits December 1, 2023 09:59

added import

0871ef7

fixed errors

d3ad48d

reformat

08f32c6

lostella added pending v0.13.x backport This contains a fix to be backported to the v0.13.x branch pending v0.14.x backport This contains a fix to be backported to the v0.14.x branch labels Dec 4, 2023

lostella mentioned this pull request Dec 5, 2023

Fix Rotbaum to handle short series #3073

Merged

lostella removed pending v0.13.x backport This contains a fix to be backported to the v0.13.x branch pending v0.14.x backport This contains a fix to be backported to the v0.14.x branch labels Dec 5, 2023

lostella closed this Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Rotbaum to handle short series #3065

Fix Rotbaum to handle short series #3065

leica2023 commented Nov 28, 2023

lostella commented Nov 28, 2023 •

edited

Loading

lostella commented Nov 28, 2023

lostella Nov 28, 2023

leica2023 Nov 28, 2023

leica2023 commented Nov 28, 2023

lostella commented Nov 30, 2023

lostella Nov 30, 2023

lostella Nov 30, 2023

leica2023 Dec 1, 2023

lostella Nov 30, 2023

leica2023 Dec 1, 2023

Fix Rotbaum to handle short series #3065

Fix Rotbaum to handle short series #3065

Conversation

leica2023 commented Nov 28, 2023

lostella commented Nov 28, 2023 • edited Loading

lostella commented Nov 28, 2023

lostella Nov 28, 2023

Choose a reason for hiding this comment

leica2023 Nov 28, 2023

Choose a reason for hiding this comment

leica2023 commented Nov 28, 2023

lostella commented Nov 30, 2023

lostella Nov 30, 2023

Choose a reason for hiding this comment

lostella Nov 30, 2023

Choose a reason for hiding this comment

leica2023 Dec 1, 2023

Choose a reason for hiding this comment

lostella Nov 30, 2023

Choose a reason for hiding this comment

leica2023 Dec 1, 2023

Choose a reason for hiding this comment

lostella commented Nov 28, 2023 •

edited

Loading