Why should the validation or testing dataset use the whole data? #3199

arieffadhlan · 2024-06-20T13:51:46Z

Description

I'm still new to using DeepAR model. I see AWS Sagemaker makes the following statement and it looks like GluonTS implements the same. "You can create training and test datasets that satisfy this criteria by using the entire dataset (the full length of all time series that are available) as a test set and removing the last prediction_length points from each time series for training."

Why should the testing dataset use the whole data?

For example, I have 1000 data and want to predict the next 30 days. The training data will be used from 1 - 970. Why does the testing data use 1 - 1000?

arieffadhlan added the bug Something isn't working label Jun 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why should the validation or testing dataset use the whole data? #3199

Why should the validation or testing dataset use the whole data? #3199

arieffadhlan commented Jun 20, 2024 •

edited

Loading

Why should the validation or testing dataset use the whole data? #3199

Why should the validation or testing dataset use the whole data? #3199

Comments

arieffadhlan commented Jun 20, 2024 • edited Loading

Description

arieffadhlan commented Jun 20, 2024 •

edited

Loading