[ENH] Issue 1641 - Matrix profile-based anomaly detectors: left STAMPi #2091

ferewi · 2024-09-24T12:23:00Z

Reference Issues/PRs

What does this implement/fix? Explain your changes.

This PR implements the LeftSTAMPi Anomaly Detector based on the implementation in TimeEval (https://github.com/TimeEval/TimeEval-algorithms/blob/main/left_stampi/algorithm.py)

The Algorithm can be run in two modes.

Batch mode: Here the whole time series is put in at once. Internally the LeftSTAMPi algorithm is applied incrementally.
Stream mode: Here the Algorithm is initialized on a specified number of data points. The Matrix profile is then calculated incrementally.

Remarks:
The batch mode is implemented in the fit_predict method.
The stream mode is implemented in the fit and predictmethods, where fitis used to init the algorithm and predict is used for imcremental updates. As the predictmethod only accepts np.ndarray as its argument, on every update the new data point, which is a scalar has to be wrapped in a one-element numpy array. This is actually unnecessarry but avoiding this would involve a change to the interface in BaseAnomalyDetector.

Does your contribution introduce a new dependency? If yes, which one?

No.

Any other comments?

See remarks in the implementation description.

PR checklist

For all contributions

I've added myself to the list of contributors. Alternatively, you can use the @all-contributors bot to do this for you.
@all-contributors please add @ferewi for code, doc and test
The PR title starts with either [ENH], [MNT], [DOC], [BUG], [REF], [DEP] or [GOV] indicating whether the PR topic is related to enhancement, maintenance, documentation, bugs, refactoring, deprecation or governance.

For new estimators and functions

I've added the estimator to the online API documentation.
(OPTIONAL) I've added myself as a __maintainer__ at the top of relevant files and want to be contacted regarding its maintenance. Unmaintained files may be removed. This is for the full file, and you should not add yourself if you are just making minor changes or do not want to help maintain its contents.

For developers with write access

(OPTIONAL) I've updated aeon's CODEOWNERS to receive notifications about future changes to these files.

…he documentation

aeon-actions-bot · 2024-09-24T12:23:53Z

Thank you for contributing to `aeon`

I have added the following labels to this PR based on the title: [ $\color{#FEF1BE}{\textsf{enhancement}}$ ].
I have added the following labels to this PR based on the changes made: [ $\color{#6F6E8D}{\textsf{anomaly detection}}$ ]. Feel free to change these if they do not properly represent the PR.

The Checks tab will show the status of our automated tests. You can click on individual test runs in the tab or "Details" in the panel below to see more information if there is a failure.

If our pre-commit code quality check fails, any trivial fixes will automatically be pushed to your PR unless it is a draft.

Don't hesitate to ask questions on the aeon Slack channel if you have any.

PR CI actions

These checkboxes will add labels to enable/disable CI functionality for this PR. This may not take effect immediately, and a new commit may be required to run the new configuration.

Run pre-commit checks for all files
Run all pytest tests and configurations
Run all notebook example tests
Run numba-disabled codecov tests
Stop automatic pre-commit fixes (always disabled for drafts)
Push an empty commit to re-run CI checks

SebastianSchmidl

Thank you for your contribution!

I'm in favor of supporting streaming use cases, and I also like the idea of repeatedly calling predict. However, this violates the convention that no internal representation is updated in predict. We can neither use repeated calls to fit because it is assumed to reset the estimator on the beginning of each call.

I guess, we need to design a new API for streaming. @MatthewMiddlehurst how do you think about this?

Until we have decided on the new API, I would suggest adding leftSTAMPi only with its batch API.

aeon/anomaly_detection/_left_stampi.py

aeon/anomaly_detection/tests/test_left_stampi.py

ferewi · 2024-09-24T16:03:32Z

@CodeLionX Thanks for your comments and suggestions. The failing test-suite made me aware that the approach I took for the streaming case is violating the concept of fit, predictand fit_predict. I'll remove the streaming case for now. An idea for the streaming API might be to have an update method that is allowed to modify the internal representation after the intitial fitting.

Also, could you point me to the documentation of the sklearn and aeon conventions regarding the naming conventions (self.mp_vs self.mp), etc.? (If you have that at hand - I surely can google that myself)

…ter a decision about the streaming API has been made.

ferewi · 2024-09-25T09:09:33Z

Still making changes to fix the failling tests. I will re-request a new review when I am done.

SebastianSchmidl · 2024-09-25T09:37:06Z

Also, could you point me to the documentation of the sklearn and aeon conventions regarding the naming conventions (self.mp_vs self.mp), etc.? (If you have that at hand - I surely can google that myself)

It is sprinkled in this guide: https://scikit-learn.org/dev/developers/develop.html

E.g.

Also it is expected that parameters with trailing _ are not to be set inside the __init__ method. All and only the public attributes set by fit have a trailing _. As a result the existence of parameters with trailing _ is used to check if the estimator has been fitted.

is in Parameters and Init-section

MatthewMiddlehurst · 2024-09-25T11:32:23Z

Thanks for the contribution. Feel free to ask if you have any questions regarding the failures, i.e. we have a tag for estimators which can't be pickled (usually due to dependencies outside of our control).

MatthewMiddlehurst · 2024-09-25T11:33:41Z

Think adding to the base API should be a separate PR yeah, would need to see a proposal but if update or a similar method would work that sounds fine.

…rt to 'fit'.

SebastianSchmidl · 2024-09-25T13:35:39Z

Think adding to the base API should be a separate PR yeah, would need to see a proposal but if update or a similar method would work that sounds fine.

Another question is whether we actually want to introduce streaming algorithms in aeon. Are there already other streaming/online estimators? The current architecture is tailored to the batch-case. Maybe this is a topic for the next dev-meeting.

ferewi · 2024-09-25T13:53:50Z

Then maybe you discuss this in your next dev meeting and if you decide that you want to integrat a streaming api, I am happy to put in a proposal and another PR to implement this for the LeftSTUMPi algorithm.

ferewi · 2024-09-25T14:06:49Z

Thanks for the contribution. Feel free to ask if you have any questions regarding the failures, i.e. we have a tag for estimators which can't be pickled (usually due to dependencies outside of our control).

I guess I figured it out. I tagged the class with 'cant-pickle' and the tests are green now.

SebastianSchmidl

Otherwise, looks good 👍🏼

aeon/anomaly_detection/_left_stampi.py

… hints

ferewi · 2024-09-27T14:02:23Z

@CodeLionX How is the process after the PR is approved? Do I have to do something or is this simply integrated at some point?

MatthewMiddlehurst · 2024-09-27T14:14:48Z

Don't have to do anything, can be merged at any point really. Usually give it some time for other comments though. Not really a massive rush until it's close to release time 🙂

SebastianSchmidl · 2024-09-27T14:15:23Z

We will wait a bit, so that other maintainers / core devs get the chance to object; otherwise, I'll merge it in later.

EDIT: Matthew was faster 🤷🏼

SebastianSchmidl · 2024-09-27T14:17:49Z

@ferewi regarding the online-API, we decided in our dev-meeting that aeon will not support this in the near future. If you have a use case for streaming/online anomaly detection, we can, of course, talk about this again.

Matthew will create an issue to track this.

TonyBagnall · 2024-09-27T17:47:06Z

Fantastic, thanks for this

ferewi · 2024-09-27T21:43:27Z

Cool - I was just interested in how the usual process is :)

Thank you for your help @CodeLionX @MatthewMiddlehurst :)

MatthewMiddlehurst · 2024-11-14T13:27:41Z

@all-contributors add @ferewi for code

This may be duplicated, dont remember if we did this 🙂

allcontributors · 2024-11-14T13:27:53Z

@MatthewMiddlehurst

I've put up a pull request to add @ferewi! 🎉

ferewi added 3 commits September 24, 2024 13:22

added LeftSTAMPi implementation based on the implementation in TimeEval

9c8d7ea

fixed example markup in LeftSTAMPi doctring and added LeftSTAMPi to t…

df8b0fd

…he documentation

updated maintainer name to github username

38916bf

ferewi requested review from SebastianSchmidl and MatthewMiddlehurst as code owners September 24, 2024 12:23

aeon-actions-bot bot added anomaly detection Anomaly detection package enhancement New feature, improvement request or other non-bug code enhancement labels Sep 24, 2024

exclude examples from doctest

df2b0e5

SebastianSchmidl requested changes Sep 24, 2024

View reviewed changes

removed implementation of the streaming mode. Might be added again af…

17826d9

…ter a decision about the streaming API has been made.

Merge branch 'aeon-toolkit:main' into feature_1641_implement-left-stampi

0a5c683

ferewi added 2 commits September 25, 2024 13:34

fixed state modification in 'predict' by moving the initialisation pa…

6e94fd7

…rt to 'fit'.

import stumpy only once

ae5c307

mock stumpy to run unit tests if package not installed

34a87fa

ferewi requested a review from SebastianSchmidl September 25, 2024 14:06

Merge branch 'aeon-toolkit:main' into feature_1641_implement-left-stampi

9781dbd

SebastianSchmidl requested changes Sep 25, 2024

View reviewed changes

aeon/anomaly_detection/_left_stampi.py Outdated Show resolved Hide resolved

SebastianSchmidl mentioned this pull request Sep 26, 2024

[MNT] remove redundant soft dependency checks #2101

Merged

2 tasks

ferewi and others added 3 commits September 26, 2024 10:46

removed obsolete check for stumpy being loaded and added missing type…

c4ac90c

… hints

Automatic pre-commit fixes

3ea5b56

Merge branch 'aeon-toolkit:main' into feature_1641_implement-left-stampi

987aee3

ferewi requested a review from SebastianSchmidl September 26, 2024 08:48

SebastianSchmidl approved these changes Sep 26, 2024

View reviewed changes

TonyBagnall merged commit 5fadd1c into aeon-toolkit:main Sep 27, 2024
14 checks passed

allcontributors bot mentioned this pull request Nov 14, 2024

📝 Add ferewi as a contributor for code #2354

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Issue 1641 - Matrix profile-based anomaly detectors: left STAMPi #2091

[ENH] Issue 1641 - Matrix profile-based anomaly detectors: left STAMPi #2091

ferewi commented Sep 24, 2024

aeon-actions-bot bot commented Sep 24, 2024

SebastianSchmidl left a comment

ferewi commented Sep 24, 2024

ferewi commented Sep 25, 2024

SebastianSchmidl commented Sep 25, 2024 •

edited

Loading

MatthewMiddlehurst commented Sep 25, 2024

MatthewMiddlehurst commented Sep 25, 2024

SebastianSchmidl commented Sep 25, 2024

ferewi commented Sep 25, 2024

ferewi commented Sep 25, 2024

SebastianSchmidl left a comment

ferewi commented Sep 27, 2024

MatthewMiddlehurst commented Sep 27, 2024

SebastianSchmidl commented Sep 27, 2024 •

edited

Loading

SebastianSchmidl commented Sep 27, 2024 •

edited

Loading

TonyBagnall commented Sep 27, 2024

ferewi commented Sep 27, 2024

MatthewMiddlehurst commented Nov 14, 2024

allcontributors bot commented Nov 14, 2024

[ENH] Issue 1641 - Matrix profile-based anomaly detectors: left STAMPi #2091

[ENH] Issue 1641 - Matrix profile-based anomaly detectors: left STAMPi #2091

Conversation

ferewi commented Sep 24, 2024

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

Any other comments?

PR checklist

For all contributions

For new estimators and functions

For developers with write access

aeon-actions-bot bot commented Sep 24, 2024

Thank you for contributing to aeon

PR CI actions

SebastianSchmidl left a comment

Choose a reason for hiding this comment

ferewi commented Sep 24, 2024

ferewi commented Sep 25, 2024

SebastianSchmidl commented Sep 25, 2024 • edited Loading

MatthewMiddlehurst commented Sep 25, 2024

MatthewMiddlehurst commented Sep 25, 2024

SebastianSchmidl commented Sep 25, 2024

ferewi commented Sep 25, 2024

ferewi commented Sep 25, 2024

SebastianSchmidl left a comment

Choose a reason for hiding this comment

ferewi commented Sep 27, 2024

MatthewMiddlehurst commented Sep 27, 2024

SebastianSchmidl commented Sep 27, 2024 • edited Loading

SebastianSchmidl commented Sep 27, 2024 • edited Loading

TonyBagnall commented Sep 27, 2024

ferewi commented Sep 27, 2024

MatthewMiddlehurst commented Nov 14, 2024

allcontributors bot commented Nov 14, 2024

Thank you for contributing to `aeon`

SebastianSchmidl commented Sep 25, 2024 •

edited

Loading

SebastianSchmidl commented Sep 27, 2024 •

edited

Loading

SebastianSchmidl commented Sep 27, 2024 •

edited

Loading