Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

the definitions of unsupervised and semi-supervised approaches #6

Open
Pengyu-Li-2002 opened this issue Nov 13, 2024 · 1 comment
Open

Comments

@Pengyu-Li-2002
Copy link

Dear Author,

Thank you for your open-source work; it has significantly advanced the field of time series anomaly detection (TSAD). However, I have some questions. In the code, you use the training set for both training and validation, and the test set solely for inference, considering this approach as a semi-supervised method. Does this differ from the current definitions of unsupervised and semi-supervised approaches commonly used in the TSAD field? I look forward to your response.

Thank you!

@qhliu26
Copy link
Member

qhliu26 commented Nov 13, 2024

Hello Pengyu,

Thank you for your interest in our work! I think this is a common practice in this field. Actually, the test set is the entire time series, and the training set is the initial portion of the time series which exhibits no anomaly or very few anomalies.

Here I attach some definitions of unsupervised/semi-supervised approaches in TSAD for reference:
(i) unsupervised, which does not require any labeled data. So we directly pass the test set into the anomaly detectors.
(ii) semi-supervised, requiring labels only for normal instances. So, we use the training set to fix/train the model before applying it to the test set.

Let me know if I address your questions and feel free to offer suggestions!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants