TimeMixer-KAN is an innovative model that combines the TimeMixer architecture with Kolmogorov-Arnold Networks (KAN) to enhance time series forecasting, particularly for stock trading. This model, even without fine-tuning, has demonstrated superior performance over the standard TimeMixer model in short-term stock price forecasting.
TimeMixer-KAN is designed to leverage the strengths of both TimeMixer and Kolmogorov-Arnold Networks to process sequential data. By integrating these models, TimeMixer-KAN can effectively handle complex patterns in time series data, making it particularly useful for forecasting tasks in financial markets.
Hybrid Architecture: Combines TimeMixer’s sequential modeling capabilities with the Kolmogorov-Arnold Networks for increased flexibility and performance.
Short-Term Forecasting: Outperforms the standalone TimeMixer model, achieving a validation MSE loss of 0.0012793 on the stock 'AU8U.SI' (not yet fine-tuned).
To train the model, use the following command:
python ./stocks.py --tmkan --train --save_path ./timemixer_kan/
To resume training,
python ./stocks.py --tmkan --train --resume --save_path ./timemixer_kan/
To test the model, use:
python ./stocks.py --tmkan --test --save_path ./timemixer_kan/
This is a small experiment project, and any feedback or suggestions are greatly appreciated! Feel free to comment, open issues, or submit pull requests to help improve the model and its capabilities. I am using both the TimeMixer model concept (https://github.com/kwuking/TimeMixer) and KAN (https://github.com/pg2455/KAN-Tutorial)
- Fine tuning the model including tensorboard and predicted visualization
- Experiment on long and short term forecasting
- More trading stocks for comparison purposes
- MoE haven't done yet
- Add LLM model like Gemini, Llama, etc
- Data Augmentation