-
Notifications
You must be signed in to change notification settings - Fork 248
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance Issue - OPTICS #521
Comments
Hello, @mallika0613, Are your sure that core is used? What is version do you use? Is it possible to see input data? |
I am using python3.6. How do I check if the core is used? |
@mallika0613 , I mean pyclustering version - what is pyclustering version? Have you seen warning messages like this one: "The pyclustering ccore is not supported for platform..." or something like this. You can start debugging process check which method is used for processing in the |
I think you are right, core is not being used. But also I dont see the ccore not supported message. |
@mallika0613 , How did you install the library?
|
I am running it on aws instance. I used pip command to install the library |
@mallika0613 , is there any information about hardware platform and operating system? |
It has Linux OS with 488GB memory and 64 CPUs |
@mallika0613 , what is CPU architecture (for example, x86, x86_64)? |
It is x86_64 |
x86_64 is supported. Ok, you can try to rebuilt core manually: $ cd pyclustering/ccore
$ make ccore_x64 And, please, check that ccore is used instead of python after that. |
@mallika0613 , in case of |
Ok, then I think installation is successful. |
@mallika0613 , just to be sure, could please do following: $ make clean
$ make ccore_x64 |
I followed the steps, but I dont think its improving the performace. |
@mallika0613 , clustering speed rate can be affected by data complexity, that's true. I will investigate perfomance issues, but, currently, I can recommend you to try other algorithms or to use other libraries, like scikit-learn or ELKI. |
Sure, thank you. Also a quick check, is it possible to extract important features from the model? |
@mallika0613 , I have reduce algorithmic complexity, it should help. But there is an additional issue that also should improve performance when it be done - #379 .
|
I am running OPTICS algorithm on 50k data points, since the data is text it has around 5k features. The time taken to run the program seems huge. Tried using ccore but doesnt seem to improve. Is there any way that I could improve performance.
The text was updated successfully, but these errors were encountered: