Save DBScan Model #650

EricCacciavillani · 2020-11-08T17:27:10Z

First off thank you so much for writing this package! Been having a blast with it and the documentation is amazing!

I found a bug though when I try to pickle.load the object back from a pickle file and it results in this error.

annoviko · 2020-11-08T19:07:35Z

Thank you for your reporting. Yes, I can recognize it. It because pyclustering uses C++ code to get maximum performance (I rewrite some of the algorithms using C++) and not all internal data is transferred back to python, only results. And as a result object cannot be restored. I think this is an issue for almost all algorithms which delegate calculation to C++ implementation.

I will investigate the issue and will introduce correction for all clustering algorithms.

Still, it would be useful to have a code example, I would like to know the use-case if it is possible.

EricCacciavillani · 2020-11-08T22:16:29Z

Sorry for responding so late was helping a co-worker.

from pyclustering.cluster.dbscan import dbscan
import pickle

# Data that has had pca and scaled
scaled = auto_cluster.get_scaled_data()

# Model instance
dbscan_instance = dbscan(scaled, .07865, 8, True)
dbscan_instance.process()

# Save model to dir
file_dir = 'DBScan.pkl'
list_pickle = open(file_dir, 'wb')
pickle.dump(dbscan_instance,list_pickle)
list_pickle.close()

# Load model in
with open(file_dir, 'rb') as handle:
    load_model = pickle.load(handle)

…ickle).

annoviko · 2020-11-09T21:20:37Z

I have supported dumping and loading of the algorithm. The changes are on the branch 0.10.dev, and they will be available in the next release 0.10.1.

annoviko added the Bug Tasks related to found bugs label Nov 8, 2020

annoviko self-assigned this Nov 9, 2020

annoviko added a commit that referenced this issue Nov 9, 2020

#650: Dump and load DBSCAN algorithm using states (that are used by p…

0a35173

…ickle).

annoviko closed this as completed Nov 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save DBScan Model #650

Save DBScan Model #650

EricCacciavillani commented Nov 8, 2020

annoviko commented Nov 8, 2020

EricCacciavillani commented Nov 8, 2020 •

edited

Loading

annoviko commented Nov 9, 2020

Save DBScan Model #650

Save DBScan Model #650

Comments

EricCacciavillani commented Nov 8, 2020

annoviko commented Nov 8, 2020

EricCacciavillani commented Nov 8, 2020 • edited Loading

annoviko commented Nov 9, 2020

EricCacciavillani commented Nov 8, 2020 •

edited

Loading