Trainining Omikuji from scipy.sparse.csr_matrix #55
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I've adapted an alternative method to train the omikuji model by bypassing disk write for the Python wrapper.
The main work is based on the creation of a new methods in the
lib.rs
file calledload_omikuji_data_set_from_features_labels
.It is designed to take in the three main numpy arrays defining the underlying structure of the
scipy.sparse.csr_matrix
.In other words I map the
scipy.sparse.csr_matrix.{indices, indptr, data}
arrays into Rust vectors, and then I recreate a features matrix together with the labels set, in a way similar to thetrain_on_data
method.