missing `get_minimum_spanning_tree` and similar #57

rtbs-dev · 2023-12-05T15:46:36Z

Very impressive library so far. Just wanted to mention here, unless I'm misreading your API docs, that the Graph object doesn't have an implementation of Prim's or Kruskal's minimum/maximum spanning tree. This is the last thing keeping me on e.g. scipy.sparse.csgraph, and was the first thing I looked for here.

Ideally, I would imagine a slightly more useful MST interface that e.g. defaults to the spanning tree for the whole graph, but could accept an array of node activation flags and an (optional) cost matrix to calculate the MST on that induced subgraph. This is part of a simple way to approximate the steiner tree on those nodes, for instance. If the user doesn't supply a cost matrix, then the metric closure would work (again, if desired...MST on the original graph weights is probably the default).

I did find these, but a number expressly say the tree is not minimal:

spanning_arborescence
spanning_arborescence_kruskal
random_spanning_arborescence_kruskal (very nice! Wilson's algorithm? Is this a uniform-random sample over unweighted trees? Is the MST the mode for weighted edges, like it normally would be?

The text was updated successfully, but these errors were encountered:

LucaCappelletti94 · 2023-12-05T15:48:12Z

What do you mean by but a number expressly say the tree is not minimal?

rtbs-dev · 2023-12-06T03:43:31Z

I mean in the docs. From the second one:

Returns consistent spanning arborescence using Kruskal.
The spanning tree is NOT minimal.

From the third:

The spanning tree is NOT minimal. The given random_state is NOT the root of the tree.

And the first seems to look like the second, and never specifies if the arborescence is minimal over the provided edge weights.

LucaCappelletti94 · 2023-12-06T08:05:58Z

All of these methods will return you the arborescences, which of course have a minimal number of edges. I don't recall whether I implemented one for the weighted case, as I don't have ever needed one. Do you know any good algorithm that scales well?

rtbs-dev · 2023-12-06T20:24:09Z

Sure; for starters, scipy implements minimum_spanning_tree (in fact, everything in the scipy.sparse.csgraph module would be a great thing to include here!)

The source code there has a reference implementation of Kruskal's algorithm (in a weighted setting).

The other option (outside of networkX's many implementations, one of which is Boruvka's algorithm) is graphblas, which would be very fast if done on the matrix, directly, but I can only find a version of Prim's algorithm in a C++ template repo...nothing for python-graphblas.

rtbs-dev · 2023-12-06T20:59:31Z

Note that these are all essentially O(|E|log|V|), so they are considered quite fast already. I think there's an expected-linear-time one, as well, e.g. here. But that would probably be more work than it's worth.

My use-case is typically finding MSTs in a metric closure, so Prim's algorithm runs faster (on dens graphs).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

missing `get_minimum_spanning_tree` and similar #57

missing `get_minimum_spanning_tree` and similar #57

rtbs-dev commented Dec 5, 2023

LucaCappelletti94 commented Dec 5, 2023

rtbs-dev commented Dec 6, 2023

LucaCappelletti94 commented Dec 6, 2023

rtbs-dev commented Dec 6, 2023

rtbs-dev commented Dec 6, 2023

missing get_minimum_spanning_tree and similar #57

missing get_minimum_spanning_tree and similar #57

Comments

rtbs-dev commented Dec 5, 2023

LucaCappelletti94 commented Dec 5, 2023

rtbs-dev commented Dec 6, 2023

LucaCappelletti94 commented Dec 6, 2023

rtbs-dev commented Dec 6, 2023

rtbs-dev commented Dec 6, 2023

missing `get_minimum_spanning_tree` and similar #57

missing `get_minimum_spanning_tree` and similar #57