-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Can anybody provide a kv-store (distributed training) example with cpp package? #12576
Comments
Thanks for submitting the issue @xiaolin-cheng |
@kalyc do we have a C++ example of distributed training? thanks |
@xiaolin-cheng We currently do not have example in C++ to demonstrate the distributed training. We have the support for distributed training in Python. Would you be able to share the use cases where you are planning to use distributed training using C++? I am especially curious about the reasons behind choosing C++ over Python for training. Thanks |
I am implementing an allreduce distributed training algorithm, and wanted to compare with MXNet's native parameter server approach. I did some hacking in the library C++ code, so comparing with C++ ps training would be fair. A quick C++ ps example of even how to use C++ ps APIs would be great. I was lost when I looked into C++ ps APIs and had no idea where to start. Thanks
|
@xiaolin-cheng Thanks for providing the information regarding your use case. The C++ API that are meant to be used as front-end API are available [here] (https://github.com/apache/incubator-mxnet/tree/master/cpp-package/include/mxnet-cpp) Currently, the documentation for these APIs are not getting generated and published. We have a PR #12977 out that will add the documentation to the main site. |
@xiaolin-cheng were you able to implement what you needed with the generated c++ documentation? if you were, you can contribute your code as an example for distributed training with c++ |
@mxnet-label-bot add[Distributed] |
@xiaolin-cheng , i also want to have a distributed-kvstore c++ example, do you have one now ? |
I am OK with python bindings, but not able to figure out how to use cpp package kv-store APIs. There is no working example under mxnet\cpp-package\example. Thanks a lot!
The text was updated successfully, but these errors were encountered: