add symbol.SwapAxis operator, just can do Forward(). #502

starimpact · 2015-11-06T07:53:50Z

this is a early version, can run now, but haven't been tested.
This is just a look-version, means let you have a look and give me some ideas.

starimpact · 2015-11-06T07:56:03Z

I am sorry for did not see your comments.

starimpact · 2015-11-06T07:57:25Z

thanks for your suggestions.

…tests.

starimpact · 2015-11-06T09:47:40Z

I have finished the new code based on your suggestions.
And the Backward is also completed.
Please check the code for me.Give me more suggestions.
Thank you very much.:smile:

tqchen · 2015-11-06T18:03:27Z

make/config.mk

@@ -38,15 +38,15 @@ ADD_CFLAGS =
 #---------------------------------------------

 # whether use CUDA during compile
-USE_CUDA = 0
+USE_CUDA = 1


Please change this back to default, as most users don't have cuda

tqchen · 2015-11-06T18:14:13Z

Thanks for the contribution. I have made a few comments on the code, in general

Code Style, we use Google C++ code style, you can reproduce the linter check locally using

make lint

Try to use standard functions when possible
- This include std::swap and std::accumulate
DeclareBackwardDependency, as Backward only depend on out_grad
- c.f. http://mxnet.readthedocs.org/en/latest/developer-guide/operator.html
Please also write a testcase in https://github.com/dmlc/mxnet/blob/master/tests/python/unittest/test_operator.py

starimpact · 2015-11-07T02:58:28Z

Thanks very much for your suggestions.

…swapaxis files.some code style change. function is ready to go.

starimpact · 2015-11-07T08:14:43Z

function is ready to go. please check it.
maybe there is a little code style problem.
😆 😄 😸 😈

starimpact · 2015-11-08T05:51:37Z

I come across a problem:
SwapAxis(..., dim1=2, dim2=3)can work well on both cpu and gpu.
SwapAxis(..., dim1=3, dim2=2)only work well on cpu, but fail on gpu with the following error:

INFO:root:Start training with [gpu(0)]
[13:46:12] ./dmlc-core/include/dmlc/logging.h:208: [13:46:12] ./mshadow/mshadow/./tensor_blob.h:530: Check failed: (this->shape_.Size()) == (shape.Size()) TBlob.get_with_shape: new and old shape do not match total elements
[13:46:12] ./dmlc-core/include/dmlc/logging.h:208: [13:46:12] src/engine/./threaded_engine.h:295: [13:46:12] ./mshadow/mshadow/./tensor_blob.h:530: Check failed: (this->shape_.Size()) == (shape.Size()) TBlob.get_with_shape: new and old shape do not match total elements
terminate called after throwing an instance of 'dmlc::Error'
  what():  [13:46:12] src/engine/./threaded_engine.h:295: [13:46:12] ./mshadow/mshadow/./tensor_blob.h:530: Check failed: (this->shape_.Size()) == (shape.Size()) TBlob.get_with_shape: new and old shape do not match total elements
Aborted (core dumped)

starimpact · 2015-11-08T06:16:25Z

std::accumulate can not be recognized by nvcc.

tqchen · 2015-11-08T06:17:24Z

Oh, yap. Maybe create our own version of prod function is easier. Then I think it is OK. Thanks

tqchen · 2015-11-08T06:18:40Z

For the error, as it indicates, it means the shape's size do not match the size of TBlob. Likely due to shape initialization error, either InferShape or Shape2Five

tqchen · 2015-11-08T06:19:21Z

The platform dependency issue was likely due to some uninitialized memory(variable) that causes uncertainties, but just my guess

tqchen · 2015-11-08T06:37:32Z

resolve all the comments by cpplint, there are detailed messages

tqchen · 2015-11-08T06:39:32Z

For output, we need to do

exec_c.forward()
out = exec.output[0].asnumpy()
print out

The error was due to we did not call asnumpy to wait for the result, and the system start to shutdown before the computation starts.

starimpact · 2015-11-08T06:42:49Z

I can see nothing comments in the swapaxis files by cpplint? where are they?

starimpact · 2015-11-08T06:46:00Z

there is something error...

=====69/70 cpp-header files passed check=====
src/operator/swapaxis-inl.h: 34 Errors of 4 Categories map={'whitespace': 29, 'runtime': 2, 'readability': 2, 'build': 1}
=====57/58 cpp-soruce files passed check=====
src/operator/swapaxis.cc: 2 Errors of 1 Categories map={'readability': 2}
=====40/40 python files passed check=====
2 files failed lint
make: *** [lint] Error 1

tqchen · 2015-11-08T06:49:36Z

Hmm the error message occurs before these when it scan through files. You
may need to scroll up a bit to see it
On Sat, Nov 7, 2015 at 10:46 PM Zhang Ming [email protected] wrote:

there is something error...

=====69/70 cpp-header files passed check=====
src/operator/swapaxis-inl.h: 34 Errors of 4 Categories map={'whitespace': 29, 'runtime': 2, 'readability': 2, 'build': 1}
=====57/58 cpp-soruce files passed check=====
src/operator/swapaxis.cc: 2 Errors of 1 Categories map={'readability': 2}
=====40/40 python files passed check=====
2 files failed lint
make: *** [lint] Error 1

—
Reply to this email directly or view it on GitHub
#502 (comment).

starimpact · 2015-11-08T06:53:16Z

cpu is work right now, but not for gpu:

def test2():
    data_in = mx.symbol.Variable('data')
    conv = mx.symbol.Convolution(data=data_in, kernel=(3, 3), num_filter=16)
    datatmp = np.ones((1, 1, 32, 64))
    mxdata = mx.nd.array(datatmp)
    weightmp = np.ones((16, 1, 3, 3))
    mxweight = mx.nd.array(weightmp)
    biastmp = np.zeros(16)
    mxbias = mx.nd.array(biastmp)
    exe_c = conv.bind(ctx=mx.gpu(0), args=[mxdata, mxweight, mxbias])
    exe_c.forward()
    out = exe_c.outputs[0].asnumpy()
    print out

test2()

Error

[14:50:16] ./dmlc-core/include/dmlc/logging.h:208: [14:50:16] ./mshadow/mshadow/./tensor_blob.h:508: Check failed: Device::kDevMask == dev_mask_ && DataType<DType>::kFlag == type_flag_ TBlob.get: device type do not match specified type
[14:50:16] ./dmlc-core/include/dmlc/logging.h:208: [14:50:16] src/engine/./threaded_engine.h:295: [14:50:16] ./mshadow/mshadow/./tensor_blob.h:508: Check failed: Device::kDevMask == dev_mask_ && DataType<DType>::kFlag == type_flag_ TBlob.get: device type do not match specified type
terminate called after throwing an instance of 'dmlc::Error'
  what():  [14:50:16] src/engine/./threaded_engine.h:295: [14:50:16] ./mshadow/mshadow/./tensor_blob.h:508: Check failed: Device::kDevMask == dev_mask_ && DataType<DType>::kFlag == type_flag_ TBlob.get: device type do not match specified type
Aborted (core dumped)

starimpact · 2015-11-08T07:09:16Z

make lint is all passed!

…support.change test function name to test_swapaxes.

tqchen · 2015-11-08T18:12:27Z

src/operator/swapaxis-inl.h

+                  std::vector<TShape> *out_shape,
+                  std::vector<TShape> *aux_shape) const override {
+    int input_num = in_shape->size();
+    if (input_num == 0) {


CHECK_EQ(input_shape->size(), 1);

tqchen · 2015-11-08T18:18:47Z

Thanks for the good job! I have last few comments, please address them and rebase to resolve the conflict to current master here.

http://mxnet.readthedocs.org/en/latest/contribute.html#how-to-resolve-conflict-with-master

The conflict might have something to do with commits you have in mshadow, if that is the case, the best way might be reset mshadow's version, or keep a copy of your files, and do a clean fork

starimpact · 2015-11-09T02:44:29Z

Done, please have a check!

starimpact · 2015-11-09T02:45:55Z

oh, sorry, have not updated your newest comments.

starimpact · 2015-11-09T02:50:33Z

I think you forgot I can not push to your repository.

starimpact · 2015-11-09T02:51:40Z

Do I have another way to add your newest commits without do second time fork?

tqchen · 2015-11-09T02:52:30Z

You can do a rebase http://mxnet.readthedocs.org/en/latest/contribute.html#how-to-resolve-conflict-with-master

starimpact · 2015-11-09T03:07:06Z

something is wrong when I do rebase your mxnet.

starimpact · 2015-11-09T03:08:43Z

I do fetch your mxnet, but my repository is changed to my older version one when I do rebase.
really strange.

starimpact · 2015-11-09T03:13:18Z

I have pushed the newest code to my forked mxnet, can you check it?

starimpact · 2015-11-09T03:14:17Z

I need a little time to figure out the strange problem of rebase.

starimpact · 2015-11-09T03:49:08Z

Can I do merge?

tqchen · 2015-11-09T03:51:01Z

The general instruction is here http://mxnet.readthedocs.org/en/latest/contribute.html#how-to-resolve-conflict-with-master

If you find files with conflicts, edit the files and merge the conflict, and do a git add as indicated in the instruction

starimpact · 2015-11-09T03:56:21Z

the situation is: I have commit a lot in my local repository, when I rebase to your newest master, my current work directory content is updated to the first commit. What can I do?

starimpact · 2015-11-09T04:04:55Z

OK, likely I found the way....

starimpact · 2015-11-09T05:45:02Z

ok ,now, please check.

tqchen · 2015-11-09T06:09:07Z

closed due to #519

ming zhang added 2 commits November 5, 2015 14:45

add swapaxis

4d749bb

symbol.SwapAxis can run now, but not sure whether work ok.

ab19122

ming zhang added 2 commits November 6, 2015 17:39

finished forward and backward of symbol.SwapAxis.but still need more …

0ca99da

…tests.

disable the debug out.

1dd6ada

ming zhang added 2 commits November 6, 2015 17:55

fixed a bug in Backward.

1a2dc81

add myself into CONTRIBUTORS.md.

a0ec24d

tqchen reviewed Nov 6, 2015
View reviewed changes

add test_SwapAxis in file test_operator.py. remove the debug info in …

23cfadc

…swapaxis files.some code style change. function is ready to go.

make code be google style. remove std::accumulate which nvcc doesnot …

fcd62fd

…support.change test function name to test_swapaxes.

tqchen reviewed Nov 8, 2015
View reviewed changes

perfect the codes.

a21d3be

tqchen closed this Nov 9, 2015

add symbol.SwapAxis operator, just can do Forward(). #502

add symbol.SwapAxis operator, just can do Forward(). #502

Conversation

starimpact commented Nov 6, 2015

starimpact commented Nov 6, 2015

starimpact commented Nov 6, 2015

starimpact commented Nov 6, 2015

tqchen Nov 6, 2015

Choose a reason for hiding this comment

tqchen commented Nov 6, 2015

starimpact commented Nov 7, 2015

starimpact commented Nov 7, 2015

starimpact commented Nov 8, 2015

starimpact commented Nov 8, 2015

tqchen commented Nov 8, 2015

tqchen commented Nov 8, 2015

tqchen commented Nov 8, 2015

tqchen commented Nov 8, 2015

tqchen commented Nov 8, 2015

starimpact commented Nov 8, 2015

starimpact commented Nov 8, 2015

tqchen commented Nov 8, 2015

starimpact commented Nov 8, 2015

starimpact commented Nov 8, 2015

tqchen Nov 8, 2015

Choose a reason for hiding this comment

tqchen commented Nov 8, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

tqchen commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

tqchen commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

starimpact commented Nov 9, 2015

tqchen commented Nov 9, 2015