[Codegen] remove fp16 function override for cuda #4331

yzhliu · 2019-11-14T00:39:09Z

override w/o volatile does not have any effect as cuda already has the functionality.
override w/ volatile is not supported by NVRTC at the moment.
I suggest to disable the feature for now and enable later once we find the solution.
see https://discuss.tvm.ai/t/error-fp16-cuda-compilation-error/ for details.

@vinx13 @Hzfengsy @tqchen Please advise.

vinx13 · 2019-11-14T01:26:50Z

src/codegen/literal/cuda_half_t.h

@@ -76,7 +77,7 @@ class TVM_ALIGNED(2) half {
  TVM_XINLINE explicit half(const uint8_t& value) { constructor(value); }
  TVM_XINLINE explicit half(const int32_t& value) { constructor(value); }
  TVM_XINLINE explicit half(const uint32_t& value) { constructor(value); }
-  TVM_XINLINE explicit half(const int64_t& value) { constructor(value); }
+  TVM_XINLINE explicit half(const long long& value) { constructor(value); }


why this change?

do I need to #include <cstdint> for int64_t ?

it's okay to use long long as they are the same for cuda

* add volatile override back * [codegen] remove fp16 function override for cuda

* [TOPI][OP] Support Faster-RCNN Proposal OP on CPU (apache#4297) * Support Proposal operator on CPU. * PyLint space issue * PyLint space issue * Pylint singleton-comparison issue * [QNN][Legalize] Specialize for Platforms without any fast Int8 arithmetic units. (apache#4307) * fix error when memory_id is VTA_MEM_ID_OUT (apache#4330) * [CI][DOCKER] Add ONNX runtime dep (apache#4314) * [DOCKER] Add ONNX runtime dep * Improve ci script * [QNN] Quantize - Fixing the sequence of lowering. (apache#4316) * [QNN] Use Int16 upcast in Fallback Conv2D. Fix test names. (apache#4329) * [doc][fix] fix sphinx parsing for pass infra tutorial (apache#4337) * change ci image version (apache#4313) * [Codegen] remove fp16 function override for cuda (apache#4331) * add volatile override back * [codegen] remove fp16 function override for cuda * [CI] Set workspace to be per executor (apache#4336) * [Build][Windows] Fix Windows build by including cctype (apache#4319) * Fix build * dummy change to retrigger CI * dummy change to retrigger ci * dummy change to retrigger ci * Enable hipModuleGetGlobal() (apache#4321) * [Relay][Pass] Add pass to remove unused functions in relay module (apache#4334) * [Relay][Pass] Add pass to remove unused functions in relay module * Add tests * Fix lint * Fix visit order * Add pass argument * Fix * Add support for quant. mul operator in tflite frontend (apache#4283) A test for qnn_mul has to be added when the qnn elemwise tests (apache#4282) get merged. * Add topi.nn.fifo_buffer to TVM doc (apache#4343) * Solve custom model of prelu (apache#4326) * Deprecate NNVM warning msg (apache#4333) * [Contrib] Add MKL DNN option (apache#4323) * [Contrib] Add MKL DNN * update * update * [Relay][Frontend][TF] Fix transpose when axes is not a param (apache#4327) * [Relay][Frontend][TF] Use _infer_value_simulated when axes is not a const to Transpose * uncomment tests * dummy change to retrigger ci * [RUNTIME] Add device query for AMD GcnArch (apache#4341) * add gcnArch query * kGcnArch query for cuda is a no-op * [Test][Relay][Pass] Add test case for lambda lift (apache#4317) * [Relay][Frontend][ONNX] operator support: DepthToSpace, SpaceToDepth (apache#4271) * imp module is deprecated (apache#4275) * [VTA] Bug fix for padded load with large inputs (apache#4293) * bug fix for padded load with large inputs * Update TensorLoad.scala * Update test_vta_insn.py * fix inconsistent tag name (apache#4134) * [CodeGen] Add build config option disable_assert to control whether to generate assert (apache#4340) * Bump up CUDA log version in tophub.py (apache#4347) * Add check to ensure input file was successfully opened in NNVM deploy code demo (apache#4315) * [COMMUNITY] Add DISCLAIMER, KEYS for ASF release (apache#4345) * [COMMUNITY] Add DISCLAIMER, KEYS for ASF release * Add file name spec * [Relay][VM][Interpreter] Enable first-class constructors in VM and interpreter via eta expansion (apache#4218) * Fix constructor pretty printing * Make Module::HasDef name consistent with API * Add VM constructor compilation via eta expansion * Lint * Fix CI * Fix failing test * Address comment * Retrigger CI * Retrigger CI * Update dmlc_tvm_commit_id.txt

yzhliu added 2 commits November 13, 2019 16:28

add volatile override back

41d657c

[codegen] remove fp16 function override for cuda

07685b4

yzhliu force-pushed the cu_fp16 branch from 7438a22 to 07685b4 Compare November 14, 2019 01:15

vinx13 reviewed Nov 14, 2019

View reviewed changes

tqchen assigned vinx13 Nov 14, 2019

vinx13 approved these changes Nov 14, 2019

View reviewed changes

vinx13 merged commit cf83d50 into apache:master Nov 14, 2019

vinx13 added the status: accepted label Nov 14, 2019

reminisce mentioned this pull request Nov 14, 2019

Update TVM submodule apache/mxnet#16777

Merged

zxy844288792 pushed a commit to zxy844288792/tvm that referenced this pull request Nov 15, 2019

[Codegen] remove fp16 function override for cuda (apache#4331)

464da21

* add volatile override back * [codegen] remove fp16 function override for cuda

zxy844288792 pushed a commit to zxy844288792/tvm that referenced this pull request Nov 15, 2019

[Codegen] remove fp16 function override for cuda (apache#4331)

9651abb

* add volatile override back * [codegen] remove fp16 function override for cuda

yzhliu mentioned this pull request Nov 16, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen] remove fp16 function override for cuda #4331

[Codegen] remove fp16 function override for cuda #4331

yzhliu commented Nov 14, 2019

vinx13 Nov 14, 2019

yzhliu Nov 14, 2019

vinx13 Nov 14, 2019

[Codegen] remove fp16 function override for cuda #4331

[Codegen] remove fp16 function override for cuda #4331

Conversation

yzhliu commented Nov 14, 2019

vinx13 Nov 14, 2019

Choose a reason for hiding this comment

yzhliu Nov 14, 2019

Choose a reason for hiding this comment

vinx13 Nov 14, 2019

Choose a reason for hiding this comment