[Relay][VM] Add AllocTensor instruction and better instruction printer #3306

icemelon · 2019-06-07T00:52:20Z

I changed the current AllocTensor instruction to AllocTensorReg instruction for dynamic shape allocation, and add AllocTensor instruction for const shape. Current every AllocTensor requires a LoadConst instruction before, which I think significantly increases the number of instructions and reduces the readability. Therefore I think it's better to have both AllocTensor and AllocTensorReg instruction.
I updated the VM instruction printer so that it's easier to understand.
Fix InvokePacked support for tuple type input.

src/relay/backend/vm/compiler.cc

wweic · 2019-06-07T22:58:05Z

src/runtime/vm/vm.cc

-  runtime::TVMArgsSetter setter(values.data(), codes.data());
+  size_t arity = 0;
+  for (Index i = 0; i < arg_count; i++) {
+    if (args[i].ptr_->tag == ObjectTag::kDatatype) {


When will this happen?

Ops like concatenate take in a tuple of tensors. And fusion sometimes can make a fused function take a tuple as input.

zhiics

LGTM

icemelon · 2019-06-10T21:25:12Z

@jroesch @tqchen Could you help review this PR?

jroesch · 2019-06-13T01:36:54Z

Sorry for the slow review Haichen, could you provide an example of the updated instruction printer? my only concern is I'm not sure if we should introduce duplicate instructions for readability wins, should we really differentiate static vs. non-static allocation.

I am not a 100% sure this matches the design I had in mind for the memory manifestation and optimization. We could just do this for now, and later change it when the memory ships but I was thinking we should introduce a low level concept of storage independent from allocation of tensors.

For example:

fn @f(%x, %y) {
  let z = %x + %y;
  ...
}

fn @f(%x, %y) {
  let sto = alloc_storage(..);
  let out1 = alloc_tensor(sto, ...);
  add(%x, %y, %out1);
  ...
}

I guess we could have two low-level instruction variants which take storage, dtype and shape but in general I don't like specializing/duplicating as it increases the number of potential code paths, and the allocation mechanism should have no real differences afaict. The rest of the changes look good to me.

icemelon · 2019-06-13T06:14:00Z

@jroesch I don't have a strong argument on adding new static tensor allocation. But I don't see adding this new instruction will cause any overhead either. I suggest that we can have this static alloc instruction for now, and later change it to alloc tensor from storage. I can also reserve an opcode for alloc storage instruction. What do you think?

I'll add examples of instruction printer.

icemelon · 2019-06-13T06:25:20Z

Updated instruction printer outputs:
Move: move $2 $1
Return: ret $3
Alloc tensor: alloc_tensor $1 [1, 3, 224, 224] float32
Alloc tensor with regisger: alloc_tensor_reg $2 $1 float32
Alloc datatype: alloc_data $3 tag(0) [$1, $2]
Alloc closure: alloc_closure $4 VMFunc[3]($1, $2)
Load const: load_const $1 Const[1]
Get field: get_field $2 $1[2]
Invoke VM function: invoke $4 VMFunc[4]($1, $2, $3)
Invoke packed function: invoked_packed PackedFunc[1](in: $1, $2, out: $3, $4)
Invoke closure: invoke_closure $4 $1($2, $3)
If: if $2 0 10
Goto: goto 2
Select: select $4 $1 $2 $3

jroesch · 2019-06-14T22:17:57Z

Okay looks good, I plan on moving back to VM after tutorial

apache#3306) * Update vm print & add AllocTensor instruction * patch * fix invoke packed * update cmake * tweak move * update invoke_closure * lint * add doc * tweak

icemelon requested review from jroesch and tqchen June 7, 2019 00:52

tqchen added the status: need review label Jun 7, 2019

wweic reviewed Jun 7, 2019

View reviewed changes

zhiics approved these changes Jun 9, 2019

View reviewed changes

wweic approved these changes Jun 9, 2019

View reviewed changes

icemelon added 8 commits June 12, 2019 22:40

Update vm print & add AllocTensor instruction

93e0a19

patch

ba1c27e

fix invoke packed

6a9a46c

update cmake

929dd81

tweak move

ababe3f

update invoke_closure

1442330

lint

93d87b7

add doc

69b24da

icemelon force-pushed the vm-print branch from 5c8dba0 to 69b24da Compare June 13, 2019 06:14

tweak

3d9c107

jroesch approved these changes Jun 14, 2019

View reviewed changes

jroesch merged commit b8fa8f6 into apache:master Jun 14, 2019

icemelon deleted the vm-print branch June 14, 2019 22:19

tqchen mentioned this pull request Nov 8, 2019

[RELEASE][DRAFT] TVM v0.6 Release candidate #4259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay][VM] Add AllocTensor instruction and better instruction printer #3306

[Relay][VM] Add AllocTensor instruction and better instruction printer #3306

icemelon commented Jun 7, 2019 •

edited

Loading

wweic Jun 7, 2019

icemelon Jun 9, 2019

zhiics left a comment

icemelon commented Jun 10, 2019

jroesch commented Jun 13, 2019 •

edited

Loading

icemelon commented Jun 13, 2019

icemelon commented Jun 13, 2019

jroesch commented Jun 14, 2019

[Relay][VM] Add AllocTensor instruction and better instruction printer #3306

[Relay][VM] Add AllocTensor instruction and better instruction printer #3306

Conversation

icemelon commented Jun 7, 2019 • edited Loading

wweic Jun 7, 2019

Choose a reason for hiding this comment

icemelon Jun 9, 2019

Choose a reason for hiding this comment

zhiics left a comment

Choose a reason for hiding this comment

icemelon commented Jun 10, 2019

jroesch commented Jun 13, 2019 • edited Loading

icemelon commented Jun 13, 2019

icemelon commented Jun 13, 2019

jroesch commented Jun 14, 2019

icemelon commented Jun 7, 2019 •

edited

Loading

jroesch commented Jun 13, 2019 •

edited

Loading