dmlc/tvm sync 20190410 #26

wweic · 2019-04-10T17:07:22Z

Thanks for contributing to TVM! Please refer to guideline https://docs.tvm.ai/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers.

* [Relay, TOPI] Add deformable conv2d * Moved to op level2 * Fix lint * Moved to level2 & bug fix * Update comments * Disabled flaky test of conv2d

* do second order * add comment * better name * use tvm assert all close * refire ci

) This reverts commit f5ca991.

Fix comment bugs and code style

…#2929)

…ed (apache#2860)

…pache#2850) * [FRONTEND][ONNX] Some bug fixes and Shape operator fixed for relay. * * test cases * * ci error

…pache#2864) * [FRONTEND][TENSORFLOW] bug fix for tensorflow official slim models. * * review comments

…apache#2937)

* [DOCKER][FRONTEND] Run DarkNet tests * update tests to pass CI

* Update take * Add special case for canonical simplify and fix test cases * Use lower case for wrap and clip * remove unnecssary lower * Fix mxnet converter for take * fix

* gather_nd added * gather_nd test added * more test added * fix lint * fix build error * fix lint * comments addressed

* error fixed * rename * solve conlicts with master * more test added * fix error * remove test * comment addressed

* Fix bias add default axis * update * Fix canonicalize ops for bias_add

* [Relay][Frontend] Support TF Gather * fix comments

…e#2897 (apache#2950) There are many OpenCL platforms that do not yet support OpenCL 2.0, hence we use 1.2 APIs, some of which are now deprecated. In order to turn off the deprecation warnings (elevated to errors by -Werror) we explicitly disable the 1.2 deprecation warnings. At the point TVM supports minimum version 2.0, this commit can be reverted.

…2819)

…ewer c++(https://en.cppreference.com/w/cpp/utility/functional/unary_function) (apache#2962)

…pache#2664) [AUTOTVM][TOPI] Port x86 NCHWc to AutoTVM for Task Extraction

* [HEADER] ASF header dir=include * [HEADER] ASF Header dir=src * [HEADER] ASF Header -dir=python * [HEADER] ASF header dir=topi * [HEADER] ASF Header dir=nnvm * [HEADER] ASF Header -dir=tutorials * [HEADER] ASF Header dir=tests * [HEADER] ASF Header -dir=docker * fix whitespace * [HEADER] ASF Header -dir=jvm * [HEADER] ASF Header -dir=web * [HEADER] ASF Header --dir=apps * [HEADER] ASF Header --dir=vta * [HEADER] ASF Header -dir=go * temp * [HEADER] ASF Header --dir=rust * [HEADER] Add ASF Header --dir=cmake * [HEADER] ASF Header --dir=docs * [HEADER] Header for Jenkinsfile * [HEADER] ASF Header to toml and md * [HEADER] ASF Header to gradle * Finalize rat cleanup * Fix permission * Fix java test * temporary remove nnvm onnx test

…n in CombineParallelConv2D (apache#2961) * [Relay] InferCorrectLayout for strided_slice * Add min_num_branches option to CombineParallelConv2D * Return undef if original layout contains splitted axes

…apache#2988) * save * lint

lint lint save save add more case save error lint lint commit do lint save fix lint wrap it back as func lint save remove dead comment fix style fix lint Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> address review feedback pe now handle freevar. as a result preserving function is now trivial. test add basic test, implement pretty printing for generic function test lint fix segfault save save do test fix another error address comment commit save address review feedback add test for invalidate, fix error in lookup rename cont to boduy fix error and add regression test fix error, add test case Update src/relay/pass/partial_eval.cc Co-Authored-By: MarisaKirisame <[email protected]> fix lint remove extra line save save

…generating (apache#5962) * Code migration Start (neo-ai#1) * Init commit: Code migration Start * Add loop_state.cc/h * Add ComputeDAG basic test * Split transform_step out & Update more UTs (neo-ai#3) * Split transform_step out * Update GetProducers & GetConsumers * Update UTs * Add UT for CacheReadWrite & Some bug fix * Add search_task, measure and serialization (neo-ai#4) * Add FollowSplit & FollowFusedSplit tests * Update dag.InferBound & its UT * Add search_task, measure and serialization * Update Serialization UT * Add MetaTileRewritePolicy (neo-ai#5) * Add feature * Add cost_model, meta_tile_rewrite_policy * Add MetaTileRewritePolicy basic UT * Basic Python API for State (neo-ai#6) * Add Basic Python API for State * Add UTs for State * Add Python API: Measure & Task (neo-ai#7) * Update the return value of state operation * Add task * Copy measure.py & utils.py * Fix LocalBuilder * Fix LocalRunner * Add ansor.auto_schedule() API; First AutoSchedule working version(neo-ai#8) * Add basic Python support for ansor.auto_schedule * Update AutoSchedule API * Bug fix for get the attach point of a fused iter * Update UT after infer bug fix * Bug fix & Add python serialization API (neo-ai#10) * Delete C++ UT hack since Python is ready * Add ndarray.non_empty * Update Serialization python API * Improve code style, python wrapper and test cases (neo-ai#11) * Update c++ code style and unit test * Update python State wrapper and test cases * fix unit tests * Add RPCRunner & OpenCL/CUDA test (neo-ai#12) * Add RPCRunner & OpenCL search test * Add CUDA search test * Add RPCRunner test * rebase to upstream/master * Add Ansor basic tutorial (neo-ai#13) * Add basic tutorial * migrate feature extraction (neo-ai#14) * Add XGBModel & RPCRunnerWarpper (neo-ai#15) * Add XGBModel & RPCRunnerWarpper * Revert "Add Parallel Granularity Mutation" * Migrate workload_registry.py (neo-ai#16) * add workload registry * update * update * add task scheduler (neo-ai#17) * Add conv2d cuda tutorial with workload registry (neo-ai#18) * add tune_test.py (the old tune_wkl.py) (neo-ai#19) * add tune_test.py (the old tune_wkl.py) * update * fix measure * fix for gpu * Code refine for tune_test.py & Add a pre load callback (neo-ai#20) * Bug fix for tutorials * Add PreLoadMeasuredStates * Add search_callback support for task tuner * Code refine for tune_test.py * Update * Update * Update * Update * Bug fix * Add python custom sketch rule (neo-ai#21) * Add custom sketch rule * Bug fix * Ansor Relay Integration (without layout rewrite) (neo-ai#22) * relay integration * Add tune_op_subgraph.py & Some code clean for tune_network.py (neo-ai#23) * Add single op tune scripts * Add tune subgraph support * Merge all op & all subgraph to one file * Rename file * add explicit_unroll_max_extent (neo-ai#25) * Add Index simplification & API update (neo-ai#26) * Add vectorized cooperative_fetching test * Update math simplify for vectorized CF * File rename * Update tune_network * API update * Update PreLoadMeasuredStates & Some bug fix (neo-ai#27) * Add a threading wrapper to fix the test bug * Set default TVM_USE_AUTO_SCHEDULER to false * Update PreLoadMeasuredStates callback * Add tensorize step for loop_state (neo-ai#31) * Add tensorize step * State python api update (neo-ai#33) * Start to update api * Add compute_dag to state * API update * kernel layout rewrite (neo-ai#28) * kernel layout rewrite * remove some hacks * add defuse_ops pass and move kernel_layout_rewrite pass after fuse_ops pass * set TVM_RELAY_DISABLE_BUILD_CACHE for task extraction and prepare_layout_rewrite * [cache flush] port cache flush to ansor (neo-ai#32) * Improve relay integration (neo-ai#34) * tmp checkpoint * Improve relay integration * Improve relay integration * Fix xgb error & Simplify dispatcher (neo-ai#35) * Rename "MetaTileRewritePolicy" to "SketchPolicy". (neo-ai#36) * Rename "MetaTileRewritePolicy" to "SketchPolicy". * Add a new class for auto_unroll_max_step, storage_offset in StageNode * fix tune_op_subgraph.py * rebase * Migrate all node::make to noderef's construct function (neo-ai#37) * Start to move xxxnode::make to noderef() * Update * Update * Finish transform_step * Finish comute dag & auto schedule * Update * Update * Update * Update * Update * Code refine * Code refine * Code refine * Update * Update * Some lint fix & Recover the double constructor of tvm::PrimExpr (neo-ai#39) * lint fix * clang-format-fix * pylint fix * Update * Recover the double constructor of tvm::PrimExpr * Fix pylint * pylint fix * pylint fix * Add MutateComputeLocation and MutateParallel in evolutionary search (neo-ai#40) * Add MutateComputeLocation and MutateParallel in evolutionary search * fix lint * Improve loop state python API (stage_tensors -> stage_ops) (neo-ai#41) * improve loop state python API (stage_tensors -> stage_ops) * fix * ComputeDAG bug fix & Add Custom TensorCore Matmul Example (neo-ai#42) * Bug Fix * Sample example of Custom TensorCore Matmul * Rever Commits, Start to build minimum Ansor system * Code clean for minimum Ansor system * Bug fix & Delete AccessAnalyzer * Delete attachmap & Code clean * Doc update Update statenode::stages from vector to Array * Headfile update & Python doc update * clang-format fix * pylint fix * Update * Doc update * Update * Bug fix after code merge to the new master * clang-format fix * Update * Update * Update std::vector to Array; Update verbosity setting; Some commemts addressed * std::vector->Array & std::string->String * Add init_state to ComputeDAG * Update * Update some unordered_map to Map * clang-format fix * Comments addressed Delete ReplayAndInferBound Delete ReplaySteps & InferBoundCommon * Lint fix * Update * Update * Update * Update * Update * Update * Update * Update * Update * Rename ansor namespace to auto_schedule * Update * Rename ThreadPool to ParallelFor * Add parallel_for * Remove ThreadPool * Update python/tvm/auto_schedule/auto_schedule.py * trigger CI Co-authored-by: Lianmin Zheng <[email protected]> Co-authored-by: Minmin Sun (孙敏敏) <[email protected]> Co-authored-by: Zhao Wu <[email protected]>

…generating (apache#5962) * Code migration Start (#1) * Init commit: Code migration Start * Add loop_state.cc/h * Add ComputeDAG basic test * Split transform_step out & Update more UTs (#3) * Split transform_step out * Update GetProducers & GetConsumers * Update UTs * Add UT for CacheReadWrite & Some bug fix * Add search_task, measure and serialization (#4) * Add FollowSplit & FollowFusedSplit tests * Update dag.InferBound & its UT * Add search_task, measure and serialization * Update Serialization UT * Add MetaTileRewritePolicy (#5) * Add feature * Add cost_model, meta_tile_rewrite_policy * Add MetaTileRewritePolicy basic UT * Basic Python API for State (#6) * Add Basic Python API for State * Add UTs for State * Add Python API: Measure & Task (#7) * Update the return value of state operation * Add task * Copy measure.py & utils.py * Fix LocalBuilder * Fix LocalRunner * Add ansor.auto_schedule() API; First AutoSchedule working version(#8) * Add basic Python support for ansor.auto_schedule * Update AutoSchedule API * Bug fix for get the attach point of a fused iter * Update UT after infer bug fix * Bug fix & Add python serialization API (#10) * Delete C++ UT hack since Python is ready * Add ndarray.non_empty * Update Serialization python API * Improve code style, python wrapper and test cases (#11) * Update c++ code style and unit test * Update python State wrapper and test cases * fix unit tests * Add RPCRunner & OpenCL/CUDA test (#12) * Add RPCRunner & OpenCL search test * Add CUDA search test * Add RPCRunner test * rebase to upstream/master * Add Ansor basic tutorial (#13) * Add basic tutorial * migrate feature extraction (#14) * Add XGBModel & RPCRunnerWarpper (#15) * Add XGBModel & RPCRunnerWarpper * Revert "Add Parallel Granularity Mutation" * Migrate workload_registry.py (#16) * add workload registry * update * update * add task scheduler (#17) * Add conv2d cuda tutorial with workload registry (#18) * add tune_test.py (the old tune_wkl.py) (#19) * add tune_test.py (the old tune_wkl.py) * update * fix measure * fix for gpu * Code refine for tune_test.py & Add a pre load callback (#20) * Bug fix for tutorials * Add PreLoadMeasuredStates * Add search_callback support for task tuner * Code refine for tune_test.py * Update * Update * Update * Update * Bug fix * Add python custom sketch rule (#21) * Add custom sketch rule * Bug fix * Ansor Relay Integration (without layout rewrite) (#22) * relay integration * Add tune_op_subgraph.py & Some code clean for tune_network.py (#23) * Add single op tune scripts * Add tune subgraph support * Merge all op & all subgraph to one file * Rename file * add explicit_unroll_max_extent (#25) * Add Index simplification & API update (#26) * Add vectorized cooperative_fetching test * Update math simplify for vectorized CF * File rename * Update tune_network * API update * Update PreLoadMeasuredStates & Some bug fix (#27) * Add a threading wrapper to fix the test bug * Set default TVM_USE_AUTO_SCHEDULER to false * Update PreLoadMeasuredStates callback * Add tensorize step for loop_state (#31) * Add tensorize step * State python api update (#33) * Start to update api * Add compute_dag to state * API update * kernel layout rewrite (#28) * kernel layout rewrite * remove some hacks * add defuse_ops pass and move kernel_layout_rewrite pass after fuse_ops pass * set TVM_RELAY_DISABLE_BUILD_CACHE for task extraction and prepare_layout_rewrite * [cache flush] port cache flush to ansor (#32) * Improve relay integration (#34) * tmp checkpoint * Improve relay integration * Improve relay integration * Fix xgb error & Simplify dispatcher (#35) * Rename "MetaTileRewritePolicy" to "SketchPolicy". (#36) * Rename "MetaTileRewritePolicy" to "SketchPolicy". * Add a new class for auto_unroll_max_step, storage_offset in StageNode * fix tune_op_subgraph.py * rebase * Migrate all node::make to noderef's construct function (#37) * Start to move xxxnode::make to noderef() * Update * Update * Finish transform_step * Finish comute dag & auto schedule * Update * Update * Update * Update * Update * Code refine * Code refine * Code refine * Update * Update * Some lint fix & Recover the double constructor of tvm::PrimExpr (#39) * lint fix * clang-format-fix * pylint fix * Update * Recover the double constructor of tvm::PrimExpr * Fix pylint * pylint fix * pylint fix * Add MutateComputeLocation and MutateParallel in evolutionary search (#40) * Add MutateComputeLocation and MutateParallel in evolutionary search * fix lint * Improve loop state python API (stage_tensors -> stage_ops) (#41) * improve loop state python API (stage_tensors -> stage_ops) * fix * ComputeDAG bug fix & Add Custom TensorCore Matmul Example (#42) * Bug Fix * Sample example of Custom TensorCore Matmul * Rever Commits, Start to build minimum Ansor system * Code clean for minimum Ansor system * Bug fix & Delete AccessAnalyzer * Delete attachmap & Code clean * Doc update Update statenode::stages from vector to Array * Headfile update & Python doc update * clang-format fix * pylint fix * Update * Doc update * Update * Bug fix after code merge to the new master * clang-format fix * Update * Update * Update std::vector to Array; Update verbosity setting; Some commemts addressed * std::vector->Array & std::string->String * Add init_state to ComputeDAG * Update * Update some unordered_map to Map * clang-format fix * Comments addressed Delete ReplayAndInferBound Delete ReplaySteps & InferBoundCommon * Lint fix * Update * Update * Update * Update * Update * Update * Update * Update * Update * Rename ansor namespace to auto_schedule * Update * Rename ThreadPool to ParallelFor * Add parallel_for * Remove ThreadPool * Update python/tvm/auto_schedule/auto_schedule.py * trigger CI Co-authored-by: Lianmin Zheng <[email protected]> Co-authored-by: Minmin Sun (孙敏敏) <[email protected]> Co-authored-by: Zhao Wu <[email protected]>

masahi and others added 30 commits April 10, 2019 10:06

[Relay] Add support for TupleGetItem in op fusion (apache#2914)

75fb75a

[Relay, TOPI] Deformable conv2d (apache#2908)

4f2b9cf

* [Relay, TOPI] Add deformable conv2d * Moved to op level2 * Fix lint * Moved to level2 & bug fix * Update comments * Disabled flaky test of conv2d

TVM debugresult dump to Chrome Tracing (apache#2922)

eccb928

[Relay] add test for second order ad (apache#2754)

09bd436

* do second order * add comment * better name * use tvm assert all close * refire ci

Revert "[Relay] add test for second order ad (apache#2754)" (apache#2926

a84b8f0

) This reverts commit f5ca991.

[Tutorial] Cache the test data in tutorial (apache#2923)

a38aa40

[AUTOTVM] Refactor measure build func (apache#2927)

4260e42

Fix intersect of modular set (apache#2904)

d10974a

Fix comment bugs and code style

[Relay, OpFusion] Fix handling TupleGetItem for nested tuples (apache…

5d9412e

…#2929)

Consistent result of DetectLinearEquation() when an empy vars is pass…

7292ae6

…ed (apache#2860)

[FRONTEND][ONNX] Some bug fixes and Shape operator fixed for relay. (a…

5c8b322

…pache#2850) * [FRONTEND][ONNX] Some bug fixes and Shape operator fixed for relay. * * test cases * * ci error

Outdated renaming for flatten in ONNX converter (apache#2843)

f077351

[FRONTEND][TENSORFLOW] bug fix for tensorflow official slim models. (a…

30f0c58

…pache#2864) * [FRONTEND][TENSORFLOW] bug fix for tensorflow official slim models. * * review comments

Fix vcvtph2ps codegen (apache#2925)

189863f

[ARITH] Analyzer CanonicalSimplifier (apache#2891)

083021c

Update schedule_dataflow_rewrite.cc (apache#2934)

c97007a

[TEXPR][PASS] Fix thread all reduce to avoid write after read hazzard (…

987a7d0

…apache#2937)

Fix PRC typo (apache#2939)

25c72c9

[DOCKER][FRONTEND] Run DarkNet tests (apache#2673)

ed8da06

* [DOCKER][FRONTEND] Run DarkNet tests * update tests to pass CI

[Relay] Add foldr1 (apache#2928)

53a4824

[Relay/TOPI][OP] Add clip and wrap mode support in take (apache#2858)

1ae30d7

* Update take * Add special case for canonical simplify and fix test cases * Use lower case for wrap and clip * remove unnecssary lower * Fix mxnet converter for take * fix

[Relay, Quantization] Quantize all fields of concatenate (apache#2913)

bdfedd3

Fix makedirs() condition in contrib. (apache#2942)

95d948d

[Relay][OP] Gather_nd exposed to relay (apache#2945)

c867317

* gather_nd added * gather_nd test added * more test added * fix lint * fix build error * fix lint * comments addressed

Add missing #!/bin/bash directive. (apache#2951)

7accfed

[Bugfix] Bilinear resize bug fix from PR apache#2777 (apache#2857)

1ea177f

* error fixed * rename * solve conlicts with master * more test added * fix error * remove test * comment addressed

[Relay][OP] Fix bias_add default axis (apache#2829)

5ce9d59

* Fix bias add default axis * update * Fix canonicalize ops for bias_add

[Rust] Unify types between bindings and pure Rust impl (apache#2616)

746ac1b

[Relay][Frontend] Support TF Gather (apache#2935)

abf50dc

* [Relay][Frontend] Support TF Gather * fix comments

ehsanmok and others added 19 commits April 10, 2019 10:06

[RUST] Remove empty ty.rs (apache#2958)

3ce857a

fix undefined reference to dlopen, etc (apache#2957)

7f358f8

[TOPI] bitserial_conv2d move to autotvm template and updates (apache#…

64750b1

…2819)

Removed std::unary_function because it is deprecated and removed in n…

e704f66

…ewer c++(https://en.cppreference.com/w/cpp/utility/functional/unary_function) (apache#2962)

[REFACTOR] Remove stale verilog generator (apache#2964)

3493a68

[tvm4j] provide error msg for failure function call (apache#2967)

ac3aa77

[TVM][Bugfix] Fix missing runtime:: (apache#2966)

cd74afe

[WIP][AUTOTVM][TOPI] Port x86 NCHWc to AutoTVM for Task Extraction (a…

db9cf43

…pache#2664) [AUTOTVM][TOPI] Port x86 NCHWc to AutoTVM for Task Extraction

Rustify PackedFunc & Friends (apache#2969)

048db2a

[Relay][RFC][Fix] Rename RelayPrint to AsText (apache#2984)

3c77297

[Relay] InferCorrectLayout for strided_slice & min_num_branches optio…

1d2e92d

…n in CombineParallelConv2D (apache#2961) * [Relay] InferCorrectLayout for strided_slice * Add min_num_branches option to CombineParallelConv2D * Return undef if original layout contains splitted axes

[Relay] Add expr_visitor, fix expr_functor exponential blowup problem (…

b77e3d5

…apache#2988) * save * lint

Update let_list.h (apache#2987)

5d3f7ee

Expose backtrace symbols in Debug mode (apache#3001)

8991f73

add output format to ndk build func (apache#2999)

d0b093a

fix java checkstyle version (apache#2998)

389ff44

Update dmlc_tvm_commit_id

e860938

wweic requested review from yongwww and zhiics April 11, 2019 04:24

zhiics approved these changes Apr 11, 2019

View reviewed changes

yongwww approved these changes Apr 11, 2019

View reviewed changes

wweic merged commit 1c2a22a into neo-ai:dev Apr 11, 2019

wweic deleted the wweic-sync-20190410 branch April 11, 2019 18:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dmlc/tvm sync 20190410 #26

dmlc/tvm sync 20190410 #26

wweic commented Apr 10, 2019

dmlc/tvm sync 20190410 #26

dmlc/tvm sync 20190410 #26

Conversation

wweic commented Apr 10, 2019