Commit 05f3aba
Squashed commit
[Meta Schedule][M3c] Schedule Rules, Mutator & Postprocs (apache#485)
[Meta Schedule][M3c] PostOrderApply (apache#486)
Fix Post Order Apply (apache#490)
[MetaSchedule] Relay Integration (apache#489)
[M3c][Meta Schedule] Add Trace Correctness Test for PostOrderApply (apache#492)
Fix replay trace. (apache#493)
[M3c][Meta Schedule] Implement the Replay Func class. (apache#495)
[PR] Test script for meta-schedule task extraction. Interface to load… (apache#494)
[Meta Schedule Refactor] Get child blocks (apache#500)
Read-at && Write-at (apache#497)
[M3c][Meta Schedule] Measure Callbacks (apache#498)
[Bug] Fix Infinite Loop Caused When Calling Methods Not Overrided In PyClass (apache#496)
[MetaSchedule] Sample-Perfect-Tile (apache#501)
[MetaSchedule] TE Workloads (apache#502)
[TensorIR] GetProducer, GetConsumer (apache#506)
[MetaScheduleRefactor] Annotate&Unannotate (apache#505)
[MetaSchedule] Multi-Level-Tiling & Auto-Inline (apache#503)
[Tests] Add unittests for auto-inline and multi-level-tiling (apache#508)
[Meta Schedule] Minor Fixes (apache#507)
[MetaSchedule] Rewrite Cooperative-Fetching / Unbound-Block / Reduction-Block (apache#509)
[MetaSchedule] Rewrite Parallel-Vectorize-Unroll / Verify-GPU / Disallow-Dynamic-Loops (apache#499)
[Meta Schedule] Add Helper Function & Minor Modification (apache#512)
[MetaSchedule] Test for Rewrite Parallel-Vectorize-Unroll (apache#513)
[Meta Schedule] Feature Extractor & Cost Model (apache#510)
Blockize & Tensorize (apache#514)
Layout Rewriting: Suggest-Index-Map (apache#520)
[MetaSchedule] Parallel-Vectorize-Unroll & Random-Compute-Location (apache#516)
[Meta Schedule] Per-Store-Feature (apache#521)
Add traced schedule for blockize & tensorize (apache#526)
[Meta Schedule] Add XGBoost Model & Random Model (apache#519)
User-Interface: Tune-TIR (apache#525)
User-Interface: Tune-TE (apache#527)
[Minor] More logging on python (apache#528)
Get CUDA tuning working (apache#529)
[MetaSchedule] TensorRT BYOC (apache#518)
[BugFix] LocalBuilder API (apache#531)
[Meta Schedule] Add Cost Model Update Measure Callback (apache#530)
[Bugfix] BuilderInput with default params (apache#532)
[MetaSchedule] Mutator-Tile-Size, Mutate-Parallel, Mutate-Unroll (apache#534)
[Meta Schedule] Evolutionary Search (apache#522)
[BugFix] Remove duplicated definition of MakeMultinomialSampler (apache#535)
[Meta Schedule] Fix some bugs (apache#537)
Initiate Experiments for CPU Performance Alignment with Ansor (apache#538)
[Meta Schedule] Tweak experiment scripts (apache#539)
[Meta Schedule] Initiate experiments on CUDA (apache#540)
[TIR][Schedule] Buffer transform (apache#523)
Auto Tensor Core (apache#524)
Working on Evo Search (apache#542)
[Meta Schedule] Add Replay Tuning Interface (apache#543)
Evolutionary Search on CPU (apache#544)
Misc improvement over the error message (apache#545)
[TIR][Schedule] Software pipelining (apache#533)
[Meta Schedule Refactor] fixing unit tests (apache#547)
[MetaSchedule] Mutator-Compute-Location (apache#548)
Misc Improvement of Evolutionary Search (apache#549)
Hotfix for software pipeline (apache#552)
Misc Improvement (apache#550)
Co-authored-by: Siyuan Feng <[email protected]>
Co-authored-by: Bohan Hou <[email protected]>
Co-authored-by: Hongyi Jin <[email protected]>
Co-authored-by: Ruihang Lai <[email protected]>
Co-authored-by: Junru Shao <[email protected]>
Co-authored-by: Wuwei Lin <[email protected]>
Co-authored-by: Sunghyun Park <[email protected]>
Co-authored-by: Xiyou Zhou <[email protected]>
Squashed commit
[Meta Schedule][M3c] Schedule Rules, Mutator & Postprocs (apache#485)
[Meta Schedule][M3c] PostOrderApply (apache#486)
Fix Post Order Apply (apache#490)
[MetaSchedule] Relay Integration (apache#489)
[M3c][Meta Schedule] Add Trace Correctness Test for PostOrderApply (apache#492)
Fix replay trace. (apache#493)
[M3c][Meta Schedule] Implement the Replay Func class. (apache#495)
[PR] Test script for meta-schedule task extraction. Interface to load… (apache#494)
[Meta Schedule Refactor] Get child blocks (apache#500)
Read-at && Write-at (apache#497)
[M3c][Meta Schedule] Measure Callbacks (apache#498)
[Bug] Fix Infinite Loop Caused When Calling Methods Not Overrided In PyClass (apache#496)
[MetaSchedule] Sample-Perfect-Tile (apache#501)
[MetaSchedule] TE Workloads (apache#502)
[TensorIR] GetProducer, GetConsumer (apache#506)
[MetaScheduleRefactor] Annotate&Unannotate (apache#505)
[MetaSchedule] Multi-Level-Tiling & Auto-Inline (apache#503)
[Tests] Add unittests for auto-inline and multi-level-tiling (apache#508)
[Meta Schedule] Minor Fixes (apache#507)
[MetaSchedule] Rewrite Cooperative-Fetching / Unbound-Block / Reduction-Block (apache#509)
[MetaSchedule] Rewrite Parallel-Vectorize-Unroll / Verify-GPU / Disallow-Dynamic-Loops (apache#499)
[Meta Schedule] Add Helper Function & Minor Modification (apache#512)
[MetaSchedule] Test for Rewrite Parallel-Vectorize-Unroll (apache#513)
[Meta Schedule] Feature Extractor & Cost Model (apache#510)
Blockize & Tensorize (apache#514)
Layout Rewriting: Suggest-Index-Map (apache#520)
[MetaSchedule] Parallel-Vectorize-Unroll & Random-Compute-Location (apache#516)
[Meta Schedule] Per-Store-Feature (apache#521)
Add traced schedule for blockize & tensorize (apache#526)
[Meta Schedule] Add XGBoost Model & Random Model (apache#519)
User-Interface: Tune-TIR (apache#525)
User-Interface: Tune-TE (apache#527)
[Minor] More logging on python (apache#528)
Get CUDA tuning working (apache#529)
[MetaSchedule] TensorRT BYOC (apache#518)
[BugFix] LocalBuilder API (apache#531)
[Meta Schedule] Add Cost Model Update Measure Callback (apache#530)
[Bugfix] BuilderInput with default params (apache#532)
[MetaSchedule] Mutator-Tile-Size, Mutate-Parallel, Mutate-Unroll (apache#534)
[Meta Schedule] Evolutionary Search (apache#522)
[BugFix] Remove duplicated definition of MakeMultinomialSampler (apache#535)
[Meta Schedule] Fix some bugs (apache#537)
Co-authored-by: Siyuan Feng <[email protected]>
Co-authored-by: Bohan Hou <[email protected]>
Co-authored-by: Hongyi Jin <[email protected]>
Co-authored-by: Ruihang Lai <[email protected]>
Co-authored-by: Junru Shao <[email protected]>
Co-authored-by: Wuwei Lin <[email protected]>
Co-authored-by: Sunghyun Park <[email protected]>
Co-authored-by: Xiyou Zhou <[email protected]>
Initiate Experiments for CPU Performance Alignment with Ansor (apache#538)
* ...
* update
* update
* print
* more
[Meta Schedule] Tweak experiment scripts (apache#539)
[Meta Schedule] Initiate experiments on CUDA (apache#540)
* [Meta Schedule] Initiate experiments on CUDA
* ...
* fix boolean printing
Auto Tensor Core (apache#524)
Working on Evo Search (apache#542)
Squashed commit
[Meta Schedule][M3c] Schedule Rules, Mutator & Postprocs (apache#485)
[Meta Schedule][M3c] PostOrderApply (apache#486)
Fix Post Order Apply (apache#490)
[MetaSchedule] Relay Integration (apache#489)
[M3c][Meta Schedule] Add Trace Correctness Test for PostOrderApply (apache#492)
Fix replay trace. (apache#493)
[M3c][Meta Schedule] Implement the Replay Func class. (apache#495)
[PR] Test script for meta-schedule task extraction. Interface to load… (apache#494)
[Meta Schedule Refactor] Get child blocks (apache#500)
Read-at && Write-at (apache#497)
[M3c][Meta Schedule] Measure Callbacks (apache#498)
[Bug] Fix Infinite Loop Caused When Calling Methods Not Overrided In PyClass (apache#496)
[MetaSchedule] Sample-Perfect-Tile (apache#501)
[MetaSchedule] TE Workloads (apache#502)
[TensorIR] GetProducer, GetConsumer (apache#506)
[MetaScheduleRefactor] Annotate&Unannotate (apache#505)
[MetaSchedule] Multi-Level-Tiling & Auto-Inline (apache#503)
[Tests] Add unittests for auto-inline and multi-level-tiling (apache#508)
[Meta Schedule] Minor Fixes (apache#507)
[MetaSchedule] Rewrite Cooperative-Fetching / Unbound-Block / Reduction-Block (apache#509)
[MetaSchedule] Rewrite Parallel-Vectorize-Unroll / Verify-GPU / Disallow-Dynamic-Loops (apache#499)
[Meta Schedule] Add Helper Function & Minor Modification (apache#512)
[MetaSchedule] Test for Rewrite Parallel-Vectorize-Unroll (apache#513)
[Meta Schedule] Feature Extractor & Cost Model (apache#510)
Blockize & Tensorize (apache#514)
Layout Rewriting: Suggest-Index-Map (apache#520)
Co-authored-by: Siyuan Feng <[email protected]>
Co-authored-by: Bohan Hou <[email protected]>
Co-authored-by: Hongyi Jin <[email protected]>
Co-authored-by: Ruihang Lai <[email protected]>
Co-authored-by: Junru Shao <[email protected]>
Co-authored-by: Wuwei Lin <[email protected]>
Co-authored-by: Sunghyun Park <[email protected]>
[MetaSchedule] Parallel-Vectorize-Unroll & Random-Compute-Location (apache#516)
* parallel vectorize unroll & random compute location
* rebased
[Meta Schedule] Per-Store-Feature (apache#521)
[Meta Schedule] Add XGBoost Model & Random Model (apache#519)
* Squashed commit
[Meta Schedule][M3c] Schedule Rules, Mutator & Postprocs (apache#485)
[Meta Schedule][M3c] PostOrderApply (apache#486)
Fix Post Order Apply (apache#490)
[MetaSchedule] Relay Integration (apache#489)
[M3c][Meta Schedule] Add Trace Correctness Test for PostOrderApply (apache#492)
Fix replay trace. (apache#493)
[M3c][Meta Schedule] Implement the Replay Func class. (apache#495)
[PR] Test script for meta-schedule task extraction. Interface to load… (apache#494)
[Meta Schedule Refactor] Get child blocks (apache#500)
Read-at && Write-at (apache#497)
[M3c][Meta Schedule] Measure Callbacks (apache#498)
[Bug] Fix Infinite Loop Caused When Calling Methods Not Overrided In PyClass (apache#496)
[MetaSchedule] Sample-Perfect-Tile (apache#501)
[MetaSchedule] TE Workloads (apache#502)
Co-authored-by: Siyuan Feng <[email protected]>
Co-authored-by: Bohan Hou <[email protected]>
Co-authored-by: Hongyi Jin <[email protected]>
Co-authored-by: Ruihang Lai <[email protected]>
Co-authored-by: Junru Shao <[email protected]>
Co-authored-by: Wuwei Lin <[email protected]>
Co-authored-by: Sunghyun Park <[email protected]>
[TensorIR] GetProducer, GetConsumer (apache#506)
[MetaScheduleRefactor] Annotate&Unannotate (apache#505)
* annotate
* annotate
* lint
* test
* fix
* fix
* fix
[MetaSchedule] Rewrite Cooperative-Fetching / Unbound-Block / Reduction-Block (apache#509)
Fix sttr func & schedule naming.
Fix schedule -> sch.
Add feature extractor.
Fix init.
Add cost model.
Remove unused include.
[MetaSchedule] Rewrite Parallel-Vectorize-Unroll / Verify-GPU / Disallow-Dynamic-Loops (apache#499)
* wip
fix
* revoke change to gallery
* split postprocessors to separate files
* rename attrs
* minor
* minor tweak on utils.h
* refactor disallow-dynamic-loop
* refactor verify_gpu_code
* succesfully give up refactoring parallelize-vectorize-unroll
* python structuring
* unittests
Co-authored-by: Junru Shao <[email protected]>
Fix issues.
Fix init.
Finish random model part.
Finish xgb model.
Minor fix.
Rebase.
Add init.
Await refactor of callback.
Update a bit on the test case.
Move impos.
Minor fix.
More fixes.
Remove unused import.
Fix per store feature test.
Update model save / load.
* Fix model save / load with tar.
* Fix issues.
* Remove dup.
Co-authored-by: Junru Shao <[email protected]>
User-Interface: Tune-TIR (apache#525)
* User-Interface: Tune-TIR
* fix fix fix
User-Interface: Tune-TE (apache#527)
* fix a lot of issues
* Add tune-te
Get CUDA tuning working (apache#529)
[Meta Schedule] Evolutionary Search (apache#522)
* Checkpoint.
Fix cost model comment.
Finish evolutionary seaarch.
Remove extra code.
Fix compile.
Add comments.
Add python part.
Ad test.
Update other files & comments.
* Squashed commit
[Meta Schedule][M3c] Schedule Rules, Mutator & Postprocs (apache#485)
[Meta Schedule][M3c] PostOrderApply (apache#486)
Fix Post Order Apply (apache#490)
[MetaSchedule] Relay Integration (apache#489)
[M3c][Meta Schedule] Add Trace Correctness Test for PostOrderApply (apache#492)
Fix replay trace. (apache#493)
[M3c][Meta Schedule] Implement the Replay Func class. (apache#495)
[PR] Test script for meta-schedule task extraction. Interface to load… (apache#494)
[Meta Schedule Refactor] Get child blocks (apache#500)
Read-at && Write-at (apache#497)
[M3c][Meta Schedule] Measure Callbacks (apache#498)
[Bug] Fix Infinite Loop Caused When Calling Methods Not Overrided In PyClass (apache#496)
[MetaSchedule] Sample-Perfect-Tile (apache#501)
[MetaSchedule] TE Workloads (apache#502)
Co-authored-by: Siyuan Feng <[email protected]>
Co-authored-by: Bohan Hou <[email protected]>
Co-authored-by: Hongyi Jin <[email protected]>
Co-authored-by: Ruihang Lai <[email protected]>
Co-authored-by: Junru Shao <[email protected]>
Co-authored-by: Wuwei Lin <[email protected]>
Co-authored-by: Sunghyun Park <[email protected]>
* [TensorIR] GetProducer, GetConsumer (apache#506)
* [MetaScheduleRefactor] Annotate&Unannotate (apache#505)
* annotate
* annotate
* lint
* test
* fix
* fix
* fix
* [MetaSchedule] Rewrite Cooperative-Fetching / Unbound-Block / Reduction-Block (apache#509)
* Blockize & Tensorize (apache#514)
* Blockize & Tensorize
* Update tensor intrin
* Fix blockized & Recalculate affine flags
* Cleanup utils.cc
* Add test cases of blockize
* Re-enable affine flag checking
* Checkpoint.
Fix cost model comment.
Finish evolutionary seaarch.
Remove extra code.
Fix compile.
Add comments.
Add python part.
Ad test.
Update other files & comments.
Fix random seed bug.
Minor fix.
Fix num-cores.
Add docs.
Check point.
Add max_fail_cnt.
Minor fix.
Minor fix.
Segfault.
Fix pointers to trace.
Test fix.
Remove measure callbacks.
Refactor a bit.
Split function.
Adjust variable name.
Minor fixes.
Add mutator probs to TuneContext.
Add token.
Fix loops.
Remove include.
Add has workload for database.
Add check.
Add concurrent bitmask.
* Fix TuneContext.
* Fix haash & stuff.
* Modifyy shash.
* Remove trace field.
* Minor fix.
* Fix cbmask.
* Fix numbers.
Co-authored-by: Junru Shao <[email protected]>
Co-authored-by: Siyuan Feng <[email protected]>
Co-authored-by: Bohan Hou <[email protected]>
Co-authored-by: Hongyi Jin <[email protected]>
Co-authored-by: Ruihang Lai <[email protected]>
Co-authored-by: Wuwei Lin <[email protected]>
Co-authored-by: Sunghyun Park <[email protected]>
[BugFix] Remove duplicated definition of MakeMultinomialSampler (apache#535)
Tune relay.
Further add interface.
Remove unused import
Fix rebase.
Add task name dispatch.
Add task deduplication.
Rename extract_task to extract_task_from_relay
Remove duplicate function def.
Minor fix.1 parent 0f3892b commit 05f3aba
File tree
5 files changed
+28
-17
lines changed- python/tvm/meta_schedule
- src/tir/schedule
- tests/python/unittest
5 files changed
+28
-17
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
177 | 177 | | |
178 | 178 | | |
179 | 179 | | |
180 | | - | |
| 180 | + | |
181 | 181 | | |
182 | 182 | | |
183 | 183 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
| 22 | + | |
22 | 23 | | |
23 | 24 | | |
24 | 25 | | |
25 | | - | |
| 26 | + | |
26 | 27 | | |
27 | 28 | | |
28 | 29 | | |
| |||
650 | 651 | | |
651 | 652 | | |
652 | 653 | | |
653 | | - | |
| 654 | + | |
654 | 655 | | |
655 | 656 | | |
656 | 657 | | |
657 | 658 | | |
| 659 | + | |
658 | 660 | | |
659 | 661 | | |
660 | 662 | | |
| |||
664 | 666 | | |
665 | 667 | | |
666 | 668 | | |
667 | | - | |
| 669 | + | |
668 | 670 | | |
669 | 671 | | |
670 | 672 | | |
671 | 673 | | |
672 | 674 | | |
673 | 675 | | |
674 | 676 | | |
| 677 | + | |
| 678 | + | |
| 679 | + | |
| 680 | + | |
| 681 | + | |
| 682 | + | |
| 683 | + | |
| 684 | + | |
| 685 | + | |
| 686 | + | |
| 687 | + | |
| 688 | + | |
| 689 | + | |
| 690 | + | |
| 691 | + | |
| 692 | + | |
| 693 | + | |
| 694 | + | |
675 | 695 | | |
676 | 696 | | |
677 | | - | |
| 697 | + | |
678 | 698 | | |
679 | 699 | | |
680 | 700 | | |
| |||
684 | 704 | | |
685 | 705 | | |
686 | 706 | | |
687 | | - | |
| 707 | + | |
688 | 708 | | |
689 | 709 | | |
690 | 710 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | | - | |
39 | | - | |
40 | 31 | | |
41 | 32 | | |
42 | 33 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
112 | 112 | | |
113 | 113 | | |
114 | 114 | | |
115 | | - | |
| 115 | + | |
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
91 | 91 | | |
92 | 92 | | |
93 | 93 | | |
94 | | - | |
| 94 | + | |
95 | 95 | | |
96 | 96 | | |
97 | 97 | | |
| |||
0 commit comments