Dumping IR after every optimization pass #528

akroviakov · 2023-06-14T17:17:26Z

This PR introduces a new config flag dump-after-all, which when set to 1 will dump module-level IR before and after optimizations (IR_UNOPT/IR_OPT). When set to 2 will dump IR before optimizations, after each transformation pass (IR_AFTER_#passCounter_#passName) and after all transformations.

The intent behind this feature was to make the debugging of problematic queries easier.

With the introduction of the new pass builder and pass managers (e.g., FunctionPassManager), the transformation passes can now run more often, but on a smaller granularity scale (e.g., earlier it was one pass for all functions, now it is one pass per function). This way we cannot "summarize" passes (e.g., after GVN, after SROAP), since now the entire functions transformation pipeline runs for each function fully, so expect many (really many) files for even seemingly simple queries when the severity is set to 2.

The example usage is in IntelGPUEnablingTest.
You can call it like e.g., ./IntelGPUEnablingTest --gtest_filter=JoinTest.SimpleJoin --dump-after-all=1

ienkovich

It is nice in general to have the ability to dump LLVM IR. The existing IR channel wouldn't be very convenient if we dumped IR after each pass into the same file. On the other hand, it has a nice feature of creating a new file on each test run and it has a static counter so that I can easily find a proper module in logs.
How would these new dumps work in this situation? A simple scenario I'd like to have is to run the same test with different flags and then compare generated IRs.

ienkovich · 2023-06-14T18:30:06Z

omniscidb/Shared/Config.h

@@ -83,6 +83,7 @@ struct CodegenConfig {
  bool null_mod_by_zero = false;
  bool hoist_literals = true;
  bool enable_filter_function = true;
+  short dump_after_all_severity{0};


I'd propose to move it to debug config structure. Also, rename it to dump_llvm_ir_after_each_pass or similar, simple dump is too generic since we have different IRs on various levels.

ienkovich · 2023-06-14T18:32:50Z

omniscidb/QueryEngine/Compiler/HelperFunctions.cpp

+                                                llvm::Any IR,
+                                                const llvm::PreservedAnalyses&) -> void {
+    std::error_code ec;
+    llvm::raw_fd_ostream os("IR_AFTER_" + std::to_string(pass_counter++) + "_" +


Does it rewrite or append?

ienkovich · 2023-06-14T18:35:47Z

omniscidb/Tests/IntelGPUEnablingTest.cpp

@@ -1001,6 +1001,13 @@ int main(int argc, char* argv[]) {
                     "Dump IR and PTX for all executed queries to file."
                     " Currently only supports single node tests.");

+  desc.add_options()("dump-after-all",


Can we have it in ConfigBuilder so that it would be more widely available? It was supposed to replace all these options in each separate test binary.

kurapov-peter

Looks good, two minor comments

kurapov-peter · 2023-07-05T17:04:02Z

omniscidb/ConfigBuilder/ConfigBuilder.cpp

+  opt_desc.add_options()("dump-after-all",
+                         po::value<short>(&config_->debug.dump_llvm_ir_after_each_pass)
+                             ->default_value(config_->debug.dump_llvm_ir_after_each_pass)
+                             ->implicit_value(0),


I'd expect the implicit value to be a non-zero value because if you specify the option it means you want the dumps to be generated.

kurapov-peter · 2023-07-05T17:06:55Z

omniscidb/QueryEngine/Compiler/HelperFunctions.cpp

+  size_t pass_counter{1};
+  std::string ir_dump_dir{};
+  if (co.dump_llvm_ir_after_each_pass) {
+    work_unit_meta.w_unit_counter++;


What's going to happen in the heterogeneous mode when you potentially have two compilations running simultaneously? Should the work_unit_meta live in the work unit?

Yes, the timestamp can be saved in the module for debug purposes and then you can ask for it in optimize_ir(). But there are lots of ways to get into optimize_ir(), so where do you embed the timestamp into the module?
E.g., you can embed the timestamp in Executor::compileWorkUnit(), but CPU codegen path might go through Executor::reduceMultiDeviceResultSets() -> ResultSetReductionJIT::codegen() -> ..., omitting Executor::compileWorkUnit(), so how many other places should have code that embeds the timestamp? Right now it seems to me that this is the least amount of code that does the job, or am I missing something?

akroviakov requested a review from kurapov-peter June 14, 2023 17:17

ienkovich reviewed Jun 14, 2023

View reviewed changes

kurapov-peter suggested changes Jul 5, 2023

View reviewed changes

akroviakov force-pushed the akroviak/ir_dump branch from 1a4d539 to 037f2ef Compare July 7, 2023 18:21

dumping IR modules after passes

78a2a99

akroviakov force-pushed the akroviak/ir_dump branch from 037f2ef to 78a2a99 Compare August 9, 2023 12:01

kurapov-peter approved these changes Aug 9, 2023

View reviewed changes

kurapov-peter merged commit ad924b5 into main Aug 10, 2023

kurapov-peter deleted the akroviak/ir_dump branch August 10, 2023 14:13

kurapov-peter mentioned this pull request Aug 16, 2023

Dumping module IR after transformation passes through config flag #370

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dumping IR after every optimization pass #528

Dumping IR after every optimization pass #528

akroviakov commented Jun 14, 2023

ienkovich left a comment

ienkovich Jun 14, 2023

ienkovich Jun 14, 2023

ienkovich Jun 14, 2023

kurapov-peter left a comment

kurapov-peter Jul 5, 2023

kurapov-peter Jul 5, 2023

akroviakov Jul 7, 2023

Dumping IR after every optimization pass #528

Dumping IR after every optimization pass #528

Conversation

akroviakov commented Jun 14, 2023

ienkovich left a comment

Choose a reason for hiding this comment

ienkovich Jun 14, 2023

Choose a reason for hiding this comment

ienkovich Jun 14, 2023

Choose a reason for hiding this comment

ienkovich Jun 14, 2023

Choose a reason for hiding this comment

kurapov-peter left a comment

Choose a reason for hiding this comment

kurapov-peter Jul 5, 2023

Choose a reason for hiding this comment

kurapov-peter Jul 5, 2023

Choose a reason for hiding this comment

akroviakov Jul 7, 2023

Choose a reason for hiding this comment