Skip to content

Commit 9fcebdc

Browse files
committed
remove path
1 parent dff30dc commit 9fcebdc

File tree

2 files changed

+4
-5
lines changed

2 files changed

+4
-5
lines changed

benchmark/matmul_fp8/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# FP8 Matmul Benchmark (8192×8192)
22

3-
This document records the throughput achieved by `benchmark_matmul.py` when multiplying FP8 matrices sized `M = N = 8192` across different `K` dimensions. Each measurement relies on the default autotuning search space bundled with the benchmark. The file lives in `/weka-hg/prod/deepseek/permanent/wanglei/tilelang/benchmark/matmul_fp8`.
3+
This document records the throughput achieved by `benchmark_matmul.py` when multiplying FP8 matrices sized `M = N = 8192` across different `K` dimensions. Each measurement relies on the default autotuning search space bundled with the benchmark.
44

55
## Environment
66

benchmark/matmul_fp8/benchmark_matmul.py

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -219,9 +219,9 @@ def main(
219219
if __name__ == "__main__":
220220
# Parse command-line arguments for matrix dimensions
221221
parser = argparse.ArgumentParser(description="Autotuned MatMul Benchmark")
222-
parser.add_argument("--m", type=int, default=8192, help="Matrix dimension M")
223-
parser.add_argument("--n", type=int, default=8192, help="Matrix dimension N")
224-
parser.add_argument("--k", type=int, default=8192, help="Matrix dimension K")
222+
parser.add_argument("--m", type=int, default=16384, help="Matrix dimension M")
223+
parser.add_argument("--n", type=int, default=16384, help="Matrix dimension N")
224+
parser.add_argument("--k", type=int, default=16384, help="Matrix dimension K")
225225
parser.add_argument(
226226
"--with_roller",
227227
action="store_true",
@@ -237,7 +237,6 @@ def main(
237237

238238
# matmul(...) returns (best_latency, best_config, ref_latency)
239239
best_result = matmul(M, N, K, with_roller)
240-
print(best_result.get_kernel_source())
241240
best_latency = best_result.latency
242241
best_config = best_result.config
243242

0 commit comments

Comments
 (0)