[torchbench] `doctr_reco_predictor` fails to run inference on dynamo. #6832

ysiraichi · 2024-03-27T14:15:28Z

🐛 Bug

Running the upstreamed benchmarking scripts with the following command results in an unexpected error.

python xla/benchmarks/experiment_runner.py \
       --suite-name torchbench \
       --accelerator cuda \
       --xla PJRT \
       --dynamo openxla \
       --test eval \
       --repeat 8 --iterations-per-run 1 \
       --print-subprocess \
       --no-resume -k speech_transformer

Traceback (most recent call last):
  File "xla/benchmarks/experiment_runner.py", line 945, in <module>
    main()
  File "xla/benchmarks/experiment_runner.py", line 941, in main
    runner.run()
  File "xla/benchmarks/experiment_runner.py", line 61, in run
    self.run_single_config()
  File "xla/benchmarks/experiment_runner.py", line 256, in run_single_config
    metrics, last_output = self.run_once_and_gather_metrics(
  File "xla/benchmarks/experiment_runner.py", line 345, in run_once_and_gather_metrics
    output, _ = loop(iter_fn=self._default_iter_fn)
  File "xla/benchmarks/experiment_runner.py", line 302, in loop
    output, timing, trace = iter_fn(benchmark_experiment, benchmark_model,
  File "xla/benchmarks/experiment_runner.py", line 218, in _default_iter_fn
    output = benchmark_model.model_iter_fn(
  File "torch/_dynamo/eval_frame.py", line 390, in _fn
    return fn(*args, **kwargs)
  File "xla/benchmarks/benchmark_model.py", line 170, in eval
    pred = self.module(*inputs)
  File "torch/nn/modules/module.py", line 1527, in _wrapped_call_impl
    return self._call_impl(*args, **kwargs)
  File "torch/nn/modules/module.py", line 1536, in _call_impl
    return forward_call(*args, **kwargs)
  File "/lib/python3.8/site-packages/doctr/models/recognition/crnn/pytorch.py", line 224, in forward
    out["preds"] = self.postprocessor(logits)
  File "/lib/python3.8/site-packages/doctr/models/recognition/crnn/pytorch.py", line 97, in __call__
    return self.ctc_best_path(logits=logits, vocab=self.vocab, blank=len(self.vocab))
  File "/lib/python3.8/site-packages/doctr/models/recognition/crnn/pytorch.py", line 55, in ctc_best_path
    @staticmethod
  File "torch/_dynamo/eval_frame.py", line 390, in _fn
    return fn(*args, **kwargs)
  File "torch/_dynamo/external_utils.py", line 36, in inner
    return fn(*args, **kwargs)
  File "torch/_functorch/aot_autograd.py", line 917, in forward
    return compiled_fn(full_args)
  File "torch/_functorch/_aot_autograd/utils.py", line 89, in g
    return f(*args)
  File "torch/_functorch/_aot_autograd/runtime_wrappers.py", line 107, in runtime_wrapper
    all_outs = call_func_at_runtime_with_args(
  File "torch/_functorch/_aot_autograd/utils.py", line 113, in call_func_at_runtime_with_args
    out = normalize_as_list(f(args))
  File "torch/_functorch/_aot_autograd/jit_compile_runtime_wrappers.py", line 181, in rng_functionalization_wrapper
    return compiled_fw(args)
  File "torch/_functorch/_aot_autograd/utils.py", line 89, in g
    return f(*args)
  File "torch/_dynamo/backends/torchxla.py", line 36, in fwd
    compiled_graph = bridge.extract_compiled_graph(model, args)
  File "xla/torch_xla/core/dynamo_bridge.py", line 617, in extract_compiled_graph
    xm.mark_step()
  File "xla/torch_xla/core/xla_model.py", line 1056, in mark_step
    torch_xla._XLAC._xla_step_marker(
RuntimeError: Bad StatusOr access: INTERNAL: during context [Unknown]: Seen floating point types of different precisions in %concatenate.10776 = f32[4,64,128]{2,1,0} concatenate(f16[1,64,128]{2,1,0} %reshape.10772, f16[1,64,128]{2,1,0} %reshape.10773, f32[1,64,128]{2,1,0} %reshape.10774, f32[1,64,128]{2,1,0} %reshape.10775), dimensions={0}, but mixed precision is disallowed.

Environment

PyTorch Commit: a52b4e22571507abc35c2d47de138497190d2e0a
PyTorch/XLA Commit: 84e7feb
PyTorch/benchmark Commit: d6015d42d9a1834bc7595c4bd6852562fb80b30b

cc @miladm @JackCaoG @vanbasten23 @zpcore @frgossen @golechwierowicz @cota

The text was updated successfully, but these errors were encountered:

zpcore · 2024-04-04T21:47:22Z

Same cause of #6831, close for now.

ysiraichi added the xla:gpu label Mar 27, 2024

ysiraichi mentioned this issue Mar 27, 2024

Failing Torchbench Models: tracking issue #5932

Open

zpcore closed this as completed Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[torchbench] `doctr_reco_predictor` fails to run inference on dynamo. #6832

[torchbench] `doctr_reco_predictor` fails to run inference on dynamo. #6832

ysiraichi commented Mar 27, 2024 •

edited

Loading

zpcore commented Apr 4, 2024

[torchbench] doctr_reco_predictor fails to run inference on dynamo. #6832

[torchbench] doctr_reco_predictor fails to run inference on dynamo. #6832

Comments

ysiraichi commented Mar 27, 2024 • edited Loading

🐛 Bug

Environment

zpcore commented Apr 4, 2024

[torchbench] `doctr_reco_predictor` fails to run inference on dynamo. #6832

[torchbench] `doctr_reco_predictor` fails to run inference on dynamo. #6832

ysiraichi commented Mar 27, 2024 •

edited

Loading