[Task]: RunInference - send failures to dead letter queue #24209

damccorm · 2022-11-16T20:26:13Z

What needs to happen?

Right now, if RunInference fails a batch inference, it fails the whole transform. For batch pipelines, this means failing the pipeline on non-retryable failures (which represent most inference failures), for streaming it means infinite retries and a stuck pipeline.

We should handle failures by passing them to the next step as part of the PredictionResult object instead so that users can perform custom error handling. We should also document this behavior in the PyDoc and on our website.

Issue Priority

Priority: 2

Issue Component

Component: run-inference

The text was updated successfully, but these errors were encountered:

damccorm · 2022-11-16T20:26:48Z

@BjornPrime this is a good one to pick up if you have space in the future (if you finish your current work or are blocked)

BjornPrime · 2022-11-17T18:51:34Z

.take-issue

AnandInguva · 2023-01-17T17:41:25Z

@damccorm what is the action item on this one?

Should I pick this up as an effort to model updates? or we can add it once the model updates is out

damccorm · 2023-01-17T21:29:50Z

I would address it separately, but if you would like to do so it would be helpful

AnandInguva · 2023-01-17T22:00:00Z

.take-issue

damccorm added task awaiting triage and removed awaiting triage labels Nov 16, 2022

github-actions bot added ml P2 python run-inference labels Nov 16, 2022

github-actions bot assigned BjornPrime Nov 17, 2022

damccorm unassigned BjornPrime Jan 17, 2023

github-actions bot assigned AnandInguva Jan 17, 2023

damccorm assigned damccorm and unassigned AnandInguva Apr 13, 2023

damccorm mentioned this issue Apr 13, 2023

DLQ support in RunInference #26261

Merged

3 tasks

damccorm closed this as completed in #26261 Apr 14, 2023

github-actions bot added this to the 2.48.0 Release milestone Apr 14, 2023

damccorm added the done & done Issue has been reviewed after it was closed for verification, followups, etc. label Apr 25, 2023

liferoad mentioned this issue May 6, 2024

[Feature Request]: Add with_exception_handling() for PTransforms in Python #31193

Open

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Task]: RunInference - send failures to dead letter queue #24209

[Task]: RunInference - send failures to dead letter queue #24209

damccorm commented Nov 16, 2022

damccorm commented Nov 16, 2022 •

edited

Loading

BjornPrime commented Nov 17, 2022

AnandInguva commented Jan 17, 2023

damccorm commented Jan 17, 2023

AnandInguva commented Jan 17, 2023

[Task]: RunInference - send failures to dead letter queue #24209

[Task]: RunInference - send failures to dead letter queue #24209

Comments

damccorm commented Nov 16, 2022

What needs to happen?

Issue Priority

Issue Component

damccorm commented Nov 16, 2022 • edited Loading

BjornPrime commented Nov 17, 2022

AnandInguva commented Jan 17, 2023

damccorm commented Jan 17, 2023

AnandInguva commented Jan 17, 2023

damccorm commented Nov 16, 2022 •

edited

Loading