Skip to content

Sagemaker Localization Timeout #556

@kannappan

Description

@kannappan

Sagemaker times out during long running inference jobs.

Whether this would solve by changing the instance type or instance count

Instance Type: ml.m4.4xlarge

20:53:12
10.32.0.2 - - [03/Jan/2019:20:53:11 +0000] "POST /invocations HTTP/1.1" 200 231 "-" "AHC/2.0"

20:53:15
2019/01/03 20:53:14 [error] 55#55: *754210 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 10.32.0.2, server: , request: "POST /invocations HTTP/1.1", upstream: "http://unix:/tmp/gunicorn.sock/invocations", host: "model.aws.local:8080"

20:53:15
10.32.0.2 - - [03/Jan/2019:20:53:14 +0000] "POST /invocations HTTP/1.1" 504 192 "-" "AHC/2.0"

20:53:15
[2019-01-03 20:53:14 +0000] [45] [CRITICAL] WORKER TIMEOUT (pid:3906)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions