Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Staging to main: Merge multiple Dockerfiles into a single one #2189

Merged
merged 1 commit into from
Nov 15, 2024

Conversation

miguelgfierro
Copy link
Collaborator

Merge multiple Dockerfiles into a single one

Signed-off-by: Simon Zhao [email protected]
Co-authored-by: Miguel Fierro [email protected]

Description

Related Issues

References

Checklist:

  • I have followed the contribution guidelines and code style for this project.
  • I have added tests covering my contributions.
  • I have updated the documentation accordingly.
  • I have signed the commits, e.g. git commit -s -m "your commit message".
  • This PR is being made to staging branch AND NOT TO main branch.

Merge multiple Dockerfiles into a single one
---------
Signed-off-by: Simon Zhao <[email protected]>
Co-authored-by: Miguel Fierro <[email protected]>
@miguelgfierro
Copy link
Collaborator Author

I checked and the GPU tests that were failing now are working: https://github.com/recommenders-team/recommenders/actions/runs/11831613396
Everything is green.

// https://github.com/devcontainers/features/blob/main/src/anaconda/devcontainer-feature.json
"ghcr.io/devcontainers/features/anaconda:1": {
"version": "2024.06-1"
"build": {
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some of the GPU tests failed. When looking at the AzureML workspace I see this error: Provisioning error
The specified Virtual Machine size in low priority is currently out of capacity. Please retry later, try reducing the Virtual Machine size or number of instances, try using dedicated VMs to improve chances of capacity allocations, or try deploying to a different region.

I'll try to run the tests again, otherwise, I'll try to see if we need new VMs.

Copy link
Collaborator

@SimonYansenZhao SimonYansenZhao Nov 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find the VM size is specified in tests/ci/azureml_tests/submit_groupwise_azureml_pytest.py. Maybe we need to use another one, because the code was written almost 3 years ago.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The VMs are chosen from the portal. Finally it worked. I think it was a momentary error.
Merging.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@SimonYansenZhao the process automatically cancells after 6h. I was trying to find how this limit is defined to reduce it, but I don't know where we are setting this value. Do you know?

Copy link
Collaborator

@SimonYansenZhao SimonYansenZhao Nov 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@miguelgfierro miguelgfierro merged commit 65fae79 into main Nov 15, 2024
85 checks passed
github-actions bot pushed a commit that referenced this pull request Nov 15, 2024
Staging to main: Merge multiple Dockerfiles into a single one
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants