Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

errors when i run train.py #8

Open
phchtl opened this issue Oct 19, 2023 · 0 comments
Open

errors when i run train.py #8

phchtl opened this issue Oct 19, 2023 · 0 comments

Comments

@phchtl
Copy link

phchtl commented Oct 19, 2023

Hello, I've been studying your dl2 paper and code recently, and I have a few questions I'd like to ask you! Currently, my environment is Python 2.7 and TensorFlow-GPU 1.15. I've installed the necessary Python packages to run the code, modified the training mode in parameters.py to "sl," and also adjusted these two parameters: JOB_LEN_PATTERN to "Normal" and JOB_ARRIVAL_PATTERN to "Google_Trace." When I attempt to pre-train the model in sl mode, I encounter a deadlock issue. The code seems to occupy GPU resources but doesn't actually utilize the graphics card. What should I do to ensure that the code runs properly? I haven't loaded any other datasets like the Google cluster trace, and I haven't installed any modified versions of MXNet. Thank you very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant