Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Input/output error Verify all devices in group 29 are bound to vfio-<bus> or pci-stub and not already in use when trying to start vm #95

Open
lakernel opened this issue May 15, 2022 · 2 comments

Comments

@lakernel
Copy link

lakernel commented May 15, 2022

When i to start my win10 vm i get the error:

Input/output error Verify all devices in group 29 are bound to vfio-<bus> or pci-stub and not already in use

Setup:

  • OS: Debian GNU/Linux 11 (bullseye) x86_64
  • Kernel: 5.10.0-14-amd64
  • CPU: AMD Ryzen 5 5600X (12) @ 3.700GHz
  • GPU1: NVIDIA GeForce RTX 2070 SUPER (main gpu)
    note: i'm trying to use both gpu's with vgpu the error is only present when using gpu1, gpu1 is also used to run gnome desktop
  • GPU2: NVIDIA GeForce GTX 980

Driver :

NVIDIA-Linux-x86_64-460.73.01-grid-vgpu-kvm-v5
with patch: https://github.com/rupansh/vgpu_unlock_5.12

dmesg | grep vfio:

[ 2720.206992] [nvidia-vgpu-vfio] 00eac34d-8b67-401e-9faa-feb8f2456c29: start failed. status: 0x1

systemctl status nvidia-vgpud.service:

May 15 13:57:18 debian nvidia-vgpud[27664]: BAR1 Length: 0x100
May 15 13:57:18 debian nvidia-vgpud[27664]: Frame Rate Limiter enabled: 0x1
May 15 13:57:18 debian nvidia-vgpud[27664]: Number of Displays: 1
May 15 13:57:18 debian nvidia-vgpud[27664]: Max pixels: 1310720
May 15 13:57:18 debian nvidia-vgpud[27664]: Display: width 1280, height 1024
May 15 13:57:18 debian nvidia-vgpud[27664]: License: GRID-Virtual-Apps,3.0
May 15 13:57:18 debian nvidia-vgpud[27664]: PID file unlocked.
May 15 13:57:18 debian nvidia-vgpud[27664]: PID file closed.
May 15 13:57:18 debian nvidia-vgpud[27664]: Shutdown (27664)

systemctl status nvidia-gridd.service:

May 15 13:14:12 debian nvidia-gridd[1266]: vgpu_unlock loaded.
May 15 13:14:12 debian nvidia-gridd[1357]: vgpu_unlock loaded.
May 15 13:14:12 debian nvidia-gridd[1357]: Started (1357)
May 15 13:14:13 debian nvidia-gridd[1357]: Licensing not supported for GPUs in the system
May 15 13:14:13 debian nvidia-gridd[1357]: Failed to handle license change events
May 15 13:14:13 debian nvidia-gridd[1357]: Licensing not supported for GPUs in the system
May 15 13:14:13 debian nvidia-gridd[1357]: Failed to unlock PID file: Bad file descriptor
May 15 13:14:13 debian nvidia-gridd[1357]: Failed to close PID file: Bad file descriptor
May 15 13:14:13 debian nvidia-gridd[1357]: Shutdown (1357)

journalctl -u nvidia-vgpu-mgr:

May 15 13:59:04 debian nvidia-vgpu-mgr[32260]: VgpuStart {
uuid: {00eac34d-8b67-401e-9faa-feb8f2456c29},
config_params: "vgpu_type_id=258",
unknown_410: [182, 127, 0, 0, 0, 9, 0, 0, 1, 0, 0, 0, 0, 9, 0, 0],
}
May 15 13:59:04 debian nvidia-vgpu-mgr[1326]: VgpuStart {
uuid: {00eac34d-8b67-401e-9faa-feb8f2456c29},
config_params: "vgpu_type_id=258",
unknown_410: [182, 127, 0, 0, 0, 9, 0, 0, 1, 0, 0, 0, 0, 9, 0, 0],
}
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: notice: vmiop_env_log: vmiop-env: guest_max_gpfn:0x0
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: notice: vmiop_env_log: (0x0): Received start call from nvidia-vgpu-vfio module: mdev uuid 00eac34d-8b67-401e-9faa-feb8f2456c29 GPU PCI id 00:09:00.0 config params >
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: notice: vmiop_env_log: (0x0): pluginconfig: vgpu_type_id=258
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: notice: vmiop_env_log: Successfully updated env symbols!
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: op_type: 0x20801322 failed.
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: op_type: 0x2080014b failed.
May 15 13:59:04 debian nvidia-vgpu-mgr[32741]: vgpu_unlock loaded.
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: op_type: 0xa0820102 failed.
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: NVOS status 0x56
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: Assertion Failed at 0x9ace12f0:293
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: 11 frames returned by backtrace
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: /lib/x86_64-linux-gnu/libnvidia-vgpu.so(_nv004938vgpu+0x26) [0x7fcf9ad33e76]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: /lib/x86_64-linux-gnu/libnvidia-vgpu.so(+0x8923a) [0x7fcf9ace023a]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: /lib/x86_64-linux-gnu/libnvidia-vgpu.so(+0x8a2f0) [0x7fcf9ace12f0]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: /lib/x86_64-linux-gnu/libnvidia-vgpu.so(+0x8cecf) [0x7fcf9ace3ecf]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: vgpu(+0x13b89) [0x5642be413b89]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: vgpu(+0x14c89) [0x5642be414c89]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: vgpu(+0xeb80) [0x5642be40eb80]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: vgpu(+0xc336) [0x5642be40c336]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: vgpu(+0x3b7a) [0x5642be403b7a]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xea) [0x7fcf9b230d0a]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: vgpu(+0x3bbd) [0x5642be403bbd]
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: (0x0): init_device_instance failed for inst 0 with error 1 (error setting vGPU configuration information from RM)
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: (0x0): Initialization: init_device_instance failed error 1
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_log: display_init failed for inst: 0
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_env_log: (0x0): vmiope_process_configuration: plugin registration error
May 15 13:59:04 debian nvidia-vgpu-mgr[32740]: error: vmiop_env_log: (0x0): vmiope_process_configuration failed with 0x1f

@stapler117
Copy link

Input/output error Verify all devices in group 29 are bound to vfio-<bus> or pci-stub and not already in use

This particular error has shown up for me when I had a custom profile with an incorrect framebuffer value. Are you using custom profiles?

@lakernel
Copy link
Author

nope i was using a clean install of debian

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants