Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? #943

MaxAkbar · 2024-09-28T16:16:01Z

Describe the bug
After migrating Phi-3.5-vision-instruct to Onnx format I am not able to use the NuGet package Microsoft.ML.OnnxRuntimeGenAI.Cuda version 0.4.0 to load the Onnx model. When referencing the folder where the Onnx model is I get an error that file not found.

Microsoft.ML.OnnxRuntimeGenAI.OnnxRuntimeGenAIException: 
'Load model from C:\Users\***\source\repos\models\microsoft\Phi-3.5-vision-instruct-to-onnx\phi-3.5-v-128k-instruct-vision-onnx\ 
failed:Load model C:\Users\***\source\repos\models\microsoft\Phi-3.5-vision-instruct-to-onnx\phi-3.5-v-128k-instruct-vision-onnx\ failed. 
File doesn't exist'

To Reproduce
Steps to reproduce the behavior:

Follow instructions for converting the Phi-3.5-vision-instruct to onnx format.
Create a simple c# console application and load the model:

using Microsoft.ML.OnnxRuntimeGenAI;

string modelPath = @"C:\models\microsoft\Phi-3.5-vision-instruct-onnx";
using Model model = new Model(modelPath);

After running the application you will get an error Microsoft.ML.OnnxRuntimeGenAI.OnnxRuntimeGenAIException
See error in description above.

Expected behavior
The expected behavior is to have the model loaded and be able to run inference.

Desktop (please complete the following information):

OS: Windows 11 Pro
Build: Version 10.0.26120 Build 26120
Browser Edge

Additional context
I have converted the Phi-3.5-mini-instruct to Phi-3.5-mini-instruct-cuda-fp32-onnx and able to run it without any issues.

The text was updated successfully, but these errors were encountered:

kunal-vaishnavi · 2024-09-30T19:56:59Z

The Phi-3 vision and Phi-3.5 vision models are split into three separate ONNX models: a vision component, an embedding component, and a text component. The build.py file in the instructions you linked should create all three components for you.

According to your error, the vision component cannot be found. Can you check your modelPath folder to see if you have any subfolders named vision_init_export, vision_after_export, or vision_after_opt? It's possible that something failed during the export --> optimize --> quantize process for creating the vision component. If the process failed at any point, then the latest vision component is temporarily saved in one of those subfolders before it is finally saved in modelPath. You may need to delete the modelPath folder and then re-run the build.py file with the latest ONNX Runtime version installed so that the process does not fail.

Please note that re-designed ONNX models for Phi-3 vision and Phi-3.5 vision will be published to enable multi-image support.

natke · 2024-10-03T22:10:53Z

Hi @MaxAkbar, did you check your model directory for the files that Kunal described above?

MaxAkbar · 2024-10-03T23:54:58Z

I just noticed that the file sizes are way too small, so something failed during the conversion :(. Has anyone been able to convert the vision into onnx format? I did look at the output but nothing jumped out at me as an error.

I had a thread here about how to convert to onnx: microsoft/Phi-3CookBook#187

natke · 2024-10-07T04:45:27Z

Thank you @MaxAkbar, would you be able to attach the output from the build.py script here, so that we can parse for errors?

kunal-vaishnavi · 2024-11-14T01:44:33Z

The new Phi-3 vision and Phi-3.5 vision ONNX models have now been released. The new models support no-image, single-image, and multi-image scenarios.

MaxAkbar changed the title ~~Does Microsoft.ML.OnnxRuntimeGenAI.Cuda support Phi-3.5 Onnx format?~~ Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Onnx format? Sep 28, 2024

MaxAkbar changed the title ~~Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Onnx format?~~ Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? Sep 28, 2024

natke added the waiting-for-customer label Oct 3, 2024

microsoft-github-policy-service bot added the ep:CUDA label Oct 3, 2024

kunal-vaishnavi closed this as completed Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? #943

Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? #943

MaxAkbar commented Sep 28, 2024 •

edited

Loading

kunal-vaishnavi commented Sep 30, 2024

natke commented Oct 3, 2024

MaxAkbar commented Oct 3, 2024 •

edited

Loading

natke commented Oct 7, 2024

kunal-vaishnavi commented Nov 14, 2024

Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? #943

Does Microsoft.ML.OnnxRuntimeGenAI.Cuda (version 0.4.0) support Phi-3.5 Vision Onnx format? #943

Comments

MaxAkbar commented Sep 28, 2024 • edited Loading

kunal-vaishnavi commented Sep 30, 2024

natke commented Oct 3, 2024

MaxAkbar commented Oct 3, 2024 • edited Loading

natke commented Oct 7, 2024

kunal-vaishnavi commented Nov 14, 2024

MaxAkbar commented Sep 28, 2024 •

edited

Loading

MaxAkbar commented Oct 3, 2024 •

edited

Loading