arcface model is invalid #91

snnn · 2018-08-30T20:47:19Z

I downloaded the model from:
https://s3.amazonaws.com/onnx-model-zoo/arcface/resnet100/resnet100.onnx

If you open the model, take a look at the second OP: Sub. Its first input, A, is a float tensor, but its second input, B, is a double tensor.

ankkhedia · 2018-08-30T21:24:43Z

Hi @snnn, I tried viewing the above model using Netron and for me the second input B for Sub operator shows up as float tensor in Netron.

snnn · 2018-08-31T04:53:42Z

Hi @ankkhedia

It's float64 ?

prasanthpul · 2018-09-11T05:04:02Z

@snnn is the problem that the type needs to be the same for both?

snnn · 2018-09-11T17:26:08Z

Yes.

prasanthpul · 2018-09-11T17:31:04Z

@ankkhedia can you fix the model?

ankkhedia · 2018-09-11T17:43:09Z

@prasanthpul I will take a look.

prasanthpul · 2018-10-29T23:05:17Z

@ankkhedia any update on this?

ankkhedia · 2018-10-29T23:08:20Z

Hi @prasanthpul Sorry for being late as got pulled into some other things. I will try to prioritise it this week.

ankkhedia · 2018-10-30T21:25:49Z

@prasanthpul @snnn It seems to be error in MXNet-ONNX converter. I have raised an issue with the team apache/mxnet#13044
I will convert and put back new model here when the issue gets fixed.

linkerzhang · 2018-11-02T17:28:37Z

This is not good. We'd remove these models if they're invalid. We can add them back after fixing those issues.

@snnn are there more model issues you saw please? Thank you very much for bringing this up!

linkerzhang · 2018-11-02T17:29:04Z

@ankkhedia

snnn · 2018-11-06T17:23:30Z

In addition to Arcface, there are also problems in:

Resnet18v1
Resnet34v1
Resnet50v1
Resnet101v1
Resnet152v1
vgg16
Vgg16_bn
Vgg19
Vgg19_bn

snnn · 2018-11-10T01:36:33Z

@ankkhedia Any update? Could you please confirm if these models have problems?

Thanks

ankkhedia · 2018-11-12T17:34:12Z

I will check other models. However, Arcface issue has been fixed and I will update the new model.

ankkhedia · 2018-11-14T18:47:37Z

Hi @snnn Could you please point to the problems with the above models you listed so that I can take a look.

snnn · 2018-11-14T19:11:47Z

The inputs to GEMM operator, are not 2D tensors. They have more than 2 dimensions.

ankkhedia · 2018-11-14T19:27:01Z

@snnn This has been discussed in this issue before. #90.
I think there was no good support for GEMM in ONNX when these models were created. ONNX do have some missing operator and are usually mapped to the closest operator in the source framework.

As far as I know, support for GEMM in ONNX-MXNet is either work in progress or has been done. I will post new model if the support has been added.

snnn · 2018-11-14T19:51:24Z

Hi @ankkhedia , do you have an estimated time of completion?

ankkhedia · 2018-11-14T19:54:59Z

@snnn I will have to check with ONNX-MXNet converter team to be able to give a clear ETA.
I will update you on the same.
If the support has not been added, then it depends upon their roadmap on when the support will be complete. The team is working actively to get rigorous operator coverage.

prasanthpul · 2018-11-14T22:29:38Z

@ankkhedia I think your last comment is about the other models. can you confirm whether arcface model has been fixed? Will you be posting a 1.3 version as well?

snnn · 2018-11-14T22:32:25Z

The issue was already there 3 months, but we still don't know when it can be fixed?
From user experience perspective, ONNX user would think ONNX model zoo is low quality. I suggest we either fix it quickly, or delete the malformed models.

prasanthpul · 2018-11-14T22:34:35Z

@snnn lets create separate issue for the other models. this issue is only for arcface.
for the other models, I agree that if we cannot fix them they should be removed for now.

ankkhedia · 2018-11-14T22:35:00Z

@snnn @prasanthpul The model has been fixed and updated in the S3. I checked the model structure with Netron and float64 issue is not there anymore.

prasanthpul · 2018-11-14T22:37:36Z

Thanks @ankkhedia. Looks like only 1.2 (opset7) version is posted. will you be posting 1.3 as well?

snnn · 2018-11-14T22:40:28Z

Hi @ankkhedia , could please verify it?
I got the model from:
'https://s3.amazonaws.com/onnx-model-zoo/arcface/resnet100/resnet100.tar.gz'

It is still wrong.

ankkhedia · 2018-11-14T22:42:18Z

@snnn Sorry for the miss. I uploaded renet100.onnx file. I will change this tar too.

ankkhedia · 2018-11-14T22:55:55Z

@snnn added the latest tar file.

ryanlai2 · 2018-11-15T00:14:08Z

Can we fix ArcFace's README.md so that the table to download the model is correct? The download link was changed to download an OpSet8 model.

Currently, there is only one download link for ArcFace and it's labeled as OpSet 7, v1.2.1. However, the link downloads an OpSet8 v1.3 version of the model.
https://github.com/onnx/models/tree/master/models/face_recognition/ArcFace

ankkhedia · 2018-11-15T00:31:42Z

updated :)

snnn · 2018-11-15T18:20:02Z

Hi @ankkhedia , the old issue is fixed, but we get new one.
For the "relu0" node, its inputs has shape of [1, 64, 112, 112] and [64]. There is no broadcast rule can be applied on them.

snnn · 2018-11-16T21:50:49Z

Hi @ankkhedia , Could you verify issue?

Thanks.

Roshrini · 2018-11-16T23:15:20Z

Hi @snnn, I verified this issue on my end. We are actively working on both Prelu and Gemm issue mentioned and re-upload the models as early as we can. Thanks for reporting this and sorry for the inconvenience it has caused.

ankkhedia · 2018-11-29T21:56:02Z

Hi @snnn There are open PR to fix the above issues with Prelu and GEMM.
I have generated a model after including those fixes https://s3.amazonaws.com/onnx-model-zoo/arcface/resnet100/resnet100_new.onnx
Could you please let me know if this model looks good to you.

We will update the model once the PR are merged.

snnn · 2018-11-30T10:25:11Z

Hi @ankkhedia , thank you for fixing it. I'm having a vacation, with poor internet connection. I'll ask my colleague for help.

snnn · 2018-12-11T20:25:58Z

The problem is solved. Thanks!

snnn · 2019-02-13T18:25:02Z

Hi @ankkhedia , would you please put the new model in https://github.com/onnx/models/tree/master/models/face_recognition/ArcFace ?

ankkhedia · 2019-02-13T18:39:19Z

@snnn I have updated the model in https://s3.amazonaws.com/onnx-model-zoo/arcface/resnet100/resnet100.onnx

Could you please verify?

snnn · 2019-02-13T18:42:56Z

Hi @ankkhedia
https://s3.amazonaws.com/onnx-model-zoo/arcface/resnet100/resnet100.tar.gz is not updated.

ankkhedia · 2019-02-13T18:48:57Z

My bad. Updating the same

snnn · 2019-02-13T18:52:54Z

And this https://s3.amazonaws.com/onnx-model-zoo/arcface/resnet100/resnet100-md5.txt ?

ankkhedia · 2019-02-13T19:00:31Z

@snnn
uploaded resnet100.tar.gz and resnet100-md5.txt now.

snnn · 2019-02-13T19:04:14Z

Perfect. Thanks!

XinyuDu · 2019-02-19T09:01:44Z

@ankkhedia Hi, How can I convert the arcface mxnet model to onnx model without the float64 error? THX!

luan1412167 · 2019-10-08T04:53:49Z

@snnn @ankkhedia I get the error. It may be same as your error. Maybe it as
#91 (comment)
Have Any your experiment help me? Thanks
2019-10-08 11:49:13.612837502 [E:onnxruntime:, sequential_executor.cc:165 Execute] Non-zero status code returned while running PRelu node. Name:'relu0' Status Message: /home/luandd/project_company/face_rec/onnxruntime/onnxruntime/core/providers/cpu/math/element_wise_ops.h:329 void onnxruntime::BroadcastIterator::Init(int64_t, int64_t) axis == 1 || axis == largest was false. Attempting to broadcast an axis by a dimension other than 1. 64 by 112

luan1412167 · 2019-10-10T02:53:30Z

@snnn @ankkhedia have you right model with spatial=1?

sky186 · 2019-12-16T09:10:50Z

@ankkhedia
hello , arcface mxnet to onnx canbe fixed?
how to convert onnx ,is right?
the prelu out not right? because Iwant to convert caffe,but the onnx can be export but is not right?

sky186 · 2019-12-16T09:12:49Z

@luan1412167
hi, now youcan convert mxnet arcface to onnx right ? I fix ,but export model prelu out not right,not to equal mxnet,could you tell me how to convert onnx right ?

HoangTienDuc · 2020-03-18T08:06:29Z

Hi @ankkhedia , the old issue is fixed, but we get new one.
For the "relu0" node, its inputs has shape of [1, 64, 112, 112] and [64]. There is no broadcast rule can be applied on them.

hi @ankkhedia @snnn i also try to convert arcface LResNet100E-IR mxnet to onnx by using convert_onnx.py. Then, it seem that, i got the same error with @snnn when i deploy my model.

onnx runtime error 1: Non-zero status code returned while running PRelu node. Name:'relu0' Status Message: relu0: right operand cannot broadcast on dim 0 LeftShape: {1,64,112,112}, RightShape: {64}

Can you guide me how to fix it?
Thank all off u.

snnn · 2020-03-18T17:23:00Z

see apache/mxnet#17711

@vinitra is fixing it.

snnn mentioned this issue Oct 2, 2018

Duplicate VGG19 models #93

Closed

ankkhedia mentioned this issue Oct 30, 2018

Export to ONNX not working as expected for ArcFace model apache/mxnet#13044

Closed

vandanavk mentioned this issue Nov 21, 2018

ONNX export: Add Flatten before Gemm apache/mxnet#13356

Merged

5 tasks

Roshrini mentioned this issue Nov 29, 2018

onnx export slope for prelu operator corrected apache/mxnet#13460

Closed

4 tasks

snnn closed this as completed Dec 11, 2018

This was referenced Feb 11, 2019

ArcFace model csharp InferenceSession #135

Closed

OnnxRuntime csharp Inference example microsoft/onnxruntime#462

Closed

snnn reopened this Feb 13, 2019

snnn closed this as completed Feb 13, 2019

vinitra-zz mentioned this issue Feb 27, 2020

[ONNX export] Fixing spatial export for batchnorm apache/mxnet#17711

Merged

4 tasks

arcface model is invalid #91

arcface model is invalid #91

Comments

snnn commented Aug 30, 2018

ankkhedia commented Aug 30, 2018

snnn commented Aug 31, 2018

prasanthpul commented Sep 11, 2018

snnn commented Sep 11, 2018

prasanthpul commented Sep 11, 2018

ankkhedia commented Sep 11, 2018

prasanthpul commented Oct 29, 2018

ankkhedia commented Oct 29, 2018

ankkhedia commented Oct 30, 2018

linkerzhang commented Nov 2, 2018

linkerzhang commented Nov 2, 2018

snnn commented Nov 6, 2018

snnn commented Nov 10, 2018

ankkhedia commented Nov 12, 2018

ankkhedia commented Nov 14, 2018

snnn commented Nov 14, 2018

ankkhedia commented Nov 14, 2018

snnn commented Nov 14, 2018

ankkhedia commented Nov 14, 2018

prasanthpul commented Nov 14, 2018

snnn commented Nov 14, 2018

prasanthpul commented Nov 14, 2018

ankkhedia commented Nov 14, 2018 • edited Loading

prasanthpul commented Nov 14, 2018

snnn commented Nov 14, 2018

ankkhedia commented Nov 14, 2018 • edited Loading

ankkhedia commented Nov 14, 2018

ryanlai2 commented Nov 15, 2018 • edited Loading

ankkhedia commented Nov 15, 2018

snnn commented Nov 15, 2018

snnn commented Nov 16, 2018

Roshrini commented Nov 16, 2018

ankkhedia commented Nov 29, 2018

snnn commented Nov 30, 2018

snnn commented Dec 11, 2018

snnn commented Feb 13, 2019

ankkhedia commented Feb 13, 2019 • edited Loading

snnn commented Feb 13, 2019

ankkhedia commented Feb 13, 2019

snnn commented Feb 13, 2019

ankkhedia commented Feb 13, 2019

snnn commented Feb 13, 2019

XinyuDu commented Feb 19, 2019 • edited Loading

luan1412167 commented Oct 8, 2019 • edited Loading

luan1412167 commented Oct 10, 2019

sky186 commented Dec 16, 2019

sky186 commented Dec 16, 2019

HoangTienDuc commented Mar 18, 2020

snnn commented Mar 18, 2020 • edited Loading

ankkhedia commented Nov 14, 2018 •

edited

Loading

ankkhedia commented Nov 14, 2018 •

edited

Loading

ryanlai2 commented Nov 15, 2018 •

edited

Loading

ankkhedia commented Feb 13, 2019 •

edited

Loading

XinyuDu commented Feb 19, 2019 •

edited

Loading

luan1412167 commented Oct 8, 2019 •

edited

Loading

snnn commented Mar 18, 2020 •

edited

Loading