Zipformer Onnx FP16 #1671

manickavela29 · 2024-06-26T18:44:59Z

Hi @csukuangfj , @yaozengwei
Exporting zipformer onnx model in FP16
Model is trained in mixed precision, therefore having fp16 shouldn't give any loss

Tested with data, and model accuracy is exactly same as fp32.
cc: k2-fsa/sherpa-onnx#41 , k2-fsa/sherpa-onnx#40

Signed-off-by: manickavela29 <[email protected]>

csukuangfj · 2024-06-26T22:39:24Z

Could you describe how you test it? Does it work on CPU.

Also, could you use fp16.onnx as the suffix when fp16 is used?

Signed-off-by: manickavela29 <[email protected]>

manickavela29 · 2024-06-27T01:18:08Z

I have tested with A10 GPU and it is holding good in sherpa-onnx

I havn't tested this on CPU, but AVX512 has support for sure, I think AVX2 will also handle it.

Generally onnxrt will have fall back mechanism if fp16 inherently not there in fp32 conversion happen implicitly
but I have not tested it, and it is also subjected to Ops availability(should not be an issue with zipformer)

This is an optional export flag, and wanted to keep it simple with the existing model itself.
thats the reason didn't prefer having a new model with '-fp16' separately. It is also backward compatible 🙂

But if this is still required, I will modify the filename still.

FYI,
I have only tested with Zipformer encoder model, so raised this patch.
later if I can time to experiment with decoder/joiner too I will just extend the support

csukuangfj · 2024-06-27T03:35:57Z

Could you test the fp16 model either in icefall with onnx_pretrained.py or in sherpa-onnx with CPU?

thats the reason didn't prefer having a new model with '-fp16' separately. It is also backward compatible

Currently, we have .onnx for fp32, int8.onnx for int8. Using fp16.onnx makes it clearer to users the model they use is fp16.

Signed-off-by: manickavela29 <[email protected]>

manickavela29 · 2024-06-27T05:59:02Z

I ran it once with sherpa, and it was up and running,
some minor warnings from onnxrt optimizaiton but it is stable

4/Exp'
2024-06-27 05:57:13.449610676 [W:onnxruntime:, constant_folding.cc:269 ApplyImpl] Could not find a CPU kernel and hence can't constant fold Exp node '/norm_15/Exp'
2024-06-27 05:57:13.449640846 [W:onnxruntime:, constant_folding.cc:269 ApplyImpl] Could not find a CPU kernel and hence can't constant fold Softmax node '/downsample_output/Softmax'

Signed-off-by: manickavela29 <[email protected]>

csukuangfj · 2024-06-27T06:53:14Z

Thanks! Is it ready to merge?

manickavela29 · 2024-06-27T07:07:44Z

yes, completed from my end!

csukuangfj · 2024-06-27T08:08:10Z

Thank you for your contribution!

Signed-off-by: manickavela29 <[email protected]>

manickavela29 added 2 commits June 26, 2024 18:35

Zipformer Onnx fp16

5934b37

updating requirement file

f054f92

Signed-off-by: manickavela29 <[email protected]>

lint fix

f657c36

Signed-off-by: manickavela29 <[email protected]>

minor refactor

fa235ad

Signed-off-by: manickavela29 <[email protected]>

extending to export-onnx.py

683ae6c

Signed-off-by: manickavela29 <[email protected]>

csukuangfj merged commit eaab2c8 into k2-fsa:master Jun 27, 2024
250 of 253 checks passed

manickavela29 deleted the zip_onnx_fp16 branch June 27, 2024 08:13

yfyeung pushed a commit to yfyeung/icefall that referenced this pull request Aug 9, 2024

Zipformer Onnx FP16 (k2-fsa#1671)

f8c4983

Signed-off-by: manickavela29 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zipformer Onnx FP16 #1671

Zipformer Onnx FP16 #1671

manickavela29 commented Jun 26, 2024 •

edited

Loading

csukuangfj commented Jun 26, 2024

manickavela29 commented Jun 27, 2024 •

edited

Loading

csukuangfj commented Jun 27, 2024

manickavela29 commented Jun 27, 2024

csukuangfj commented Jun 27, 2024

manickavela29 commented Jun 27, 2024

csukuangfj commented Jun 27, 2024

Zipformer Onnx FP16 #1671

Zipformer Onnx FP16 #1671

Conversation

manickavela29 commented Jun 26, 2024 • edited Loading

csukuangfj commented Jun 26, 2024

manickavela29 commented Jun 27, 2024 • edited Loading

csukuangfj commented Jun 27, 2024

manickavela29 commented Jun 27, 2024

csukuangfj commented Jun 27, 2024

manickavela29 commented Jun 27, 2024

csukuangfj commented Jun 27, 2024

manickavela29 commented Jun 26, 2024 •

edited

Loading

manickavela29 commented Jun 27, 2024 •

edited

Loading