Float16 Support (#3794) #3795

myungjoo · 2022-06-10T04:38:58Z

myungjoo
Jun 10, 2022
Maintainer

Q1: should we support it or not?

(IF Q1==yes)
Q2: should this be "binary16"? (arbitrary binary value that we do NOT care its semantics) or should we treat it as floating point numbers?

If Q2 == float, it means that potentially, tensor_transform will do some arithmetics on it.
If Q2 == binary, it means that tensor_transform won't be able to touch it.

(IF Q2==float)
Q3: Is there a general consensus on its format that we can follow?

anyj0527 · 2022-06-10T05:16:01Z

anyj0527
Jun 10, 2022
Maintainer

In https://www.tensorflow.org/lite/performance/model_optimization, there are some guide on how to use fp16, int16 quantize tflite models. But all examples keep the input / output as float32 or uint8 and do quantize parameters INSIDE a model.

I've never seen 16bit input / output models yet, but it could be appeared in the future.

1 reply

myungjoo Jun 10, 2022
Maintainer Author

There are hw accelerators using FP16. (NXP)

designe · 2022-07-05T01:46:01Z

designe
Jul 5, 2022

I think that fp16 quantization is relatively easier to convert than 8-bit integer quantization, so it will be a necessary feature .

I have a fp16 quantize model. but it was failed to run.
below is the log in version of 2.1.1.0-0~202207041214~ubuntu20.04.1

Unimplemented data type FLOAT16 (1) in tensor
** (python3:3075844): CRITICAL **: 10:33:03.476: Failed to construct interpreter
** (python3:3075844): CRITICAL **: 10:33:03.476: Failed to load model (TensorFlow-lite interpreter->loadModel() has returned -2. Please check if the model

Can I use fp16 quantize model with build option -DFLOAT16_SUPPORT now?

4 replies

designe Jul 5, 2022

Oh, I checked #3784 issue now.

myungjoo Jul 5, 2022
Maintainer Author

Uh.. and FP16 data are NOT quantized data (same as FP32 data are not quantized data.)

designe Jul 5, 2022

I thought float16 support was the basis on which the FP16 quantization model could run.
Sorry if I didn't understand properly!

myungjoo Jul 20, 2022
Maintainer Author

Quantization = Integer. (practically, w/ common exponents)
FP16 = Floating point 16 = not integers (individual exponents)
If FP16 can be called quantized, FP32 can be, too. :)

myungjoo · 2022-07-20T05:02:34Z

myungjoo
Jul 20, 2022
Maintainer Author

Anyway. FP16 support is merged.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NNStreamer

Float16 Support (#3794) #3795

{{title}}

Replies: 3 comments 5 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

NNStreamer

Float16 Support (#3794) #3795

myungjoo Jun 10, 2022 Maintainer

Replies: 3 comments · 5 replies

anyj0527 Jun 10, 2022 Maintainer

myungjoo Jun 10, 2022 Maintainer Author

designe Jul 5, 2022

designe Jul 5, 2022

myungjoo Jul 5, 2022 Maintainer Author

designe Jul 5, 2022

myungjoo Jul 20, 2022 Maintainer Author

myungjoo Jul 20, 2022 Maintainer Author

myungjoo
Jun 10, 2022
Maintainer

Replies: 3 comments 5 replies

anyj0527
Jun 10, 2022
Maintainer

myungjoo Jun 10, 2022
Maintainer Author

designe
Jul 5, 2022

myungjoo Jul 5, 2022
Maintainer Author

myungjoo Jul 20, 2022
Maintainer Author

myungjoo
Jul 20, 2022
Maintainer Author